Data Analyst Interview Fundamentals: Power BI, DAX, and SQL Essentials Quiz

Prepare for your next data analyst interview by reviewing essential concepts in Power BI, DAX, SQL, and data modeling. This quiz covers foundational knowledge, best practices, and scenario-based questions relevant for data analysis roles and business intelligence projects.

  1. Power BI Components

    Which platform is primarily used for creating reports and data models in Power BI?

    1. Power BI Desktop
    2. Power BI Service
    3. Power BI Mobile
    4. Power View

    Explanation: Power BI Desktop is designed for developing reports, performing data modeling, and creating DAX calculations. Power BI Service is mainly used for publishing, sharing, and collaborative features in the cloud. Power BI Mobile is intended for viewing reports on mobile devices. Power View is an older data visualization tool, not the main development environment.

  2. Data Modeling

    When building a data model in Power BI, why is the Star Schema preferred over the Snowflake Schema?

    1. It simplifies relationships and improves query performance
    2. It uses more storage space
    3. It requires more normalization
    4. It makes data structure more complex

    Explanation: The Star Schema allows for simpler relationships and often improves query speed in Power BI. Using this schema makes your model easier to understand and optimize. Snowflake Schema involves more normalization, leading to complex joins and slower performance. The other options either incorrectly attribute complexity or storage usage.

  3. DAX Basics

    For calculating a year-over-year sales comparison in Power BI, which DAX function is commonly used?

    1. SAMEPERIODLASTYEAR
    2. TOPN
    3. CONCATENATE
    4. COUNTROWS

    Explanation: SAMEPERIODLASTYEAR is a time intelligence DAX function that helps compare data from the same period in the previous year. TOPN is used for sorting and retrieving top rows, CONCATENATE joins text values, and COUNTROWS counts rows in a table. The distractors do not handle time-based comparisons.

  4. Optimizing Power BI Reports

    Which action is recommended to improve the performance of a slow Power BI report with a large dataset?

    1. Reduce columns and rows at the data source level
    2. Use only calculated columns
    3. Add multiple complex visuals
    4. Store all data in a flat table

    Explanation: Reducing unnecessary columns and rows at the data source limits the data Power BI needs to process, resulting in faster reports. Using only calculated columns can slow performance due to heavy calculation load. Adding complex visuals strains the report, making it slower. Flat tables increase complexity and are less efficient compared to optimized schemas.

  5. DAX Measures

    Which DAX function is used for ranking values, such as ranking products by sales?

    1. RANKX
    2. SUMMARIZE
    3. REMOVEFILTERS
    4. CALCULATETABLE

    Explanation: RANKX is designed to rank items, such as sales by product or region. SUMMARIZE is used for grouping data, REMOVEFILTERS adjusts filters, and CALCULATETABLE creates new tables with filter context applied. These other functions do not provide ranking capabilities.

  6. Row Level Security

    How does Row Level Security (RLS) help in Power BI reports shared with multiple users?

    1. By restricting data access based on user roles
    2. By encrypting all data
    3. By speeding up report refreshes
    4. By changing report themes

    Explanation: Row Level Security restricts what data users can view by assigning role-based filters, ensuring each user only sees authorized data. It does not encrypt data, which is managed separately, nor does it directly influence refresh speed or report themes.

  7. SQL In Practice

    Which SQL operation would you use to combine customer information from two tables in preparation for reporting?

    1. JOIN
    2. PRINT
    3. DELETE
    4. RAND

    Explanation: JOIN operations combine data from two or more tables based on a related column, essential for integrating datasets before loading into reporting tools. PRINT is for displaying output messages, DELETE removes records, and RAND generates random numbers—none of which combine table data.

  8. Data Cleaning

    Before loading data into Power BI, which step can help reduce model size and improve performance?

    1. Remove unnecessary columns
    2. Increase the number of visuals
    3. Split tables for every field
    4. Add redundant relationships

    Explanation: Eliminating unused columns reduces the data volume, making the Power BI model more efficient and faster. Increasing visuals or redundant relationships adds complexity without performance benefits, and splitting tables for every field causes unnecessary fragmentation.

  9. Time Intelligence Functions

    Which DAX function can help calculate the Year-To-Date total sales?

    1. TOTALYTD
    2. DISTINCTCOUNT
    3. FORMAT
    4. EARLIER

    Explanation: TOTALYTD calculates the running total for a measure from the start of the year to a specific date, commonly used for Year-To-Date analysis. DISTINCTCOUNT counts unique values, FORMAT changes the display format, and EARLIER is used in row contexts but does not aggregate year-to-date values.

  10. Explaining Dashboards

    When explaining a complex dashboard to a non-technical manager, what approach is most effective?

    1. Focus on business insights and use simple language
    2. Describe all underlying DAX formulas in detail
    3. Only show raw data tables
    4. Discuss the relationships in the data model extensively

    Explanation: Translating technical results into business value and using easy-to-understand language helps non-technical audiences grasp dashboard outcomes. Describing DAX formulas or raw data in detail may cause confusion, and discussing the data model's structure is often too technical for business-focused audiences.

  11. Incremental Data Refresh

    What is a main advantage of using Incremental Refresh in Power BI for large datasets?

    1. Only new or changed data is refreshed, not the entire dataset
    2. It automatically creates custom visuals
    3. It adds extra dashboards to your workspace
    4. It applies machine learning algorithms by default

    Explanation: Incremental Refresh allows Power BI to update just the new or modified records, saving time and computing resources. It does not create visuals, dashboards, or apply machine learning; its primary purpose is efficient data refresh management.

  12. Filter Functions

    If you want to ignore all filters on a column except one, which DAX function should you use?

    1. ALLEXCEPT
    2. ALL
    3. IF
    4. MINX

    Explanation: ALLEXCEPT keeps filters on specified columns but ignores others, useful for calculations that require maintaining certain contexts. ALL removes all filters, IF performs logical checks, and MINX is an iterator for finding minimums, so they are not suitable for this task.

  13. Data Visualization Best Practice

    Why is it important to remove unnecessary visuals from a Power BI dashboard?

    1. To enhance performance and avoid user confusion
    2. To increase the overall data volume
    3. To hide data errors from users
    4. To complicate the dashboard's design on purpose

    Explanation: Removing unnecessary visuals streamlines report performance and makes dashboards easier to interpret. Increasing data volume or intentionally complicating the design hinders usability, while hiding errors is not a legitimate practice for data visualization.

  14. Data Preparation

    Which SQL clause would most likely be used to summarize sales by region before importing into Power BI?

    1. GROUP BY
    2. ORDER BY
    3. EXCEPT
    4. HAVOC

    Explanation: GROUP BY allows you to aggregate data, such as calculating total sales per region, making it ready for dashboarding. ORDER BY only sorts results, EXCEPT compares two result sets, and HAVOC is not a valid SQL keyword.

  15. Object-Level Security

    How does Object-Level Security differ from Row Level Security in Power BI?

    1. Object-Level Security restricts access to entire tables or columns
    2. It filters data only at the row level
    3. It encrypts datasets automatically
    4. It enables dashboard sharing with anonymous users

    Explanation: Object-Level Security provides control over visibility of different tables or columns for users, while Row Level Security works at the individual row level. Encryption is not handled by this feature, and sharing with anonymous users is managed in other security settings.

  16. Scenario Handling

    If key data for a requested KPI is missing, what should a data analyst do?

    1. Inform stakeholders, suggest alternatives, and document limitations
    2. Ignore the issue and generate incomplete reports
    3. Guess the missing values and proceed
    4. Delete all data and restart the project

    Explanation: Communicating data issues, proposing workarounds, and transparently documenting limitations ensure stakeholders make informed decisions. Ignoring issues, making guesses, or starting over without rationale can lead to misleading insights and project problems.