Essential Data Analyst Interview Quiz Quiz

Test your knowledge of common data analyst interview questions with this beginner-friendly quiz. Covering Power BI, SQL, Excel, and Python basics, this quiz is perfect for anyone preparing for a data analyst role or seeking to assess their foundational data analysis skills.

  1. Fact vs Dimension in Power BI

    In Power BI, which type of table typically contains numeric values for analysis, such as sales amounts or quantities?

    1. Transaction log
    2. Fact table
    3. Dimension table
    4. Image table

    Explanation: A fact table stores measurable, quantitative data for analysis, like sales or profit figures, and is central to most data models. Dimension tables hold descriptive attributes, such as names or categories, and do not usually store numbers for calculations. Image tables are not standard terminology in Power BI modeling. A transaction log tracks changes but is not used directly for analysis in Power BI.

  2. Understanding DAX

    What is DAX in Power BI, and what is its primary purpose?

    1. A database storage system
    2. A formula language for data analysis
    3. A data visualization tool
    4. An Excel add-in

    Explanation: DAX (Data Analysis Expressions) is a formula language designed for creating calculations and aggregations in Power BI and similar tools. It is not a visualization tool, nor does it store data like a database. DAX is built into Power BI, not provided as an Excel add-in.

  3. Handling Missing Data

    If you discover missing or blank values while preparing a dataset, which is a commonly recommended first step?

    1. Immediately delete all rows with missing values
    2. Ignore the missing data
    3. Identify and analyze the pattern of missing data
    4. Change the format of all columns

    Explanation: The first step should always be to identify and understand why data is missing before deciding on a course of action. Deleting rows right away may remove important information. Ignoring the issue can lead to inaccurate results, and simply changing column formats does not address missing data.

  4. SQL WHERE Clause Operators

    Which of the following operators can be used in the SQL WHERE clause to filter records?

    1. AND
    2. PIVOT
    3. PASTE
    4. SUM

    Explanation: The AND operator allows combining multiple conditions in the WHERE clause for more precise filtering. PIVOT is used for restructuring data, not filtering rows. SUM is an aggregate function, not an operator for filtering. PASTE is not an SQL function or operator.

  5. Purpose of SQL Joins

    What is the main purpose of using JOINs in SQL queries?

    1. To sort table columns alphabetically
    2. To encrypt database records
    3. To permanently delete a table
    4. To combine rows from two or more tables based on related columns

    Explanation: SQL JOINs are designed to combine records from multiple tables using common columns, supporting complex queries. Deleting tables is done with DROP, not JOIN. Sorting data is managed with ORDER BY, and encryption is a separate security feature.

  6. Differences Between RANK() and DENSE_RANK()

    In SQL, how does the DENSE_RANK() function differ from the RANK() function?

    1. DENSE_RANK() converts text to numbers
    2. DENSE_RANK() sums values instead of ranking
    3. DENSE_RANK() deletes duplicate rows
    4. DENSE_RANK() does not skip rank numbers after ties

    Explanation: DENSE_RANK() assigns consecutive ranks without gaps for rows with tied values, whereas RANK() inserts gaps after ties. The function doesn't sum or delete rows, nor does it change data types like converting text to numbers.

  7. Purpose of Pivot Tables in Excel

    What is the main use of Pivot Tables in Excel?

    1. To add background images
    2. To summarize and analyze large amounts of data
    3. To permanently filter out data
    4. To write database scripts

    Explanation: Pivot Tables are used to efficiently summarize, analyze, and explore large datasets in Excel. They do not serve to permanently filter out data or create scripts. Adding background images is unrelated to pivot table functionality.

  8. VLOOKUP Usage in Excel

    Which formula in Excel would you use to search for a value in the first column of a table and return a value in the same row from another column?

    1. VLOOKUP
    2. AVERAGEIF
    3. COUNTBLANK
    4. CONCATENATE

    Explanation: VLOOKUP looks up a value in the first column and retrieves information from other columns in the same row. AVERAGEIF calculates averages based on a condition, COUNTBLANK counts blank cells, and CONCATENATE joins text, but none of these fetch data from tables like VLOOKUP.

  9. Relative, Absolute, and Mixed References in Excel

    Which reference type in Excel changes both the row and column when a formula is copied to another cell?

    1. Sequential reference
    2. Relative reference
    3. Absolute reference
    4. Mixed reference

    Explanation: Relative references adapt both the column and row when copied across cells, making formulas dynamic. Absolute references keep the reference fixed using dollar signs. Mixed references vary either row or column, but not both. Sequential reference is not an Excel term.

  10. Reading Files in Python

    Which Pandas function in Python is commonly used to read a CSV file into a DataFrame?

    1. load_excel
    2. read_csv
    3. importfile
    4. file_read

    Explanation: The read_csv function loads CSV files directly into DataFrames for analysis. load_excel is incorrect (should be read_excel for Excel files), and there are no standard Pandas functions named importfile or file_read.

  11. Difference between loc[] and iloc[] in Pandas

    In Pandas, which indexer would you use to select rows by label instead of by integer position?

    1. iloc[]
    2. loc[]
    3. find[]
    4. read[]

    Explanation: The loc[] indexer selects data using explicit labels, making it suitable for named rows. iloc[] selects by integer index, not by label. read[] and find[] are not valid Pandas indexers.

  12. Purpose of a DataFrame in Pandas

    What is a DataFrame in the context of Python’s Pandas library?

    1. A one-dimensional numeric array
    2. A file-reading function
    3. A two-dimensional labeled data structure
    4. A programming interface

    Explanation: A DataFrame is a table with rows and columns, similar to a spreadsheet. A one-dimensional numeric array is called a Series in Pandas. file-reading functions or programming interfaces are unrelated to DataFrame's definition.

  13. SQL Aggregate Functions

    Which SQL function calculates the total sum of values in a selected column?

    1. ROWID()
    2. SUM()
    3. GROUP BY
    4. INDEX()

    Explanation: The SUM() function calculates the total of all numeric values in a column. ROWID() returns the unique ID for each row, GROUP BY organizes records but does not perform calculations itself, and INDEX() is not a standard SQL aggregate function.

  14. Purpose of Row-Level Security (RLS) in Power BI

    What does Row-Level Security (RLS) allow you to control in Power BI reports?

    1. The total size of your report file
    2. The refresh timer
    3. Which rows of data each user can view
    4. The types of charts available

    Explanation: RLS restricts data access so different users see only the rows relevant to them, enhancing security and personalization. It does not affect report file size, available chart types, or report refresh timing.

  15. Creating a Calculated Column in Power BI

    In Power BI, where is a calculated column created, and how does it differ from a measure?

    1. It is only used for conditional formatting
    2. It replaces all existing columns
    3. It is used to add images to a report
    4. It is added to a table and calculated row by row; unlike a measure, which is calculated based on filter context

    Explanation: A calculated column is created at the table level with values calculated for each row, whereas measures are dynamic and computed in the context of report filters. Conditional formatting does not create columns, and calculated columns do not replace or add images to reports.

  16. Power BI Bookmarks

    What is a Bookmark in Power BI and how is it typically used?

    1. A snapshot of a report’s current state for easy navigation or presentation
    2. A type of data encryption
    3. A tool for linking data sources
    4. A method to schedule report refresh

    Explanation: A Bookmark saves a specific view of a report, allowing users to return to that state or create interactive presentations. It does not connect data sources or provide encryption. Scheduling refresh is managed by separate options in Power BI.