Welcome to your ultimate Data Analytics & Data Science Interview Questions Prep!

This page is designed to help you practice and master the most important interview questions from Python, Pandas, NumPy, SQL, and Power BI. Whether you’re preparing for your first analytics role or aiming for a data science career upgrade, these carefully selected MCQs and scenario-based questions will sharpen your problem-solving skills and boost your confidence for real interviews.

Which of the following is used to handle missing values in Python?
Which attribute shows the number of rows and columns in Pandas?
In Pandas, which function is used to read a CSV file?
Which of the following is NOT a visualization in Power BI?
What is the output of bool([])?
What is the output of sorted([3,1,2])?
Which method gives the first 5 rows of a DataFrame?
Which function is used to merge two DataFrames?
What will df['col1'].unique() return?
Which Pandas function is used to combine DataFrames vertically?
In Power BI, which type of join is used by default in relationships?
NumPy (Conditional Replacement) You have a 2D NumPy array. You want to replace all negative numbers with 0, but only for rows where the row sum is negative. Which is correct? 5 Marks
Which method writes a DataFrame to Excel?
Which of the following creates an array of zeros in NumPy?
Which of the following is NOT a data visualization library in Python?
What is the output of len(set([1,2,2,3,3,3]))?
Python (Frequency Count) You have a list of numbers with millions of elements. Which method is the most efficient way to find the top 3 most frequent numbers? 5 Marks
Which data type does range(5) return in Python 3?
What does len("Data Science") return?
What does df.isnull().sum() return?
Which keyword is used to define a function in Python?
Which of the following is mutable in Python?
Which language is used in Power BI for data modeling?
What will print(type(lambda x: x+1)) return?
In SQL, which clause is used to filter aggregated results?
Pandas (Handling Missing Values) You want to replace missing values in some columns with the mean and in others with the mode. Which Pandas approach is most efficient? 5 Marks
Which of the following is NOT a step in the data analysis process?
What type of data is the number of students in a class?
Which SQL keyword is used to remove duplicate values from a result set?
Which module in Python is used for regular expressions?
Which method sorts values in Pandas DataFrame?
Which of the following SQL queries retrieves all students with marks greater than 80?
Which function replaces missing values with a given number?
Which of the following is NOT a Power BI component?
SQL (Top N per Group) You need the top 3 customers by total sales in each region. Which SQL query is correct? 5 Marks
What is the shape of np.array([[1,2,3],[4,5,6]])?
Power BI (YTD vs LYTD) You want to calculate Year-to-Date (YTD) sales compared to last year. Which DAX formula is correct? 5 Marks
Which of the following is not a Python data structure?
What is the output of list("data")?
What will df.iloc[0, 1] return?
Which function is used to create pivot tables in Pandas?
Which of the following is used for exception handling in Python?
What does df.describe() do?
Which SQL clause is used for sorting?
Which operation does np.dot(A, B) perform?