Welcome to your ultimate Data Analytics & Data Science Interview Questions Prep!

This page is designed to help you practice and master the most important interview questions from Python, Pandas, NumPy, SQL, and Power BI. Whether you’re preparing for your first analytics role or aiming for a data science career upgrade, these carefully selected MCQs and scenario-based questions will sharpen your problem-solving skills and boost your confidence for real interviews.

Which data type does range(5) return in Python 3?
Which function is used to create pivot tables in Pandas?
Which module in Python is used for regular expressions?
What will print(type(lambda x: x+1)) return?
What will df.iloc[0, 1] return?
What is the shape of np.array([[1,2,3],[4,5,6]])?
Which method writes a DataFrame to Excel?
Python (Frequency Count) You have a list of numbers with millions of elements. Which method is the most efficient way to find the top 3 most frequent numbers? 5 Marks
What will df['col1'].unique() return?
What is the output of sorted([3,1,2])?
What does df.describe() do?
What type of data is the number of students in a class?
Which function replaces missing values with a given number?
Which SQL keyword is used to remove duplicate values from a result set?
In SQL, which clause is used to filter aggregated results?
Which attribute shows the number of rows and columns in Pandas?
Which of the following is mutable in Python?
Which keyword is used to define a function in Python?
Which function is used to merge two DataFrames?
Which operation does np.dot(A, B) perform?
Pandas (Handling Missing Values) You want to replace missing values in some columns with the mean and in others with the mode. Which Pandas approach is most efficient? 5 Marks
Which method sorts values in Pandas DataFrame?
In Pandas, which function is used to read a CSV file?
Which of the following is used for exception handling in Python?
Which of the following creates an array of zeros in NumPy?
Which of the following is NOT a data visualization library in Python?
What is the output of len(set([1,2,2,3,3,3]))?
NumPy (Conditional Replacement) You have a 2D NumPy array. You want to replace all negative numbers with 0, but only for rows where the row sum is negative. Which is correct? 5 Marks
In Power BI, which type of join is used by default in relationships?
Which of the following is not a Python data structure?
Which of the following is NOT a step in the data analysis process?
What does len("Data Science") return?
What does df.isnull().sum() return?
What is the output of list("data")?
Which SQL clause is used for sorting?
Which of the following is NOT a visualization in Power BI?
Power BI (YTD vs LYTD) You want to calculate Year-to-Date (YTD) sales compared to last year. Which DAX formula is correct? 5 Marks
Which method gives the first 5 rows of a DataFrame?
Which of the following is used to handle missing values in Python?
Which Pandas function is used to combine DataFrames vertically?
Which language is used in Power BI for data modeling?
Which of the following is NOT a Power BI component?
Which of the following SQL queries retrieves all students with marks greater than 80?
SQL (Top N per Group) You need the top 3 customers by total sales in each region. Which SQL query is correct? 5 Marks
What is the output of bool([])?