Part 2/8:
Understanding Database and Data Warehouse
One of the foundational questions is the difference between a database and a data warehouse.
Database: An organized collection of structured data used for transactional operations—typically supporting CRUD (Create, Read, Update, Delete) operations.
Data Warehouse: A centralized repository optimized for analytical processing, integrating data from multiple sources for reporting and decision-making.
Handling Missing Values in Data Sets
Missing values often present a challenge in data analysis. Candidates may be asked how they would manage such values.
Answer: Candidates can mention strategies such as:
Dropping missing values if they are not significant.
Using Python’s
DataFrame.dropna()
function.