When a dataset contains empty cells in the 'Age' column, which term best describes those empty values?
Which data cleaning step involves deleting repeated rows, such as having the exact same customer information appear twice in a table?
In a sales dataset, a single entry showing a sales value much higher than the others could indicate what?
If a column meant to store 'Yes' or 'No' contains values like 'Ye' or 'N0', what data issue is this?
What is it called when you adjust numerical values to a similar range, such as converting all ages to values between 0 and 1?
Transforming text labels like 'red', 'green', and 'blue' in a color column into numbers is best known as what?
Which method can you use to ensure all features contribute equally to analysis, such as giving equal weight to 'height' in cm and 'weight' in kg?
If some dates are formatted as '01/02/2023' and others as '2023-02-01', what type of problem does this present?
Suppose a sensor records temperature as 20, 21, 500, 22, 23; what is the term for unusually high or low values that may distort analysis?
If you fill empty cells in a 'salary' column with the average salary from the data, which technique are you using?