The process of removing corrupt data, incomplete data, verifying ranges (e.g you saw 13th grade in a k12 student record table, or someone date of birth is in the future) is called
Question 6
6.
What do you understand by the word data filtering?
Question 7
7.
Question 8
8.
Question 9
9.
Question 10
10.
Convert 215 in base 10 to its binary form
Question 11
11.
Question 12
12.
Convert the three letter word "AND" to binary
Question 13
13.
Question 14
14.
A major telecom company decides to get all posts and comments on twitter within the past two years where their company name was in the tag or was mentioned with the post content. What kind of information can they get from these twitter data? List at least 3
Question 15
15.
Question 16
16.
What is a primary benefit of data cleaning in statistical analysis?
Enhances data visualization aesthetics
Reduces bias and improves accuracy of results
Speeds up computer processing time
Increases data storage capacity
Which aspect of data cleaning is most crucial for ensuring reliable analysis?
Handling missing values and incomplete records
Changing the font of data entries
Creating backup copies of the dataset
Renaming all variables
Data cleaning helps improve data quality by
Deleting all outliers automatically
Adding random values to fill gaps
Converting all numbers to text format
Identifying and correcting inconsistencies in the dataset
Which is one way in which number systems are abstract?
The same number can be represented by different number representations
A number system can be best represented by one number systems
Symbols cannot be used to add, subtract, multiply or divide them in their abstract form
They use constants
The process of grouping your data into categories based on common features or values is called?
data filtering
data classifying
data cleaning
data mining
What is the information about the creation date, author of digital document / files on your computer called?
Metadata
Content
Context
Media Assets
What is the amount of data compression an algorithm can produce reliant upon?
A large file size
Several Patterns in the data
No repeating parts of the file being compressed
Small file size
How can an organization begin the process of analyzing data?
By following an iterative development process
By establishing measurements the data should show
By developing hypothesis and questions to test
By checking to see if the data matches previously collected data
Meta data can be used to
Provide updates to the data
Sort the data
Brand the data
Help find and organize the data
What could be a reason to perform additional research on correlations found through data analysis
There may not be an cause-and-effect relationship between the correlation variables
A single source may not provide enough data for a conclusion
To understand the relationship between the variables
All of the above
When is sampling needed?
Sampling is used to store analog data
Sampling is used when converting digital data to analog data
Sampling is used when converting analog data to digital data
Sampling is used to provide an approximation to real world data