AP CSP DATA ANALYSIS 1
star
star
star
star
star
Last updated 3 months ago
12 questions
1
You were recently employed as a data analyst and have a need to calculate the total amount of money the store will receive if they sell all of the available Technical books. Which columns in the database can be ignored and still allow you to perform this calculation
You were recently employed as a data analyst and have a need to calculate the total amount of money the store will receive if they sell all of the available Technical books. Which columns in the database can be ignored and still allow you to perform this calculation
1
Which of the following Boolean expressions would identify books that need reordering (quantity less than 10) AND cost more than $40?
Which of the following Boolean expressions would identify books that need reordering (quantity less than 10) AND cost more than $40?
1
If the data is stored in a list of dictionaries, which of the following would correctly access the price of the book called "Data Structures"?
Assume the list name is called bookstore and we are using python as our program of choice
List of dictionaries look like this [ { "author": "J.K Williams", "Title": "Digital Dreams", ...}, { ...}, ...]
... means and so on
If the data is stored in a list of dictionaries, which of the following would correctly access the price of the book called "Data Structures"?
Assume the list name is called bookstore and we are using python as our program of choice
List of dictionaries look like this
[ { "author": "J.K Williams", "Title": "Digital Dreams", ...}, { ...}, ...]
... means and so on
1
Data filtering questionA program needs to find all Sci-Fi books that are in stock (quantity > 0) and cost less than $30. What is the minimum number of comparisons needed for each book record?
Data filtering question
A program needs to find all Sci-Fi books that are in stock (quantity > 0) and cost less than $30. What is the minimum number of comparisons needed for each book record?
1
A team of researchers wants to create a program to analyze the amount of pollution reported in roughly 3000 counties across the United States. The program is intended to combine county data and then process the data. Which of the following is MOST likely to be a challenge in creating the program?
A team of researchers wants to create a program to analyze the amount of pollution reported in roughly 3000 counties across the United States. The program is intended to combine county data and then process the data. Which of the following is MOST likely to be a challenge in creating the program?
1
You are an analyst for an ISD and need to analyze same data from two different campuses. The data is already clean, before you combine the data what do you need to do?
You are an analyst for an ISD and need to analyze same data from two different campuses. The data is already clean, before you combine the data what do you need to do?
1
A well known high school in college station has a dataset with the following information.
Student_ID, Total_Days, Number_Days_Present, Number_Days_AbsentAttendance_Rate
Which of the following information CANNOT be directly determined without additional data?
A well known high school in college station has a dataset with the following information.
Student_ID,
Total_Days,
Number_Days_Present,
Number_Days_Absent
Attendance_Rate
Which of the following information CANNOT be directly determined without additional data?
1
A well known high school in college station has a dataset with the following information.
Student_ID, Total_Days, Number_Days_Present, Number_Days_AbsentAttendance_Rate
Something is wrong with this dataset, I have one redundant data fields which would not cause error but consumes unnecessary memory space, can you spot it?
In a minimum of 3 full sentences, tell me your observations, why you think is redundant, and what would you advice?
A well known high school in college station has a dataset with the following information.
Student_ID,
Total_Days,
Number_Days_Present,
Number_Days_Absent
Attendance_Rate
Something is wrong with this dataset, I have one redundant data fields which would not cause error but consumes unnecessary memory space, can you spot it?
In a minimum of 3 full sentences, tell me your observations, why you think is redundant, and what would you advice?
1
A large data set contains information about all students majoring in computer science in colleges across the united states. The data set contains the following information about each student.
The students genderThe state in which the student attends collegeThe student grade point average (GPA) on a 4.0 scale
Which of the following could be answered by analyzing only the information in the data set?
A large data set contains information about all students majoring in computer science in colleges across the united states. The data set contains the following information about each student.
The students gender
The state in which the student attends college
The student grade point average (GPA) on a 4.0 scale
Which of the following could be answered by analyzing only the information in the data set?
1
On a video sharing platform, each video has the following metadata:Upload date, Duration, Uploader's username, View count, Like count.
The video itself contains the actual footage. Which task would specifically require analyzing the video data rather than just the metadata?
On a video sharing platform, each video has the following metadata:
Upload date,
Duration,
Uploader's username,
View count,
Like count.
The video itself contains the actual footage. Which task would specifically require analyzing the video data rather than just the metadata?
1
An image sharing website stores photos (data) and metadata including: photographer name, camera model, location tags, and upload time. Which task would necessitate analyzing the image data itself?
An image sharing website stores photos (data) and metadata including: photographer name, camera model, location tags, and upload time. Which task would necessitate analyzing the image data itself?
1
A music streaming service stores songs (data) along with metadata including: Artist name, Release date, Play count, Genre tags, and User playlist additions.
Which analysis would require examining the actual song data?
A music streaming service stores songs (data) along with metadata including:
Artist name,
Release date,
Play count,
Genre tags, and
User playlist additions.
Which analysis would require examining the actual song data?