QA-123-4407

.pdf

School

University of the People *

*We aren’t endorsed by this school

Course

4407

Subject

Information Systems

Date

May 8, 2024

Type

pdf

Pages

8

Uploaded by AgentPower13325 on coursehero.com

************************************** UNIT 1 ********************************************* ******* DATA MINING ****** Q: Data Mining can be said to be a process designed to detect patterns in data sets. A: TRUE Q: The objective of (blank) is to identify valid novel and potentially useful, and understandable correlations and patterns in existing data. A: Data Mining ****** UNSUPERVISED ***** Q: In unsupervised learning, the learning algorithm must be trained using data attributes that have been paired with an outcome variable. A: FALSE Q: Which of the following is an example of an unsupervised learning algorithm? A: K-Means Q: Unsupervised learning involves building a statistical model for predicting, or estimating an output based upon one or more inputs. A: FALSE ******* SUPERVISED ******* Q: In a supervised learning model, Bias refers to the error that is introduced from the assumptions of the data analyst. A: FALSE Q: Regression analysis involves developing a model where one or more inputs are used to predict an output variable. Regression, in this context, represents what kind of learning. A: Supervised learning ***** Machine Learning Types ****** Q: Which of the following is NOT a machine learning technique? A: Linear Components Analytics Q: A predication outcome variable must be categorical? A: FALSE
Q: Assuming that we have a data set that includes sales data for every customer over the course of several years and we wanted to use this data to predict future sales which would be the most appropriate technique to investigate? A: Regression Q: Assume that you had a variety of data including medical history, diet, heredity factors on individuals who developed cancer and you wanted to use this data to determine whether a person is likely to develop cancer. Which technique would be the most promising to start with? A: Classification
************************************** UNIT 2 ********************************************* Question: True or False: Information Retrieval or text analytics is NOT a form of data mining. Answer: FALSE **************************************** Question: NoSQL databases provide greater performance at the expense of availability. Answer: TRUE **************************************** Question: The snowflake schema differs from the star schema in that the table holding the dimensional data are normalized. Answer: TRUE **************************************** Question: Map/Reduce refers to an optimized approach to process SQL queries. Answer: FALSE **************************************** Question: Which of the following is an example of a NOSQL Analytics database? Answer: cassandra **************************************** Question: The term OLAP stands for? Answer: Online Analytical Processing **************************************** Question: What does ETL stand for? Answer: extract transform load **************************************** Question: In a data warehouse, unidimensional data is stored in a star schema format. Answer: FALSE ****************************************
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help