Data Mining for Business Analytics: Concepts, Techniques, and Applications with XLMiner
3rd Edition
ISBN: 9781118729274
Author: Galit Shmueli, Peter C. Bruce, Nitin R. Patel
Publisher: WILEY
expand_more
expand_more
format_list_bulleted
Concept explainers
Expert Solution & Answer
Chapter 2, Problem 4P
Explanation of Solution
Given: Refer to table 2.6 as the randomly selected sample from a large bank
To find: The next step for future indication and inspect the data carefully.
Solution:
In the randomly sampled table 2.6, the column named personal loan has 4 records with value 1, which indicates that the solicitation for the personal loan was accepted...
Expert Solution & Answer
Want to see the full answer?
Check out a sample textbook solutionStudents have asked these similar questions
Draw the ER/EER model for the given scenario.
Manufacturer have unique name, an address, and a phone number.
Products have a product_id, a name and price.
Products may be classified into electronics and grocery.
Electronic products have description, type attributes.
Groceries have fssai_number, date_of manufacture, date_of_expiry.
Each product is made by one manufacturer, and different manufacturers may produce
different products..
Customers are identified by their unique social security number. They have email
addresses, and physical addresses. Several customers may live at the same (physical)
address, but we assume that no two customers have the same email address.
An order has a unique order number, and a date. An order is placed by one customer. For
each order, there are one or more products ordered, and there is a quantity for each
product on the order
Model evaluation
Create a predictions variable using your fitted model and the test dataset; call it y_pred. Then get the accuracy score of your predictions and save it in a variable called accuracy. Finally get the confusion matrix for your predictions and save it in a variable called confusion_mat.
Code:
y_pred = Noneaccuracy = Noneconfusion_mat = None
How can you tell if a model meets the requirements for proportionality and additivity?
Chapter 2 Solutions
Data Mining for Business Analytics: Concepts, Techniques, and Applications with XLMiner
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, computer-science and related others by exploring similar questions and additional content below.Similar questions
- There are two alternative methods of representing an employee’s optional (single) phone number. The first is to use a nullable attribute and the second is to use a weak entity. 1. Given a requirement that a large number of the employees (> 80%) are assigned a phone number, which is the preferred method and why? 2. Given a requirement that only a small number of employees (< 10%) are assigned phone numbers, which is the preferred method and why? 3. How would the requirement of ‘multiple phone numbers can be assigned to an Employee’ affect your decision? Your answers should consider the performance tradeoff between number of database operations per query and record’s size.arrow_forwardFirst, perform the following tasks: • Make a linear regression model with all the features in the dataset. Use train_test_split to keep 20% of the data for testing. • Use your model to predict values for test set and print the predictions for the first 10 instances of the test data and compare them with actual values. • Print the coefficient values and their corresponding feature name (e.g. age 43, bmi 200, .) • Note that you can access feature_names from diabetes dataset directly • Calculate training-MSE, testing-MSE, and R-squared value. Compare the two models. Did using all available features improve the performance? In [ ]: # Your code goes here In [ ]: # Your code goes herearrow_forwardA derived attribute can be stored in a table or calculated on each occasion that it is required. An example of a derived attribute is your Grade Point Average which is calculated by adding together a score for each unit (e.g., HD=4, D=3, C=2, P=1, N=0) and dividing that by the number of units you have completed. Explain how the decision as to whether the Grade Point Average would be stored in a table or calculated would be made. You are administering a database in a large retail company that records sales at each of the company’s stores. Tuning the database to improve performance by adding indexes and optimising query execution is an important ongoing aspect of the physical database design. Which attributes in a database such as this (apart from the primary key) are likely to BENEFIT from being indexed? Which attributes would you generally avoid indexing, and why?arrow_forward
- Create an ER model to represent the data used by the library.The library provides books to borrowers. Each book is described by title, edition, and year of publication, and is uniquely identified using the ISBN. Each borrower is described by his or her name and address and is uniquely identified using a borrower number. The library provides one or more copies of each book and each copy is uniquely identified using a copy number, status indicating if the book is available for loan, and the allowable loan period for a given copy. A borrower may loan one or many books, and the date each book is loaned out and is returned is recorded. Loan number uniquely identifies each book loan.arrow_forwardER modeling is one approach in creating a data model. There are several approaches in creating a data model (bottom up, top down, inside out, and mix). 1. explain each of these approaches!2. Then explain when we use top down, bottom up, inside out and mix? what is the function of each of the approaches?arrow_forwardThe analytical method relies heavily on the use of data modelling. In order to build a model, we'll need some data.arrow_forward
- Question # 2: Consider the following data Classification Model where YACT is your actual observation and YPRED is the model prediction value. You have to use the data and find CONFUSION MATRIX, and using confusion matrix compute the value of the following errors: Precision value of each class Recall value of each class F-measure value of each class Model Accuracy Model Precision value Model Recall value Model F-measure value YACT R G B B R G R R G G B B R R G B R B G R B G B R R B B G G G YPRED R R R B B B R R G B B R R G G G B B R R B G B R B G B G R Rarrow_forwardStudy the details below and create ERD diagram for each question below. Drone / Video: a Drone may take many Videos; a Video is taken by one Drone. A Video instance is optional for a Drone; a Drone instance is required for a Video. A Facility may be recorded by many Videos; a Video records one Facility. A Video instance is optional for a Facility; a Facility instance is required for a Video. A Video may be the basis of many Analyses; an Analysis is based on one Video. An Analysis instance is optional for a Video; a Video instance is required for an Analysisarrow_forwardConsider the following scenario: you are interested in researching the connection that exists between the number of 'likes' one gets on Facebook and the number of 'friends' one has on the social media site. Specifically, you want to know whether there is a relationship between the two variables. Please describe the statistical strategy that you would employ using just your own words. Thank you.arrow_forward
- The only way to tell if a model is right or wrong is to see how well it works on test data. tell me more; give me more information? Explain?arrow_forwardDraw an ER model; An author writed a book . Every book has a title, ISBN, year, and price. each book may be authored by one or more author and each author may write one , or more than one book. For authors the database keeps the name, address and URL of their homepage. Each book has a publisher. The database keeps the name, address , and phone number adn the URL of their wedbsite. Each publishers publishes one or more books and each book is published by exactly one publisher. The warehouse stocks several books. Each warehouse has a code, address and phone number. The database records the number of copies stocked at various warehouse.arrow_forwardUse a simple example to explain the iterative nature of the model calibration process.arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Database System ConceptsComputer ScienceISBN:9780078022159Author:Abraham Silberschatz Professor, Henry F. Korth, S. SudarshanPublisher:McGraw-Hill EducationStarting Out with Python (4th Edition)Computer ScienceISBN:9780134444321Author:Tony GaddisPublisher:PEARSONDigital Fundamentals (11th Edition)Computer ScienceISBN:9780132737968Author:Thomas L. FloydPublisher:PEARSON
- C How to Program (8th Edition)Computer ScienceISBN:9780133976892Author:Paul J. Deitel, Harvey DeitelPublisher:PEARSONDatabase Systems: Design, Implementation, & Manag...Computer ScienceISBN:9781337627900Author:Carlos Coronel, Steven MorrisPublisher:Cengage LearningProgrammable Logic ControllersComputer ScienceISBN:9780073373843Author:Frank D. PetruzellaPublisher:McGraw-Hill Education
Database System Concepts
Computer Science
ISBN:9780078022159
Author:Abraham Silberschatz Professor, Henry F. Korth, S. Sudarshan
Publisher:McGraw-Hill Education
Starting Out with Python (4th Edition)
Computer Science
ISBN:9780134444321
Author:Tony Gaddis
Publisher:PEARSON
Digital Fundamentals (11th Edition)
Computer Science
ISBN:9780132737968
Author:Thomas L. Floyd
Publisher:PEARSON
C How to Program (8th Edition)
Computer Science
ISBN:9780133976892
Author:Paul J. Deitel, Harvey Deitel
Publisher:PEARSON
Database Systems: Design, Implementation, & Manag...
Computer Science
ISBN:9781337627900
Author:Carlos Coronel, Steven Morris
Publisher:Cengage Learning
Programmable Logic Controllers
Computer Science
ISBN:9780073373843
Author:Frank D. Petruzella
Publisher:McGraw-Hill Education