
Computer Networking: A Top-Down Approach (7th Edition)
7th Edition
ISBN: 9780133594140
Author: James Kurose, Keith Ross
Publisher: PEARSON
expand_more
expand_more
format_list_bulleted
Question
20
Data Mining
Suppose that we have a sample data for four people and the data has four attributes. The data is given in the following table.
Name |
Test-1 (nominal) |
Test-2 (nominal) |
Test-3 (nominal) |
Jack |
Code A |
M |
S |
Joe |
Code B |
M |
Y |
Jimmy |
Code C |
N |
M |
Jill |
Code A |
F |
Y |
Jin |
Code B |
F |
M |
Use the three nominal attributes to calculate the distance between Joe and Jill.
Expert Solution

This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution
Trending nowThis is a popular solution!
Step by stepSolved in 4 steps

Knowledge Booster
Similar questions
- Task 4: Given the data set with two dimensions X and Y: Calculate every step and not using libraries X Y HE 1435 4232 Use a linear regression method to calculate the parameters a and ß where y = a + Bx. (Show every step and not using libraries)arrow_forwardUse the sacramento.csv file to complete the following assignment. Create a file, sacramento.py, that loads the .csv file and runs a logistic regression. The regression should predict whether or not a house has 1 or more than one bathroom based on beds, sqft, and price, in that order.You will need to create a new variable from baths, and it should make it such that those observations of 1 bath correspond to a value of 0, and those with more than 1 bath correspond to a 1. Make sure to add a constant using sm.add_constant(X) Your file should print the results in this way: print(mod.params.round(2)) print(mod.pvalues.round(2)) print('The smallest p-value is for sqft')arrow_forwardWrite your codes in a single m.file for each question Write commend to each line for explaining the linearrow_forward
- Analysts at a start-up company are analyzing 35 months of sales data. They partition the data (the first 20 observations are assigned to the training set; the most recent 15 months are in the test set). The only independent variable is T (month number, ranging from 1 to 35). Five models (polynomials of order 1 - 5) are fit to the data. The first order is just the linear model; the 2nd order polynomial is the quadratic model; order 3 is the cubic model, etc. In each case the model is fit on the training data, and scored on both the training and test data sets. The results are below. Based on this output, which is the best predictive model? Metrics AE RMSE MAE SSE Metrics AE RMSE MAE SSE Model 2 Model 3 Model 5 Model 4 Model 1 1 <0.000001 0.955978 0.792802 18.277907 1 -1.034550 1.424155 1.208991 30.423248 Training Data Scoring Models (Polynomial of order 1-5) 4 2 <0.000001 0.928791 0.759583 17.253086 3 <0.000001 <0.000001 0.928295 0.855568 0.761951 0.652212 17.234646 14.639962 Test Data…arrow_forwarda player plays in a single team in a given season. however, the player may change the teams from a season to the next. for a player we keep the name (we want to search based on the last name), the date of birth, the jersey number (which can change when the player changes the team). a coach is hired by a club in a given season. a coach coaches one or more teams per seasons, but at most 3. organize the persons associated with the league in a hierarchy. I need the ERD Model and the relationships ?arrow_forwardThe Python code for step 7 is needed. Step 6 has been attached for data referencearrow_forward
- A data race is defined as follows.arrow_forwardLeast Squares Regression 1. In MATLAB Write a function to calculate the slope and intercept for a best-fit line using the least-squares linear regression method and to calculate the R2 value. Your inputs should be arrays of X- and Y-values. Your outputs should be the slope and intercept and the R2 value. Sectionarrow_forwardPEAR: sort the data according to the employees salary (the last column). Use the Schwartzian transform method.arrow_forward
- Please label each part!!arrow_forwardUse the sacramento.csv file to complete the following assignment. Create a file, sacramento.py, that loads the .csv file and runs a logistic regression. The regression should predict whether or not a house has 1 or more than one bathroom based on beds, sqft, and price, in that order.You will need to create a new variable from baths, and it should make it such that those observations of 1 bath correspond to a value of 0, and those with more than 1 bath correspond to a 1. Make sure to add a constant using sm.add_constant(X) Your file should print the results in this way: print(mod.params.round(2)) print(mod.pvalues.round(2)) print('The smallest p-value is for sqft') Please use sm.add_constant(X)! In Jupyter Notebook please! Please use the sys module!arrow_forward
arrow_back_ios
arrow_forward_ios
Recommended textbooks for you
- Computer Networking: A Top-Down Approach (7th Edi...Computer EngineeringISBN:9780133594140Author:James Kurose, Keith RossPublisher:PEARSONComputer Organization and Design MIPS Edition, Fi...Computer EngineeringISBN:9780124077263Author:David A. Patterson, John L. HennessyPublisher:Elsevier ScienceNetwork+ Guide to Networks (MindTap Course List)Computer EngineeringISBN:9781337569330Author:Jill West, Tamara Dean, Jean AndrewsPublisher:Cengage Learning
- Concepts of Database ManagementComputer EngineeringISBN:9781337093422Author:Joy L. Starks, Philip J. Pratt, Mary Z. LastPublisher:Cengage LearningPrelude to ProgrammingComputer EngineeringISBN:9780133750423Author:VENIT, StewartPublisher:Pearson EducationSc Business Data Communications and Networking, T...Computer EngineeringISBN:9781119368830Author:FITZGERALDPublisher:WILEY

Computer Networking: A Top-Down Approach (7th Edi...
Computer Engineering
ISBN:9780133594140
Author:James Kurose, Keith Ross
Publisher:PEARSON

Computer Organization and Design MIPS Edition, Fi...
Computer Engineering
ISBN:9780124077263
Author:David A. Patterson, John L. Hennessy
Publisher:Elsevier Science

Network+ Guide to Networks (MindTap Course List)
Computer Engineering
ISBN:9781337569330
Author:Jill West, Tamara Dean, Jean Andrews
Publisher:Cengage Learning

Concepts of Database Management
Computer Engineering
ISBN:9781337093422
Author:Joy L. Starks, Philip J. Pratt, Mary Z. Last
Publisher:Cengage Learning

Prelude to Programming
Computer Engineering
ISBN:9780133750423
Author:VENIT, Stewart
Publisher:Pearson Education

Sc Business Data Communications and Networking, T...
Computer Engineering
ISBN:9781119368830
Author:FITZGERALD
Publisher:WILEY