MATLAB: An Introduction with Applications
6th Edition
ISBN: 9781119256830
Author: Amos Gilat
Publisher: John Wiley & Sons Inc
expand_more
expand_more
format_list_bulleted
Question
Problem:1 A) Crazy Dave, a well-known baseball analyst, wants to determine which variables are important in predicting a team’s wins in a given season. He has collected data related to wins, earned run average (ERA), and runs scored for the 2009 season (stored BB2009). Develop a model to predict the number of wins based on ERA and runs scored.
- State the multiple regression equation.
- Interpret the meaning of the slopes in this equation.
- Predict the number of wins for a team that has an ERA of 4.50 and has scored 750 runs.
- Perform a residual analysis on the results and determine
- Is there a significant relationship between number of wins and the two independent variables (ERA and runs scored) at the 0.05 level of significance?
- Determine the p-value in (e) and interpret its meaning.
- Interpret the meaning of the coefficient of multiple determination in this problem.
- At the 0.05 level of significance, determine whether each independent variable makes a significant contribution to the regression model. Indicate the most appropriate regression model for this set of data.
- Determine the p-values in (i) and interpret their meaning.
- Construct a 95% confidence
interval estimate of the - population slope between wins and ERA.
- Compute and interpret the coefficients of determination.
- Which is more important in predicting wins—pitching, as measured by ERA, or offense, as measured by runs scored? Explain.
Team League Wins Runs Scored Hits Allowed Walks Allowed Saves Errors E.R.A. Baltimore 0 64 741 1633 546 31 90 5.15 Boston 0 95 872 1494 530 41 82 4.35 Chicago White Sox 0 79 724 1438 507 36 113 4.14 Cleveland 0 65 773 1570 598 25 97 5.06 Detroit 0 86 743 1449 594 42 88 4.29 Kansas City 0 65 686 1486 600 34 116 4.83 Los Angeles Angels 0 97 883 1513 523 51 85 4.45 Minnesota 0 86 817 1542 466 48 76 4.50 New York Yankees 0 103 915 1386 574 51 86 4.26 Oakland 0 75 759 1486 523 38 105 4.26 Seattle 0 85 640 1359 534 49 105 3.87 Tampa Bay 0 84 803 1421 515 41 98 4.33 Texas 0 87 764 1432 531 45 106 4.38 Toronto 0 75 798 1509 551 25 76 4.47 Arizona 1 70 720 1470 525 36 124 4.42 Atlanta 1 86 735 1399 530 38 96 3.57 Chicago Cubs 1 83 707 1329 586 40 105 3.84 Cincinnati 1 78 673 1420 577 41 89 4.18 Colorado 1 92 804 1427 528 45 87 4.22 Florida 1 87 772 1425 601 45 106 4.29 Houston 1 74 643 1521 546 39 78 4.54 Los Angeles Dodgers 1 95 780 1265 584 44 83 3.41 Milwaukee 1 80 785 1498 607 44 98 4.83 New York Mets 1 70 671 1452 616 39 97 4.45 Philadelphia 1 93 820 1479 489 44 76 4.16 Pittsburgh 1 62 636 1491 563 28 73 4.59 St. Louis 1 91 730 1407 460 43 96 3.66 San Diego 1 75 638 1422 603 45 94 4.37 San Francisco 1 88 657 1268 584 41 88 3.55 Washington 1 59 710 1533 629 33 143 5.00
Problem:1 B) Suppose that in addition to using ERA to predict the number of wins, Crazy Dave wants to include the league (0=American, 1=National) as an independent variable. Develop a model to predict wins based on ERA and league.
- State the multiple regression equation.
- Interpret the slopes in (a).
- Predict the number of wins for a team with an ERA of 4.50 in the American League. Construct a 95% confidence interval estimate for all teams and a 95% prediction interval for an individual team.
- Perform a residual analysis on the results
Expert Solution
This question has been solved!
Explore an expertly crafted, step-by-step solution for a thorough understanding of key concepts.
This is a popular solution
Trending nowThis is a popular solution!
Step by stepSolved in 3 steps
Knowledge Booster
Similar questions
- Researchers at a large nutrition and weight management company are trying to build a model to predict a person’s body fat percentage from an array of variables. A variables selection method is used to build a regression model. SAS output for the final model is given in photo. Question: What percentage of the variation in percent body fat remains unexplained, even after introducing weight and abdomen circumference into the model, and then also determine the interpretation of the slope for weight?arrow_forwardWhy the regression line is a straight line rather than a curved line?arrow_forwardStatistics Canada produces consumer price indexes for several different categories. Shown here are the percentage changes in consumer price indexes over a period of 22 years for durable goods, semidurable goods, and nondurable goods. Also displayed are the percentage changes in total goods price index. Use these data and multiple regression to develop a model that attempts to predict the total goods index by the other three variables. Comment on the result of this analysis. Year TotalGoods DurableGoods SemidurableGoods NondurableGoods 1 85.7 89.4 91.9 83.2 2 86.4 90.4 92.5 83.8 3 87.8 92.5 93.4 85.2 4 86.8 96.0 94.2 81.6 5 88.4 99.0 94.9 82.9 6 89.9 100.8 95.4 84.4 7 91.2 101.6 97.0 86.0 8 91.4 101.5 97.7 86.1 9 93.1 101.5 99.2 88.4 10 96.0 100.8 99.6 93.3 11 98.4 100.1 100.3 97.4 12 100 100 100 100 13 101.9 99.2…arrow_forward
- The U.S. Postal Service is attempting to reduce the number of complaints made by the public against its workers. To facilitate this task, a staff analyst for the service regresses the number of complaints lodged against an employee last year on the hourly wage of the employee for the year. The analyst ran a simple linear regression in SPSS. The results are shown below. The current minimum wage is $5.15. If an employee earns the minimum wage, how many complaints can that employee expect to receive? Is the regression coefficient statistically significant? How can you tell?arrow_forwardWrite down the fitted regression equation. If in the family, the primary caregiver is the biological mother, whose income is $50,000 and the youth live with the family most of the time. What predicted youth absent days in school?arrow_forwardResearchers are interested in predicting the height of a child based on the heights of their mother and father. Data were collected, which included height of the child (height ), height of the mother ( mothersheight), and height of the father (fathersheight ). The initial analysis used the heights of the parents to predict the height of the child (all units are inches). The results of the analysis, a multiple regression, are presented below. . regress height mothersheight fathersheight Source Model Residual Total height mothersheight fathersheight _cons SS df 208.008457 314.295372 37 2 104.004228 8.49446952 MS 522.303829 39 13.3924059 Coef. Std. Err. .6579529 .1474763 .2003584 .1382237 9.804327 12.39987 t P>|t| 4.46 0.000 C 0.156 0.79 0.434 Number of obs = F( 2, 37) = Prob > F R-squared Adj R-squared Root MSE = .3591375 -.0797093 -15.32021 = 40 12.24 0.0001 0.3983 0.3657 2.9145 [95% Conf. Interval] .9567683 .4804261 34.92886 What is the predicted height for a child born to a mother…arrow_forward
- - X Wins and ERA Earned run Wins, x average, y 20 2.79 18 3.31 17 2.65 16 3.83 14 3.94 12 4.27 11 3.78 9 5.18 Print Donearrow_forwardThe number of Master’s degrees granted at a particular university is given for selected years in the table below. Year 2005 2010 2015 2020 Number of Master’s Degrees 58 69 76 83 Use the linear regression equation for this data to predict the number of Master’s degrees that this university will grant in the year 2021. Blank 1. Calculate the answer by read surrounding text. Round to the nearest whole number of Master's degrees.arrow_forwardCalculate two lines of regression and calculate a linear regression equation to model the data given below: years (1980,1985,1990,1995,2000) Enrolment ( 21,25,29,39,47)arrow_forward
- A researcher is investigating possible explanations for deaths in traffic accidents. He examined data from 2000 for each of the 52 cities randomly selected in the US. The variables were death and income. Deaths: The number of deaths in traffic accidents per cityIncome: The median income per city The researcher ran a simple linear regression model: Deaths = Bo+B1(Income). Results shown in photo below. Question: Please help me better understand how to use results from photo to find value of R-squared of this simple linear regression model.arrow_forwardThe table below shows the average weekly wages (in dollars) for state government employees and federal government employees for 10 years. Construct and interpret a 99% prediction interval for the average weekly wages of federal government employees when the average weekly wages of state government employees is $925. The equation of the regression line isarrow_forwardUsed cars 2010 Vehix.com offered several used ToyotaCorollas for sale. The following table displays the ages ofthe cars and the advertised prices. a) Make a scatterplot for these data.b) Do you think a linear model is appropriate? Explain.c) Find the equation of the regression line. d) Check the residuals to see if the conditions for infer-ence are met. Age (yr) Price ($) Age (yr) Price ($)1 15988 6 99951 13988 6 119882 14488 7 89903 10995 8 94883 13998 8 89954 13622 9 59904 12810 10 41005 9988 12 2995arrow_forward
arrow_back_ios
arrow_forward_ios
Recommended textbooks for you
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman
MATLAB: An Introduction with Applications
Statistics
ISBN:9781119256830
Author:Amos Gilat
Publisher:John Wiley & Sons Inc
Probability and Statistics for Engineering and th...
Statistics
ISBN:9781305251809
Author:Jay L. Devore
Publisher:Cengage Learning
Statistics for The Behavioral Sciences (MindTap C...
Statistics
ISBN:9781305504912
Author:Frederick J Gravetter, Larry B. Wallnau
Publisher:Cengage Learning
Elementary Statistics: Picturing the World (7th E...
Statistics
ISBN:9780134683416
Author:Ron Larson, Betsy Farber
Publisher:PEARSON
The Basic Practice of Statistics
Statistics
ISBN:9781319042578
Author:David S. Moore, William I. Notz, Michael A. Fligner
Publisher:W. H. Freeman
Introduction to the Practice of Statistics
Statistics
ISBN:9781319013387
Author:David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:W. H. Freeman