Applied Statistics in Business and Economics
Applied Statistics in Business and Economics
5th Edition
ISBN: 9780077837303
Author: David Doane, Lori Seward Senior Instructor of Operations Management
Publisher: McGraw-Hill Education
bartleby

Videos

Textbook Question
Book Icon
Chapter 13, Problem 38CE

Note: Exercises marked * are based on optional material.

Instructions for Data Sets: Choose one of the data sets AK below or as assigned by your instructor. Only the first three and last three observations are shown for each data set. In each data set, the dependent variable (response) is the first variable. Choose the independent variables (predictors) as you judge appropriate. Use a spreadsheet or a statistical package (e.g., MegaStat or Minitab) to perform the necessary regression calculations and to obtain the required graphs. Write a concise report answering questions 13.25 through 13.41 (or a subset of these questions assigned by your instructor). Label sections of your report to correspond to the questions. Insert tables and graphs in your report as appropriate. You may work with a partner if your instructor allows it.

If you did not already do so, request leverage statistics. Are any observations influential? Explain.

Expert Solution & Answer
Check Mark
To determine

Find leverage statistics.

Identify any of the observations are influential.

Answer to Problem 38CE

The leverage statistics are,

ObservationSales/SqFtPredictedResidualLeverage
1702505.5378196.46220.0659
2210388.0933–178.0930.0818
3365419.7986–54.79860.0789
4443419.201723.798280.0458
5399336.133962.866050.0798
6265524.3932–259.3930.0816
7572365.8614206.13860.0655
8642491.9392150.06080.0935
9461422.989238.01080.0225
10639458.365180.6350.0703
11484502.2794–18.27940.0715
12581466.1341114.86590.0478
13268432.9745–164.9740.0586
14573497.559675.440420.2737
15586525.230660.769440.1168
16369398.5007–29.50070.0584
17351498.6047–147.6050.0985
18458429.387128.612860.0535
19987614.5091372.49090.1820
20357454.3592–97.35920.0429
21406417.2942–11.29420.0250
22681391.1612289.83880.0493
23368492.4983–124.4980.2093
24304460.1672–156.1670.0604
25394415.2689–21.26890.0913
26562486.6875.319970.0580
27495423.781671.218360.0942
28310388.496–78.4960.1363
29373422.8679–49.86790.0227
30236345.16–109.160.1516
31413406.29046.7095890.1565
32625543.407581.592520.1197
33274397.4102–123.410.0526
34543558.9323–15.93230.1372
35179297.105–118.1050.0794
36375361.730813.269220.0837
37329433.9038–104.9040.0659
38297430.0182–133.0180.0682
39323455.7566–132.7570.0800
40469404.89964.1010.0291
41353497.4495–144.4490.0837
42380491.0586–111.0590.0696
43398408.7628–10.76280.0353
44312318.6083–6.608270.0574
45452432.440919.559150.0731
46699362.4679336.53210.0617
47367347.570419.429610.0801
48432380.885651.114380.0736
49367355.486311.513680.0922
50401381.55919.441020.0432
51414481.2256–67.22560.0375
52481428.100652.899390.0183
53538415.7548122.24520.0271
54330359.279–29.2790.0356
55250438.5112–188.5110.0532
56292396.9591–104.9590.0582
57517411.7635105.23650.0231
58552470.100581.899450.0275
59387361.769925.230090.0832
60427408.302218.697770.0631
61454497.6884–43.68840.0887
62512441.105270.894830.0793
63345375.7731–30.77310.1071
64234334.17–100.170.0622
65348333.453914.546130.1051
66348458.6665–110.6660.1285
67295315.655–20.6550.1077
68361376.5859–15.58590.0450
69468232.9942235.00580.2319
70404393.759410.240590.1052
71246373.6202–127.620.1022
72340403.9505–63.95050.1144
73401413.2786–12.27860.0619
74327316.562210.437850.1045

The observations 14, 19, 23 and 69 are considered to have higher leverage values.

The influential observation is 23.

Explanation of Solution

Calculation

The given information is that, the dataset of ‘Noodles & Company Sales, Seating, and Demographic data’ contains n=74 observations. The response variable is ‘annual sales per square foot’, there are k=5 predictor variables ‘Interior Seat Count, Patio Seat Count, Median HH Income, Median Age of Population, % with Bachelor's Degree’. The considered level of significance is 0.05.

Software procedure:

Step by step procedure to obtain regression output using MegaStat software is given as,

  • • Choose MegaStat >Correlation/Regression>Regression Analysis.
  • • SelectInput ranges, enter the variable range for ‘Seats-Inside, Seats-Patio, MedIncome, MedAge, BachDeg%’ as the column of X, Independent variable(s)
  • • Enter the variable range for ‘Sales/SqFt’ as the column of Y, Dependent variable.
  • • In Options> Residuals chooseDiagnostics and influential residuals.
  • • Click OK.

Output using MegaStatsoftware is given below:

Applied Statistics in Business and Economics, Chapter 13, Problem 38CE , additional homework tip  1

Influential observation:

The influential observation has a great effect on the parameters of the regression line when it is removed from the data set.

The influential observations can be identified using the leverage values. If the observation have the high leverage value, that is any leverage statistic is greater than value of 2(k+1)n, k denotes the number of predictors and n denotes the number of observations, then remove the observation from the data set redo the regression analysis, if the regression statistic changes significantly then the observation is considered as influential observation.

Substitute, n=74,k=5 in the formula,

2(k+1)n=2(5+1)74=1274=0.1622

The leverage statistics greater than 0.1622 are, 0.274 corresponding to observation 14, 0.182 corresponding to observation 19, 0.209 corresponding to observation 23 and 0.232 corresponding to observation 69

The observations 14, 19, 23 and 69 are considered to have higher leverage values.

Regression conclusion including all observations:

Let β1 is the parameter for the predictor seats-inside, β2 is the parameter for the predictor seats-Patio, β3 is the parameter for the predictor median income, β4 is the parameter for the predictor median age of population, Let β5 is the parameter for the predictor % with Bachelor's Degree.

The p-value for predictor seats-inside is 0.0733.

The p-value for predictor seats-patio is 0.2350.

The p-value for predictor MedIncome is 0.0589.

The p-value for predictor MedAge is 0.9972.

The p-value for predictor BachDeg% is 0.0015.

Null hypothesis:

H0:βj=0;j=1,2,3,4,5

The predictor variable j is not related to annual sales.

Alternative hypothesis:

H1:βj0

The predictor variable j is related to annual sales.

Rejection rules:

  • • If p-value is less than the level of significance then the null hypothesis is rejected. The predictor is significant.
  • • If p-value is greater than the level of significance then the null hypothesis is not rejected. The predictor is not significant.

Conclusion for seats-inside:

The p-value for predictor seats-inside is 0.0733.

The level of significance is 0.05.

The p-value is greater than the level of significance.

That is, p-value(=0.0733)>α(=0.05).

The null hypothesis is not rejected.

The predictor variable seats-inside is not related to annual sales.

The predictor seats-inside is not significant.

Conclusion for seats-patio:

The p-value for predictor seats-patio is 0.2350.

The level of significance is 0.05.

The p-value is greater than the level of significance.

That is, p-value(=0.2350)>α(=0.05).

The null hypothesis is not rejected.

The predictor variable seats-patio is not related to annual sales.

The predictor seats-patio is not significant.

Conclusion for median income:

The p-value for predictor median income is 0.0589.

The level of significance is 0.05.

The p-value is greater than the level of significance.

That is, p-value(=0.0589)>α(=0.05).

The null hypothesis is not rejected.

The predictor variable median income is not related to annual sales.

The predictor median income is not significant.

Conclusion for median age:

The p-value for predictor median age of population is 0.9972.

The level of significance is 0.05.

The p-value is greater than the level of significance.

That is, p-value(=0.9972)>α(=0.05).

The null hypothesis is not rejected.

The predictor variable median age of population is not related to annual sales.

The predictor median age of population is not significant.

Conclusion for ‘% with Bachelor's Degree’:

The p-value for predictor ‘% with Bachelor's Degree’ is 0.0015.

The level of significance is 0.05.

The p-value is less than the level of significance.

That is, p-value(=0.0015)<α(=0.05).

The null hypothesis is rejected.

The predictor variable ‘% with Bachelor's Degree’ is related to annual sales.

The predictor ‘% with Bachelor's Degree’of population is significant.

The p-value for ‘% with Bachelor's Degree’ indicates predictor significance at α=0.05. All the other predictor variables are not significant.

Regression analysis by removing the observation 14:

Software procedure:

Step by step procedure to obtain regression equation using MegaStat software is given as,

  • • Choose MegaStat >Correlation/Regression>Regression Analysis.
  • • SelectInput ranges, enter the variable range for ‘Seats-Inside, Seats-Patio, MedIncome, MedAge, BachDeg%’ as the column of X, Independent variable(s)
  • • Enter the variable range for ‘Sales/SqFt’ as the column of Y, Dependent variable.
  • • Click OK.

Output using MegaStatsoftware is given below:

Applied Statistics in Business and Economics, Chapter 13, Problem 38CE , additional homework tip  2

It is clear that the predictor variable ‘BachDeg%’ with p-value 0.0015 is significant at α=0.05 by removing observation 13. All the remaining predictors are not significant. Hence removing 14 did not change the significance of the predictors.

Regression analysis by removing the observation 19:

Software procedure:

Step by step procedure to obtain regression equation using MegaStat software is given as,

  • • Choose MegaStat >Correlation/Regression>Regression Analysis.
  • • SelectInput ranges, enter the variable range for ‘Seats-Inside, Seats-Patio, MedIncome, MedAge, BachDeg%’ as the column of X, Independent variable(s)
  • • Enter the variable range for ‘Sales/SqFt’ as the column of Y, Dependent variable.
  • • Click OK.

Output using MegaStatsoftware is given below:

Applied Statistics in Business and Economics, Chapter 13, Problem 38CE , additional homework tip  3

It is clear that the predictor variable ‘BachDeg%’ with p-value 0.0016 is significant at α=0.05 by removing observation 19. All the remaining predictors are not significant. Hence removing 19 did not change the significance of the predictors.

Regression analysis by removing the observation 23:

Software procedure:

Step by step procedure to obtain regression equation using MegaStat software is given as,

  • • Choose MegaStat >Correlation/Regression>Regression Analysis.
  • • SelectInput ranges, enter the variable range for ‘Seats-Inside, Seats-Patio, MedIncome, MedAge, BachDeg%’ as the column of X, Independent variable(s)
  • • Enter the variable range for ‘Sales/SqFt’ as the column of Y, Dependent variable.
  • • Click OK.

Output using MegaStatsoftware is given below:

Applied Statistics in Business and Economics, Chapter 13, Problem 38CE , additional homework tip  4

It is clear that the predictor variables ‘MedIncome’ with p-value 0.0496 and ‘BachDeg%’ with p-value 0.0016 are significant at α=0.05 because p-value is less than level of significance, by removing observation 23. All the remaining predictors are not significant. Hence removing 23has changed the significance for predictor ‘Median income’.

Regression analysis by removing the observation 69:

Software procedure:

Step by step procedure to obtain regression equation using MegaStat software is given as,

  • • Choose MegaStat >Correlation/Regression>Regression Analysis.
  • • SelectInput ranges, enter the variable range for ‘Seats-Inside, Seats-Patio, MedIncome, MedAge, BachDeg%’ as the column of X, Independent variable(s)
  • • Enter the variable range for ‘Sales/SqFt’ as the column of Y, Dependent variable.
  • • Click OK.

Output using MegaStatsoftware is given below:

Applied Statistics in Business and Economics, Chapter 13, Problem 38CE , additional homework tip  5

It is clear that the predictor variable ‘BachDeg%’ with p-value 0.0017 is significant at α=0.05 by removing observation 69. All the remaining predictors are not significant. Hence removing 69 did not change the significance of the predictors.

The significance for the regression statistics has changed when the observation 23 is removed from the data set. Hence, the influential observation is 23.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!

Chapter 13 Solutions

Applied Statistics in Business and Economics

Ch. 13.3 - Prob. 11SECh. 13.3 - A regression model to predict Y, the state...Ch. 13.4 - A regression of accountants starting salaries in a...Ch. 13.4 - An agribusiness performed a regression of wheat...Ch. 13.5 - Prob. 15SECh. 13.5 - A regression model to predict the price of...Ch. 13.5 - Prob. 17SECh. 13.5 - Prob. 18SECh. 13.6 - Prob. 19SECh. 13.6 - Prob. 20SECh. 13.7 - Prob. 21SECh. 13.7 - Using the Metals data, construct a correlation...Ch. 13.8 - Prob. 23SECh. 13.8 - Which violations of regression assumptions, if...Ch. 13 - (a) List two limitations of simple regression. (b)...Ch. 13 - (a) What does represent in the regression model?...Ch. 13 - Prob. 3CRCh. 13 - Prob. 4CRCh. 13 - Prob. 5CRCh. 13 - Prob. 6CRCh. 13 - Prob. 7CRCh. 13 - Prob. 8CRCh. 13 - Prob. 9CRCh. 13 - (a) State the formula for the standard error of...Ch. 13 - (a) What is a categorical predictor? (b) Why is a...Ch. 13 - Prob. 12CRCh. 13 - Prob. 13CRCh. 13 - (a) What is multicollinearity? (b) What are its...Ch. 13 - Prob. 15CRCh. 13 - (a) State the formula for a variance inflation...Ch. 13 - Prob. 17CRCh. 13 - Prob. 18CRCh. 13 - Prob. 19CRCh. 13 - Prob. 20CRCh. 13 - (a) Name two ways to detect autocorrelated...Ch. 13 - (a) What is a lurking variable? How might it be...Ch. 13 - Instructions for Data Sets: Choose one of the data...Ch. 13 - Note: Exercises marked are based on optional...Ch. 13 - Prob. 27CECh. 13 - Note: Exercises marked are based on optional...Ch. 13 - Note: Exercises marked are based on optional...Ch. 13 - Prob. 30CECh. 13 - Prob. 31CECh. 13 - Prob. 32CECh. 13 - Prob. 33CECh. 13 - Prob. 34CECh. 13 - Prob. 35CECh. 13 - Note: Exercises marked are based on optional...Ch. 13 - Note: Exercises marked are based on optional...Ch. 13 - Note: Exercises marked are based on optional...Ch. 13 - Prob. 39CECh. 13 - Prob. 40CECh. 13 - Prob. 41CECh. 13 - In a model of Fords quarterly revenue TotalRevenue...Ch. 13 - In a study of paint peel problems, a regression...Ch. 13 - A hospital emergency room analyzed n = 17,664...Ch. 13 - Prob. 45CECh. 13 - A researcher used stepwise regression to create...Ch. 13 - A sports enthusiast created an equation to predict...Ch. 13 - An expert witness in a case of alleged racial...Ch. 13 - Prob. 50CECh. 13 - Prob. 51CECh. 13 - Prob. 52CECh. 13 - Which statement is correct concerning one-factor...Ch. 13 - Prob. 2ERQCh. 13 - Prob. 3ERQCh. 13 - Prob. 4ERQCh. 13 - Prob. 5ERQCh. 13 - Prob. 6ERQCh. 13 - Prob. 7ERQCh. 13 - Prob. 8ERQCh. 13 - Prob. 9ERQCh. 13 - Prob. 10ERQCh. 13 - Prob. 11ERQCh. 13 - Prob. 12ERQCh. 13 - Prob. 13ERQCh. 13 - Prob. 14ERQCh. 13 - Prob. 15ERQ
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Recommended textbooks for you
Text book image
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Text book image
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Text book image
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Text book image
Functions and Change: A Modeling Approach to Coll...
Algebra
ISBN:9781337111348
Author:Bruce Crauder, Benny Evans, Alan Noell
Publisher:Cengage Learning
Hypothesis Testing using Confidence Interval Approach; Author: BUM2413 Applied Statistics UMP;https://www.youtube.com/watch?v=Hq1l3e9pLyY;License: Standard YouTube License, CC-BY
Hypothesis Testing - Difference of Two Means - Student's -Distribution & Normal Distribution; Author: The Organic Chemistry Tutor;https://www.youtube.com/watch?v=UcZwyzwWU7o;License: Standard Youtube License