Introduction To Statistics And Data Analysis
6th Edition
ISBN: 9781337793612
Author: PECK, Roxy.
Publisher: Cengage Learning,
expand_more
expand_more
format_list_bulleted
Concept explainers
Question
Chapter 14.1, Problem 6E
a.
To determine
Estimate the ecology score.
b.
To determine
Explain the change in prediction with the given situation.
c.
To determine
Calculate the estimated mean difference in ecology score for men and women.
d.
To determine
Interpret the coefficient of
e.
To determine
Comment on the coding of ideology and social class variables.
Suggest a better way of incorporating the given two variables into the model.
Expert Solution & Answer
Want to see the full answer?
Check out a sample textbook solutionStudents have asked these similar questions
Is CEO compensation related to a company's performance?
To test whether CEO compensation and a company's stock performance are related, a financial analyst collected data on
12
randomly selected, publicly traded companies. For each company, the analyst looked at two variables: the percent change in stock price over the past five years (which she denoted
x
) and the percent change in CEO compensation over the past five years (which she denoted
y
). For these
12
companies, the least-squares regression equation relating the two variables was
=y+−0.1650.046x
, and the standard error of the slope of this least-squares regression line was approximately
0.028
.
Using her information, test for a significant linear relationship between these two variables by doing a hypothesis test regarding the population slope
β1
. (Assume that the variable
y
follows a normal distribution for each value of
x
and that the other regression assumptions are satisfied.) Use the…
The relationship between total cholesterol (milligrams per deciliter) and BMI (Ratio of weight in kilograms to height in metres squared) of 20 participants
is shown in the scatterplot below along with the least squares regression line.
Which of the following statements is correct?
a) The relationship between total cholesterol and BMI is linear as can be seen by the random scatter of the data above and below the least squares regression line.
Both variables are metric and therefore it is appropriate to use Pearson's correlation to measure the linear association between the two variables.
b) The relationship between total cholesterol and BMI is non-linear and since both variables are metric it is appropriate to use Pearson's correlation to measure the linear association between the two variables.
c) The relationship between total cholesterol and BMI is non-linear as can be seen by the patterning of points around the least squares regression line and therefore it is not…
According to an article, one may be able to predict an individual's level of support for ecology based on demographic and ideological characteristics. The multiple
regression model proposed by the authors was the following.
y = 3.60-.01.x₁ +.01.x2-.07x3+.12x4+.02xs-.04x6-.01x7.04x8-.02xg+e
The variables are defined as follows.
y = ecology score (higher values indicate a greater concern for ecology)
x₁ = age times 10
x₂ = income (in thousands of dollars)
x3 = gender (1 = male, 0 = female)
X4 = race (1 = white, 0 = nonwhite)
X5 = education (in years)
x6 = ideology (4 = conservative, 3 = right of center, 2 = middle of the road, 1 = left of center, and 0 = liberal)
X7 = social class (4 = upper, 3 = upper middle, 2 = middle, 1 = lower middle, 0 = lower)
x8 = postmaterialist (1 if postmaterialist, 0 otherwise)
x9 = materialist (1 if materialist, O otherwise)
(a) Suppose you knew a person with the following characteristics: a 30 year old, white female with a college degree (20 years of…
Chapter 14 Solutions
Introduction To Statistics And Data Analysis
Ch. 14.1 - Prob. 1ECh. 14.1 - The authors of the paper Weight-Bearing Activity...Ch. 14.1 - Prob. 3ECh. 14.1 - Prob. 4ECh. 14.1 - Prob. 5ECh. 14.1 - Prob. 6ECh. 14.1 - Prob. 7ECh. 14.1 - Prob. 8ECh. 14.1 - Prob. 9ECh. 14.1 - The relationship between yield of maize (a type of...
Ch. 14.1 - Prob. 11ECh. 14.1 - A manufacturer of wood stoves collected data on y...Ch. 14.1 - Prob. 13ECh. 14.1 - Prob. 14ECh. 14.1 - Prob. 15ECh. 14.2 - Prob. 16ECh. 14.2 - State as much information as you can about the...Ch. 14.2 - Prob. 18ECh. 14.2 - Prob. 19ECh. 14.2 - Prob. 20ECh. 14.2 - The ability of ecologists to identify regions of...Ch. 14.2 - Prob. 22ECh. 14.2 - Prob. 23ECh. 14.2 - Prob. 24ECh. 14.2 - Prob. 25ECh. 14.2 - Prob. 26ECh. 14.2 - This exercise requires the use of a statistical...Ch. 14.2 - Prob. 28ECh. 14.2 - The article The Undrained Strength of Some Thawed...Ch. 14.2 - Prob. 30ECh. 14.2 - Prob. 31ECh. 14.2 - Prob. 32ECh. 14.2 - Prob. 33ECh. 14.2 - This exercise requires the use of a statistical...Ch. 14.2 - This exercise requires the use of a statistical...Ch. 14.3 - Prob. 36ECh. 14.3 - Prob. 37ECh. 14.3 - When Coastal power stations take in large amounts...Ch. 14.3 - Prob. 39ECh. 14.3 - The article first introduced in Exercise 14.28 of...Ch. 14.3 - Data from a random sample of 107 students taking a...Ch. 14.3 - Benevolence payments are monies collected by a...Ch. 14.3 - Prob. 43ECh. 14.3 - Prob. 44ECh. 14.3 - Prob. 45ECh. 14.3 - Prob. 46ECh. 14.3 - Exercise 14.26 gave data on fish weight, length,...Ch. 14.3 - Prob. 48ECh. 14.3 - Prob. 49ECh. 14.3 - Prob. 50ECh. 14.4 - Prob. 51ECh. 14.4 - Prob. 52ECh. 14.4 - The article The Analysis and Selection of...Ch. 14.4 - Prob. 54ECh. 14.4 - Prob. 55ECh. 14.4 - Prob. 57ECh. 14.4 - Prob. 58ECh. 14.4 - Prob. 59ECh. 14.4 - Prob. 60ECh. 14.4 - This exercise requires use of a statistical...Ch. 14.4 - Prob. 62ECh. 14 - Prob. 63CRCh. 14 - Prob. 64CRCh. 14 - The accompanying data on y = Glucose concentration...Ch. 14 - Much interest in management circles has focused on...Ch. 14 - Prob. 67CRCh. 14 - Prob. 68CRCh. 14 - Prob. 69CRCh. 14 - A study of pregnant grey seals resulted in n = 25...Ch. 14 - Prob. 71CRCh. 14 - Prob. 72CRCh. 14 - This exercise requires the use of a statistical...
Knowledge Booster
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.Similar questions
- A researcher interested in explaining the income levels of St. Lucian workers, developed the following multiple linear regression model: INC = a + BEDU + YEXP+ ST RN where INC = monthly income, EDU = the number of years of formal education, EXP = the number of years of workforce experience and TRN = the number of weeks spent in job training. A sample of 35 workers was processed using MINITAB and the following is an extract of the output obtained: %3D PredictorCoef StDev t-ratio Constant 3315.7 1371.6 2.41 EDU 116.53 26.07 4.47 EXP 282.96 222.8 1.27 TRN -318.12125.74-2.53 S= 2.05 R-sq = 77.3% R-sq(adj) = 75.2% The coefficient y is significant at the 5% level of significance. Select one: O True Falsearrow_forwardConsider a linear regression model that relates school expenditures and family background to student performance in Massachusetts using 224 school districts. The response variable is the mean score on the MCAS (Massachusetts Comprehensive Assessment System) exam given in May 1998 to 10th-graders. Four explanatory variables are used: (1) STR is the student-to-teacher ratio, (2) TSAL is the average teacher’s salary, (3) INC is the median household income, and (4) SGL is the percentage of single family households. The Excel Regression output for the sample regression equation is given below. (a) What proportion of the variation in MCAS score is explained by the explanatory variables? (b) At the 5% level, are the explanatory variables jointly significant in explaining MCAS score? Explain briefly. (c) At the 5% level, which variables are individually significant at predicting MCAS score? Explain briefly. (d) Suppose a second regression model (Model 2) was generated using only…arrow_forward4. Assessing bivariate correlations between the three variables used in this research problem can tell us whether there is likely to be a meaningful multiple regression model and if redundancy among predictor variables exists. Does it appear as though there may be a meaningful model that predicts dissertation stress? a. No, the bivariate correlations between the SPI Scientist and SPI Practitioner to the DSI scores were both low and no significant. b. Yes, the bivariate correlations between the SPI Scientist and SPI Practitioner to the DSI scores were both low; however, the correlation between SPI Scientist and DSI was significant. c. Yes, the bivariate correlations between the SPI Scientist and SPI Practitioner to the DSI scores were both low; however, they were both significant to DSI scores. t want predictor variables that are too higharrow_forward
- Suppose researchers are interested in exploring the factors which affect depression for Australian adults. The researchers recruited a sample of 99 Australian adults and collected data on several variables which may influence depression. Note that here depression is represented by a score, with higher values representing higher levels of depression. The variables for this study are listed below: Age Gender (0 = female, 1 = male) Stress level Anxiety level Depression Why would conducting a multiple linear regression analysis be appropriate here? Group of answer choices A) Multiple linear regression can be appropriate here because we have one metric dependent variable and several metric or dichotomous independent variables B) Multiple linear regression can be appropriate here because we have one metric dependent variable and several categorical independent variables C) Multiple linear regression can be appropriate here because we have one categorical dependent variable and…arrow_forwardAccording to an article, one may be able to predict an individual's level of support for ecology based on demographic and ideological characteristics. The multiple regression model proposed by the authors was the following. y = 3.60-.01x₁+.01.₂-.07x3+.12x4+.02xs-.04x6-01-.04.xg-.02.xg+c The variables are defined as follows. y = ecology score (higher values indicate a greater concern for ecology) X₁ = age times 10 x₂ = income (in thousands of dollars) x3 = gender (1 = male, 0 = female) X4 = race (1 = white, 0 = nonwhite) X5 = education (in years) x6 = ideology (4 = conservative, 3 = right of center, 2 = middle of the road, 1 = left of center, and 0 = liberal) X7 = social class (4 = upper, 3 = upper middle, 2 = middle, 1 = lower middle, 0 = lower) xg = postmaterialist (1 if postmaterialist, 0 otherwise) x9 = materialist (1 if materialist, 0 otherwise) (a) Suppose you knew a person with the following characteristics: a 30 year old, white female with a college degree (20 years of…arrow_forwardConsider a linear regression model that relates school expenditures and family background to student performance in Massachusetts using 224 school districts. The response variable is the mean score on the MCAS (Massachusetts Comprehensive Assessment System) exam given in May 1998 to 10th-graders. Four explanatory variables are used: (1) STR is the student-to-teacher ratio, (2) TSAL is the average teacher’s salary, (3) INC is the median household income, and (4) SGL is the percentage of single family households. The Excel Regression output for the sample regression equation is given below. (a) What proportion of the variation in MCAS score is explained by the explanatory variables? (b) At the 5% level, are the explanatory variables jointly significant in explaining MCAS score? Explain briefly. (c) At the 5% level, which variables are individually significant at predicting MCAS score? Explain briefly. (d) Suppose a second regression model (Model 2) was generated using only…arrow_forward
- 2. A study is conducted in patients with HIV. The primary outcome is CD4 cell count which is a measure of the stage of the disease. Lower CD4 counts are associated with more advanced disease. The investigators are interested in the association between vitamin and mineral supplements and CD4 count. A multiple regression analysis is performed relating CD4 count to use of supplements(coded as l ves, 0-no) and to duration of HIV, in years(ie., the number of years between the diagnosis of HIV and the study date). For the analysis, Y-CD4 count. Y 501.41 12.67 Supplements - 30.23 Duration of HIV A. What is the expected CD4 count for a patient taking supplements who has had HIV for 2.5 years? B. What is the expected CD4 count for a patient not taking supplements who was diagnosed with HIV at study enrollment? C. What is the expected CD4 count for a patient not taking supplements who has had HIV for 2.5 yearsarrow_forward2. A study is conducted in patients with HIV. The primary outcome is CD4 cell count which is a measure of the stage of the disease. Lower CD4 counts are associated with more advanced disease. The investigators are interested in the association between vitamin and mineral supplements and CD4 count. A multiple regression analysis is performed relating CD4 count to use of supplements (coded as 1=yes, 0=no) and to duration of HIV, in years (i.e., the number of years between the diagnosis of HIV and the study date). For the analysis, Y=CD4 count.Y = 501.41 + 12.67 Supplements – 30.23 Duration of HIVA. What is the expected CD4 count for a patient taking supplements who has had HIV for 2.5 years?Y = 438.505B. What is the expected CD4 count for a patient not taking supplements who was diagnosed with HIV at study enrollment?Y = 471.18C. What is the expected CD4 count for a patient not taking supplements who has had HIV for 2.5 years.= 425.835I would like to know if calculated correctly.…arrow_forwardA researcher recorded the number of e-mails received in a month and the number of online purchases made during that month for 50 people with an online presence. The resulting data were used to conduct a hypothesis test to investigate whether the slope of the population regression line relating number of e-mails received to number of online purchases is positive. What are the correct hypotheses for the test? H0:β1=0Ha:β1≠0H0:β1=0Ha:β1≠0 A H0:β1=0Ha:β1>0H0:β1=0Ha:β1>0 B H0:β1=0Ha:β1<0H0:β1=0Ha:β1<0 C H0:β1>0Ha:β1=0H0:β1>0Ha:β1=0 D H0:b1=0Ha:b1≠0 Earrow_forward
- ABC Company is exploring different prediction models that can be used to forecast indirect labor costs. One independent variable under consideration is machine hours. Following are matching observations on indirect labor costs and machine hours for the past six months: Machine hours Indirect labor costs P20,000 P24,000 P17,000 P22,000 P13,000 P14,000 Month 1 300 400 240 4 370 5 200 6 225 Using linear regression analysis, the estimated variable indirect labor cost per machine hour is closest to: P35.62 P15.44 P52.71 P58.31arrow_forwardA researcher is interested to find out how the engine displacement, vehicle weight, and the type of transmission [i.e. automatic & manual] affect the automobile gasoline mileage. Type of transmission is represented by one dummy variable: automatic (=1, if transmission is automatic; = 0, if transmission is manual) Part of the raw data and the regression output of 40 different car models are given below. Automobile Miles/gallon Displacement(cubic in) Weight(lb)) Automatic Trans. Corvette 22.50 x x x Cruiser 19.30 x x x Omega x 440 x x Nova x 514 x x Corolla E5 x x 5911 x Cougar x x 3800 x Starfire x x x 0 Cordoba x x x 1 SUMMARY OUTPUT Regression Statistics Multiple R 0.874319 R Square xxx Adjusted R Square xxx Standard Error 3.395 Observations 40…arrow_forwardA researcher is interested to find out how the engine displacement, vehicle weight, and the type of transmission [i.e. automatic & manual] affect the automobile gasoline mileage. Type of transmission is represented by one dummy variable: automatic (=1, if transmission is automatic; = 0, if transmission is manual) Part of the raw data and the regression output of 40 different car models are given below. Automobile Miles/gallon Displacement(cubic in) Weight(lb)) Automatic Trans. Corvette 22.50 x x x Cruiser 19.30 x x x Omega x 440 x x Nova x 514 x x Corolla E5 x x 5911 x Cougar x x 3800 x Starfire x x x 0 Cordoba x x x 1 SUMMARY OUTPUT Regression Statistics Multiple R 0.874319 R Square xxx Adjusted R Square xxx Standard Error 3.395 Observations 40…arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Calculus For The Life SciencesCalculusISBN:9780321964038Author:GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.Publisher:Pearson Addison Wesley,Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt
Calculus For The Life Sciences
Calculus
ISBN:9780321964038
Author:GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.
Publisher:Pearson Addison Wesley,
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
Correlation Vs Regression: Difference Between them with definition & Comparison Chart; Author: Key Differences;https://www.youtube.com/watch?v=Ou2QGSJVd0U;License: Standard YouTube License, CC-BY
Correlation and Regression: Concepts with Illustrative examples; Author: LEARN & APPLY : Lean and Six Sigma;https://www.youtube.com/watch?v=xTpHD5WLuoA;License: Standard YouTube License, CC-BY