The file Galton on D2L contains the 928 observations Francis Galton used in 1885 to estimate the relationship between the heights of parents and the heights of their children. The column Children refers to the height (in inches) of a child, and the column Mid-Parents refers to the average height (in inches) of the mother and father of that child. You can download this file into Excel and Minitab. a. Calculate the regression Height of Children = a +b (Height of Mid-Parents). b. Calculate the average for Height of Children, and calculate the average Height of Mid-Parents. c. Create a new variable in Minitab which is the Height of Children measured in terms of deviations from its mean. Call this new variable y. Also, create a new variable in Minitab with is the Height of Mid-Parents measured in terms of deviations from its mean. Call this new variable x. Calculate the regression y = a + bx. You can create the new y and x variables in Excel of Minitab, whichever you find more convenient. If you use Minitab, click on Calc, Calculator, then fill in the boxes for Store Results In and Expression. “Store Results In” is the name you give to the new variable you are creating and “Expression” is the algebraic expression that defines the variable you are creating. d. If a person’s parents are 3 inches above average height, do you predict their children to be above or below average height? And how many inches above or below average height? e. If a person’s parents are 3 inches below average height, do you predict their children to be above or below average height? And how many inches above or below average height? f. The term “regression to the mean” comes from Galton’s work. Why do you think that term is appropriate in the context of this problem? g. Use the F statistic to test the hypothesis that there is no relation between the heights of children and the heights of their parents at the 5% level of significance. Do you reject this hypothesis or not? h. If you reject the hypothesis in the previous question, what is the probability that you are committing a Type I error (i.e., what is the probability of a false positive)? I was given an excel spreadsheet with 928 observations. The first 10 are the following, I just need to know how to calculate, I can then do the rest: Mid-Parents Children 73 72.2 73 73.2 73 73.2 73 73.2 72.5 68.2 72.5 69.2 72.5 69.2 72.5 70.2 72.5 71.2 Please show me how to solve using EXCEL
Correlation
Correlation defines a relationship between two independent variables. It tells the degree to which variables move in relation to each other. When two sets of data are related to each other, there is a correlation between them.
Linear Correlation
A correlation is used to determine the relationships between numerical and categorical variables. In other words, it is an indicator of how things are connected to one another. The correlation analysis is the study of how variables are related.
Regression Analysis
Regression analysis is a statistical method in which it estimates the relationship between a dependent variable and one or more independent variable. In simple terms dependent variable is called as outcome variable and independent variable is called as predictors. Regression analysis is one of the methods to find the trends in data. The independent variable used in Regression analysis is named Predictor variable. It offers data of an associated dependent variable regarding a particular outcome.
The file Galton on D2L contains the 928 observations Francis Galton used in 1885 to estimate the relationship between the heights of parents and the heights of their children. The column Children refers to the height (in inches) of a child, and the column Mid-Parents refers to the average height (in inches) of the mother and father of that child. You can download this file into Excel and Minitab.
a. Calculate the regression Height of Children = a +b (Height of Mid-Parents).
b. Calculate the average for Height of Children, and calculate the average Height of Mid-Parents.
c. Create a new variable in Minitab which is the Height of Children measured in terms of deviations from its mean. Call this new variable y. Also, create a new variable in Minitab with is the Height of Mid-Parents measured in terms of deviations from its mean. Call this new variable x. Calculate the regression y = a + bx. You can create the new y and x variables in Excel of Minitab, whichever you find more convenient. If you use Minitab, click on Calc, Calculator, then fill in the boxes for Store Results In and Expression. “Store Results In” is the name you give to the new variable you are creating and “Expression” is the algebraic expression that defines the variable you are creating.
d. If a person’s parents are 3 inches above average height, do you predict their children to be above or below average height? And how many inches above or below average height?
e. If a person’s parents are 3 inches below average height, do you predict their children to be above or below average height? And how many inches above or below average height?
f. The term “regression to the mean” comes from Galton’s work. Why do you think that term is appropriate in the context of this problem?
g. Use the F statistic to test the hypothesis that there is no relation between the heights of children and the heights of their parents at the 5% level of significance. Do you reject this hypothesis or not? h. If you reject the hypothesis in the previous question, what is the probability that you are committing a Type I error (i.e., what is the probability of a false positive)?
I was given an excel spreadsheet with 928 observations. The first 10 are the following, I just need to know how to calculate, I can then do the rest:
Mid-Parents | Children |
73 | 72.2 |
73 | 73.2 |
73 | 73.2 |
73 | 73.2 |
72.5 | 68.2 |
72.5 | 69.2 |
72.5 | 69.2 |
72.5 | 70.2 |
72.5 | 71.2 |
Please show me how to solve using EXCEL
Trending now
This is a popular solution!
Step by step
Solved in 4 steps with 5 images