You were asked to help with analysis of birth weights (BW) of 10,000 infants born in NYC during a certain period of time
The aim of the analysis is to see whether the birth weights of the infants are associated with mothers AGE at birth (continuous variable in years) and mothers smoking status (the maternal smoking status MSS contains 4 categories “Non-smoker”, “Past-smoker”, “Passive-smoker”, “Smoker”) and NYC boroughs (the BOROUGH variable contains 5 categories “Manhattan”, “Bronx”, “Brooklyn”, “Queens” and “Staten Island”)
Questions:
(1) How many dummy variables do you need to create to analyze the effect of BOROUGH and MSS? Select “Manhattan” and "Non-Smoker" as a reference category. Write out all the dummy variables.
(2) Write down the population model that estimates BW based on variables AGE, MSS and BOROUGH. Make sure that it is clear what each predictor means.
(3) How many parallel lines are computed by model from 1i)?
Step by stepSolved in 4 steps
- If the correlation between variable X and variable Y is perfect, what do you know about the prediction?arrow_forwardResearchers at a local public health office are interested in the difference in prevalence of sickle cell anemia by ethnicity in a population under its jurisdiction. Suppose the following table represents prevalence of sickle cell anemia reported based on a thorough survey. Ethnicity Prevalence of Sickle Cell African-American =242/1049 Other =165/1864 Calculate the risk difference between the groups.arrow_forwardCan someone please help me with part c on question 1?arrow_forward
- How is correlation defined?arrow_forwardHello! Please look at the image attached. There are 6 correlation scatter graphs which each represents a continent. Please analyse the graphs, as what they indicate and what they show as in the relationship between obesity% and life expectancy. Thank youarrow_forwardA sociologist studying the justice system has just written a paper detailing her findings after examining the records of thousands of inmates. Among other things, she looked at the time spent in prison by inmates who had been sentenced to 5- 10 years for a felony conviction. The histogram below, which appears in her paper, summarizes the time spent in prison for each of 40 such inmates. Frequency 20- 16 15- 12 10- 5- 4 4 0- 20 80 Time spent in prison (in months) 40 60 100 120 Based on this histogram, estimate the standard deviation of the sample of 40 prison terms. Carry your intermediate computations to at least four decimal places, and round your answer to at least one decimal place. (If necessary, consult a list of formulas.)arrow_forward
- Question 6 of 7 > -/1 E View Policies Current Attempt in Progress The following data set lists the number of women from each of 10 different countries who were on the Rolex Women's World Golf Rankings Top 25 list as of March 31, 2009. The data, entered in that order, are for the following countries: Australia, Brazil, England, Japan, Korea, Mexico, Norway, Sweden, Taiwan, and the United States. 2 1 1 2 9 1 1 2 2 4 a. Calculate the mean and median for these data set. Мean %3D i Median = i b. Identify the outliers in this data set. Write your answers in ascending orders. Outliers = i and i Drop the outliers and recalculate the mean and median. Mean = i Median = i Which of these two summary measures changes by a larger amount when you drop the outliers? c. Which is the better summary measure for these data, the mean or the median? Explain. The is a better measure because it as sensitive to the outliers as the other one.arrow_forwardTrue or false .If there is no linear correlation between enrollment and the number of burglaries , then those two variables are not related in anyway. Explain why ?arrow_forwardYou were asked to help with an analysis of birth weights (BW) of 10,000 infants born in NYC during a certain period of time. The aim of the analysis is to see whether the birth weights of the infants are associated with mothers’ AGE at birth (continuous variable in years), mothers’ current HOUSING status (“own” and “not-own”; “own” is the reference group), and NYC boroughs (the BOROUGH variable contains 5 categories “Manhattan”, “Bronx”, “Brooklyn”, “Queens” and “Staten Island”; “Manhattan” is the reference group). Write down the population model that estimates the effect of mother’s AGE on BW while adjusting for mother’s current HOUSING status. If you use dummy or binary variables, specify what they mean. Write down the population model that estimates the effect of mother’s AGE on BW while adjusting for mother’s current HOUSING status and BOROUGH. If you use dummy or binary variables, specify what they mean. Write down the population model that estimates the effect of mother’s AGE…arrow_forward
- Suppose you have access to a database with the variables listed below. Using the two variables BMI(the predictor) and diabetes(the outcome) Name two types of descriptive statistics you could report specifying the variables BMI and diabetes.arrow_forwardHelp me pleasearrow_forwardIdentifying individuals with a high risk of Alzheimer's disease usually involves a long series of cognitive tests. However, researchers have developed a 7-Minute Screen, which is a quick and easy way to accomplish the same goal. The question is whether the 7-Minute Screen is as effective as the complete series of tests. To address this question, Ijuin et al. (2008) administered both tests to a group of patients and compared the results. The following data represent results similar to those obtained in the study. 7-Minute Screen Cognitive Series 3 11 8 19 10 22 8 20 4 14 7 13 4 9 5 20 14 25 Which statistical test would you select: Answer H0: Answer H1: Answer What is the coefficient that indicates the strength of the relationship? Answer =Answer What is the standard error? Answer What is the appropriate 95% CI? [ Answer , Answer ] What is the d-effect size? Answer What is the t-observed? Answer What is the actual p-value? Answer THREE decimal places…arrow_forward
- MATLAB: An Introduction with ApplicationsStatisticsISBN:9781119256830Author:Amos GilatPublisher:John Wiley & Sons IncProbability and Statistics for Engineering and th...StatisticsISBN:9781305251809Author:Jay L. DevorePublisher:Cengage LearningStatistics for The Behavioral Sciences (MindTap C...StatisticsISBN:9781305504912Author:Frederick J Gravetter, Larry B. WallnauPublisher:Cengage Learning
- Elementary Statistics: Picturing the World (7th E...StatisticsISBN:9780134683416Author:Ron Larson, Betsy FarberPublisher:PEARSONThe Basic Practice of StatisticsStatisticsISBN:9781319042578Author:David S. Moore, William I. Notz, Michael A. FlignerPublisher:W. H. FreemanIntroduction to the Practice of StatisticsStatisticsISBN:9781319013387Author:David S. Moore, George P. McCabe, Bruce A. CraigPublisher:W. H. Freeman