Lab Assignment #1

.docx

School

University of Lethbridge *

*We aren’t endorsed by this school

Course

HLSC-3450

Subject

Statistics

Date

Jan 9, 2024

Type

docx

Pages

5

Uploaded by GrandHeron3832 on coursehero.com

1 APPLIED CLINICAL STATS: HLSC 3450 Lab Homework #1 Due: Monday - October 23 rd & Wednesday - October 11 th , 2023 (at the start of lab) The following questions require you to use the LabPractice dataset on Moodle 1) Report the sample size and number of variables in the dataset. Sample Size – 75, Number of variables - 73 2) Obtain information on the appropriate variables to answer the following questions: a) What percentage of this sample is male (gender)? 45.3 % b) What is the mode for religion (religion)? Protestant c) What is the cumulative percentage for 2 years vocational mothers’ education (maed)? 73.3 % d) What proportion of participants have BC grades in high school (grades)? 16 e) How many and what percentage of students took geometry (geo)? 36 students accounting for 48% f) What is the mode, median, and mean for the variable “visualization test” (visual)? Mode – 1, Median – 4.75, Mean – 5.2433 g) What is the average score for match achievement ( mathach )? 12.5645 h) What is the mode (modal category) for ethnicity (ethnic)? What percentage does this group represent? Euro-Amer. 56.2% i) What percentage of students have less A-B and most A-B math grades (mathgr)? How many students belong to each group? Less A-B – 48 students, 58.7%, Most A-B – 31 Students, 41.3% j) What is the average age in the dataset? How does this compare to the median and modal age? Average age – 19.1467, which is higher than the median and mode, both of which are 18. k) Use the variables “visualization test” (visual) and the “visualization retest” (visual2) to answer these questions: i. Which variable has a larger standard deviation? Visualization test has the larger standard deviation. ii. What does the standard deviation tell you about the distribution of the values of the variable? The larger the standard deviation, the greater the spread of values within the data collected and the less powerful that data becomes for use in inferential statistical analysis. l) If you are 1 standard deviation (SD) above the mean for the visualization test (visual), what would be your visualization performance score? 9.15533
2 m) A math achievement score has a mean of 12.5645 and a SD of 6.67031. i. What percentage of the scores is 1 SD above the mean? 34.1% ii. What percentage is 1 SD below the mean? 34.1% n) Scores on the visualization retest are distributed with a mean of 4.5467 and a SD of 3.01816. About 68% of these visualization retest scores fall between …. and ….? 7.56486 and 1.52854 o) Determine the three measures of central tendency (mean, median, & mode) for the variables “visualization test” (visual) and “visualization retest” (visual2). Based on this information, would you describe the distribution of these variables as skewed? If yes, in what direction? Hint: use the normal distribution curve. - Yes, the distribution of both variables are skewed in a positive direction because the mean of both are higher than either their respective medians and modes. 3) Below are some survey questions and responses. What type of data (variable) would they produce based on their respective responses? a) Do you think the flu vaccine offered this winter will protect you from avian flu? o Response 1 (Yes) o Response 2 (No) o Response 3 (Unsure) o Response 4 (Don’t know) Answer…………………………………. Nominal b) How many times a week do you exercise for more than 1/2 hour? o Response 1 (Once) o Response 2 (Twice) o Response 3 (Three times) o Response 4 (Four times or more) o Response 5 (Never) Answer……………………………… Ordinal c) How old were you when you started studying at the University of Lethbridge? o …………………………………. (Scale) Answer…………………………………. 4) When choosing the appropriate level of measurement for a variable . . . a) We use a scale level of measurement for a variable with what type of data? Continuous Data b) We use an ordinal level of measurement for a variable with what type of data? Data that can be categorized AND ordered. c) We use a nominal level of measurement for a variable with what type of data?
3 Data that can be categorized but CANNOT be ordered. d) Give five examples of each of the following variables? I. Scale variable – Age, weight, height, temperature, income II. Ordinal variable – Liker Scale, grades using letters, ranking of athletes, satisfaction survey, happiness rating. III. Nominal variable – Gender, ethnicity, eye colour, religion, birth month. 5) Obtain the appropriate visual plots for the variables “gender” and “ethnicity” (ethnic). a) What is the modal category for each variable? Gender – Female, Ethnicity – Euro- amer. b) How many people are in each category? Gender – 41, Ethnicity - 41 6) Obtain the appropriate visual plots for the variables “math achievement test” (mathach) and “mosaic pattern test” (mosaic). Place a normal curve over the chart. Describe the distribution of each variable, reporting: a) Mean – Math - 12.5645, Mosaic – 27.413 b) Median – Math – 13, Mosaic - 27 c) Mode – Math – 14.33, Mosaic - 25 d) Standard deviation – Math – 6.67031, Mosaic – 9.5738 e) Based on the above information, do you think the distributions of these variables are normal or skewed? Explain. - Both are slightly skewed, Math skewed in a negative direction while mosaic is skewed in a positive direction. This is determined by evaluating the mean, median and mode. Because each are not the same for their respective variables, this demonstrates a level of skewness within the data. 7) A recent news article by the Lethbridge Herald reports that, less sleep or too much sleep may increase the risk of developing depression, according to a survey of randomly selected 100,000 college students. Compared to students with an average of eight hours of sleep per night, those who sleep five or less hours were 50% more likely to develop depression. Researchers could not explain this intriguing finding, but suggested those students who sleep fewer hours per night might have underlying illnesses. Match each of the following with the corresponding elements of the study. a) Study population – College Students b) Sample size – 100,000 c) Dependent variable – Development of Depression, yes or no d) Independent variable – Number of hours slept per night e) Type of study – observational f) Sampling method – Simple random probability sampling. 8) Identify the level of measurement for each of the following questions/variables: a) A survey asks, “In a typical week, how many times do you eat at the University of Lethbridge canteen?” - Scale
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help