Homework 9: Analyzing Scottish Hill Races with R Code

2023/11/3 08:04 Homework 9 file:///F:/UIUC/STAT420/Homework9.html 1/21 Homework 9 Wenbin Nie Due 11/2/2023 Homework Instructions Make sure to add your name to the header of the document. When submitting the assignment on Gradescope, be sure to assign the appropriate pages of your submission to each Exercise. The point value for each exercise is noted in the exercise title. For questions that require code, please create or use the code chunk directly below the question and type your code there. Your knitted pdf will then show both the code and the output, so that we can assess your understanding and award any partial credit. For written questions, please provide your answer after the indicated Answer prompt. You are encouraged to knit your file as you work, to check that your coding and formatting are done so appropriately. This will also help you identify and locate any errors more easily. Homework Setup We’ll use the following packages for this homework assignment. We’ll also read in data from a csv file. To access the data, you’ll want to download the dataset from Canvas and place it in the same folder as this R Markdown document. You’ll then be able to use the following code to load in the data. library (ggplot2) library (faraway) library (ISLR) ## Warning: package 'ISLR' was built under R version 4.3.2 library (car) ## Warning: package 'car' was built under R version 4.3.2 ## Loading required package: carData ## Warning: package 'carData' was built under R version 4.3.2 ## ## Attaching package: 'car' ## The following objects are masked from 'package:faraway': ## ## logit, vif

2023/11/3 08:04 Homework 9 file:///F:/UIUC/STAT420/Homework9.html 2/21 Exercise 1: Formatting [5 points] The first five points of the assignment will be earned for properly formatting your final document. Check that you have: included your name on the document properly assigned pages to exercises on Gradescope selected page 1 (with your name) and this page for this exercise (Exercise 1) all code is printed and readable for each question all output is printed generated a pdf file Exercise 2: Scottish Hill Races [30 points] For this exercise, we’ll use the races.table dataset that includes information on record-winning times (minutes) for 35 hill races in Scotland, as reported by Atkinson (1986). The additional variables record the overall distance travelled (miles) and the height climbed in the race. Below, we are reading in the data from an online source. We do correct one error reported by Atkinson before beginning our analysis and adjust the height climbed to be recorded in thousands of feet. Source: Atkinson, A. C. (1986). Comment: Aspects of diagnostic regression analysis (discussion of paper by Chatterjee and Hadi). Statistical Science , 1 , 397-402. url = 'http://www.statsci.org/data/general/hills.txt' races.table = read.table(url, header=TRUE, sep='\t') races.table[18,4] = 18.65 races.table$Climb = races.table$Climb / 1000 head(races.table) Race <chr> Distance <dbl> Climb <dbl> Time <dbl> 1 Greenmantle 2.5 0.650 16.083 2 Carnethy 6.0 2.500 48.350 3 CraigDunain 6.0 0.900 33.650 4 BenRha 7.5 0.800 45.600 5 BenLomond 8.0 3.070 62.267 6 Goatfell 8.0 2.866 73.217 6 rows part a Create a scatterplot matrix of the quantitative variables contained in the race.table dataset. Interpret this scatterplot matrix. What variable do you think will be more important in predicting the record time of that race?

2023/11/3 08:04 Homework 9 file:///F:/UIUC/STAT420/Homework9.html 3/21 # Use this code chunk for your answer. for_matrix1 = races.table[,2:4] pairs(for_matrix1) Answer: It seems like Distance is more important part b Fit a multiple regression model predicting the record time of a race from the distance travelled, the height climbed, and an interaction of the two variables. Report the summary of the model. What is the for this model? What does this suggest about the strength of the model? # Use this code chunk for your answer. lm1 = lm (Time ~ Distance * Climb, data = races.table) summary(lm1)

Your preview ends here