Assignment 2 Solved(1)
docx
School
University Of Connecticut *
*We aren’t endorsed by this school
Course
5604
Subject
Statistics
Date
Feb 20, 2024
Type
docx
Pages
8
Uploaded by EarlMusicCrab33
Part 1
– Refer to page 89 of your textbook and answer the following problems. Use the attached RidingMowers and LaptopSalesJanuary2008 datasets to answer the following questions. You may complete this part in either JMP or Python.
3.2 a – With most of the owners (as indicated by the blue dots) occurring in the top right area of the data space, it seems that people are people are more likely own a riding lawnmower then they have a larger lot and a higher income. This makes sense because riding lawnmowers are more expensive than push mowers so a higher income would be required to decide to make that
expensive purchase. Also, riding lawnmowers are more useful on a larger lot. If a lot is too small, it might be hard to navigate a riding lawnmower in the small space. In fact, when the lot is smaller than 16,000 sqft, nobody owns a riding lawnmower. When the lot is large, you can relax and ride your mower around instead of doing the hard work of pushing it across the yard. The data show that everyone with a lot size over 21,000 sqft owns a riding lawnmower.
3.3 a – The store in postcode N17 6OA has the highest average retail price of 495. The store in postcode W4 3PH has the lowest retail price of 481.
Part 2
– Continue working in the LaptopSalesJanuary2008 dataset to answer the following questions. You may complete this part in either JMP or Python. 1.
Assuming the dataset includes all laptop models sold by the stores, would you want to use the Screen Size column to predict Retail Price? Why or why not? Justify your choice with an appropriate visualization.
When I look at the distribution of the screen size column, I see that all values in that column are the same, exactly 15 inches. Both the max and min values are 15 inches. Because there is no variability in the data for screen size, it can’t impact the price. When a column is all one value, we don’t use it for modeling. 2.
What has a bigger impact on Retail Price – RAM or Processor Speed? Make comparative box plots to support your answer. Include screen shots of the visualizations you made. (Hint: For boxplots you should have one continuous variable and one categorical variable.)
When looking at the boxplots of the distribution of retail price subsetted by RAM, we can see that laptops with 2 GB of RAM are generally priced higher than laptops with 1 GB of RAM. The median price of laptops with 2GB of RAM is 500 compared to 470 when there’s 1 GB of RAM. The quartiles, min, and max are all higher when the RAM is 2 GB.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
When looking at the boxplots of the distribution of retail price subsetted by processor speed, we
can see that the medians are almost the same. They only differ by 15. The higher processor speed is higher at each key point in the box plot, but only by a little. When comparing the two variables as potential predictors, the RAM drives more differentiation in price than the processor speed. So I would expect the RAM to be a better predictor. 3.
Check the correlations among Configuration, Retail Price, and CustomerStoreDistance. Make three observations about the correlations beyond just the numerical value. What do the numbers and patterns indicate about the meaning of the data? Include a screenshot of the correlations and scatterplot matrices.
Here are some observations that occur to me from the output above:
- The configuration variable is not truly continuous. That is why you see the banding in the scatterplots that include the configuration column. There is data in the ranges from 1-80, 145-
224, and 289-368.
- As the configuration number increases, the laptop probably also has higher end features which drive up price. A little bit of extra exploration shows that the configuration ranges are based off of battery life. - At first I thought that customers seem to be a little bit more willing to travel further to purchase
the higher battery life laptop configurations. This is a very weak pattern though, as evidenced by
the 0.0021 correlation. I’m just noticing that there seems to be a bit more dots on the right side of the scatterplot for the highest group of configurations. So I binned the configurations in the ranges that indicate the different battery life and RAM groupings. I then looked at the distribution of the distance traveled for each of these. There is no noteworthy pattern here. So the configuration does not influence the distance people travel to purchase.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
- Price doesn’t seem to be a factor in how far people travel to purchase a laptop. They will travel
about the same distances for all different price points. To further show this relationship, I binned
the distances (assuming that within each range the person’s perception of the distance isn’t that different) and then looked at the distribution of the price for each distance. The only variability that appears is at the upper end where the data gets very thin. NOTE: In the cases where I did an extra vis to justify my answers for this question, that isn’t necessary for this assignment. Just interpreting the correlations and scatterplot matrices is
enough. I just wanted to go ahead with the next step to show you how I approach what I’m seeing.
Related Documents
Related Questions
The entirety of the data set will be in the two pictures
arrow_forward
Please use the given info to answer the subquestion Part B
arrow_forward
Thank you for any feedback on this one.
arrow_forward
Please use the given info to answer the subquestion Part A
arrow_forward
The whole data set will be in the two pictures
arrow_forward
Recently, management at Oak Tree Golf Course received a few complaints about the condition of the greens. Several players complained that the greens are too fast. Rather than react to the comments of just a few, the Golf Association conducted a survey of 100 male and
100 female golfers. The survey results are summarized here.
Excel File: data02-31.xlsx
Male Golfers
Green Condition
Gender Too Fast
Male
Handicap
Under 15
15 or more
25
25
a. Complete the crosstabulation shown below.
Green Condition
Female
Too Fast
10
Fine
Fine
40
Female Golfers
Total
Green Condition
Handicap
Under 15
15 or more
Too Fast
1
Fine
9
39 51
Total
Which group shows the highest percentage saying that the greens are too fast?
- Select your answer -
b. Refer to the initial crosstabulations. For those players with low handicaps (better players), which group (male or female) shows the highest percentage saying the greens are too fast?
For the low handicappers, the - Select your answer - have a higher percentage who…
arrow_forward
Spend at least 20 minutes looking at a few of the different unique data visualization ideas foundat this blog: http://flowingdata.com/. Discuss one of the posts in a few sentences, copying inany appropriate (and appropriately resized) graphics.
arrow_forward
The r code for side by side boxplot of vitamind v newage and vitamin d v country.
Scatterplot code for relationship between vitamin d level and age.
arrow_forward
please. help me answer this question. thank you
arrow_forward
tion 2 of 15
Last summer, the Smith family drove through seven different states and visited various popular landmarks. The prices of gasoline
in dollars per gallon varied from state to state and are listed below.
$2.34, $2.75, $2.48, $3.58, $2.87, $2.53, $3.31
Click to download the data in your preferred format.
CrunchIt! CSV Excel JMP Mac Text Minitab PC Text R SPSS TI Calc
Calculate the range of the price of gas. Give your solution to the nearest cent.
range:
dollars per gallon
DELL
&
4.
7
8.
arrow_forward
The Conch Café, located in Gulf Shores, Alabama, features casual lunches with a great view of the Gulf of Mexico. To accommodate the increase in business during the summer vacation season, Fuzzy Conch, the owner, hires a large number of servers as seasonal help. When he interviews a prospective server, he would like to provide data on the amount a server can earn in tips. He believes that the amount of the bill and the number of diners are both related to the amount of the tip. He gathered the following sample information.
Customer
Amount of Tip
Amount of Bill
Number of Diners
Customer
Amount of Tip
Amount of Bill
Number of Diners
1
$
7.00
$
48.97
5
16
$
3.30
$
23.59
2
2
4.50
28.23
4
17
3.50
22.30
2
3
1.00
10.65
1
18
3.25
32.00
2
4
2.40
19.82
3
19
5.40
50.02
4
5
5.00
28.62
3
20
2.25
17.60
3
6
4.25
24.83
2
21
5.50
44.47
4
7
0.50
6.24
1
22
3.00
20.27
2…
arrow_forward
The Conch Café, located in Gulf Shores, Alabama, features casual lunches with a great view of the Gulf of Mexico. To accommodate the increase in business during the summer vacation season, Fuzzy Conch, the owner, hires a large number of servers as seasonal help. When he interviews a prospective server, he would like to provide data on the amount a server can earn in tips. He believes that the amount of the bill and the number of diners are both related to the amount of the tip. He gathered the following sample information.
Customer
Amount of Tip
Amount of Bill
Number of Diners
Customer
Amount of Tip
Amount of Bill
Number of Diners
1
$
6.05
$
73.22
1
16
$
3.30
$
23.59
2
2
4.50
28.23
4
17
3.50
22.30
2
3
1.00
10.65
1
18
3.25
32.00
2
4
2.40
19.82
3
19
5.40
50.02
4
5
5.00
28.62
3
20
2.25
17.60
3
6
4.25
24.83
2
21
1.40
41.80
5
7
.50
6.25
1
22
3.00
20.27
2…
arrow_forward
In IBM SPSS, what does clicking on this icon do?
arrow_forward
You will use the following data set to answer all parts of the project. This data set is the number of students enrolled at CCA from 2015 to 2019 by semester
Fall 2015
6933
Summer 2015
2495
Spring 2015
7518
Fall 2016
7386
Summer 2016
2301
Spring 2016
8056
Fall 2016
8025
Summer 2016
2235
Spring 2016
8725
Fall 2018
7982
Summer 2018
2140
Spring 2018
8436
Fall 2019
5859
Summer 2019
2089
Spring 2019
9048
1) Find the mean, median, and mode for CCA Student enrollment data set. Then using the formulas for samples, find the variance and standard deviation.
2) Organize the data set on student enrollment by creating a frequency distribution and include the relative frequency. Group the data into seven logical equal intervals starting with 2,000 ≤ x < 10,000 and so on.
x
f
relative f…
arrow_forward
The basketball coach at a local college believes that his team scores more points at home games when more people show up. Below is a list of all home games last year with scores and corresponding attendance. Use Excel, SPSS, or work by hand to show your work finding r. Show your work on the attached pages.
Score
Attendance
Score
Attendance
54
380
67
410
57
350
78
215
59
320
67
113
80
478
56
250
82
451
85
450
75
250
101
489
73
489
99
472
53
451
a. What is the correlation between Score and Attendance rounded to 2 decimals?
b. In terms of strength and direction, how would you describe this correlation?
c. What is the obtained t-score for this correlation?
d. What is the critical t-score for a two-tailed test with a ?
e. Is this correlation significant based on the t-scores?
f. Based on Table 10.4, approximately how many cases would you expect to need to…
arrow_forward
The basketball coach at a local college believes that his team scores more points at home games when more people show up. Below is a list of all home games last year with scores and corresponding attendance. Use Excel, SPSS, or work by hand to show your work finding r. Show your work on the attached pages.
Score
Attendance
Score
Attendance
54
380
67
410
57
350
78
215
59
320
67
113
80
478
56
250
82
451
85
450
75
250
101
489
73
489
99
472
53
451
What is the obtained t-score for this correlation?
What is the critical t-score for a two-tailed test with a = 0.05?
Is this correlation significant based on the t-scores?
arrow_forward
The basketball coach at a local college believes that his team scores more points at home games when more people show up. Below is a list of all home games last year with scores and corresponding attendance. Use Excel, SPSS, or work by hand to show your work finding r. Show your work on the attached pages.
Score
Attendance
Score
Attendance
54
380
67
410
57
350
78
215
59
320
67
113
80
478
56
250
82
451
85
450
75
250
101
489
73
489
99
472
53
451
g. What is the coefficient of determination for this relationship?
h. Interpret for these two variables presuming a causal relationship was expected.
arrow_forward
An business reviews data on the daily amount of calls it receives. Are the data discrete or continous?
arrow_forward
Recently, management at Oak Tree Golf Course received a few complaints about the condition of the greens. Several players complained that the greens are too fast. Rather than react to the comments of just a few, the Golf Association conducted a survey of 100 male and
100 female golfers. The survey results are summarized here.
Excel File: data02-31.xlsx
Male Golfers
Male
Green Condition
Handicap
Under 15
15 or more
25
25
a. Complete the crosstabulation shown below.
Green Condition
Gender Too Fast Fine
Female
35
40
Too Fast
10
65
60
Fine
40
Total
100
100
Female Golfers
200
Green Condition
Handicap
Under 15
15 or more
Too Fast
1
Note: This exercise is an example of Simpson's Paradox.
39
Fine
9
Total
75
125
Which group shows the highest percentage saying that the greens are too fast?
Females, at 40%
b. Refer to the initial crosstabulations. For those players with low handicaps (better players), which group (male or female) shows the highest percentage saying the greens are too fast?
For…
arrow_forward
I need help with this problem please.
arrow_forward
You may need to use the appropriate technology to answer this question.
A poll tracks the favorite sport of Americans who follow at least one sport. Results of the poll show that professional football is the favorite sport of 33% of Americans who follow at least one sport, followed by baseball at 15%, men's college football at 10%, auto racing at 6%, men's professional basketball at 5%, and ice hockey at 5%, with other sport at 26%. Consider a survey in which 344 college undergraduates who follow at least one sport were asked to identify their favorite sport produced the following results:
Professional
Football Baseball Men's
College Football Auto
Racing Men's
Professional
Basketball Ice
Hockey Other
Sports
113 37 48 13 8 19 106
Do college undergraduate students differ from the general public with regard to their favorite sports? Use a = 0.05.
State the null and alternative hypotheses.
* Ho: The population proportion concerning favorite sport is the same for undergraduate students…
arrow_forward
You may need to use the appropriate technology to answer this question.
A poll tracks the favorite sport of Americans who follow at least one sport. Results of the poll show that professional football is the favorite sport of 33% of Americans who follow at least one sport, followed by baseball at 15%, men's college football at 10%, auto racing at 6%, men's professional basketball at 5%, and ice
hockey at 5%, with other sport at 26%. Consider a survey in which 344 college undergraduates who follow at least one sport were asked to identify their favorite sport produced the following results:
Professional
Football
113
Baseball
37
Men's
College
Football
48
Auto
Racing
12
Men's
Professional
Basketball
8
Ice
Hockey
19
Other
Sports
Find the p-value. (Round your answer to four decimal places.)
p-value =
107
Do college undergraduate students differ from the general public with regard to their favorite sports? Use a = 0.05.
State the null and alternative hypotheses.
O Ho: Undergraduate students…
arrow_forward
Continue monitoring the process. A second ten days of data have been collected, see table labeled “2nd 10 Days of Monitoring Reservation Processing Time” in the Data File.
Develop Xbar and R charts for the 2nd 10 days of monitoring. Plot the data for the 2nd 10 days on the Xbar and R charts.
Is the reservation process for the 2nd 10 days of monitoring in control? If the control chart indicates an out-of-control process, note which days, the pattern, and whether it is the Xbar or R chart.
Based on the X-bar and R Charts that you developed for the 2nd 10 days of data, is the process in control?
Group of answer choices
No. The X-bar and R Charts are both out of control.
No. The X-bar Chart is in control, but the R Chart is out of control.
No. The R Chart is in control, but the X-bar Chart is out of control.
Yes. The X-bar and R Charts are both in control.
arrow_forward
A popular summer event is Skee-Ball. For $2, a customer purchases three balls and attempts to roll each ball into a central target. The customer wins their $2 back and wins an additional $1 if they hit the target once, an additional $3 if they hit the target twice and an additional $5 if they hit the target three times. If the customer does not hit the target at all, they lose their initial $2. Access the data set labeled ”Skee-Ball” which reports the outcomes of 2,500 games for a single day’s operation of a Skee-Ball booth.
(a) Define a random variable C (Customer Score) equal to the number of times a customer hits the target in each set of three rolls. How many possible outcomes of C are there? Report your answer as an integer.
(b) Report the net profit ($) that the Skee-Ball booth achieved for the day. Report your answer as an integer. Hint: Estimate the PDF of C based on the relative frequencies: P(C = c) = Frequency of C = c / 2,500
(c) Report P(C = 0). Round your answer to three…
arrow_forward
SEE MORE QUESTIONS
Recommended textbooks for you

Elementary Geometry for College Students
Geometry
ISBN:9781285195698
Author:Daniel C. Alexander, Geralyn M. Koeberlein
Publisher:Cengage Learning

Mathematics For Machine Technology
Advanced Math
ISBN:9781337798310
Author:Peterson, John.
Publisher:Cengage Learning,
Related Questions
- Please use the given info to answer the subquestion Part Aarrow_forwardThe whole data set will be in the two picturesarrow_forwardRecently, management at Oak Tree Golf Course received a few complaints about the condition of the greens. Several players complained that the greens are too fast. Rather than react to the comments of just a few, the Golf Association conducted a survey of 100 male and 100 female golfers. The survey results are summarized here. Excel File: data02-31.xlsx Male Golfers Green Condition Gender Too Fast Male Handicap Under 15 15 or more 25 25 a. Complete the crosstabulation shown below. Green Condition Female Too Fast 10 Fine Fine 40 Female Golfers Total Green Condition Handicap Under 15 15 or more Too Fast 1 Fine 9 39 51 Total Which group shows the highest percentage saying that the greens are too fast? - Select your answer - b. Refer to the initial crosstabulations. For those players with low handicaps (better players), which group (male or female) shows the highest percentage saying the greens are too fast? For the low handicappers, the - Select your answer - have a higher percentage who…arrow_forward
- Spend at least 20 minutes looking at a few of the different unique data visualization ideas foundat this blog: http://flowingdata.com/. Discuss one of the posts in a few sentences, copying inany appropriate (and appropriately resized) graphics.arrow_forwardThe r code for side by side boxplot of vitamind v newage and vitamin d v country. Scatterplot code for relationship between vitamin d level and age.arrow_forwardplease. help me answer this question. thank youarrow_forward
- tion 2 of 15 Last summer, the Smith family drove through seven different states and visited various popular landmarks. The prices of gasoline in dollars per gallon varied from state to state and are listed below. $2.34, $2.75, $2.48, $3.58, $2.87, $2.53, $3.31 Click to download the data in your preferred format. CrunchIt! CSV Excel JMP Mac Text Minitab PC Text R SPSS TI Calc Calculate the range of the price of gas. Give your solution to the nearest cent. range: dollars per gallon DELL & 4. 7 8.arrow_forwardThe Conch Café, located in Gulf Shores, Alabama, features casual lunches with a great view of the Gulf of Mexico. To accommodate the increase in business during the summer vacation season, Fuzzy Conch, the owner, hires a large number of servers as seasonal help. When he interviews a prospective server, he would like to provide data on the amount a server can earn in tips. He believes that the amount of the bill and the number of diners are both related to the amount of the tip. He gathered the following sample information. Customer Amount of Tip Amount of Bill Number of Diners Customer Amount of Tip Amount of Bill Number of Diners 1 $ 7.00 $ 48.97 5 16 $ 3.30 $ 23.59 2 2 4.50 28.23 4 17 3.50 22.30 2 3 1.00 10.65 1 18 3.25 32.00 2 4 2.40 19.82 3 19 5.40 50.02 4 5 5.00 28.62 3 20 2.25 17.60 3 6 4.25 24.83 2 21 5.50 44.47 4 7 0.50 6.24 1 22 3.00 20.27 2…arrow_forwardThe Conch Café, located in Gulf Shores, Alabama, features casual lunches with a great view of the Gulf of Mexico. To accommodate the increase in business during the summer vacation season, Fuzzy Conch, the owner, hires a large number of servers as seasonal help. When he interviews a prospective server, he would like to provide data on the amount a server can earn in tips. He believes that the amount of the bill and the number of diners are both related to the amount of the tip. He gathered the following sample information. Customer Amount of Tip Amount of Bill Number of Diners Customer Amount of Tip Amount of Bill Number of Diners 1 $ 6.05 $ 73.22 1 16 $ 3.30 $ 23.59 2 2 4.50 28.23 4 17 3.50 22.30 2 3 1.00 10.65 1 18 3.25 32.00 2 4 2.40 19.82 3 19 5.40 50.02 4 5 5.00 28.62 3 20 2.25 17.60 3 6 4.25 24.83 2 21 1.40 41.80 5 7 .50 6.25 1 22 3.00 20.27 2…arrow_forward
arrow_back_ios
SEE MORE QUESTIONS
arrow_forward_ios
Recommended textbooks for you
- Elementary Geometry for College StudentsGeometryISBN:9781285195698Author:Daniel C. Alexander, Geralyn M. KoeberleinPublisher:Cengage LearningMathematics For Machine TechnologyAdvanced MathISBN:9781337798310Author:Peterson, John.Publisher:Cengage Learning,

Elementary Geometry for College Students
Geometry
ISBN:9781285195698
Author:Daniel C. Alexander, Geralyn M. Koeberlein
Publisher:Cengage Learning

Mathematics For Machine Technology
Advanced Math
ISBN:9781337798310
Author:Peterson, John.
Publisher:Cengage Learning,