The squared distance from any sample point to the origin has a x² distribution with mean d. Consider a prediction point x₁ drawn from this distribution, and let a = Xo/|xo| be an associated unit vector. Let zi aTx; be the projection of each of the training points on this - direction. (a). Show that the z; are distributed N(0, 1) with expected squared distance from the origin 1, while the target point has expected squared distance d from the origin. (b). For d = 10 show that the expected distance of a test point from the centre of the training data is 3.1 standard deviations, while all the training points have expected distance 0.80 along direction a. So most prediction points see themselves as lying on the edge of the training set. Note: for this question you need to use a result for the expected value of a squared root of a chi-squared distribution. Either find such a result, or obtain your answer by simulation.

The squared distance from any sample point to the origin has a x² distribution with mean d. Consider a prediction point x₁ drawn from this distribution, and let a = Xo/|xo| be an associated unit vector. Let zi aTx; be the projection of each of the training points on this - direction. (a). Show that the z; are distributed N(0, 1) with expected squared distance from the origin 1, while the target point has expected squared distance d from the origin. (b). For d = 10 show that the expected distance of a test point from the centre of the training data is 3.1 standard deviations, while all the training points have expected distance 0.80 along direction a. So most prediction points see themselves as lying on the edge of the training set. Note: for this question you need to use a result for the expected value of a squared root of a chi-squared distribution. Either find such a result, or obtain your answer by simulation.

Calculus For The Life Sciences

2nd Edition

ISBN:9780321964038

Author:GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.

Publisher:GREENWELL, Raymond N., RITCHEY, Nathan P., Lial, Margaret L.

Chapter13: Probability And Calculus

Section13.2: Expected Value And Variance Of Continuous Random Variables

Problem 10E

See similar textbooks

Related questions

Q: The weight percent of silicon in six different rock samples, each containing different amounts of…

A: The question related to paired t-test.The data on the weight percent of silicon in different 6 rocks…

Q: OP(AnB) OP (Sample Sapce) = 1 OP(A or B) = P (A) + P (B) ○ P(A) = 1 – P (Aº) = P (A) × P (B) if…

A: It is given that the probability rules on all the parts. Here, need to find out the wrong statement.

Q: Two wine tasters rate each wine they taste on a scale of 1 to 5. From data on their ratings of a…

A: The question is about probability.Given :

Q: The box-and-whisker plots below (sometimes called boxplots) summarize the noon temperatures for each…

A: Interpreting a box and whisker plot involves understanding the distribution, central tendency,…

Q: The weights of bags of baby carrots are normally distributed, with a mean of 33ounces and a…

A: Suppose the random variable x defines the weight of a bag of baby carrots.The random variable x…

Q: Here are the shopping times (in minutes) for each of eighteen shoppers at a local grocery store.…

A: Given data,24, 38, 23, 34, 33, 30, 22, 26, 17, 31, 21, 19, 27, 32, 20, 28, 30, 15

Q: A random sample of daily high temperatures in January and February are listed. At an alpha equal to…

A: The objective of this question is to determine if there is a significant difference in the variances…

Q: If we flip a coin three times, find the probability of getting less than 2 tails? a What is the…

A: From the provided information,We flip a coin three times then the possible outcomes are as…

Q: Parking at a XYZ university has become a very big problem. University administrators are interested…

A: Comment: As per the our company guidelines we are supposed to answer only three subparts. Kindly…

Q: A survey by the Arthur Andersen Enterprise attempted to determine what the leading challenges are…

A: Since you have posted a question with multiple sub-parts, we will solve the first four sub-parts for…

Q: Let W₁ < W₂ < ... < Wn be the order statistics of n independent observations from a U(0, 1)…

A: the order statistic of independent observation form U(0, 1) distribution.

Q: X and Y are random variables such that E(X) = 5, Var(X) = 4, E(Y) = 3, Var(Y) = 1, and Corr(X,Y) =…

A: X and Y are random variables such that,

Q: A scientist is studying the effect of a new type of exercise program on cardiovascular health. The…

A: The objective of the question is to identify the type of variable that the exercise program…

Q: Records show that the lifetimes of batteries manufactured by a certain company have a mean of 610…

A: Given that the lifetime of batteries of a certain company has a mean of 610 hours and a standard…

Q: 17. Each full carton of Grade A eggs consists of 1 randomly selected empty cardboard container and…

A: The given data is as follows:Population mean, Population standard deviation,

Q: A user is allowed to create an 8-digit password, but it must satisfy the following requirements: The…

A: The number of digits in the password is 8.The requirements of the password are: The first 2…

Q: When females were finally allowed to become pilots of fighter jets, engineers needed to redesign the…

A: The question is about normal distribution.Given :Population mean weight of females ( ) = 174…

Q: If y is the dependent variable and is the independent variable. And if the sum of the cross product…

A: According to the given information,Sample size, n = 20

Q: We play a game with two tetrahedral dice, red and blue, each with faces labeled 1-4. In each turn,…

A: The game is played with tetrahedral dice colored red and blue.Both the dice have faces labeled…

Q: The table below shows the number of one company's stores located in each of 50 regions. Complete…

A: The data given is,…

Q: [defn) The Gallup News Service sent out 2,000 questionnaires for survey about climate change. 1,004…

A: The question is related to the sample survey. Gallup New Services sent out 2000 questionnaires for a…

Q: Find the population mean or sample mean as indicated. Sample: 18, 15, 7, 11, 19 Select the correct…

A: Sample: The population mean or sample mean value value must be calculated.

Q: Show that fx(x) = nfx, + ... + xn(nx) holds even if the MGF of X does

A: are i.i.d distributed random variables(rv) with pdf and denote the sample mean.It is required to…

Q: Το which of the following is Σ#1 (X; – X)X, equal? a) Σ1 (X; – X2 b) (Σ;, X?) – x? ©) Σ (X - XX d)…

A: It is required to identify the correct expression to which, the given expression, equals.

Q: The annual income of workers in a particular industry is $49, 400 with a standard deviation of…

A: It is given thatPopulation mean = 49400Standard deviation = 7340

Q: The coach wants to take the top 20% of the team and train them for the World Sit-up Championships.…

A: Scoresf90-99180-89470-79960-691250-591740-491330-391020-29710-1930-91Objective:- To find the raw…

Q: Here are yesterday's high temperatures (in Fahrenheit) in 11 U.S. cities. 46, 47, 62, 62, 64, 67,…

A: The objective of the question is to find the five-number summary and the interquartile range for the…

Q: 1. Please sort the data set below from low score to high score. 2. X 45-49 1 21 18 27 1 0-4 1 f 49 1…

A: Given data set is,Here, total frequency N = 25

Q: 3. Consider the simple linear regression model: y = Bo+ Bi +, where the intercept term 30 is known.…

A: Given information is,Simple linear regreesion mdel:y=β0+β1x+ϵ1Known intercept term:β0Errors have…

Q: 6. According to Bernoulli's distribution, determine the probability of tossing seven or less heads…

A: To determine the probability of getting seven or fewer heads from 20 tosses and the relative error…

Q: Assume that the daily maximum wind speed at a given location follows a Gaussian distribution with a…

A: The given data is as follows:Population mean, Coefficient of variation,

Q: A random variable X has the following PMF: [refer to image] a) Find E(X) b) If Y = g(X) = X^2,…

A: The solution to the problem you provided involves several key concepts from probability…

Q: In a Math 131 class at a University, the grades on the first exam are shown in the table below.…

A: Here the given data values are : 92 , 40 , 70 , 95 , 92 , 40 , 50 , 79 , 42 , 65 ,80 , 82 , 74 , 78…

Q: Question 3: In the following marketing set, we have 9 years with the sales in 10 million euro and…

A: Given the marketing set, which contains 9 years with sales in 10 million euro and the advertising…

Q: Given the following table of 28 observations, calculate the following percentiles. Do not round your…

A: The objective of the question is to calculate the first quartile (Q1), the 35th percentile, and the…

Q: You are given: L is the force of mortality for a life age z under the Standard Ultimate Life Table.…

A: Givenwhere is the force of mortality for a life aged x under the Standard Ultimate Life table.

Q: QUESTION 3 The following table contains yearly data on sales (S) and advertising (A) from 2020 to…

A: Since you have posted a question with multiple sub-parts, we will provide solutions to only the…

Q: A hot tub manufacturer advertises that with its heating equipment, a temperature of 100°F can be…

A: The question is about hypothesis testing.Given :Randomly selected no. of tubs ( n ) = 25Sample mean…

Q: Which type of sampling method is the following scenario? A researcher selects every 10th name of a…

A: The type of sampling method described in the scenario where a researcher selects every 10th name of…

Q: Grace and colleagues studied the population history of termite colonies. They noted an initial…

A: The data on the average weight of individual foraging workers (x) and the population of the colony…

Q: For small samples, when we make inferences about Population Means, what assumptions do we need to…

A: Here are the assumptions for making inferences about a population Mean:1) Random: The data come from…

Q: Did Americans work less than 40 hours a week on average in 1980? In 1980, the GSS included questions…

A: Given that,

Q: The table next to this contains data on the duration of recovery time for 3 brands of headache…

A: Given the data on the duration of recovery time for 3 brands of headache medicine given to 15 fever…

Q: Hello, I would like to check my work, could you please help with this? A runner for team 1 can run…

A: The objective of the question is to calculate the z-scores for the runners of team 1 and team 2 and…

Q: 4. The accompanying table shows the manufacturer's suggested retail price (MSRP), the highway miles…

A: Dataset:

Q: Two algebra classes with the same number of students took a final exam. The box-and-whisker plots…

A: A box plot helps in understanding the distribution of the data.

Q: Here are the scores of 13 students on a geography test. 59, 60, 61, 61, 67, 71, 73, 74, 85, 85, 86,…

A: The objective of this question is to calculate the five-number summary and the interquartile range…

Q: A plastic manufacturing company intends to create a control chart for the upcoming period by…

A: The data on the number of defects of the company performs 25 observations, taking 50 samples for…

Q: About 31% of a population are of a particular ethnic group. 130 people are randomly selected from…

A: Sample of size(n)=130and p=31%

Q: (b) Suppose that the measurement 126 (the smallest measurement in the data set) were replaced by 48.…

A: Mean, median, and mode are measures of central tendency used to describe the center of a data set.…

Question

The problem with KNN is that in high dimensions, most points tend to lie on the boundary of the data space. Consider explanatory variables drawn from a spherical multinormal distribution x ~ N(0, I), where x is a random d-vector, and I is a d x d identity matrix.

The squared distance from any sample point to the origin has a x² distribution with mean
d. Consider a prediction point xo drawn from this distribution, and let a = Xo/||xo|| be an
associated unit vector. Let z; = aTx; be the projection of each of the training points on this
direction.
(a). Show that the z; are distributed N(0, 1) with expected squared distance from the origin
1, while the target point has expected squared distance d from the origin.
(b). For d = 10 show that the expected distance of a test point from the centre of the
training data is 3.1 standard deviations, while all the training points have expected
distance 0.80 along direction a. So most prediction points see themselves as lying on
the edge of the training set. Note: for this question you need to use a result for the
expected value of a squared root of a chi-squared distribution. Either find such a result,
or obtain your answer by simulation.

Quantities that have magnitude and direction but not position. Some examples of vectors are velocity, displacement, acceleration, and force. They are sometimes called Euclidean or spatial vectors.

Expert Solution