You are hired by a bank to help determine its lending standards. You decide to start by estimating a simple linear regression model to help predict whether a loan will become past-due (i.e. if any of the payments are tate). You ask the bank to give you data on the following two random variables: PD is an indicator variable equal to 1 if the loan is past-due, and zero if all payments have been made on time. credit_score is the client's credit score at the time the loan was originated. The table below shows sample descriptive statistics for these variables: Variable PD Sample Mean 0.1 Sample Standard Deviation 0.08 credit_score 550 25
Correlation
Correlation defines a relationship between two independent variables. It tells the degree to which variables move in relation to each other. When two sets of data are related to each other, there is a correlation between them.
Linear Correlation
A correlation is used to determine the relationships between numerical and categorical variables. In other words, it is an indicator of how things are connected to one another. The correlation analysis is the study of how variables are related.
Regression Analysis
Regression analysis is a statistical method in which it estimates the relationship between a dependent variable and one or more independent variable. In simple terms dependent variable is called as outcome variable and independent variable is called as predictors. Regression analysis is one of the methods to find the trends in data. The independent variable used in Regression analysis is named Predictor variable. It offers data of an associated dependent variable regarding a particular outcome.
Step by step
Solved in 4 steps with 2 images