Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Simple Linear Regression: Understanding the Relationship between Two Variables, Schemes and Mind Maps of Probability and Statistics

The concept of regression analysis, focusing on simple linear regression where the relationship between a dependent variable and an independent variable is linear. It covers the least square approach to find the constants 'a' and 'b' in the equations Y = a + bX and X = c + dY, which represent the lines of regression for predicting Y from X and X from Y, respectively. The document also includes formulas for regression coefficients and remarks on the significance of correlation between variables.

Typology: Schemes and Mind Maps

2021/2022

Uploaded on 09/27/2022

thecoral
thecoral ๐Ÿ‡บ๐Ÿ‡ธ

4.4

(29)

401 documents

1 / 6

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Simple Linear Regression
Meaning of Regression analysis: -
โžข Regression analysis means the estimation or prediction of the unknown value of one
variable from the known values of one or more variables. It attempts to establish the
โ€œnature of relationshipโ€ between variables.
โžข Dependent/Explained variable -The variable whose value is to be predicted is called
dependent or explained variable.
โžข Independent/Explanatory variable โ€“ The variables which are used to predict the
values of a dependent variable are called independent or explanatory variables.
โžข Simple regression โ€“ study of only two variables, a dependent and an independent
variable is called simple regression.
โžข Simple linear regression โ€“ when the relationship between the dependent and
independent variable is linear.
Lines of Regression โ€“ THE LEAST SQUARE APPROACH
โžข The least square line of regression of Y on X: This equation is used to estimate
value of Y for a given value of X.
Y = a + bX
Where,
a and b are constants
The value of constant a and b can be find out with the help of two normal equations.
The two normal equations are as follows:
โˆ‘Y = na + b โˆ‘X
โˆ‘XY = a โˆ‘X + b โˆ‘๐‘‹2
b = it is called the regression coefficient of Y on X and is denoted by bYX.
It measures the change in Y corresponding to a unit change in X. Thus
bYX = Slope of the line of regression of Y on X and given by
bYX= ๐‘›โˆ‘๐‘‹๐‘Œโˆ’(โˆ‘๐‘‹)(โˆ‘๐‘Œ)
๐‘›โˆ‘๐‘‹2โˆ’(โˆ‘๐‘‹)2 = b
The line of regression of Y on X passes through the point (๐‘‹,๐‘Œ) and hence the
equation of the line of regression of Y on X (Y = a + bX) can also be written as
Y - ๐‘Œ = bYX (X- ๐‘‹ )
pf3
pf4
pf5

Partial preview of the text

Download Simple Linear Regression: Understanding the Relationship between Two Variables and more Schemes and Mind Maps Probability and Statistics in PDF only on Docsity!

Simple Linear Regression

Meaning of Regression analysis: - โžข Regression analysis means the estimation or prediction of the unknown value of one variable from the known values of one or more variables. It attempts to establish the โ€œnature of relationshipโ€ between variables. โžข Dependent/Explained variable - The variable whose value is to be predicted is called dependent or explained variable. โžข Independent/Explanatory variable โ€“ The variables which are used to predict the values of a dependent variable are called independent or explanatory variables. โžข Simple regression โ€“ study of only two variables, a dependent and an independent variable is called simple regression. โžข Simple linear regression โ€“ when the relationship between the dependent and independent variable is linear. Lines of Regression โ€“ THE LEAST SQUARE APPROACH โžข The least square line of regression of Y on X : This equation is used to estimate value of Y for a given value of X.

Y = a + bX

Where, a and b are constants The value of constant a and b can be find out with the help of two normal equations. The two normal equations are as follows: โˆ‘Y = na + b โˆ‘X โˆ‘XY = a โˆ‘X + b โˆ‘๐‘‹^2 b = it is called the regression coefficient of Y on X and is denoted by bYX. It measures the change in Y corresponding to a unit change in X. Thus bYX = Slope of the line of regression of Y on X and given by

bYX=

๐‘›โˆ‘๐‘‹๐‘Œโˆ’(โˆ‘๐‘‹)(โˆ‘๐‘Œ) ๐‘›โˆ‘๐‘‹^2 โˆ’(โˆ‘๐‘‹)^2

= b

The line of regression of Y on X passes through the point (๐‘‹,๐‘Œ) and hence the equation of the line of regression of Y on X (Y = a + bX) can also be written as

Y - ๐‘Œ = bYX (X- ๐‘‹ )

โžข The least square line of regression of X on Y : This equation is used to estimate a value of X for a given value of Y.

X = c + dY

where, c and d are constants The two normal equations for estimating c and d are given by โˆ‘X = nc + dโˆ‘Y โˆ‘XY = cโˆ‘Y + dโˆ‘Y^2 d = it is called the regression coefficient of X on Y and is denoted by bXY. It measures the change in X corresponding to a unit change in Y. Thus bXY= Slope of the line of regression of X on Y and given by

bXY =

๐‘›โˆ‘๐‘‹๐‘Œโˆ’(โˆ‘๐‘‹)(โˆ‘๐‘Œ) ๐‘›โˆ‘๐‘Œ^2 โˆ’(โˆ‘๐‘Œ)^2 The line of regression of X on Y passes through the point (๐‘‹,๐‘Œ) and hence the equation of the line of regression of X on Y (X = c + dY) can also be written as X- ๐‘‹ = bXY(Y - ๐‘Œ) Remarks:

1. It may be remarked that there are always two lines of regression, one of Y on X and the other X on Y. Y on X = to predict value of Y from known values of X X on Y = to predict value of X from known values of Y 2. Since the two lines of regression of passes through the point (๐‘‹,๐‘Œ), the mean values (๐‘‹,๐‘Œ) can be obtained as the point of intersection of the two regression lines. Regression Coefficient โ€“ Some Formulas

  1. Formulas for regression Coefficients in terms of Covariance and Variance:

bYX =

๐ถ๐‘œ๐‘ฃ(๐‘‹,๐‘Œ) ๐œŽ๐‘‹^2

bXY =

๐ถ๐‘œ๐‘ฃ(๐‘‹,๐‘Œ) ๐œŽ๐‘Œ^2

(We can also assume X= maintenance cost and Y = age of cars and then solve the question accordingly) X (Age) Y(Cost) X^2 Y^2 XY 2 10 4 100 20 4 20 16 400 80 6 25 36 625 150 8 30 64 900 240 โˆ‘X = 20 โˆ‘Y = 85 โˆ‘X^2 = 120 โˆ‘Y^2 = 2025 โˆ‘XY= 490 (a) Regression equation for Costs(Y) related to age(X): It means we have to find out equation Y on X, which is given as

Y - ๐‘Œ = bYX (X- ๐‘‹ )

And bYX =

๐‘›โˆ‘๐‘‹๐‘Œโˆ’(โˆ‘๐‘‹)(โˆ‘๐‘Œ) ๐‘›โˆ‘๐‘‹^2 โˆ’(โˆ‘๐‘‹)^2

bYX =

4 ร— 490 โˆ’( 20 )( 85 ) 4 ร— 120 โˆ’( 20 )^2

1960 โˆ’ 1700 480 โˆ’ 400

260 80

bYX = 3.

๐‘‹ = โˆ‘๐‘‹ ๐‘›

20 4 ๐‘‹ = 5 Regression equation Y on X:

Y - ๐‘Œ = bYX (X- ๐‘‹ )

Y โ€“ 21.25 = 3.25 (X โ€“ 5)

Y โ€“ 21.25 = 3.25X โ€“ 16.

Y = 5 + 3.25X, which is in the form of Y = a + bX

(b) Regression equation for Age (X) related to Cost(Y): It means we have to find out equation X on Y, which is given as

X- ๐‘‹ = bXY(Y - ๐‘Œ)

And, bXY =

๐‘›โˆ‘๐‘‹๐‘Œโˆ’(โˆ‘๐‘‹)(โˆ‘๐‘Œ) ๐‘›โˆ‘๐‘Œ^2 โˆ’(โˆ‘๐‘Œ)^2

bXY =

4 ร— 490 โˆ’( 20 )( 85 ) 4 ร— 2025 โˆ’( 85 )^2

1960 โˆ’ 1700 8100 โˆ’ 7225

260 875

bYX =.

โˆ‘๐‘Œ ๐‘›

85 4 ๐‘Œ = 21. Regression equation X on Y:

X - ๐‘‹ = bXY (Y- ๐‘Œ )

X โ€“ 5 = .297 (Y โ€“ 21.25)

Y โ€“ 5 = .297Y โ€“ 6.

X = - 1.31125 + .297Y, which is in the form of X = c + dY

(c) Estimate the annual cost for a ten-year-old car: It means what will be the value of Y, if value of X = 10 (X = 10, Y =?) That is if the age of car(X) is 10, what is the cost (Y) for such a car. To find out value of Y for the given value of X, we will use regression equation Y on X.

Y = a + bX

Y = 5 + 3.25X, as obtained in part (a)

Substituting X = 10 in above equation, the estimated annual maintenance cost for a ten-year-old car is:

Y = 5 + 3.25ร—

Y = 5 + 32.

Y = 37.5 (in Rs. hundred)

(d) Estimate the age(X) of a car whose maintenance cost(Y) is 50.