Least-Squares Regression Line

Linear Regressionleast-squares regression line is used to assess the relationship between two quantitative variables. It is used to estimate the value of the response variable given an arbitrary value of the explanatory variable. There are four requirements that must be satisfied before a valid least-squares regression line can be calculated:

  1. The two variables are quantitative.
  2. A linear association exists between the two variables.
  3. No outliers
  4. Equal residuals (calculated by a residual plot)

A least-squares regression line \hat y is of the form \hat y=b_0+b_1x, with b_0=\bar y-b_1\bar x and b_1=r\left(\frac{s_y}{s_x}\right), where \bar x and \bar y are the means of x and y, respectively, and s_x and s_y are the standard deviations of x and y, respectively. For more on calculating least-squares regression line, visit this website.

It is important to be sure the fourth requirement for a least-squares regression line is satisfied. If the fourth requirement is not satisfied but a regression line is still calculated, an inaccurate correlation may be assumed. When a least-squares regression line is not appropriate, there are other options to model data. The article “Misuse of correlation and regression in three medical  journals” covers other real-world examples of ways least-squares regression lines are misused.


Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s