登录

An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.

When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two variables. The slope tells us how the dependent variable (y) changes for every one unit increase in the independent (x) variable, on average. The y-intercept describes the dependent variable when the independent variable equals zero. A regression line, or a line of best fit, can be drawn on a scatter plot and used to predict outcomes for the x and y variables in a given data set or sample data.

The difference between the observed sample value, y, and the predicted value,Equation1from the regression equation, is known as unexplained deviation. Whereas the difference between the predicted value and the sample mean, y̅, is called the explained deviation. The difference between the observed value, y, and the sample mean, , is the total deviation.

If you add the squares of the explained deviations for all data points, we get the explained variation. In the same way, if we add the squares of the unexplained deviations for all data points, we get the unexplained variation. Also, if we add the squares of the total deviations for all data points, we get the total variation. Dividing the explained variation by the total deviation gives us the value of the coefficient of determination, r2, which represents the percent of the variation in the dependent variable y that can be explained by variation in the independent variable x using the regression line.

This text is adapted from Openstax, Introductory Statistics, Section 12, Linear Regression and Correlation.

Tags
VariationData SetsStandard DeviationVarianceScatter PlotSlopeIndependent VariableDependent VariableRegression LinePredicted ValueUnexplained DeviationExplained DeviationTotal DeviationExplained VariationUnexplained VariationCoefficient Of Determination

来自章节 11:

article

Now Playing

11.8 : Variation

Correlation and Regression

6.0K Views

article

11.1 : 相关

Correlation and Regression

10.7K Views

article

11.2 : 相关系数

Correlation and Regression

5.7K Views

article

11.3 : 计算和解释线性相关系数

Correlation and Regression

5.2K Views

article

11.4 : 回归分析

Correlation and Regression

5.2K Views

article

11.5 : 异常值和影响点

Correlation and Regression

3.8K Views

article

11.6 : Residuals 和 Least-Squares 属性

Correlation and Regression

6.6K Views

article

11.7 : 残差图

Correlation and Regression

3.9K Views

article

11.9 : 预测区间

Correlation and Regression

2.1K Views

article

11.10 : 多元回归

Correlation and Regression

2.8K Views

JoVE Logo

政策

使用条款

隐私

科研

教育

关于 JoVE

版权所属 © 2025 MyJoVE 公司版权所有,本公司不涉及任何医疗业务和医疗服务。