Loading the player...

What is a 'Coefficient of Determination'

The coefficient of determination is a measure used in statistical analysis that assesses how well a model explains and predicts future outcomes. It is indicative of the level of explained variability in the data set. The coefficient of determination, also commonly known as "R-squared," is used as a guideline to measure the accuracy of the model. One way of interpreting this figure is to say that the variables included in a given model explain approximately x% of the observed variation. So, if the R2 = 0.50, then approximately half of the observed variation can be explained by the model.

BREAKING DOWN 'Coefficient of Determination'

The coefficient of determination is used to explain how much variability of one factor can be caused by its relationship to another factor. It is relied on heavily in trend analysis and is represented as a value between 0 and 1. The closer the value is to 1, the better the fit, or relationship, between the two factors. The coefficient of determination is the square of the correlation coefficient, also known as "R," which allows it to display the degree of linear correlation between two variables.

This correlation is known as the "goodness of fit." A value of 1.0 indicates a perfect fit, and therefore it is a very reliable model for future forecasts - indicating that the model explains all of the variation observed. A value of 0, on the other hand, would indicate that the model fails to accurately model the data at all. For a model with several variables, such as a multiple regression model, the adjusted R2 is a better coefficient of determination. In economics an R2 value above 0.60 is seen as worthwhile.

Analyzing the Coefficient of Determination

The coefficient of determination is the square of the correlation between the predicted scores in a data set versus the actual set of scores. It can also be expressed as the square of the correlation between X and Y scores, with the X being the independent variable and the Y being the dependent variable.

Regardless of representation, an R-squared equal to 0 means that the dependent variable cannot be predicted using the independent variable. Conversely, if it equals 1, it means that the dependent of variable is always predicted by the independent variable. A coefficient of determination that falls within this range measures the extent that the dependent variable is predicted by the independent variable. An R-squared of 0.20, for example, means that 20% of the dependent variable is predicted by the independent variable.

What Is the Goodness of Fit?

The goodness of fit, or the degree of linear correlation, measures the distance between a fitted line on a graph and all the data points that are scattered around the graph. The tight set of data will have a regression line that's very close to the points and have a high level of fit, meaning that the distance between the line and the data is very small. A good fit has an R-squared that is close to 1.

However, R-squared is unable to determine whether the data points or predictions are biased. It also doesn't tell the analyst or user whether the coefficient of determination value is good or not. A low R-squared is not bad, for example, and it's up to the person to make a decision based on the R-squared number.

The coefficient of determination should not be interpreted naively. For example, if a model’s R-squared is reported at 75%, the variance of its errors is 75% less than the variance of the dependent variable, and the standard deviation of its errors is 50% less than the standard deviation of the dependent variable. The standard deviation of the model’s errors is about one-third the size of the standard deviation of the errors that you would get with a constant-only model.

Finally, even if an R-squared value is large, there may be no statistical significance of the explanatory variables in a model, or the effective size of these variables may be very small in practical terms.

  1. Correlation Coefficient

    The correlation coefficient is a statistical measure that calculates ...
  2. Negative Correlation

    A perfect negative correlation is a relationship between two ...
  3. Pearson Coefficient

    Pearson coefficient is a type of correlation coefficient that ...
  4. Coefficient of Variation (CV)

    Coefficient of variation (CV) is a measure of the dispersion ...
  5. Benchmark For Correlation Values

    A benchmark for correlation values is a point of reference that ...
  6. Variance Inflation Factor

    Variance inflation factor is a measure of the amount of multicollinearity ...
Related Articles
  1. Investing

    Regression Basics For Business Analysis

    This tool is easy to use and can provide valuable information on financial analysis and forecasting. Find out how.
  2. Financial Advisor

    Does Your Investment Manager Measure Up?

    These key stats will reveal whether your advisor is a league leader or a benchwarmer.
  3. Investing

    5 ways to measure mutual fund risk

    Statistical measures such as alpha and beta can help investors understand investment risk on mutual funds and how it relates to returns.
  4. Investing

    T Rowe Price Capital Appreciation Fund Risk Statistics Case Study (PRWCX)

    Analyze PRWCX using popular risk metrics that are part of modern portfolio theory (MPT). Explore PRWCX's volatility, correlation and return statistics.
  5. Managing Wealth

    Variable Annuities: The Pros and Cons

    Variable annuities are one of the most complicated financial instruments—weighing the pros and cons.
  6. Investing

    PRHSX: Risk Statistics of Health Sciences Mutual Fund

    Examine the risk metric of the T. Rowe Price Health Sciences Fund. Analyze beta, capture ratios and standard deviation to assess volatility and systematic risk.
  7. Financial Advisor

    Life Insurance: Variable Vs. Variable Universal

    Do you know why you might need one policy versus the other? Read on to find out the difference between Variable and Variable Universal life insurance.
  8. Retirement

    How a Variable Annuity Works After Retirement

    These investments can provide extra income after you retire. Here’s a guide to when and how you will receive the payout.
  1. What does a negative correlation coefficient mean?

    Discover the meaning of a negative correlation coefficient, how this compares to other correlation coefficients and examples ... Read Answer >>
  2. How do I calculate correlation between market indicators and specific stocks?

    Discover how to calculate the correlation coefficient between market indicators and stock prices, a critical skill in technical ... Read Answer >>
  3. What is the correlation between U.S. stock prices and the value of the U.S. dollar?

    The correlation between American stock prices and the U.S. dollar comes down to the two variables having a correlation coefficient ... Read Answer >>
  4. What's the difference between R-squared and correlation?

    Discover how R-squared calculations determine the practical usefulness of beta and alpha correlations between individual ... Read Answer >>
  5. How should I interpret a negative correlation?

    Learn more about correlation and how businesses analyze variables. Find out how negative correlations are interpreted by ... Read Answer >>
Trading Center