The correlation coefficient (r) measures the strength and direction of the linear relationship between two variables. It ranges from −1 to +1, where −1 is a perfect negative correlation, 0 is no correlation, and +1 is a perfect positive correlation. It is one of the most used (and misused) statistics in science, business, and everyday reasoning.

The Correlation Coefficient

Interpreting r-values
r = +1.0: Perfect positive (both increase together) r = +0.7 to +0.9: Strong positive r = +0.4 to +0.7: Moderate positive r = +0.1 to +0.4: Weak positive r = 0.0: No linear relationship Negative values: same scale, opposite direction

The sign indicates direction; the magnitude indicates strength.

r-squared: The Explained Variance

Squaring r gives r², the coefficient of determination. This tells you what percentage of variation in one variable is explained by the other. If r = 0.80, then r² = 0.64, meaning 64% of the variation is explained by the relationship. The remaining 36% is due to other factors.

r valueInterpretation
±0.900.8181% of variance explained — very strong
±0.700.4949% of variance explained — strong
±0.500.2525% of variance explained — moderate
±0.300.099% of variance explained — weak

Calculate correlation for your own data with the Correlation Calculator.

Correlation Does Not Imply Causation

This is the most important principle in statistics. A strong correlation between two variables does not mean one causes the other. Ice cream sales and drowning deaths are positively correlated — not because ice cream causes drowning, but because both increase in summer (a confounding variable). Always look for confounders, reverse causation, and coincidence before drawing causal conclusions.

Key Takeaways

  • r ranges from −1 to +1 measuring strength and direction of linear relationship.
  • r² tells you the percentage of variation explained by the relationship.
  • Correlation does not imply causation — always consider confounders.
  • r only measures linear relationships — it misses curved or complex patterns.

Frequently Asked Questions

What is a strong correlation?

In most fields, r above 0.70 (or below -0.70) is considered strong. In social sciences, where human behavior is noisy, r above 0.50 may be considered strong. In physics and engineering, r below 0.95 may be considered weak. Context matters.

Can correlation be negative?

Yes. A negative correlation means as one variable increases, the other decreases. For example, hours of exercise per week and body fat percentage have a negative correlation. The strength is still measured by the magnitude (closer to -1 is stronger).

What is the difference between correlation and regression?

Correlation measures the strength of a relationship (r-value). Regression builds a predictive equation (y = mx + b) that lets you predict one variable from the other. Regression uses correlation as part of its calculation but provides more practical predictive power.

Looking for more? Browse all free resources including guides, comparisons, and glossary terms.