Pearson Correlation Calculator

Calculate Pearson's r to measure the strength and direction of linear relationships with scatter plot and step-by-step workings

About the Pearson Correlation Calculator

The Pearson Correlation Calculator is a statistical tool used to quantify the linear relationship between two continuous variables. Known formally as the Pearson Product-Moment Correlation (PPMC), this metric produces a value between -1 and +1, where the sign indicates the direction of the relationship and the magnitude indicates the strength. Researchers, data scientists, and students use this calculator to determine if an increase in one variable is consistently associated with an increase or decrease in another, helping to validate hypotheses in fields ranging from psychology to finance.

To use the tool, users input two sets of paired data. The calculator then computes the mean of each set, the deviations from those means, and the final r value. It is an essential first step in bivariate analysis, providing a mathematical basis for what might be observed visually in a scatter plot. By calculating Pearson's r, you can transition from subjective observations to objective statistical evidence regarding the connectivity of your data points.

Formula

r = Σ((x - x̄)(y - ȳ)) / √[Σ(x - x̄)² * Σ(y - ȳ)²]

In this formula, r represents the Pearson correlation coefficient. The numerator is the sum of the products of the deviations of each pair of scores (x and y) from their respective means (x-bar and y-bar). This captures how the variables vary together.

The denominator is the product of the square roots of the sum of squared deviations for each variable. This normalizes the result by the total variation in the data, ensuring the final r value stays between -1 and 1. Essentially, the calculation divides the covariance of the two variables by the product of their standard deviations.

Worked examples

Example 1: A study tracks the hours spent studying (X) and exam scores (Y) for three students: (2, 70), (4, 85), and (6, 98).

1. Find Mean X: (2+4+6)/3 = 4. Mean Y: (70+85+98)/3 = 84.33.\n2. Calculate deviations for X (x - x̄): -2, 0, 2. Square them: 4, 0, 4. Sum = 8.\n3. Calculate deviations for Y (y - ȳ): -14.33, 0.67, 13.67. Square them: 205.35, 0.45, 186.87. Sum = 392.67.\n4. Calculate product of deviations: (-2*-14.33) + (0*0.67) + (2*13.67) = 28.66 + 0 + 27.34 = 56.\n5. Apply formula: 56 / √(8 * 392.67) = 56 / √3141.36 = 56 / 56.05 = 0.999 (rounded).

Result: r = 0.98. This indicates a very strong positive linear relationship between study hours and exam scores.

Common use cases

Pitfalls and limitations

Frequently asked questions

what does it mean if pearson correlation is zero

A Pearson r value of zero indicates there is no linear relationship between the variables. However, it does not mean there is no relationship at all; the variables could still have a strong non-linear or curvilinear relationship, such as a U-shape.

what is the range of pearson correlation coefficient

The Pearson correlation coefficient ranges from -1 to +1. A value of +1 represents a perfect positive linear relationship, -1 represents a perfect negative linear relationship, and 0 indicates no linear association between the datasets.

difference between pearson and spearman correlation

Pearson correlation measures linear relationships between interval or ratio data, while Spearman's rank correlation measures monotonic relationships using ranked data. If your data has outliers or is non-linear but consistently increasing, Spearman is often preferred.

is 0.7 a strong pearson correlation

While interpretations vary by field, generally a coefficient between 0.1 and 0.3 is considered weak, 0.4 to 0.6 is moderate, and 0.7 to 0.9 is strong. A value of 1.0 is a perfect correlation, which is rare in real-world social sciences.

can outliers affect pearson correlation results

Yes, Pearson's r is highly sensitive to outliers because it uses the mean and standard deviation in its calculation. A single extreme data point can significantly inflate or deflate the correlation coefficient, potentially leading to misleading conclusions.

Related calculators

5 Number Summary Calculator
Calculate the five-number summary (min, Q1, median, Q3, max) and visualize with a box plot
Absolute Uncertainty Calculator
Calculate absolute and relative uncertainty for measurements and experimental data
Average Rating Calculator
Calculate the weighted average star rating from individual vote counts for reviews and feedback
Accuracy Calculator
Calculate accuracy, precision, and error rates for statistical analysis
Adjusted R-Squared Calculator
Calculate adjusted R² to account for the number of predictors in regression models
AIC/BIC Calculator
Compare statistical models using Akaike and Bayesian Information Criteria for model selection
Accuracy Calculator
Calculate accuracy, precision, and error rates for statistical analysis
ANOVA Calculator
Perform one-way Analysis of Variance to test if group means differ significantly