One-way ANOVA tests whether the means of three or more independent groups differ by more than random sampling variation alone. This calculator computes the F-statistic, its p-value, eta-squared as an effect size, and Bonferroni-corrected pairwise comparisons so you can identify which specific groups drive any significant overall difference.
How ANOVA works
ANOVA partitions the total variability in your data into two pieces: variability between groups (how far each group mean sits from the grand mean) and variability within groups (how far individual observations sit from their own group mean). If group membership matters, the between-groups piece should dwarf the within-groups piece. The F-statistic is exactly that ratio — between-groups mean square divided by within-groups mean square — and the F-distribution gives the probability of seeing an F that large by chance if every group truly had the same mean. A small p-value (smaller than your chosen alpha) means that's unlikely, so you reject the null hypothesis of equal means.
Why ANOVA instead of multiple t-tests
Running pairwise t-tests across k groups means k(k-1)/2 separate comparisons, each at alpha = 0.05. With four groups that's six tests, and the chance of at least one false positive climbs to roughly 26%. ANOVA performs a single omnibus test that holds the family-wise error rate at alpha for the overall question of whether any means differ. Only after that test rejects do you drop into pairwise comparisons — and even then you apply a multiplicity correction like Bonferroni so the overall false-positive risk stays controlled.
Limits and edge cases
One-way ANOVA assumes the groups are independent, the residuals are roughly normally distributed within each group, and the groups have roughly equal variances (homoscedasticity). The F-test is fairly robust to mild violations, but if variances differ by a factor of more than 3 or 4, use Welch's ANOVA. If the data are highly skewed or ordinal, use the non-parametric Kruskal-Wallis test instead. ANOVA also can't tell you about repeated measurements on the same subjects — that calls for a repeated-measures or mixed-effects model. Finally, eta-squared is the simplest effect-size measure but slightly biased upward in small samples; report omega-squared for publication-quality work.