§ art-015-statistical-rigor · April 17, 2026 · RESEARCH-NOTES

Statistical Rigor in Small-Scale ML Experiments

Harikumar · 7 min read

Three random seeds and a mean is not a confidence interval. A practical guide to bootstrap CIs, effect sizes, power analysis, and the statistical mistakes that plague ML papers.

statistics bootstrap effect sizes reproducibility

APR 20The Case for Negative Results in ML Research

5 min→

APR 18Scaling Experiments with Minimal Infrastructure

6 min→