Benchmarks¶

The skein-glm benchmark surface has two layers:

1. Per-scenario notes (this directory)¶

Hand-curated v1 benchmark pages from benches/, each with detailed headline numbers and correctness summaries for one (datafit, penalty) combination. These are stable and won’t be rewritten:

mcp_ls — MCP / Gaussian-LS vs skglm + ncvreg
scad_ls — SCAD / Gaussian-LS vs ncvreg
lasso_ls_correctness — Lasso / LS cross-package agreement

2. Publication-quality bundle (`benches/v2/`)¶

The full benchmark suite backing the software paper lives in benches/v2/ (see benches/v2/README.md in the repo). It produces ten figures (F1…F10) and five tables (T1…T6) committed to paper/ (see paper/README.md) — every artifact regenerable from a clean checkout via Snakemake:

pip install -e '.[bench]'
maturin develop --release
cd benches/v2 && snakemake --profile profiles/m1-headline

The bundle covers:

Coverage matrix (F1, T1) — every public estimator vs every comparator
Headline timings + scaling curves (F2, F3, T2)
Cross-package agreement (F4) — per-λ Jaccard / sign / rel-L2
Recovery on synthetic truth (F5, T4) — support F1, β-RMSE
Real-dataset case studies (F6, T5) — Riboflavin, Leukemia, PBC, Birthwt
IC selection (F7) and stability selection (F8)
Screening + parallelism ablation (F9) — from criterion microbenches
CV threading speedup (F10) — validates the M5.x-c 2.3-2.5× claim

See benches/v2/README.md for the design rationale (why this suite exists, what’s in it, how to run, reproducibility contract) and paper/BUNDLE.md for the artifact provenance manifest after a run.

CI¶

A lightweight regression canary (.github/workflows/bench-smoke.yml) runs two cells per PR to catch pipeline breakage. The full headline matrix is a maintainer-driven overnight job on a release build, not a per-PR check.

Benchmarks¶

1. Per-scenario notes (this directory)¶

2. Publication-quality bundle (benches/v2/)¶

CI¶

2. Publication-quality bundle (`benches/v2/`)¶