← Niche Catalog

page_classify_short

Modality: llm_chat · full deep dive — every ranked model, test result, and artifact.

4
Models
0
Benchmark Results
0
Media Artifacts
0
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1claude-sonnet-4-6anthropic0.9981.000n=2892
2claude-haiku-4-5-20251001anthropic0.1230.820provisional (n≤1)
3gpt-4o-miniopenai0.1110.740provisional (n≤1)
4gemini-2.5-flashgoogle_gemini0.1050.700provisional (n≤1)

Test Results

No benchmark outputs recorded for this niche yet.