← Niche Catalog

general

Modality: llm_chat · full deep dive — every ranked model, test result, and artifact.

5
Models
0
Benchmark Results
0
Media Artifacts
0
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1deepseek-chatdeepseek0.7061.000n=12
2gpt-4oopenai0.6950.698n=1063
3gemini-2.5-progoogle_gemini0.4441.000n=4
4claude-haiku-4-5-20251001anthropic0.2120.424n=5
5gemini-2.5-flashgoogle_gemini0.0590.133n=4

Test Results

No benchmark outputs recorded for this niche yet.