← Niche Catalog

classification

Modality: llm_chat, modality_arm · full deep dive — every ranked model, test result, and artifact.

6
Models
0
Benchmark Results
0
Media Artifacts
0
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1deepseek-chatdeepseek0.9761.000n=205
2claude-haiku-4-5anthropic0.8500.850curated
3gemini-2.5-flashgoogle0.8200.820curated
4gpt-4o-miniopenai0.8000.800curated
5deberta-v3-largehuggingface0.7500.750curated
6distilbert-multilingualhuggingface0.7000.700curated

Test Results

No benchmark outputs recorded for this niche yet.