← Niche Catalog

trm_structured_reasoning

Modality: modality_arm · full deep dive — every ranked model, test result, and artifact.

4
Models
0
Benchmark Results
0
Media Artifacts
0
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1claude-opus-4-7anthropic0.9200.920curated
2gemini-2.5-progoogle0.8800.880curated
3gpt-4oopenai0.8500.850curated
4deepseek-v3deepseek0.8200.820curated

Test Results

No benchmark outputs recorded for this niche yet.