← Niche Catalog

code_refactor

Modality: llm_chat · full deep dive — every ranked model, test result, and artifact.

3
Models
0
Benchmark Results
0
Media Artifacts
0
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1claude-sonnet-4-6anthropic0.9200.920curated
2claude-opus-4-6anthropic0.8400.840curated
3deepseek-reasonerdeepseek0.7000.700curated

Test Results

No benchmark outputs recorded for this niche yet.