Modality: modality_arm · full deep dive — every ranked model, test result, and artifact.
Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).
| # | Model | Provider | Adj. Score | Raw | Evidence |
|---|---|---|---|---|---|
| 1 | openai-text-embedding-3-large | openai | 0.920 | 0.920 | curated |
| 2 | voyage-large-2 | voyage | 0.900 | 0.900 | curated |
| 3 | openai-text-embedding-3-small | openai | 0.850 | 0.850 | curated |
| 4 | jina-embeddings-v3 | jina | 0.850 | 0.850 | curated |
| 5 | bge-m3 | huggingface | 0.800 | 0.800 | curated |
No benchmark outputs recorded for this niche yet.