Modality: modality_arm, video_generation · full deep dive — every ranked model, test result, and artifact.
Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).
| # | Model | Provider | Adj. Score | Raw | Evidence |
|---|---|---|---|---|---|
| 1 | dall-e-3 | openai | 0.850 | 0.850 | curated |
| 2 | flux-pro | fal | 0.850 | 0.850 | curated |
| 3 | flux-schnell | fal | 0.800 | 0.800 | curated |
| 4 | ideogram-v2 | replicate | 0.800 | 0.800 | curated |
| 5 | stable-diffusion-3 | fal | 0.750 | 0.750 | curated |
| 6 | sdxl | fal | 0.700 | 0.700 | curated |
| 7 | stability/stable-image-sd3-large | stability | 0.150 | 1.000 | provisional (n≤1) |
No benchmark outputs recorded for this niche yet.
Requests semantically routed into this niche.
| Requested | Method | Confidence | When |
|---|---|---|---|
image_synthesis | semantic | 0.622 | 2026-05-20 17:26 |