← Niche Catalog

coding

Modality: llm_chat · full deep dive — every ranked model, test result, and artifact.

24
Models
342
Benchmark Results
0
Media Artifacts
3
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1deepseek-chatdeepseek0.9960.997n=12836
2claude-sonnet-4-6anthropic0.9200.920curated
3claude-opus-4-6anthropic0.8400.840curated
4deepseek-ai/DeepSeek-R1together0.7530.753curated
5command-r-08-2024cohere0.7530.753curated
6Qwen/Qwen2.5-7B-Instruct-Turbotogether0.7530.753curated
7deepseek-ai/DeepSeek-V3.1together0.7530.753curated
8claude-haiku-4-5-20251001anthropic0.7340.734n=16760
9deepseek-reasonerdeepseek0.7000.700curated
10magistral-small-latestmistral0.5830.583curated
11mistral-large-latestmistral0.4170.417curated
12claude-sonnet-4-5-20250929anthropic0.3630.363n=12137
13gpt-4.1-miniopenai0.3440.344curated
14gpt-4o-miniopenai0.3440.344curated
15grok-3-minixai0.1780.400n=4
16sonarperplexity0.1670.377n=4
17sonar-properplexity0.1670.377n=4
18grok-3xai0.1670.377n=4
19gpt-4oopenai0.1500.150curated
20meta-llama/Llama-3.3-70B-Instruct-Turbotogether0.1500.150curated
21gemini-2.5-progoogle_gemini0.1090.109curated
22gemini-2.5-flashgoogle_gemini0.1020.102curated
23o3openai0.0000.000provisional (n≤1)
24o3-miniopenai0.0000.000provisional (n≤1)

Test Results

Showing top 40 of 342 benchmark outputs by weighted score.

deepseek-chat · deepseek · weighted 10.0 · syntactic_validity=10.0 · $8e-06 · 787ms

```python def add(a, b): return a + b ```

claude-haiku-4-5-20251001 · anthropic · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000238 · 633ms

```python class UserProfile: def __init__(self): self.email = None def update_email(self, new_email): self.email = new_email ```

mistral-small-latest · mistral · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 899ms

```python def add(a, b): return a + b ```

deepseek-chat · deepseek · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $1.2e-05 · 979ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gpt-4o · openai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000485 · 744ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $6.6e-05 · 633ms

```python def add(a, b): return a + b ```

Qwen/Qwen2.5-7B-Instruct-Turbo · together · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 657ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ``` This class includes an initializer to set the initial email and a method to update the email.

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $6.6e-05 · 684ms

```python def add(a, b): return a + b ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $6.6e-05 · 863ms

```python def add(a, b): return a + b ```

command-r-08-2024 · cohere · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 963ms

```python def add(a, b): return a + b ```

sonar · perplexity · weighted 10.0 · syntactic_validity=10.0 · $4.5e-05 · 6096ms

```python def add(a, b): return a + b ```[1][3][5]

mistral-small-latest · mistral · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 896ms

```python def add(a, b): return a + b ```

gpt-4o-mini · openai · weighted 10.0 · syntactic_validity=10.0 · $1.4e-05 · 894ms

```python def add(a, b): return a + b ```

gpt-4o-mini · openai · weighted 10.0 · syntactic_validity=10.0 · $1.4e-05 · 745ms

```python def add(a, b): return a + b ```

command-r-08-2024 · cohere · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 671ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

claude-sonnet-4-6 · anthropic · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1420ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

mistral-large-latest · mistral · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 745ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

command-r-08-2024 · cohere · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 769ms

```python def add(a, b): return a + b ```

deepseek-reasoner · deepseek · weighted 10.0 · syntactic_validity=10.0 · $0.000123 · 1284ms

```python def add(a, b): return a + b ```

grok-3 · xai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000396 · 1812ms

class UserProfile: def update_email(self, new_email): self.email = new_email

grok-3-mini · xai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $2.3e-05 · 4812ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

deepseek-chat · deepseek · weighted 10.0 · syntactic_validity=10.0 · $8e-06 · 872ms

```python def add(a, b): return a + b ```

sonar-pro · perplexity · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000513 · 2247ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```[2][5]

claude-haiku-4-5-20251001 · anthropic · weighted 10.0 · syntactic_validity=10.0 · $0.000107 · 881ms

```python def add(a, b): return a + b ```

mistral-large-latest · mistral · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 819ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gpt-4.1-mini · openai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1556ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gpt-4o-mini · openai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $2.9e-05 · 939ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000205 · 668ms

```python class UserProfile: def __init__(self, username, email): self.username = username self.email = email def update_email(self, new_email): self.email = new_email ```

claude-opus-4-6 · anthropic · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.00327 · 1714ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

meta-llama/Llama-3.3-70B-Instruct-Turbo · together · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1254ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ```

deepseek-ai/DeepSeek-V3.1 · together · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1402ms

```python class UserProfile: def __init__(self): self.email = None def update_email(self, new_email): self.email = new_email ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000205 · 910ms

```python class UserProfile: def __init__(self, username, email): self.username = username self.email = email def update_email(self, new_email): self.email = new_email ```

claude-haiku-4-5-20251001 · anthropic · weighted 10.0 · syntactic_validity=10.0 · $0.000107 · 709ms

```python def add(a, b): return a + b ```

deepseek-reasoner · deepseek · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.001169 · 7214ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gemini-2.5-pro · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $0.004649 · 6438ms

```python def add(a, b): return a + b ```

claude-sonnet-4-6 · anthropic · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 1479ms

```python def add(a, b): return a + b ```

sonar · perplexity · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $5.4e-05 · 2068ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

mistral-small-latest · mistral · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1790ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

sonar · perplexity · weighted 10.0 · syntactic_validity=10.0 · $4.3e-05 · 2045ms

```python def add(a, b): return a + b ```[3][5]

grok-3-mini · xai · weighted 10.0 · syntactic_validity=10.0 · $1.4e-05 · 2937ms

def add(a, b): return a + b

Resolved Here

Requests semantically routed into this niche.

RequestedMethodConfidenceWhen
code_writinglexical0.992026-05-20 17:25
code_refactorlexical0.992026-05-20 17:25
code_refactorlexical0.992026-05-20 17:24