← Niche Catalog

python_code_generation

Modality: llm_chat · full deep dive — every ranked model, test result, and artifact.

30
Models
342
Benchmark Results
0
Media Artifacts
0
Resolutions

Ranked Models

Ranked by confidence-adjusted score (single/zero-sample, non-curated scores floored; curated empirical scores trusted as-is).

#ModelProviderAdj. ScoreRawEvidence
1deepseek-reasonerdeepseek0.3560.800n=4
2grok-3-minixai0.3560.800n=4
3claude-opus-4-6anthropic0.3560.800n=4
4gemini-2.5-progoogle_gemini0.3460.778n=4
5gemini-2.5-flashgoogle_gemini0.3380.760n=4
6gpt-4oopenai0.3360.756n=4
7meta-llama/Llama-3.3-70B-Instruct-Turbotogether0.3360.756n=4
8deepseek-ai/DeepSeek-R1together0.3350.753n=4
9sonarperplexity0.3350.753n=4
10sonar-properplexity0.3350.753n=4
11grok-3xai0.3350.753n=4
12gpt-4.1-miniopenai0.3350.753n=4
13command-r-08-2024cohere0.3350.753n=4
14Qwen/Qwen2.5-7B-Instruct-Turbotogether0.3350.753n=4
15gpt-4o-miniopenai0.3350.753n=4
16deepseek-ai/DeepSeek-V3.1together0.3350.753n=4
17magistral-small-latestmistral0.2590.583n=4
18mistral-large-latestmistral0.1850.417n=4
19claude-sonnet-4-6anthropic0.1200.803provisional (n≤1)
20claude-haiku-4-5-20251001anthropic0.1200.803provisional (n≤1)
21claude-opus-4-7anthropic0.1130.753provisional (n≤1)
22grok-4xai0.1130.753provisional (n≤1)
23gpt-4.1openai0.1130.753provisional (n≤1)
24gpt-5.1openai0.1130.753provisional (n≤1)
25o4-miniopenai0.1130.753provisional (n≤1)
26gpt-5-nanoopenai0.1130.753provisional (n≤1)
27deepseek-chatdeepseek0.0000.000n=4
28mistral-small-latestmistral0.0000.000n=4
29gpt-5openai0.0000.000provisional (n≤1)
30gpt-5-miniopenai0.0000.000provisional (n≤1)

Test Results

Showing top 40 of 342 benchmark outputs by weighted score.

deepseek-chat · deepseek · weighted 10.0 · syntactic_validity=10.0 · $8e-06 · 787ms

```python def add(a, b): return a + b ```

claude-haiku-4-5-20251001 · anthropic · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000238 · 633ms

```python class UserProfile: def __init__(self): self.email = None def update_email(self, new_email): self.email = new_email ```

mistral-small-latest · mistral · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 899ms

```python def add(a, b): return a + b ```

deepseek-chat · deepseek · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $1.2e-05 · 979ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gpt-4o · openai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000485 · 744ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $6.6e-05 · 633ms

```python def add(a, b): return a + b ```

Qwen/Qwen2.5-7B-Instruct-Turbo · together · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 657ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ``` This class includes an initializer to set the initial email and a method to update the email.

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $6.6e-05 · 684ms

```python def add(a, b): return a + b ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $6.6e-05 · 863ms

```python def add(a, b): return a + b ```

command-r-08-2024 · cohere · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 963ms

```python def add(a, b): return a + b ```

sonar · perplexity · weighted 10.0 · syntactic_validity=10.0 · $4.5e-05 · 6096ms

```python def add(a, b): return a + b ```[1][3][5]

mistral-small-latest · mistral · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 896ms

```python def add(a, b): return a + b ```

gpt-4o-mini · openai · weighted 10.0 · syntactic_validity=10.0 · $1.4e-05 · 894ms

```python def add(a, b): return a + b ```

gpt-4o-mini · openai · weighted 10.0 · syntactic_validity=10.0 · $1.4e-05 · 745ms

```python def add(a, b): return a + b ```

command-r-08-2024 · cohere · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 671ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

claude-sonnet-4-6 · anthropic · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1420ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

mistral-large-latest · mistral · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 745ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

command-r-08-2024 · cohere · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 769ms

```python def add(a, b): return a + b ```

deepseek-reasoner · deepseek · weighted 10.0 · syntactic_validity=10.0 · $0.000123 · 1284ms

```python def add(a, b): return a + b ```

grok-3 · xai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000396 · 1812ms

class UserProfile: def update_email(self, new_email): self.email = new_email

grok-3-mini · xai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $2.3e-05 · 4812ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

deepseek-chat · deepseek · weighted 10.0 · syntactic_validity=10.0 · $8e-06 · 872ms

```python def add(a, b): return a + b ```

sonar-pro · perplexity · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000513 · 2247ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```[2][5]

claude-haiku-4-5-20251001 · anthropic · weighted 10.0 · syntactic_validity=10.0 · $0.000107 · 881ms

```python def add(a, b): return a + b ```

mistral-large-latest · mistral · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 819ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gpt-4.1-mini · openai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1556ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gpt-4o-mini · openai · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $2.9e-05 · 939ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000205 · 668ms

```python class UserProfile: def __init__(self, username, email): self.username = username self.email = email def update_email(self, new_email): self.email = new_email ```

claude-opus-4-6 · anthropic · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.00327 · 1714ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

meta-llama/Llama-3.3-70B-Instruct-Turbo · together · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1254ms

```python class UserProfile: def __init__(self, email): self.email = email def update_email(self, new_email): self.email = new_email ```

deepseek-ai/DeepSeek-V3.1 · together · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1402ms

```python class UserProfile: def __init__(self): self.email = None def update_email(self, new_email): self.email = new_email ```

gemini-2.5-flash · google_gemini · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.000205 · 910ms

```python class UserProfile: def __init__(self, username, email): self.username = username self.email = email def update_email(self, new_email): self.email = new_email ```

claude-haiku-4-5-20251001 · anthropic · weighted 10.0 · syntactic_validity=10.0 · $0.000107 · 709ms

```python def add(a, b): return a + b ```

deepseek-reasoner · deepseek · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.001169 · 7214ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

gemini-2.5-pro · google_gemini · weighted 10.0 · syntactic_validity=10.0 · $0.004649 · 6438ms

```python def add(a, b): return a + b ```

claude-sonnet-4-6 · anthropic · weighted 10.0 · syntactic_validity=10.0 · $0.0 · 1479ms

```python def add(a, b): return a + b ```

sonar · perplexity · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $5.4e-05 · 2068ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

mistral-small-latest · mistral · weighted 10.0 · syntactic_validity=10.0, naming_conventions=10.0 · $0.0 · 1790ms

```python class UserProfile: def update_email(self, new_email): self.email = new_email ```

sonar · perplexity · weighted 10.0 · syntactic_validity=10.0 · $4.3e-05 · 2045ms

```python def add(a, b): return a + b ```[3][5]

grok-3-mini · xai · weighted 10.0 · syntactic_validity=10.0 · $1.4e-05 · 2937ms

def add(a, b): return a + b