TAAS Routing & Audit

How requests are mapped to the right niche and model, the empirical evidence behind those choices, and the compliance side-flow that keeps every tool routing through the model router. The audit is the product.

Snapshot 2026-05-21 18:03 UTC · auto-refreshed

99.92%
Router Compliance
415
Niche Resolutions
961178
Routed Calls

Empirical Model Evidence

Head-to-head results that drive niche scores — not vendor claims, actual outputs verified by TAAS.

Code-refactor routing: which model produces valid Python · coding / code_refactor · 2026-05-20

ModelResultDetail
claude-sonnet-4-6PASS2338 tokens, end_turn, valid Python
meta-llama/Llama-3.3-70B-Instruct-TurboFAILinvalid Python (unterminated string)
gpt-4oFAILinvalid Python (unterminated string)
claude-haiku-4-5FAILinvalid Python (unterminated string)

→ Re-scored the coding niche: claude-sonnet-4-6 ranked #1; the three failing models demoted. code_refactor added as a first-class scored niche.

Smart Niche Resolution

Unknown or newly-phrased niches resolved to the nearest scored niche (semantic + lexical), or flagged genuinely new and queued for benchmarking — instead of falling back to a blind default.

RequestedResolved → nicheMethodConfidenceWhen
educational_chatchat (chat)semantic0.5932026-05-21 18:01
expert_evaluationNEW — queued for scoringnew_niche2026-05-21 17:54

Model-Router Compliance Side-Flow

Every tool must route LLM/AI calls through the model router. The nightly audit detects drift, auto-fixes via the dev system, escalates what it can't fix, and quarantines as a last resort.

RunScannedCompliantViolatorsFixedEscalatedWhen
v5_audit_20260521_030005638563805232026-05-21 03:00
v5_audit_20260520_042942634163410002026-05-20 04:29
v5_audit_20260520_041625633463313102026-05-20 04:16
v5_audit_20260520_030001631163101002026-05-20 03:00
v5_audit_20260519_232355628262820002026-05-19 23:23
v5_audit_20260519_232335628262820002026-05-19 23:23

Auto-Fix Attempts

ToolOutcomeViolationsAttemptsWhen
image.fal_recraft_v3_v1fixed12026-05-21 03:05
forces.factory.generate_course_deck_v1needs_escalationrest_anthropic, model_literal32026-05-21 03:03
forces.content.generate_metaphors_v1capped2026-05-21 03:03
forces.content.generate_card_art_v1fixed12026-05-21 03:03
aletheion.validate_field_intel_v1needs_escalationrest_anthropic, model_literal32026-05-21 03:00
test.v5_autofix_probefixed12026-05-20 17:31
forces.content.generate_metaphors_v1needs_escalationrest_anthropic, model_literal22026-05-20 04:23
forces.content.generate_metaphors_v1needs_escalationrest_anthropic, model_literal22026-05-20 04:21
expert.run_v1needs_escalationrest_anthropic, model_literal22026-05-20 04:20
forces.content.generate_metaphors_v1needs_escalationrest_anthropic, model_literal32026-05-20 04:17
forces.content.generate_card_art_v1fixed12026-05-20 04:17
expert.run_v1needs_escalationrest_anthropic, model_literal32026-05-20 04:16