Format
{domain}_{entity}
Separator: _
Examples
corpus_personscorpus_documentscorpus_atlasops_executionstoolsDomains
corpus
World data — everything we know. People, places, organizations, documents, research, government data, financial data, patents, education, entities. The information itself.
Any data that represents knowledge about the real or virtual world. If you could query it to answer a question about reality, it belongs here.
corpus.persons — Unified person profiles (identity-resolved from all sources)corpus.documents — All documents, research, intelligence, knowledge assetscorpus.atlas — World data — World Bank, IMF, census, county, state, federal datasetscorpus.atlas_catalog — Schema registry for atlas data sources with parameter mappingscorpus.education — Schools, districts, enrollment, assessments, graduation, civil rightscorpus.entities — Government entities, local entities, organizations, companiescorpus.patents — Patent families, claims, prior art, FTO, moat strategies, filingscorpus.ventures — Venture profiles, evaluations, lifecycle, assessments, org chartscorpus.social — LinkedIn activity, outreach, social media, network datacorpus.web — Pages, templates, layouts, themes, share sites, DSR contentcorpus.har — HAR captures, endpoints, journeys, site profiles, auth patternscorpus.catalogs — Product catalogs, dataset catalogs, model catalogs, inventoriescorpus.grants — Grant opportunities, applications, funder profiles, alignmentscorpus.comms — Email, messages, notifications, correspondence, campaignstools
The capability layer — what we can do. Tool registry is a single collection. Capabilities link tools to what they enable.
Tool definitions, capability registry, tool KPIs, tool healing records. NOT execution records (those are ops).
tools — The tool registry (existing, stays as-is)tools.capabilities — Capability registry linking tools to ventures and gapstools.gaps — Detected capability gapstools.audit — Capability extraction audit trailanalyses
Outputs from applying tools to corpus data. Investigations, 10-K reports, performance reports, evaluations, assessments. The conclusions.
Any document that represents an analytical output — something generated by examining corpus data. If a human or model drew a conclusion, it belongs here.
analyses.all — Unified analyses — tagged by analysis_type (agi_evaluation, financial_analysis, content_quality, governance, outcome_analysis, investigation, etc.)connectors
Data acquisition infrastructure — how we get data into the corpus. Harvest configs, pipeline state, ingestion logs, connector configs.
Anything related to acquiring external data. Connector configs, harvest jobs, pipeline DAGs, ingestion runs. The plumbing.
connectors.all — Unified connectors — tagged by connector_type (harvest, pipeline, ingestion, connector_config, harvested_model, harvested_source, harvested_tool)ops
Operational machinery — execution records, traces, logs, configs, sessions, caches, security. The runtime.
System state that supports execution but is not corpus data, not analysis output, and not tool definition. Logs, configs, sessions, execution records.
ops.executions — Tool execution recordsops.traces — Execution traces and spansops.security — Security audit log, alerts, events, controlsops.sessions — User sessions, chat sessions, auth sessionsops.configs — System configs, environment configs, feature flagsops.caches — Cache tables, materialized viewsops.uptime — Health checks, uptime monitoringops.threads — Dev threads, thread learnings, agent tasksops.selection — Model selection events, MAB statearchived
Superseded collections. Data migrated to a corpus/tools/analyses/connectors/ops collection. Prefixed with archived_ for discoverability.
Any collection whose data has been fully migrated to a unified collection. Never deleted — renamed with archived_ prefix and migration metadata.
Enforcement
- method
- contract_precondition
- description
- Any tool that creates or renames a collection must validate the name against this standard. The contract evaluator checks the domain prefix before allowing execution.
- violations
- {'wrong_prefix': 'Collection name does not match any defined domain prefix', 'missing_provenance': 'Migrated documents missing required provenance fields', 'orphan_collection': 'Collection exists outside any domain prefix and is not archived'}