Autonomous discovery + ingest via daedarch.pipeline.state_data_scavenger.
Updated 2026-04-24 02:08 UTC. Source of truth: internal gold layer with full lineage.
| State | Sources | Gold | Valid % | Owner % | Value % | Geom % | Area (sq mi) |
|---|---|---|---|---|---|---|---|
| TN | 92 | 540,765 | 95% | 95% | 94% | 99% | 41,235 |
| VT | 1 | 315,170 | 94% | 94% | 94% | 99% | 9,217 |
| UT | 10 | 7,529 | 57% | 3% | 0% | 100% | 82,170 |
| NM | 6 | 3,752 | 7% | 0% | 0% | 51% | 121,298 |
| WA | 10 | 3,386 | 17% | 7% | 0% | 73% | 66,455 |
| NY | 8 | 3,193 | 42% | 57% | 56% | 100% | 47,126 |
| MS | 8 | 2,956 | 99% | 96% | 17% | 99% | 46,923 |
| WI | 11 | 2,737 | 21% | 8% | 0% | 100% | 54,158 |
| NJ | 7 | 2,100 | 42% | 28% | 28% | 100% | 7,354 |
| DE | 7 | 2,081 | 42% | 14% | 0% | 100% | 1,949 |
| KS | 6 | 1,913 | 31% | 46% | 0% | 100% | 81,759 |
| TX | 7 | 1,876 | 52% | 59% | 0% | 61% | 261,232 |
| IA | 7 | 1,800 | 33% | 49% | 16% | 100% | 55,857 |
| AR | 4 | 1,611 | 44% | 3% | 0% | 100% | 52,035 |
| RI | 5 | 1,609 | 25% | 79% | 0% | 100% | 1,034 |
| FL | 4 | 1,498 | 100% | 66% | 66% | 100% | 53,625 |
| CO | 8 | 1,438 | 58% | 78% | 0% | 79% | 103,642 |
| CT | 3 | 1,400 | 35% | 78% | 0% | 100% | 4,842 |
| MN | 8 | 1,287 | 0% | 65% | 0% | 100% | 79,627 |
| ME | 6 | 1,282 | 26% | 0% | 0% | 99% | 30,843 |
| MD | 9 | 1,200 | 25% | 49% | 0% | 75% | 9,707 |
| OH | 7 | 1,200 | 60% | 60% | 0% | 99% | 40,861 |
| OR | 5 | 1,200 | 0% | 0% | 0% | 100% | 95,988 |
| AK | 11 | 1,200 | 75% | 74% | 20% | 100% | 570,641 |
| GA | 6 | 1,200 | 100% | 49% | 24% | 100% | 57,513 |
| CA | 9 | 1,200 | 50% | 25% | 0% | 100% | 155,779 |
| HI | 7 | 1,200 | 0% | 0% | 0% | 100% | 6,422 |
| ND | 10 | 1,200 | 50% | 47% | 0% | 100% | 69,001 |
| PA | 8 | 1,198 | 49% | 49% | 24% | 100% | 44,743 |
| MO | 8 | 1,198 | 100% | 49% | 100% | 100% | 68,742 |
| AZ | 12 | 1,196 | 49% | 49% | 0% | 100% | 113,594 |
| MI | 7 | 1,194 | 49% | 31% | 0% | 100% | 56,539 |
| ID | 7 | 1,194 | 100% | 17% | 0% | 100% | 82,643 |
| VA | 8 | 1,191 | 74% | 48% | 0% | 100% | 39,490 |
| IN | 4 | 1,186 | 74% | 24% | 0% | 100% | 35,826 |
| KY | 5 | 1,164 | 100% | 25% | 0% | 100% | 39,486 |
| WY | 6 | 1,161 | 23% | 25% | 0% | 100% | 97,093 |
| WV | 7 | 1,120 | 73% | 97% | 0% | 100% | 24,038 |
| NC | 7 | 1,077 | 0% | 94% | 0% | 72% | 48,618 |
| OK | 6 | 1,012 | 0% | 29% | 0% | 100% | 68,595 |
| AL | 3 | 990 | 67% | 67% | 0% | 99% | 50,645 |
| DC | 10 | 925 | 67% | 64% | 32% | 99% | 61 |
| NV | 8 | 920 | 97% | 32% | 0% | 97% | 109,781 |
| NE | 3 | 900 | 65% | 32% | 28% | 100% | 76,824 |
| IL | 8 | 900 | 33% | 100% | 0% | 100% | 55,519 |
| NH | 7 | 900 | 33% | 33% | 0% | 33% | 8,953 |
| MA | 7 | 881 | 65% | 59% | 0% | 100% | 7,800 |
| MT | 8 | 797 | 97% | 0% | 37% | 98% | 145,546 |
| SD | 6 | 718 | 58% | 97% | 55% | 78% | 75,811 |
| SC | 2 | 599 | 100% | 100% | 0% | 100% | 30,061 |
| LA | 6 | 504 | 98% | 80% | 0% | 99% | 43,204 |
AGOL-search patterns dominate. Tavily patterns get deprioritized as their loss count grows.
| Backend | Query template | Wins | Losses | Win rate |
|---|---|---|---|---|
| agol_search | {state_name} standardized parcels | 239 | 0 | 100% |
| agol_search | {state} parcels title:parcels | 219 | 1 | 99% |
| agol_search | {state_name} construction permits | 173 | 1 | 99% |
| agol_search | {state} building permits title:permits | 126 | 3 | 97% |
| tavily | {state_name} city open data portal building permits Socrata dataset | 82 | 4 | 94% |
| tavily | {state_name} statewide tax parcels ArcGIS FeatureServer REST endpoint | 30 | 29 | 50% |
| tavily | {state_name} construction permits ArcGIS FeatureServer | 25 | 41 | 37% |
| tavily | {state_name} county assessor parcel data portal ArcGIS open data | 1 | 50 | 2% |
daedarch.pipeline.state_data_scavenger — Tavily + AGOL content-search + Data.gov + direct URL patterns. Per-state bbox rejection for cross-state false positives. Pattern win/loss tracked in state_scavenger_patterns for self-improvement.daedarch.pipeline.harvest_source_to_bronze — paginated FeatureServer pull → trellison_parcels_bronze with provenance (source_key, source_url, fetched_at, discovery_run_id).daedarch.pipeline.promote_bronze_to_gold v1.2 — auto-learned field map per source (regex-scored heuristics across 9 canonical buckets), validation status (valid / warnings / failed), full lineage (bronze_row_id + source_url + license_flag + transformation_version)..gov, .state.us, AGOL public tenants → gov_open_data). Vendor portals → subscription_internal_only.
All data sourced from public government open-data portals and AGOL public tenants.
Every gold row carries _source_url, _license_flag, _fetched_at, _promoted_at.
To add a new state: POST /api/v4/execute {"tool_id":"daedarch.pipeline.state_data_pipeline","inputs":{"state":"XX"}}
Feedback from visitors, translated into business terminology and listed below. Use the assistant in the corner to add a comment.