Jump to content

OpenGov Technical: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 5: Line 5:
Technical Document: How OpenGov Encyclopedia Would Work
Technical Document: How OpenGov Encyclopedia Would Work


1. High-Level Architecture
== 1. High-Level Architecture ==
 
External Sources (Public Only)
External Sources (Public Only)


Line 41: Line 40:
  └── Audit / revision tags table
  └── Audit / revision tags table


'''2. Seeding the Initial Content (Phase 1: Weeks 1–4)'''
== '''2. Seeding the Initial Content (Phase 1: Weeks 1–4)''' ==
 
'''Step 1: Infrastructure Setup (Week 1)'''


=== '''Step 1: Infrastructure Setup (Week 1)''' ===
* Deploy MediaWiki + Cargo + USWDS-aligned skin on FedRAMP hosting.
* Deploy MediaWiki + Cargo + USWDS-aligned skin on FedRAMP hosting.
* Define core Cargo tables (Agency, Program, Organization, Topic).
* Define core Cargo tables (Agency, Program, Organization, Topic).
* Configure Grok bot account via GSA OneGov.
* Configure Grok bot account via GSA OneGov.


'''Step 2: Seed Agencies & Major Organizations (Weeks 2–3)'''
=== '''Step 2: Seed Agencies & Major Organizations (Weeks 2–3)''' ===
 
* Use the official USA.gov A-Z Agency Index + agency “About” pages as the seed list (~150–200 URLs).
* Use the official USA.gov A-Z Agency Index + agency “About” pages as the seed list (~150–200 URLs).
* Grok batch job (one-time run):
* Grok batch job (one-time run):
Line 57: Line 54:
* All items route to Clearance Dashboard for batch attestation (high-confidence auto-pass after validation).
* All items route to Clearance Dashboard for batch attestation (high-confidence auto-pass after validation).


'''Step 3: Seed Programs & Initiatives (Weeks 3–4)'''
=== '''Step 3: Seed Programs & Initiatives (Weeks 3–4)''' ===
 
* Sources: Federal Register notices, USAspending.gov API, curated agency program pages.
* Sources: Federal Register notices, USAspending.gov API, curated agency program pages.
* Grok prompt: “Create Program page from this source. Fill Cargo: sponsor, purpose, start_date, duration, funding, related agencies. Output wikitext + Cargo. Cite source.”
* Grok prompt: “Create Program page from this source. Fill Cargo: sponsor, purpose, start_date, duration, funding, related agencies. Output wikitext + Cargo. Cite source.”
* Review via Clearance Dashboard.
* Review via Clearance Dashboard.


'''Step 4: Seed Task/Topic Pages (Week 4)'''
=== '''Step 4: Seed Task/Topic Pages (Week 4)''' ===
 
* Grok uses seeded data: “Create ‘Prepare for a Disaster’ task page linking relevant programs/agencies. Add plain-language explanation and official USA.gov links.”
* Grok uses seeded data: “Create ‘Prepare for a Disaster’ task page linking relevant programs/agencies. Add plain-language explanation and official USA.gov links.”
* Links always point back to agency/USA.gov as source of truth.
* Links always point back to agency/USA.gov as source of truth.


'''3. Ongoing Updates (Post-Week 4)'''
== '''3. Ongoing Updates (Post-Week 4)''' ==
 
* Daily Grok orchestration monitors curated high-signal list + Federal Register + USAspending API.
* Daily Grok orchestration monitors curated high-signal list + Federal Register + USAspending API.
* Change detected → Grok proposes update → Clearance Dashboard.
* Change detected → Grok proposes update → Clearance Dashboard.
Line 75: Line 69:
* No broad scraping — only targeted, public, high-value pages.
* No broad scraping — only targeted, public, high-value pages.


'''4. Technical Safeguards'''
== '''4. Technical Safeguards''' ==
 
* '''Scalability''' — Cargo for narrative/relationships; PostgreSQL offload for high-volume numerics if needed.
* '''Scalability''' — Cargo for narrative/relationships; PostgreSQL offload for high-volume numerics if needed.
* '''Auditing''' — Mandatory revision tags (source URL, timestamp, Grok score, attestation ID) in separate table.
* '''Auditing''' — Mandatory revision tags (source URL, timestamp, Grok score, attestation ID) in separate table.