CargoAdmin, Bureaucrats, Moderators (CommentStreams), fileuploaders, Interface administrators, newuser, Push subscription managers, Suppressors, Administrators
5,287
edits
| (12 intermediate revisions by the same user not shown) | |||
| Line 119: | Line 119: | ||
* If there's disagreement or low confidence, the item flags for quick human review (one-click approve/reject/retry on the Clearance Dashboard). | * If there's disagreement or low confidence, the item flags for quick human review (one-click approve/reject/retry on the Clearance Dashboard). | ||
=== Why This Two-Step Approach? === | |||
* A single AI can sometimes confidently produce wrong or invented details (a common issue in LLMs). | * A single AI can sometimes confidently produce wrong or invented details (a common issue in LLMs). | ||
* By having one model create and a different model critically review, the system catches more errors—studies on multi-agent or dual-LLM verification show significant reductions in hallucinations (often 60-90% in similar pipelines). | * By having one model create and a different model critically review, the system catches more errors—studies on multi-agent or dual-LLM verification show significant reductions in hallucinations (often 60-90% in similar pipelines). | ||
| Line 131: | Line 130: | ||
* Verifier → Double-checks it rigorously before anything goes live. | * Verifier → Double-checks it rigorously before anything goes live. | ||
=== | === Zero-base burden model === | ||
It | The zero-base burden model is the core operational philosophy of OpenGov Encyclopedia: design the system so that human effort is minimized to near-zero for routine operations, while still maintaining full federal control, compliance, and accountability. This approach draws from federal priorities for efficient, low-touch AI governance (as emphasized in OMB guidance like M-25-21 on accelerating AI adoption through innovation and reduced bureaucracy, and related 2025-2026 directives promoting agile, cost-effective AI deployment without unnecessary administrative overhead). | ||
* | |||
* | These features ensure full compliance with FOIA (easy retrieval of historical versions and decision trails) and NARA records management requirements (permanent, auditable preservation of changes without manual intervention). | ||
* | |||
This model delivers high freshness and broad coverage with virtually no ongoing manual workload — aligning with federal goals for efficient AI use (e.g., reducing bureaucratic barriers while preserving safeguards). It lets limited staff focus on strategic oversight rather than day-to-day maintenance, making OpenGov Encyclopedia sustainable and scalable across agencies. If piloted successfully, it could serve as a blueprint for other low-touch federal knowledge initiatives. | |||
In practice, this means: | |||
==== ~80–95% fully automated processing ==== | |||
The vast majority of content creation, updates, and maintenance happens without any human intervention. The active generator AI in the dual pipeline monitors official sources continuously. When a change is detected (e.g., a new program announcement in the Federal Register, an updated funding figure on USAspending.gov, or a revised agency page on a whitelisted .gov subdomain), the system automatically: | |||
* Triggers re-processing of the affected entity/page. | |||
* Retrieves the fresh content via RAG. | |||
* Generates a draft (filling Cargo fields and narrative text). | |||
* Runs it through the verifier AI for cross-check. | |||
* Publishes approved changes as a new MediaWiki revision if confidence thresholds are met. | |||
This event-driven architecture ensures the knowledge graph stays current in near-real-time for high-signal changes (e.g., major legislation or funding updates), without scheduled batch jobs overwhelming resources. | |||
==== Daily gap scans for completeness ==== | |||
A lightweight nightly automated scan identifies "missing" entities or gaps in existing ones (e.g., a new sub-agency subdomain appears in the CISA .gov inventory, or a program referenced in multiple sources but lacking a dedicated page). The pipeline proactively creates or enhances pages for these, starting with core verifiable fields. This builds out the inventory progressively without manual queues. | |||
==== Progressive completeness ==== | |||
Not every field needs to be perfect on day one. The system prioritizes: | |||
* Core fields (always populated if verifiable): Entity name, sponsoring agency, primary .gov link, status, and basic relationships — these form the reliable backbone of the knowledge graph. | |||
* Optional/enhanced fields (e.g., detailed eligibility criteria, historical funding trends, cross-program links): These fill in over time as additional source evidence emerges and verification confidence grows (e.g., from 80% → 98%). Low-confidence or unverified details are clearly marked (e.g., "Pending confirmation" or blank with a note), ensuring transparency rather than forcing incomplete rejection. | |||
This "good enough to start, improve over time" strategy maximizes coverage quickly while upholding accuracy. | |||
==== Human involvement strictly limited to <5% escalations ==== | |||
Humans (authorized federal staff) are only involved in exceptional cases: | |||
* Verifier flags a discrepancy or low confidence on a high-impact item (e.g., a major program change affecting public services). | |||
* Random audit samples for oversight. | |||
* Edge cases like ambiguous source data. | |||
Escalations route to a simple Clearance Dashboard — a custom MediaWiki special page or integrated tool — where staff review side-by-side diffs (draft vs. sources), then click one button: Approve, Reject, or Request Retry (with optional note). No writing, editing, or content creation is required from humans. This keeps the burden minimal (often 1-2 minutes per case) and scalable even as the graph grows to thousands of entities. | |||
==== No manual writing required ==== | |||
Federal staff never draft, rewrite, or curate text/narrative. All content originates from AI synthesis of official sources, verified through the dual pipeline. This eliminates the traditional "content team" workload that plagues many government wikis or databases. | |||
==== Built-in compliance and traceability features ==== | |||
* Immutable MediaWiki revisions: Every published version is permanently stored with timestamps, attribution (e.g., "GrokBot" or "GeminiBot" username for AI contributions), and diffs. | |||
* Cargo data snapshots: Structured fields are versioned alongside pages, preserving historical states for queries or audits. | |||
* Signed audit logs: All automated actions (ingestion, generation, verification, publish) are logged with digital signatures, timestamps, and source references — fully queryable and exportable. | |||
* MediaWiki page history: Open to public view (or restricted as needed), allowing anyone to see the complete change timeline, compare versions, and understand evolution over time. | |||
=== Always defers to originals === | |||
OpenGov Encyclopedia is built on the principle that it is never the authoritative source. Every page explicitly directs users back to the original federal .gov site(s) for verification, actions, applications, or any official purpose. This respects agency ownership, prevents confusion or duplication, and builds citizen trust through transparency. | |||
In short, OpenGov Encyclopedia enhances discovery and understanding while always honoring the single source of truth on the original .gov—exactly what citizens and agencies need in an AI-powered era. | |||
Every page incorporates clear, consistent, USWDS-styled elements that are impossible to miss. These use proven patterns like alerts for notices and footers for reassurance, adapted to MediaWiki's capabilities: | |||
==== Top banner (prominent notice) ==== | |||
* A high-visibility alert-style box appears at the top of every page. | |||
* Text: This material is provided for background context and relationships only and is not the official record. | |||
* Styled using a custom MediaWiki template with USWDS-inspired classes (added via modified skin CSS in MediaWiki:Common.css or the USWDS-integrated skin file). Includes ARIA attributes for accessibility (role="status" or role="region" with aria-label). | |||
==== Direct source link ==== | |||
* | * Placed immediately below the banner or integrated into the Cargo infobox/header. | ||
* Text: Official source of truth: [Primary .gov URL] — complete actions and verify details there. | |||
* Rendered as a large, clickable USWDS-style button or link (e.g., usa-button usa-button--primary). Multiple sources are listed cleanly if applicable. | |||
==== Action buttons ==== | |||
Every relevant section ends with directive buttons: | |||
* Apply Now → | |||
* Learn More / Eligibility Details → | |||
* Take Action / Submit Application → | |||
* Styled as USWDS usa-button (primary or outline variants). All point exclusively to the official agency site, USA.gov task page, or form endpoint—no internal completion paths. | |||
==== Footer ==== | |||
(persistent reassurance with dynamic Cargo data) | |||
* | * At the bottom of every page, styled in the USWDS identifier/footer pattern. | ||
* | * Uses a Cargo-powered template to display live, queryable metadata: | ||
** Dual-AI verified from official sources | |||
* | ** Last verified: [timestamp from Cargo field] | ||
* | ** Confidence: [score from Cargo field, e.g., 98%] | ||
* | ** Always check primary .gov for authoritative information. | ||
* | |||
Cargo integration: | |||
Verification fields (e.g., LastVerified=Date, Confidence=Float) are declared in the Cargo table. The footer template queries the current page’s data (using |where=_pageName={{PAGENAME}}) to populate values dynamically. | |||
This enables site-wide queries, such as sorting articles by verification date or confidence rating: | |||
<pre>{{#cargo_query: | |||
tables=VerificationMetadata | |||
|fields=Page=Page, LastVerified=Last Verified, Confidence=Confidence | |||
|order by=LastVerified DESC | |||
|limit=50 | |||
}}</pre> | |||
(Can be embedded on a dashboard or oversight page.) | |||
All elements are automatic (baked into page templates and skin), consistent across the site, and meet WCAG 2.1 AA / Section 508 standards (high-contrast, keyboard-navigable, ARIA roles). | |||
This design turns a potential concern into a strength. | |||
* Citizens see plain-English, repeated messaging that makes the supplemental role crystal clear from the first second. | |||
* Agencies retain full ownership: OpenGov Encyclopedia never competes—it acts as a helpful map that sends users straight to the right .gov destination, better informed and ready to act. | |||
* Traffic funnels to primary sites: Prominent, task-oriented buttons improve completion rates on official pages. | |||
* Trust is reinforced: Dynamic verification details in the footer show real-time quality (timestamp + confidence score), while the Cargo backend allows easy auditing and reporting on freshness and accuracy across thousands of pages. | |||
== Why Now? == | == Why Now? == | ||
edits