Skip to main content

Grounded QA Verification

What You Do

Implement role separation — a generator that implements, an evaluator that reviews, and optionally a planner. Run three times to measure the effect of each added role.

Choose a substantive feature upgrade (multi-turn conversation, citation panel redesign, or document filtering) and keep it consistent across all runs.

Tools

  • Claude Code or Codex
  • Git
  • Node.js + Electron

Harness Mechanism

Self-verification + grounded Q&A + evidence-based completion

Feedback / ReportSpotted an issue or have an improvement idea?