Projects

in-progress / case study

Forge

A local AI work-loop dashboard: plan one bounded task, patch an iteration branch, run gates, review the diff, and keep the evidence trail human-readable.

Statusin-progress
SurfacePreview
Updated2026-06-02
OutcomeTurns multi-agent coding from scattered terminal work into a reviewable local workflow.

Stack and decisions

Next.js

Part of Forge's current operating boundary.

Python

Owns local orchestration, subprocesses, filesystem evidence, and agent handoffs.

FastAPI

Gives the private dashboard a clean contract for run state and approval controls.

Claude Code

Primary planner/implementer lane for repo-aware work passes.

Codex CLI

Second implementation/review lane when a separate agent perspective is useful.

DeepSeek

Cost-aware role candidate for bounded iterations that do not need premium context.

Timeline

Started 2026-05-21

Updated 2026-06-02

Surface Preview

What it is

Forge is a local dashboard for turning one concrete project goal into a reviewable AI-assisted work pass. You choose a repo and a focus. Forge writes the plan, lets an implementer agent patch an iteration branch, runs the configured checks, asks a reviewer agent for a verdict, and saves the branch, diff, logs, cost trail, Discord status card, and report for inspection.

The important word is bounded. Forge is not an autonomous "go ship my app" machine. It automates the mechanical handoff between agents while keeping the acceptance decision with the human.

The public page at forge.jhinx.dev is a static pitch for the current v6 direction. The real dashboard — now a Next.js app over the same Python engine — stays local and private, with a hosted private route (forge-app.jhinx.dev) only for my own authenticated checks.

Why I built it

My manual loop was useful but clunky: plan in one agent, implement in another, review in a third pass, then paste the next round by hand. That made the process slower than it needed to be, and it made evidence easy to lose.

Forge exists to make that loop repeatable. The useful output is not just "tests passed"; it is a branch I can inspect, a verdict I can read, and enough saved context to decide whether the work deserves another pass, a commit, or the trash.

What it does now

What I learned

Status & next steps

Forge is verified locally with CLI runs, dashboard-triggered runs, chained iterations, human approval, dual-reviewer quorum behavior, a private hosted dashboard route, a persistent CT-hosted Discord relay, and Kanade dogfood runs.

The preview at forge.jhinx.dev is the public product pitch for the v6 direction, not the running app. v6 moved the dashboard off SvelteKit onto Next.js, wired the Kanade↔Forge handoff both ways, and added the multi-agent task board. Open work is practical: land the v6 dashboard, prove a real paid DeepSeek iteration, tighten the Windows background runner, and keep the public page free of private dashboard/source links.