Coding

Show HN: Statewright – Visual state machines that make AI agents reliable

"Reliability trumps scale: A new approach to AI agent design uses constrained state machines and smaller models to tackle brittle problem-solving, potentially upending the industry's reliance on massive parameter counts and longer prompts."

Oscar V (AI-assisted) May 12, 2026 1 min read EN

Based on reporting from Source.

Anthropic's Claude Code now ships with a plugin ecosystem covering specialized agent roles, but a new approach uses constrained state machines and smaller models to tackle brittle problem-solving. Statewright, a visual state machine tool, defines a workflow once and enforces it across multiple agents, including Claude Code, Codex, Cursor, opencode, and Pi. The tool restricts tool calls based on the current phase, preventing the model from flailing and improving reliability.

Overview

Statewright uses a Rust engine to evaluate state machine definitions, which are deterministic and do not involve LLMs. The tool integrates with coding agents via the MCP protocol, enforcing tool restrictions per state automatically. This approach makes the problem smaller by constraining the tool and solution spaces, allowing the model to reason in a focused context at each step.

What Each Plugin Does

The Statewright plugin for Claude Code provides a visual editor for defining workflows, which can be authored by hand or generated by agents. The plugin enforces tool restrictions per state, preventing the model from using tools that are not allowed in the current phase. The tool also provides a guardrail system, which includes features such as per-state tool enforcement, Bash discernment, and environment scoping.

Tradeoffs

Statewright requires MCP support in the agent, and workflow definitions are authored by hand. The tool also has some limitations, such as requiring a managed cloud for workflow storage and run history, and having advisory enforcement for Cursor.

When to Use It

Statewright is suitable for developers who want to improve the reliability of their AI agents by constraining the tool and solution spaces. The tool is particularly useful for tasks that require a high degree of precision and control, such as coding and debugging.

Pricing

Statewright is free for individual developers, with a managed cloud at statewright.ai handling workflow storage, run history, and the MCP gateway. The tool also offers a self-hosting option for single-developer and single-team use cases.

Bottom Line

Statewright provides a new approach to AI agent design, using constrained state machines and smaller models to improve reliability. The tool integrates with multiple agents, including Claude Code, Codex, Cursor, opencode, and Pi, and provides a guardrail system to prevent the model from flailing. While it has some limitations, Statewright is a valuable tool for developers who want to improve the reliability of their AI agents.

Practical Takeaway

Statewright is a useful tool for developers who want to improve the reliability of their AI agents by constraining the tool and solution spaces. By using Statewright, developers can define a workflow once and enforce it across multiple agents, preventing the model from flailing and improving reliability. The tool is free for individual developers and offers a self-hosting option for single-developer and single-team use cases.

Tags: [tag1, tag2, tag3] Sources Used: [Source Name]

Show HN: Statewright – Visual state machines that make AI agents reliable

Overview

What Each Plugin Does

Tradeoffs

When to Use It

Pricing

Bottom Line

Practical Takeaway

Sources 1

More articles like this

Visual Studio Code 1.120

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

Reimagining the mouse pointer for the AI era

Show HN: Gigacatalyst – Extend your SaaS with an embedded AI builder

Bambu Lab is abusing the open source social contract

I hate the recent open-source rise