Live runtime · no signup
Run a live Wauldo Agent.
Pick a preset, fire a prompt. Watch the state machine, see every claim verified, get a numeric support score.
91%
Median pass
+48pts
vs LangChain
5.7s
Avg latency
0%
Hallucinations
Inside each run
01
Classify
Input and retrieved context are tagged data vs. instruction. Injection markers stripped before the LLM sees them.
02
Generate
Preset-gated workflow runs. Only the tools each state declares allowed_tools can fire.
03
Verify
Every claim checked against sources. Verdict, support score, and per-claim breakdown returned.
Ready to ship
Same engine in your stack.
This sandbox uses the exact POST /v1/tasks endpoint your API key hits. Same runtime, same verifier, same verdict surface.
Free tier
300 req/month
SDKs
Py · TS · Rust
No lock-in
OpenAI-compat