01 · CLASSIFY
Input and retrieved context are tagged data vs. instruction. Injection markers stripped before the LLM sees them.
Type a factual prompt. Watch the agent call Wikipedia / web tools, see the state machine + tool calls in the timeline, get the verifier's verdict + per-claim breakdown.
Input and retrieved context are tagged data vs. instruction. Injection markers stripped before the LLM sees them.
Preset-gated workflow runs. Only the tools each state declares allowed_tools can fire.
Every claim checked against sources. Verdict, support score, and per-claim breakdown returned.
This sandbox uses the exact POST /v1/tasks endpoint your API key hits. Same runtime, same verifier, same verdict surface.