Experience Agentic AI by touching. No prior knowledge needed
AIs are shifting from "tools you talk to" into "teams that investigate, divide work, execute, and verify". This page lets you feel that shift with sliders and buttons — 7 top-level tabs, each with sub-topics where the behavior visibly changes as you interact.
3 minutes to grasp the whole picture
- AI is not just chat — it can divide labor across agents and execute work.
- Memory, permissions, and tooling design determine whether it ships value or causes accidents.
- So you need a shared language: Agent / Context / Skill / Guardrail. Touch the tabs to learn each.
"Just vibe and build it" works only at the doorstep. Production runs on a chain: goal → constraint → observe → diff → execute → verify → record.
Try it out Subtopic
Click each chain stage below to toggle ON/OFF. All ON = Agentic Engineering; turning the Vibe toggle above ON collapses everything to 1 step (= Vibe Coding). Turn even one stage OFF and you can see the "result" below degrade.
All stages passed → working code + full records
Goal, constraint, observation log, diff, execution result, verification log, improvement record — all preserved. You can later trace "why this decision was made". This is the ideal form of Agentic Engineering.
Three units to delegate to AI: a solo Agent, a side-window Subagent for research, and a parallel Agent Team. Different roles, different uses.
Main Agent vs Subagent Subtopic
A Subagent runs "research only" or "grep only" in a side window without polluting Main's context. Click delegate — only the Subagent inflates while Main receives only a single-line conclusion.
Agent Team (parallel) Subtopic
Lining up agents with different roles lets work proceed in parallel — but assigning edits to the same file causes conflicts, so the task shape decides fit / unfit.
Controls "who runs, in what order, under what condition." Touch each of the 5 canonical patterns to feel the differences.
Switch patterns Subtopic
Click a card below to change how tasks flow in the canvas. Each moving ball represents one task unit.
How much you can show an AI is finite. What you include, what you exclude, and where you keep what — these design decisions drive quality.
Fill the context window Subtopic
Buttons below add Instructions / Past log / Tool results / RAG, filling the window. Once over 100%, the oldest items are compacted and shown gray. Toggle compaction on/off to compare.
Show to LLM vs keep in code only Subtopic
Click items to toggle between what the LLM sees and code-side only. The risk assessment on the right updates live. Try placements to find the safe shape. Switching AI changes the AI-specific risks too.
What the LLM sees
What only the code side holds
⚠ Risk assessment of current placement
Memory hierarchy — touch it and incidents appear Subtopic
Click items placed across 4 layers (Conversation / Work log / Skill / Memory) to move to an adjacent layer. Misplace and the incident details appear below. Switching AI reveals the memory mechanism differences too.
"Reusable procedures" (Skills) and "specs for hooking up externals" (Tool / MCP) are different things. Touch the 5 subtopics in order to feel: define → load → call → connect → defend.
5-1. Dissect SKILL.md VENDOR-SPECIFIC Subtopic
SKILL.md file — not a memo, but a fixed schema: name / when_to_use / steps / examples. Click each section of the sample to see its meaning and the AI behavior.
5-2. Load all vs lazy load Subtopic
Assume an organization with 20 registered Skills. Compare "load everything upfront" (all packed into CLAUDE.md) vs "lazy load" (SKILL.md on demand). The task selector changes which Skills the lazy-load side picks.
All loaded upfrontpacked into CLAUDE.md
68% of context is consumed by Skill bodies before work begins. Conversation history, observation logs, and outputs must fit in the remaining 32%.
Lazy loadSKILL.md on demand
Only Skills relevant to the task are expanded. 84% remains free for conversation history, observation logs, and outputs.
5-3. The 5 steps of Function Calling Subtopic
An AI calling external Tools (functions) follows a fixed 5-step flow. Press "Next step" to follow the AI's internal reasoning. Reset to repeat.
5-4. World with MCP / without MCP VENDOR-SPECIFIC Subtopic
MCP (Model Context Protocol) is a shared standard connecting AIs to external services (Git / DB / files / n8n / Docker etc). Without MCP: per-AI custom wiring (N × M combinations every time). With MCP: one standard connects all (N + M) — maintenance cost collapses. Toggle and click outer services to light up the path.
5-5. Tool Poisoning — the definition itself is attacked Subtopic
MCP integration is powerful, but injecting malicious instructions into a tool definition's description field can hijack the AI — this is Tool Poisoning. Switch the definition tab and toggle the guard ON/OFF to observe the difference.
How far to let the AI execute. The norm: gate dangerous operations with approval. Half of all failures trace back to weak design here.
Permission ladder + action verdict quiz PRODUCT EXAMPLE Subtopic
Pick a permission level and the "concrete actions" below immediately show OK / pending-approval / blocked. Climb the ladder rung-by-rung to feel what each level unlocks.
Approval gate (HITL) on / off Subtopic
When a dangerous operation like "DROP TABLE users" arrives, the outcome differs depending on whether an approval gate is in place. Toggle the button to compare.
⚠ Outcome without HITL
✓ Outcome with HITL
Even if you understand the mechanism, incidents are inevitable without guards. Click each example to see what actually happens.
↑ 30 failure patterns. The typical behavior and mitigation differ per AI. Cards marked ✗ do not apply to the current AI (not applicable) and cannot be selected. Click a chip in the banner to jump to its card.