From upstream signal to device-agnostic execution

ANIMA Six-Layer Architecture

ANIMA's L0–L5 stack translates upstream signals (text, voice, BCI, vision) into verifiable robot actions layer by layer, each layer carrying a clear input/output contract and a named failure path.

L0 — Signal

Upstream signal gateway: turns BCI / ASR / vision / text into an intent token + confidence + drift_score. Drift propagates downstream as an uncertainty handle.

L1 — Parser (LLM)

The LLM is locked to forced tool-calling, compressing natural-language instructions into a structured TaskSpec JSON. It never emits motor commands — only structure.

L2 — Planner

TaskSpec → py_trees behavior tree. Conditional branching, retry, and fallback live in the tree itself — no ad-hoc state machines.

L3 — Skill

Function-Calling + Affordance Scoring selects skills (more accurate than RAG when the skill set is under 100). Preconditions and expected effects are declared in the skill registry.

L4 — Adapter

Device-agnostic actuation. The same L1–L3 drives a manipulator, a mobile base, a wheelchair, or a future humanoid — because each L4 implements the EmbodiedAdapter protocol.

L5 — Assessment

The five factors (ITA / MQA / SQA / GOA / PEA) each evaluate at Pre / Runtime / Post stages, producing a success probability with calibrated confidence.

Key points

Why this layer matters

  • Structure comes before generative free-form behavior
  • Framework logic stays robot-agnostic; embodiment lives at L4
  • Every layer has an interpretable failure path
Related robots

Current robot carriers