Agents

AI Agents (Autonomous Agents)

2022ActivePublished: 5 May 2026Updated: 5 May 2026Published

Key innovation

Embodies the Agentic AI paradigm as a concrete executable artifact - a single, autonomous instance with a goal, access to tools, memory, and a control loop that can be deployed, monitored, and versioned as a production software component.

How it works

The agent receives a goal from the user and definitions of available tools (JSON/OpenAPI/MCP schemas) and system instructions (role, security policies). In each iteration of the loop: (1) the model analyzes the current context and decides on the next action - invoke a tool, ask a question, or terminate; (2) the host executes the chosen tool and returns the result; (3) the result is appended to the context as an observation; (4) the model decides whether to continue. The loop ends when the model determines the goal has been achieved, the max_steps limit is reached, or an error state requiring escalation to a human is detected. The agent can maintain short-term memory within the context window and long-term memory in an external storage (vector database, key-value store).

Problem solved

A single LLM invocation is unable to handle open-ended tasks, where the number of steps is not known in advance, requiring interaction with the environment, access to current data, code execution, or iterative result verification. The AI Agent solves this problem by embedding the model in a control loop with access to tools and memory, enabling autonomous end-to-end task execution.

Components

Foundation model (LLM)Reasoning and control loop engine

The agent's reasoning and decision engine. Generates plans, selects tools, interprets results, and decides termination. Typically an LLM post-trained with RLHF and tool-use, optionally a reasoning model (CoT with a dedicated reasoning token budget).

Chat LLM (Claude, GPT, Gemini)

Reasoning model (o-series, R1)

Tool interfaceBridge between the agent and the external world.

Definitions of callable functions with their schemas (JSON Schema, OpenAPI) and documentation. Anthropic calls this the Agent-Computer Interface (ACI) — care in designing it is critical for agent reliability. Often exposed through Model Context Protocol.

Function calling API

Model Context Protocol (MCP)

Official

Memory systemPersistence of task state and context

Short-term memory (conversation history, tool results in context) and optional long-term memory (vector store, key-value store, episodic structures) across sessions. Determines coherence and personalization in long-running tasks.

Official

Control loopReasoning-action cycle executor

Mechanism executing iterations: fetch context → call model → parse decision → execute tool → update context → check termination. Manages limits (max_steps, time/token budgets) and detects infinite loops.

Official

System prompt / role definitionDefines the agent's identity and operational boundaries.

Constant instruction defining the agent's identity, goal, scope of responsibility, safety rules, response format, and termination criteria. First line of defense against misbehavior and prompt injection.

GuardrailsSafety and compliance oversight

Filters and validators operating before inference (input sanitization), during (tool call schema validation), and after (output control, PII redaction, blocking irreversible actions). Critical for production safety.

Official

Evaluation / ObservabilityProduction observability and quality

Step logging, traces (LangSmith, Arize, Helicone), metrics (success rate, tool error rate, average steps, cost per task), and automated evaluations against test sets. Essential for production maintenance of an agent.

Official

Implementation

Reference implementations

Anthropic agent patterns (cookbook)

Python · Anthropic

Official

Claude Agent SDK

Python/TypeScript · Anthropic

Official

LangChain Agents

Python · LangChain AI

LlamaIndex Workflows / Agents

Python · LlamaIndex

Strands Agents SDK by AWS

Python · AWS

Official

Implementation pitfalls

Poorly Designed Agent-Computer Interface (ACI)Critical

Unclear tool names, missing examples, ambiguous parameters — the same problems that affect junior developers affect the model. Anthropic reports spending more time optimizing tools than the agent prompt itself.

Fix:Each tool definition should include usage examples, edge cases, input format, and boundaries with other tools. Use absolute paths, clear types, and poka-yoke (error prevention through argument structure). Test iteratively on real-world examples.

Hallucinations in Action (Execution Claims)Critical

The agent may claim to have performed an action it didn't actually execute, or invoke tools with fabricated parameters — particularly dangerous in multi-step pipelines where errors propagate.

Fix:Validate all tool calls and schema; enforce ground truth through execution results; do not rely on the model's internal declarations about the system state — verify the actual state using tools.

Infinite loops and compounding errorsHigh

Without a hard max_steps and repetition detection, the agent can loop indefinitely, generating wrong steps based on previously wrong observations. Costs grow linearly with the number of steps.

Fix:Set explicit max_steps, detect repeating action signatures, use an evaluator-optimizer to enforce termination conditions, and log costs in real-time.

Tool result prompt injectionCritical

Malicious instructions embedded in web pages, documents, or emails the agent reads can hijack its behavior by impersonating system instructions.

Fix:Structurally isolate untrusted content by using role labeling and sandbox tags, require explicit user confirmation for actions resulting from observed content, and apply content filtering.

Irreversible actions taken without human oversightCritical

An agent with access to write-capable tools (delete, send_email, db_write, payment) can cause real damage based on flawed reasoning. Consequences may be irreversible.

Fix:Grant the agent minimal privileges, prefer reversible operations such as soft delete and drafts instead of send, and require human-in-the-loop before taking irreversible actions.

Context Window OverflowHigh

Accumulated action history and tool results can exceed the model's context window, causing silent truncation of earlier steps and loss of relevant information.

Fix:Implement compacting or summarizing of history, offload tool results to external storage with a reference, and monitor token budget per step.

Building an agent where a workflow would sufficeMedium

Anthropic strongly recommends: don't build an agent when the task has a known, predefined structure. A workflow is cheaper, faster, more predictable, and easier to debug than an agent.

Fix:First, try a solution with a single LLM call plus retrieval, then workflow with chain, routing, or parallelization; introduce an agent only when the task is open-ended and the number of steps is unpredictable.

Evolution

1995

BDI Architectures – foundation of agent theory

Russell and Norvig formalize rational agents; Belief-Desire-Intention architectures emerge (Rao and Georgeff). The canon is defined: an agent perceives the environment and takes goal-oriented actions.

2022

ReAct – LLM as Loop Reasoning-Action Engine

Inflection point

Yao et al. (2022) demonstrate that LLMs can interleave Chain-of-Thought with tool calls in a single loop. The practical definition of an LLM-based AI Agent.

ReAct: Synergizing Reasoning and Acting in Language Models (paper)

2023

AI Agents (Autonomous Agents) - First Wave of Autonomous Agents

Virally popular implementations show autonomous GPT-4 agents performing multi-step tasks. Despite limited reliability, they show the potential and popularize the term.

2023

Function calling in main model APIs

Inflection point

OpenAI (June 2023) introduces function calling in GPT-4; Anthropic and Google follow. First-class agent support at the level of commercial model APIs.

2024

Building Effective Agents – Formal Definition of Agent vs Workflow

Inflection point

Anthropic publishes (December 2024) guidelines distinguishing agents from workflows and five composition patterns. The canonical definition of an agent: a system in which an LLM dynamically directs its own process.

Building effective agents (paper)

2024

Computer use – screen-controlling agents

Anthropic introduces Computer Use in Claude (October 2024) — the agent clicks, types, and moves the mouse like a human. OpenAI's Operator (2025) follows. Opens a class of GUI-driven agents independent of APIs.

2025

Model Context Protocol – standard for agent tools interoperability

Anthropic releases MCP as an open standard for connecting LLMs to external tool servers. Enables an ecosystem of tools portable across model providers.

2026

Commercialization of agents in the Agents-as-a-Service model

Sierra (March 2026) announces the Agents-as-a-Service paradigm — customers buy outcomes delivered by an agent rather than SaaS access. Agents become the unit of product delivery, not just a technical library.

Agents as a Service (Sierra blog) (paper)

Technical details

Hyperparameters (configurable axes)

Poziom autonomiiCritical

Scope of decisions the agent makes without human approval — from proposal-only mode to full autonomy with rollback.

proposal_onlyAgent prepares, human executes.

confirm_irreversibleAutonomy with confirmation required for irreversible actions.

full_autonomy

ToolkitCritical

List of callable functions available to the agent. Defines the action space and is the strongest predictor of agent behavior.

web_search + code_exec

file_read + file_write + bash + git

browser_use (computer use)

Maximum Number of StepsHigh

Hard limit on loop iterations before forced termination. Safeguards against cost overrun and infinite loops.

10Short tasks, low budget.

50-200Coding agents, long-term research.

Memory StrategyHigh

How context is managed across steps and sessions: context window only, summarization, vector store, episodic structures.

in_context_only

in_context + summarization

in_context + vector_memory

Cost / Token BudgetHigh

Maximum computational cost or token count per agent run. Critical for production deployments with outcome-based billing.

10k_tokens

1M_tokens / 5_USD

Human fallback strategyHigh

When the agent escalates to a human: never, on demand, after N failed steps, before irreversible action, based on uncertainty signals.

before_irreversible_actions

after_3_consecutive_failures

Execution paradigm

Primary mode

conditional

An agent is distinct from a workflow, where the path is predefined in the code and the LLM only executes specific steps, whereas in an agent, the LLM controls the entire process.

Activation pattern

input_dependent

Routing mechanism

At every step, the model decides which tool to invoke, whether to ask a clarification question, or to terminate — based on current context and observations. The execution path is not predefined in code.

Parallelism

Parallelism level

conditionally_parallel

Parallelism is most often achieved inter-sessionally (multiple agents for different tasks) or in orchestrator-workers patterns (one orchestrator delegates to multiple worker agents simultaneously).

Scope

inferenceacross_devices

Constraints

!Sekwencyjna pętla rozumowanie-działanie

!Możliwy fan-out narzędzi

Hardware requirements

Primary

Base LLM inference dominates the agent's cost and latency; GPUs with tensor cores are the standard for all modern production-grade models.

Good fit

Google deploys Gemini-based agents on TPUs; comparable throughput and cost to GPUs for most workloads.

Good fit

The control loop, tool parsing, and orchestration layer itself is lightweight and runs on CPU; hardware requirements stem from the base model, not from the agent's construction.

Sources

Building effective agents

Blog

Anthropic

Canonical definition of agent vs. workflow, and five compositional patterns for production agentic systems.

ReAct: Synergizing Reasoning and Acting in Language Models

Paper

arXiv (Yao et al.)

The technical foundation of the reasoning-action loop with LLMs.

What's next for AI agentic workflows (Andrew Ng)

article

DeepLearning.AI

Four agent design patterns: Reflection, Tool use, Planning, Multi-Agent.