01 — Intro to Agentic AI · Agentic AI Course

01 · WHAT IS AN AI AGENT

From Answering to Acting

An AI agent is a system that uses a large language model as its core reasoning engine to pursue a goal — not by generating a single response, but by running an autonomous loop: perceiving its environment, deciding what to do, taking actions via tools, observing the results, and continuing until the goal is met or a stopping condition is reached.

Lilian Weng's foundational 2023 overview identifies three capabilities that elevate an LLM into an agent: planning (breaking a goal into subgoals and sequencing steps), memory (retaining context across steps), and tool use (calling external APIs, running code, reading files). No single capability is sufficient on its own — it is their combination operating inside an automated loop that defines the agent.

Anthropic's agent documentation adds a production-focused framing: agents are distinguished by the degree to which they operate with minimal human interaction per step. The defining characteristic is not what the agent can do — it's how independently it does it.

Core definition (Anthropic docs): "In agentic contexts, Claude will sometimes act as an orchestrator of multi-agent pipelines and sometimes as a subagent within those pipelines, and sometimes as both. Orchestrators direct agents to use tools or undertake tasks. Subagents implement those instructions, taking actions with real-world consequences."

🗺️

Planning

Decomposing a goal into subtasks and sequencing them — including re-planning when earlier steps fail.

WIDELY USED (2024–2026)

🗃️

Memory

Storing and retrieving information across steps — in-context (the current prompt), external (vector stores, databases), or both.

WIDELY USED (2024–2026)

🔧

Tool Use

Calling external systems — web search, code execution, file I/O, APIs — to affect the world beyond the model's context window.

WIDELY USED (2024–2026)

Not just memory and tools. By 2026, most consumer AI products (Claude.ai, ChatGPT, Gemini) already have memory features and tool integrations. That alone doesn't make them agents. The defining question is whether the system drives each step autonomously, or whether a human must prompt it at every turn.

📎 Sources: Lilian Weng — LLM Powered Autonomous Agents (2023) · Anthropic — Tool Use Docs

02 · THE AGENT LOOP

ReAct: Reason, Then Act

The dominant pattern for implementing the agent loop was formalized in the ReAct paper (Yao et al., 2022), published at ICLR 2023. ReAct stands for Reasoning + Acting, and it describes a specific interleaving: the model generates a Thought (explicit reasoning about what to do), then an Action (a tool call or decision), then observes the Observation (the tool's output), and repeats until it can produce a final Answer.

Prior approaches either did reasoning without acting (chain-of-thought prompting) or acting without explicit reasoning (action-only agents). ReAct's contribution was showing that making the reasoning visible and interleaved with actions dramatically improves performance on multi-step tasks — and makes agent behavior easier to debug and interpret.

// REACT TRACE — THOUGHT → ACTION → OBSERVATION LOOP

GOAL
"What is the population of the capital of France, and how has it changed since 2000?"

THOUGHT 1
I need to find the capital of France first, then look up its population data.

ACTION 1
web_search("capital of France population 2000 2024")

OBSERVATION 1
Paris is the capital. Population data found: 2000 ~2.1M city / 2024 ~2.06M city.

THOUGHT 2
I have enough data. Paris population declined slightly. I can now answer.

FINAL ANSWER
Paris (population ~2.06M in 2024) saw a slight decline from ~2.1M in 2000...

This pattern — alternating explicit reasoning steps with grounded tool calls — is the foundation of how modern agent frameworks (LangChain, LangGraph, the Anthropic SDK's tool use loop) work under the hood. When you call a tool in Claude, you are participating in a ReAct-style loop.

Why it matters: Making reasoning explicit (the Thought step) means you can inspect why an agent took an action. This is critical for debugging, for safety review, and for building trust in autonomous systems.

📎 Source: Yao et al. — ReAct: Synergizing Reasoning and Acting in Language Models (arXiv:2210.03629, ICLR 2023)

03 · WHO DRIVES THE LOOP

The Real Distinction: Human-in-the-Loop vs. Autonomous

By 2026, consumer AI products have blurred the surface features that once separated "chatbots" from "agents." ChatGPT, Claude.ai, and Gemini all offer memory, multi-turn context, file uploads, web search, and even code execution. So the old framing — "agents have tools and memory, chatbots don't" — is no longer meaningful.

The meaningful distinction is who drives each step of the loop:

// CONSUMER AI vs AGENTIC AI — WHO DRIVES EACH STEP?

CONSUMER AI (e.g. Claude.ai)

👤 You: "Write me a market report"

↓ model responds, may use tools

🤖 AI: Responds, possibly using web search

↓ waits for you to decide next step

👤 You: Follow-up, revise, or stop

✓ May have tools ✓ May have memory
❌ You decide every next step

AGENTIC AI

👤 Goal: "Compile a weekly market report"

🧠 Plan: search → read → filter → synthesize

🔧 web_search("markets week of March 2026")

🔧 fetch_url(...) 🔧 fetch_url(...)

🧠 Reflect: enough data → synthesize

📄 Delivers finished, sourced report

✓ Tools ✓ Memory ✓ Self-drives each step

The underlying LLM is always a stateless function — it takes a context window in and produces tokens out. The "memory" that consumer products provide is a product layer that injects conversation history back into the prompt. What makes an agent is the automated loop that runs this function repeatedly, with tool results fed back in, without requiring a human prompt at each iteration.

📎 Sources: Anthropic — Tool Use Docs · Lilian Weng — LLM Powered Autonomous Agents (2023)

04 · AGENT TYPES

Four Architectural Patterns

Lilian Weng's survey categorizes agents by their architecture — specifically, how they balance speed, memory, and reasoning depth. These categories map directly to design decisions you will make when building agents.

⚡

Reactive Agent

Maps inputs directly to outputs with no planning or memory. Fast and predictable. Suitable for well-defined, low-variance tasks. Example: a classifier, a spam filter.

WIDELY USED (2024–2026)

🧭

Deliberative Agent

Maintains an internal model of the world, plans multi-step sequences, and reasons before acting. Slower but capable of complex goal pursuit. Most LLM-based agents use this pattern.

WIDELY USED (2024–2026)

🔀

Hybrid Agent

Combines reactive and deliberative layers. Fast reactive responses for well-defined sub-tasks; deliberative planning for novel or high-stakes decisions. The dominant pattern in production.

WIDELY USED (2024–2026)

🕸️

Multi-Agent System

Multiple specialized agents coordinated by an orchestrator. Each agent handles a focused domain. Enables parallelism and specialization that a single agent cannot achieve.

EMERGING PATTERN (2025–2026)

Type	Memory	Planning	Best For	Trade-off
Reactive	None	None	Classifiers, routers, filters	Fast, but can't handle novel situations
Deliberative	Full	Full	Research, coding, multi-step reasoning	Slower, higher token cost
Hybrid	Selective	Selective	Production agents with mixed task types	More complex to implement and debug
Multi-Agent	Distributed	Distributed	Large workflows needing parallelism	Orchestration overhead, harder to debug

📎 Source: Lilian Weng — LLM Powered Autonomous Agents (2023)

05 · AUTONOMY SPECTRUM

How Much Autonomy Does Your Agent Need?

Anthropic's agent documentation frames autonomy as a spectrum, not a binary. Real deployments live somewhere between a fully passive tool and a fully autonomous system. Understanding this spectrum is essential for making safe, appropriate design decisions.

// AUTONOMY SPECTRUM — TOOL TO MULTI-AGENT NETWORK

L1

AI Tool

Single prompt-response. Human acts on the output. No loop.

← more human

L2

Copilot

AI suggests; human approves and executes every action. AI never acts directly.

L3 ★

Supervised Agent

Autonomous by default; pauses for human approval on irreversible or high-stakes steps. Recommended default for production.

L4

Delegated Agent

Human sets the goal and constraints; agent handles execution end-to-end. Human reviews the final outcome only.

L5

Multi-Agent Network

Orchestrator delegates to specialized sub-agents. Maximum autonomy and parallelism. Highest complexity and risk surface.

more agent →

Anthropic's guidance: "Prefer cautious actions, all else being equal, and be willing to accept a worse expected outcome in order to get a reduction in variance. This is especially true in novel or unclear situations." Start at L3 (Supervised Agent) and only increase autonomy when you have evidence it is safe to do so.

📎 Source: Anthropic — Tool Use / Agents Documentation

06 · WHEN TO BUILD AN AGENT

Agents Add Complexity — Make Sure It's Worth It

Not every AI task needs an agent. An agent adds orchestration overhead, token cost, latency, and surface area for failure. The decision framework comes directly from Anthropic's agent documentation: build an agent when the task genuinely requires multi-step decision-making that cannot be collapsed into a well-crafted single prompt.

Use Case	Use an Agent?	Reason
Summarize a document I've already uploaded	No	A single well-crafted prompt handles this. Agent overhead is waste.
Research a topic across multiple live sources, synthesize findings, and write a report	Yes	Requires multi-step search, reading, filtering, and synthesis. No single prompt can do this.
Translate a paragraph from English to French	No	One inference, one output, no tools needed.
Monitor a codebase, run tests on PR merge, file issues on failure, and notify the team	Yes	Requires tool use (git, CI, issue tracker, Slack), conditionals, and multi-step orchestration.
Answer a customer's question about a product using a knowledge base	Maybe	A RAG pipeline may suffice. Add agent loop only if multi-hop retrieval or follow-up actions are needed.

Rule of thumb: If the task has a clear input and a clear single output, use a prompt. If it requires planning, branching decisions, or external tool calls — and those can't be known in advance — build an agent.

📎 Source: Anthropic — Tool Use / Agents Documentation

07 · FAILURE MODES

How Agents Break in Practice

Understanding how agents fail is as important as understanding how they work. Anthropic's documentation identifies several classes of failure that appear consistently in production deployments. Designing against these from the start saves significant debugging time later.

FAILURE MODE 01

Tool Call Hallucination

The agent invents a tool that doesn't exist or calls a real tool with fabricated parameters. Mitigated by strict tool schemas and output validation.

FAILURE MODE 02

Infinite Loop

The agent retries a failing action indefinitely because the stop condition is unclear or unreachable. Mitigated by explicit step budgets and error escalation logic.

FAILURE MODE 03

Context Window Overflow

Tool outputs and reasoning steps accumulate until the context limit is exceeded, causing truncation or degraded performance. Mitigated by summarization and context management.

FAILURE MODE 04

Goal Drift

The agent gradually shifts away from the original goal as it encounters unexpected intermediate states. Mitigated by periodically re-anchoring the agent to the original task specification.

FAILURE MODE 05

Prompt Injection

Malicious content in tool outputs overrides the agent's instructions. Particularly dangerous when agents browse the web or process untrusted documents. A major active research area.

ACTIVE RISK (2025–2026)

FAILURE MODE 06

Irreversible Action Without Approval

The agent deletes data, sends emails, or spends money without a human checkpoint. Mitigated by classifying all actions as reversible vs. irreversible and requiring approval for the latter.

HIGH IMPACT RISK

📎 Sources: Anthropic — Tool Use / Agents Docs · Lilian Weng — LLM Powered Autonomous Agents (2023)

SOURCES USED IN THIS SECTION

Verified References

Every claim in this section is grounded in one of these sources. No content is generated from model training data alone.

Source	Type	Covers	Recency
Lilian Weng — LLM Powered Autonomous Agents	Blog / Survey	Agent pillars, agent types, ReAct pattern overview	June 2023
Yao et al. — ReAct (arXiv:2210.03629)	Academic paper	Reason + Act loop, Thought/Action/Observation trace, ICLR 2023	Oct 2022 / ICLR 2023
Anthropic — Tool Use Documentation	Official docs	Agent definition, autonomy guidance, failure modes, orchestrator/subagent framing	Maintained 2024–2026

Introduction to
Agentic AI

From Answering to Acting

ReAct: Reason, Then Act

The Real Distinction: Human-in-the-Loop vs. Autonomous

Four Architectural Patterns

How Much Autonomy Does Your Agent Need?

Agents Add Complexity — Make Sure It's Worth It

How Agents Break in Practice

Verified References

Section 01 Quiz

Introduction toAgentic AI

From Answering to Acting

ReAct: Reason, Then Act

The Real Distinction: Human-in-the-Loop vs. Autonomous

Four Architectural Patterns

How Much Autonomy Does Your Agent Need?

Agents Add Complexity — Make Sure It's Worth It

How Agents Break in Practice

Verified References

Section 01 Quiz

Introduction to
Agentic AI