Question 1

What is AI agent observability?

Accepted Answer

AI agent observability is the practice of tracing every step of a multi-step AI agent pipeline: each LLM call, each tool use, each inter-agent handoff, and the full call tree. Unlike basic LLM logging, agent observability must capture the causal chain across many spans — so you can see exactly which tool returned null, which LLM received corrupted context, and which step caused a cascade failure.

Question 2

How is AI agent observability different from LLM observability?

Accepted Answer

LLM observability focuses on individual call metrics: tokens, latency, cost, and output quality for a single chat completion. AI agent observability covers the full pipeline: a workflow that makes dozens of LLM calls across multiple models, invokes external tools, and may spawn sub-agents. The trace must show the entire call tree — not just individual calls — so a failed tool two levels deep is immediately visible as the cause of a bad final answer.

Question 3

What should an AI agent observability tool capture?

Accepted Answer

A complete AI agent observability tool should capture: (1) full trace waterfall — every LLM call, tool invocation, and sub-agent step as a span with parent-child relationships; (2) token counts, cost, and latency per span and per workflow; (3) tool inputs and outputs, not just the LLM response; (4) hallucination scores at the claim level, not just a single number; (5) cost per workflow and cost trend over time so you catch prompt bloat before it shows up on your bill.

Question 4

Does Peekr support LangChain, CrewAI, and LlamaIndex?

Accepted Answer

Yes. Peekr auto-instruments at the class level — every OpenAI(), AsyncOpenAI(), and anthropic.Anthropic() instance is patched automatically. LangChain, CrewAI, LlamaIndex, and any other framework that calls the underlying provider SDK is covered without any per-framework configuration.

Question 5

How do I add AI agent observability to an existing Python agent?

Accepted Answer

Add two lines before your other imports: `import peekr` and `peekr.instrument(tenant_id='your-project', exporter=peekr.HTTPExporter(endpoint='https://peekr.starkspherelabs.com', api_key='pk_live_...'))`. Peekr patches the provider clients at the class level — your agent code stays unchanged and every LLM call and tool span is captured automatically.

Trace every step your
AI agent takes.

Single-call tracing misses agent failures.

Cascade failures

Runaway loops

Cost per workflow

The trace you wish you had at 2am.

“My agent gave the wrong answer — but I don't know why.”

“One user request triggered 122 LLM calls.”

“My agent is hallucinating — which step caused it?”

“My agent cost $0.40 per request and I don't know why.”

Instrument your agent before the first import. The rest is automatic.

Common questions about AI agent observability.

Observe your first agent in two lines.

Trace every step yourAI agent takes.