Agentic AI Glossary · The Context Advantage

Glossary

The Context Advantage glossary

Plain-English definitions for the agentic AI, data engineering, and AI governance terms used throughout the book. Every term links to the chapter that goes deeper.

118 of 118 terms

A

Agent: A system that can answer, reason, plan, use tools, and take action — not just chat.
Ch 2 Ch 10
Agentic AI: AI that acts on real systems, not just answers questions.
Ch 2 Ch 4
Action Control: Policies that decide what an agent is allowed to do, not just what it can see.
Ch 11 Ch 12 Ch 27
Access Control: Policies that decide what data an identity is allowed to see.
Ch 11
Audit Trail: A structured record of what the agent did, why, and with what evidence.
Ch 13 Ch 27
Approval Queue: A shared inbox where humans review agent actions before they execute.
Ch 12 Ch 13
Action Inventory: The structured list of every action an agent can take, with risk tier and owner.
Ch 11 Ch 27
Action Gateway: An in-line service every agent action passes through for policy checks and logging.
Ch 11 Ch 27
Agent Graph: A multi-agent topology where agents call each other freely. Powerful and easy to overuse.
Ch 31

B

Business Memory: The layer that captures how your company defines its world, beyond just schemas.
Ch 5 Ch 7 Ch 8
Business Rule: A condition or exception the business enforces, like discount eligibility.
Ch 7 Ch 8
Backpressure: Slowing the producer when the consumer cannot keep up.
Ch 16 Ch 17
Blast Radius: How many people, records, or systems an action affects if it goes wrong.
Ch 11 Ch 12

C

Caching: Reusing a previous answer to avoid recomputing it.
Ch 17 Ch 28
Catalog: An inventory of where data lives and who owns it.
Ch 6 Ch 11
Cascading Models: Trying a cheap model first and escalating to a larger one only when needed.
Ch 17
Citation: A reference the agent shows so users can verify where an answer came from.
Ch 13 Ch 14
Choice: The ability to swap models, tools, or vendors without rewriting your system.
Ch 4 Ch 20 Ch 21 Ch 22
Confidence: How sure the agent is about its answer, ideally surfaced honestly.
Ch 14
Context: The meaning, definitions, and relationships an agent needs to answer correctly.
Ch 1 Ch 4 Ch 5 Ch 6
Context Engineer: A data professional who owns meaning the way engineers own code.
Ch 6 Ch 26
Context Layer: The queryable layer above storage and compute that holds business meaning.
Ch 5 Ch 6 Ch 7 Ch 28
Context Window: The maximum amount of text a model can read in a single request.
Ch 3 Ch 17
Control: The set of policies that make agent behavior safe and predictable.
Ch 4 Ch 10 Ch 11 Ch 12 Ch 13
Cost: The total expense of running AI — tokens, tools, retries, and infrastructure.
Ch 4 Ch 15 Ch 16 Ch 17 Ch 18 Ch 19
Cost-Aware Architecture: A design that treats cost like latency: a first-class requirement.
Ch 15 Ch 16 Ch 17
Compositional Tools: Small, reliable tools that agents combine to do bigger work.
Ch 27 Ch 31
Calibrated Confidence: Confidence derived from system evidence, not from raw model probabilities.
Ch 14

D

Data Product: A piece of data offered with quality, ownership, and a clear contract.
Ch 6 Ch 7
Delta Lake: An open table format that adds reliability features on top of Parquet files.
Ch 22
Drift: When model or data behavior changes over time, often silently.
Ch 13 Ch 19
Determinism: Same input, same output, every time. Hard for LLMs, important for audits.
Ch 13 Ch 14
Downgrade: A policy outcome that lets a smaller, safer version of the action proceed.
Ch 12 Ch 17

E

Embedding: A numeric representation of text used for similarity search.
Ch 3 Ch 8
Evaluation: Measuring whether the agent is giving correct, safe, useful answers.
Ch 14 Ch 19
Eval Harness: A test suite that scores AI quality across many examples.
Ch 14 Ch 19
Eval-Driven Development: Building AI features by writing evaluations first, then improving the system.
Ch 14 Ch 19 Ch 29

F

Fine-Tuning: Adapting a base model to your domain by training it on your data.
Ch 3 Ch 21

G

Gateway: A single entry point that all AI calls go through for security, routing, and logging.
Ch 11 Ch 16 Ch 27
Glossary: A list of business terms with agreed, simple definitions.
Ch 7
Governance: The set of rules that decide what is allowed in the data and AI stack.
Ch 10 Ch 11 Ch 23
Grounding: Anchoring an answer in real, citable data instead of model memory.
Ch 13 Ch 14
Guardrails: Code-enforced rules that block dangerous inputs or outputs.
Ch 11 Ch 12
Golden Set: A curated set of examples used as the truth for evaluation.
Ch 14 Ch 19

H

Hallucination: When a model produces something that sounds confident but is not true.
Ch 1 Ch 3 Ch 14
Human in the Loop: A design where humans review, approve, or take over agent actions.
Ch 12 Ch 13 Ch 27
Handoff Contract: The typed schema two agents agree on when passing work between them.
Ch 31

I

Iceberg: An open table format widely adopted for lakehouses.
Ch 22
Ingestion: Bringing raw data into your platform from source systems.
Ch 6
Inference: Asking a model to produce an output, as opposed to training it.
Ch 3 Ch 17
Idempotent: Safe to retry — the same call twice gives the same result.
Ch 27
Interface Dividend: The compounding velocity gain a platform earns by routing change through stable interfaces.
Ch 21 Ch 22 Ch 23

K

Knowledge Graph: A structured map of concepts and how they relate, used to enrich context.
Ch 8

L

Lakehouse: A storage architecture that combines lake flexibility with warehouse reliability.
Ch 5 Ch 22
Lineage: The path a piece of data took from source to answer.
Ch 6 Ch 13
LLM: A large language model trained to generate human-like text.
Ch 2 Ch 3

M

MCP: Model Context Protocol — a standard way for agents to talk to tools and data.
Ch 21 Ch 23
Metric: A defined business number, like revenue or churn.
Ch 7 Ch 8
Metrics Layer: The place where metrics are defined once and calculated consistently.
Ch 7 Ch 8
Model Routing: Sending each request to the smallest model that can handle it well.
Ch 16 Ch 17
Model Choice Matrix: A map of tasks to the right model size for each one.
Ch 16 Ch 22
Multi-Agent: A design where multiple specialized agents work together.
Ch 31

O

Observability: The ability to see what your AI system is doing in production.
Ch 13 Ch 19 Ch 27
Ontology: A structured map of business concepts and how they relate.
Ch 5 Ch 8
Open Format: A data or model format anyone can read without vendor lock-in.
Ch 22 Ch 23
Open Interface: A standard API anyone can implement, reducing lock-in.
Ch 21 Ch 22 Ch 23
Orchestration: Coordinating multiple steps, tools, or agents into one workflow.
Ch 27 Ch 31

P

Parquet: An open columnar file format widely used in data platforms.
Ch 22
Permission: A rule about what an identity is allowed to see or do.
Ch 11
Policy as Code: Writing governance rules as software the system enforces automatically.
Ch 11 Ch 12 Ch 27
Policy Engine: A service that evaluates policy-as-code at runtime.
Ch 11 Ch 27
Prompt: The instructions and context sent to a model in a single request.
Ch 3 Ch 17
Prompt Engineering: Designing prompts so models behave well for a given task.
Ch 3
Pipeline Pattern: A multi-agent topology where agents run in a fixed sequence — the safest default.
Ch 31
Provenance: The chain of sources behind an agent's answer, surfaced so users can verify it.
Ch 13 Ch 14

R

RAG: Retrieval-Augmented Generation — letting the model read your data before answering.
Ch 3 Ch 8 Ch 9
Reasoning: A model thinking through steps before producing a final answer.
Ch 2 Ch 3
Reference Architecture: A shared pattern teams follow so each new project does not reinvent the wheel.
Ch 27 Ch 28
Retrieval: Looking up relevant content to give a model better context.
Ch 8 Ch 9 Ch 17
Retry: Re-running a failed model call. Quietly expensive at scale.
Ch 17 Ch 19
Rollback: Turning off or reverting an agent quickly when something goes wrong.
Ch 13 Ch 27
Router: A cheap classifier that decides which model or tool handles a request.
Ch 16 Ch 17
Replayable: A system where you can re-run history to debug or recover.
Ch 13 Ch 27
Reversibility: Whether an action can be undone cleanly, and how quickly.
Ch 12 Ch 13

S

Schema: The shape of a table — columns and types.
Ch 5 Ch 6
Semantic Cache: A cache keyed by meaning, not exact text, that reuses similar answers.
Ch 17
Semantic Layer: The layer that maps business meaning to underlying data.
Ch 5 Ch 7 Ch 8 Ch 26
Semantic Search: Finding content by meaning, not exact keywords.
Ch 8
Sensitivity Tag: A label that marks data by how sensitive it is, like PII or financial.
Ch 11
Signal: A measurable indicator used to monitor quality, cost, or safety.
Ch 14 Ch 19
Source of Truth: The one place a fact is considered authoritative.
Ch 6 Ch 7
Stewardship: The ongoing care and ownership of a data asset.
Ch 6 Ch 7 Ch 26
Stream: Continuous data that arrives event by event rather than batch by batch.
Ch 6
Structured Retrieval: Looking up answers in tables, metrics, or APIs before falling back to text search.
Ch 8 Ch 9 Ch 17
Synthetic Data: Generated data used for training or testing when real data is scarce.
Ch 19
Side Effect: Something an action changes in the outside world, like sending an email.
Ch 10 Ch 27
Supervisor Pattern: A multi-agent topology where one agent routes work to specialists.
Ch 31

T

Telemetry: Data emitted by a system that lets you observe its behavior.
Ch 13 Ch 19
Throttling: Limiting the rate of requests to control cost or load.
Ch 16 Ch 17
Token: The unit a model reads or writes — roughly a few characters.
Ch 3 Ch 15 Ch 17
Token Budget: A target for how many tokens a request is allowed to use.
Ch 16 Ch 17
Tool Call: When the agent invokes an external tool, like a query or API.
Ch 10 Ch 27
Trace: The full record of a single agent run, end to end.
Ch 13 Ch 19
Trust Path: The sequence of checks an agent action passes before it executes.
Ch 13 Ch 27
Trusted Agent Architecture: A nine-step reference flow every production agent follows.
Ch 27 Ch 28
Trusted Source: A dataset agreed upon as authoritative for a given metric.
Ch 6 Ch 7
Termination Condition: The explicit rule that ends an agent loop. Without one, loops burn money.
Ch 27 Ch 31
Trust Signal: A visible cue (source, definition, confidence, limit) that helps users decide when to trust an agent.
Ch 14

U

Unit Cost: Cost per unit of value delivered, like cost per resolved ticket.
Ch 15 Ch 16 Ch 19
Unity Catalog: Databricks' governance layer; one example of a managed catalog.
Ch 22

V

Validation: Checking that an input or output meets the rules before using it.
Ch 11 Ch 14
Vector Database: A store that lets you search by embedding similarity.
Ch 8 Ch 22
Vector Search: Finding similar content using embeddings.
Ch 8
Vendor Lock-In: A situation where switching providers is expensive or slow.
Ch 20 Ch 21 Ch 22
Versioning: Tracking changes to data, models, or definitions over time.
Ch 6 Ch 13

W

Warehouse: A structured store optimized for analytics queries.
Ch 5 Ch 22
Workflow: A defined sequence of steps a system runs to complete a task.
Ch 27 Ch 31

Z

Zero-Shot: Asking a model to do a task it was not explicitly trained for.
Ch 3

Go deeper than definitions.

The book turns these terms into a working method — Context, Control, Cost, Choice.

Read free preview Get the book — $59