Skip to main content

Glossary

Plain definitions for AI terms you'll see across tools, docs, and this directory.

A

AI agent
A system that plans multi-step tasks, calls tools, and iterates until a goal is met — not just a single chat reply.

Tools

Related: MCP (Model Context Protocol), Tool use / function calling

C

Context window
How much text (in tokens) a model can consider in one request — including your prompt, files, and its reply.

Fundamentals

Related: Token, RAG (Retrieval-Augmented Generation)

E

Embedding
A numeric vector representing meaning. Similar texts have similar vectors — used for search and clustering.

Architecture

Related: Vector database, RAG (Retrieval-Augmented Generation)

F

Few-shot prompting
Including a few input→output examples in the prompt to teach format and behavior without fine-tuning.

Practice

Related: Prompt engineering

Fine-tuning
Additional training on a smaller dataset to specialize a base model for a domain, tone, or task.

Training

Related: LLM (Large Language Model), Prompt engineering

G

Grounding
Tying answers to verified sources — web search, your files, or tool output — instead of memory alone.

Quality

Related: RAG (Retrieval-Augmented Generation), Hallucination

H

Hallucination
When a model states false information confidently. Mitigate with citations, RAG, and verification steps.

Quality

Related: RAG (Retrieval-Augmented Generation), Grounding

L

LLM (Large Language Model)
A neural network trained on vast text to predict the next token. Powers chat assistants, coding tools, and many AI products.

Fundamentals

Related: Token, Context window

M

MCP (Model Context Protocol)
An open standard for connecting AI clients to external tools and data via MCP servers (filesystem, GitHub, databases, etc.).

Tools

Related: AI agent, Tool use / function calling

Modality
Input/output types a model handles: text, code, image, voice, video. Multimodal models support several.

Fundamentals

Related: LLM (Large Language Model)

P

Prompt engineering
Crafting instructions, examples, and structure so models produce reliable, useful outputs.

Practice

Related: Few-shot prompting, System prompt

R

RAG (Retrieval-Augmented Generation)
Fetch relevant documents first, then ask the model to answer using those sources — reduces hallucinations on private data.

Architecture

Related: Embedding, Vector database

S

System prompt
Hidden instructions that set persona, rules, and constraints for every turn in a conversation.

Practice

Related: Prompt engineering

T

Token
The unit models read and write — roughly ¾ of an English word. Pricing and context limits are often measured in tokens.

Fundamentals

Related: Context window, LLM (Large Language Model)

Tool use / function calling
When a model returns structured calls (e.g. search, run code) that the host app executes and feeds back as results.

Tools

Related: MCP (Model Context Protocol), AI agent

V

Vector database
Storage optimized for similarity search over embeddings — common backend for RAG and semantic search.

Architecture

Related: Embedding, RAG (Retrieval-Augmented Generation)