Results for "dataset documentation"

50 results

Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory
Inter-Annotator Agreement Intermediate

Measure of consistency across labelers; low agreement indicates ambiguous tasks or poor guidelines.

Foundations & Theory
Data Augmentation Intermediate

Expanding training data via transformations (flips, noise, paraphrases) to improve robustness.

Foundations & Theory
Differential Privacy Intermediate

A formal privacy framework ensuring outputs do not reveal much about any single individual’s data contribution.

Security & Privacy
Synthetic Data Intermediate

Artificially created data used to train/test models; helpful for privacy and coverage, risky if unrealistic.

Foundations & Theory
Distillation Intermediate

Training a smaller “student” model to mimic a larger “teacher,” often improving efficiency while retaining performance.

Foundations & Theory
Data Poisoning Intermediate

Maliciously inserting or altering training data to implant backdoors or degrade performance.

Foundations & Theory
Depth vs Width Intermediate

Tradeoffs between many layers vs many neurons per layer.

AI Economics & Strategy
Canary Tokens Intermediate

Detecting unauthorized model outputs or data leaks.

AI Economics & Strategy
Generative Model Advanced

Models that learn to generate samples resembling training data.

Diffusion & Generative Models
Image Classification Intermediate

Assigning category labels to images.

Computer Vision
CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision
Wake Word Detection Intermediate

Detects trigger phrases in audio streams.

Speech & Audio AI
Trend Component Intermediate

Persistent directional movement over time.

Time Series
Batch Inference Intermediate

Running predictions on large datasets periodically.

MLOps & Infrastructure
Training Cost Intermediate

Cost of model training.

AI Economics & Strategy
Open-Weight Model Intermediate

Models whose weights are publicly available.

AI Economics & Strategy
Overgeneralization Intermediate

Applying learned patterns incorrectly.

Model Failure Modes
AlphaFold Advanced

Deep learning system for protein structure prediction.

AI in Science
Symbolic Regression Advanced

Finding mathematical equations from data.

AI in Science

Welcome to AI Glossary

The free, curated AI dictionary built from real, established terms and designed for a clean reading experience.

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.