Search: training loss — Dictionary of AI

Second-Order Methods Intermediate

Optimization using curvature information; often expensive at scale.

AI Economics & Strategy

CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision

Neural Vocoder Intermediate

Generates audio waveforms from spectrograms.

Speech & Audio AI

Gradient Advanced

Direction of steepest ascent of a function.

Mathematics

Feedback Loop Intermediate

Using production outcomes to improve models.

MLOps & Infrastructure

Line Search Intermediate

Choosing step size along gradient direction.

Foundations & Theory

Reflection Prompting Intro

Asking model to review and improve output.

Prompting & Instructions

Predictive Coding Frontier

Learning by minimizing prediction error.

World Models & Cognition

Safety-Critical System Advanced

Systems where failure causes physical harm.

Agents & Autonomy

Unauthorized Practice of Law Intermediate

AI giving legal advice without authorization.

AI in Law

Competitive Game Advanced

Agents have opposing objectives.

Agents & Autonomy

Domain Shift Intermediate

A mismatch between training and deployment data distributions that can degrade model performance.

MLOps & Infrastructure

Overfitting Intermediate

When a model fits noise/idiosyncrasies of training data and performs poorly on unseen data.

Foundations & Theory

Cross-Validation Intermediate

A robust evaluation technique that trains/evaluates across multiple splits to estimate performance variability.

Foundations & Theory

Stochastic Gradient Descent Intermediate

A gradient method using random minibatches for efficient training on large datasets.

Foundations & Theory

Normalization Intermediate

Techniques that stabilize and speed training by normalizing activations; LayerNorm is common in Transformers.

Foundations & Theory

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Federated Learning Intermediate

Training across many devices/silos without centralizing raw data; aggregates updates, not data.

Foundations & Theory

Reproducibility Intermediate

Ability to replicate results given same code/data; harder in distributed training and nondeterministic ops.

Foundations & Theory

LoRA Intermediate

PEFT method injecting trainable low-rank matrices into layers, enabling efficient fine-tuning.

Foundations & Theory

Automation Bias Intermediate

Tendency to trust automated suggestions even when incorrect; mitigated by UI design, training, and checks.

Foundations & Theory

Saddle Point Intermediate

A point where gradient is zero but is neither a max nor min; common in deep nets.

AI Economics & Strategy

Gradient Clipping Intermediate

Limiting gradient magnitude to prevent exploding gradients.

AI Economics & Strategy

Residual Connection Intermediate

Allows gradients to bypass layers, enabling very deep networks.

AI Economics & Strategy

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Emergent Abilities Intermediate

Capabilities that appear only beyond certain model sizes.

AI Economics & Strategy

Noise Schedule Advanced

Controls amount of noise added at each diffusion step.

Diffusion & Generative Models

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Robust Alignment Advanced

Maintaining alignment under new conditions.

AI Safety & Alignment

Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes

Results for "training loss"

Welcome to AI Glossary

Search

Browse

3D WordGraph