Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

405 results

Active Experimentation Advanced

AI selecting next experiments.

AI in Science
Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy
Competitive Game Advanced

Agents have opposing objectives.

Agents & Autonomy
Auction Mechanism Advanced

Rules governing auctions.

Agents & Autonomy
Mechanism Design Advanced

Designing systems where rational agents behave as desired.

Agents & Autonomy
Vickrey Auction Advanced

Truthful bidding is optimal strategy.

Agents & Autonomy
Emergent Competition Advanced

Competition arises without explicit design.

Agents & Autonomy
Algorithmic Collusion Advanced

AI tacitly coordinating prices.

Agents & Autonomy
Strategic Interaction Advanced

Decisions dependent on others’ actions.

Agents & Autonomy
Information Asymmetry Advanced

Some agents know more than others.

Agents & Autonomy
Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics
Meta-Cognition Frontier

Awareness and regulation of internal processes.

AGI & General Intelligence
Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning
Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Optimization
Data Leakage Intermediate

When information from evaluation data improperly influences training, inflating reported performance.

Foundations & Theory
Grounding Intermediate

Constraining outputs to retrieved or provided sources, often with citation, to improve factual reliability.

Foundations & Theory
Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models
Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models
SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory
Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory
MLOps Intermediate

Practices for operationalizing ML: versioning, CI/CD, monitoring, retraining, and reliable production management.

MLOps & Infrastructure
CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure
Monitoring Intermediate

Observing model inputs/outputs, latency, cost, and quality over time to catch regressions and drift.

MLOps & Infrastructure
Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory
Prompt Injection Intermediate

Attacks that manipulate model instructions (especially via retrieved content) to override system goals or exfiltrate data.

Foundations & Theory
Loss Landscape Intermediate

The shape of the loss function over parameter space.

AI Economics & Strategy
Emergent Abilities Intermediate

Capabilities that appear only beyond certain model sizes.

AI Economics & Strategy
ARIMA Intermediate

Classical statistical time-series model.

Time Series
Inference Pipeline Intermediate

Model execution path in production.

MLOps & Infrastructure
Distribution Shift Intermediate

Train/test environment mismatch.

Model Failure Modes

Welcome to AI Glossary

The free, curated AI dictionary built from real, established terms and designed for a clean reading experience.

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.