Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Active Experimentation Advanced

AI selecting next experiments.

Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy

Competitive Game Advanced

Agents have opposing objectives.

Agents & Autonomy

Auction Mechanism Advanced

Rules governing auctions.

Agents & Autonomy

Mechanism Design Advanced

Designing systems where rational agents behave as desired.

Agents & Autonomy

Vickrey Auction Advanced

Truthful bidding is optimal strategy.

Agents & Autonomy

Emergent Competition Advanced

Competition arises without explicit design.

Agents & Autonomy

Algorithmic Collusion Advanced

AI tacitly coordinating prices.

Agents & Autonomy

Strategic Interaction Advanced

Decisions dependent on others’ actions.

Agents & Autonomy

Information Asymmetry Advanced

Some agents know more than others.

Agents & Autonomy

Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics

Meta-Cognition Frontier

Awareness and regulation of internal processes.

AGI & General Intelligence

Multitask Learning Intermediate

Training one model on multiple tasks simultaneously to improve generalization through shared structure.

Machine Learning

Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Data Leakage Intermediate

When information from evaluation data improperly influences training, inflating reported performance.

Foundations & Theory

Grounding Intermediate

Constraining outputs to retrieved or provided sources, often with citation, to improve factual reliability.

Foundations & Theory

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models

SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

MLOps Intermediate

Practices for operationalizing ML: versioning, CI/CD, monitoring, retraining, and reliable production management.

MLOps & Infrastructure

CI/CD for ML Intermediate

Automated testing and deployment processes for models and data workflows, extending DevOps to ML artifacts.

MLOps & Infrastructure

Monitoring Intermediate

Observing model inputs/outputs, latency, cost, and quality over time to catch regressions and drift.

MLOps & Infrastructure

Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory

Prompt Injection Intermediate

Attacks that manipulate model instructions (especially via retrieved content) to override system goals or exfiltrate data.

Foundations & Theory

Loss Landscape Intermediate

The shape of the loss function over parameter space.

AI Economics & Strategy

Emergent Abilities Intermediate

Capabilities that appear only beyond certain model sizes.

AI Economics & Strategy

ARIMA Intermediate

Classical statistical time-series model.

Inference Pipeline Intermediate

Model execution path in production.

MLOps & Infrastructure

Distribution Shift Intermediate

Train/test environment mismatch.

Model Failure Modes

1 2 3 4 5 6 7 8 9 10 11 12 13 14