Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Inductive Bias Intermediate

Built-in assumptions guiding learning efficiency and generalization.

AI Economics & Strategy

Prompt Leakage Intermediate

Extracting system prompts or hidden instructions.

AI Economics & Strategy

Spurious Correlation Intermediate

Model relies on irrelevant signals.

Model Failure Modes

Feedback Loop Intermediate

Using production outcomes to improve models.

MLOps & Infrastructure

System Identification Advanced

Learning physical parameters from data.

Simulation & Sim-to-Real

Surrogate Model Advanced

Fast approximation of costly simulations.

LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory

Model Card Intermediate

Standardized documentation describing intended use, performance, limitations, data, and ethical considerations.

Foundations & Theory

Model Inversion Intermediate

Inferring sensitive features of training data.

AI Economics & Strategy

Model Documentation Intermediate

Required descriptions of model behavior and limits.

Governance & Ethics

RAG Intermediate

Architecture that retrieves relevant documents (e.g., from a vector DB) and conditions generation on them to reduce hallucinations.

Foundations & Theory

Multimodal Model Intermediate

Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.

Foundations & Theory

Mixture of Experts Intermediate

Routes inputs to subsets of parameters for scalable capacity.

AI Economics & Strategy

Attention Head Intermediate

A single attention mechanism within multi-head attention.

AI Economics & Strategy

CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision

Toolformer Intermediate

Models trained to decide when to call tools.

AI Economics & Strategy

Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions

Overconfidence Intermediate

Probabilities do not reflect true correctness.

Model Failure Modes

Model Release Control Intermediate

Restricting distribution of powerful models.

Governance & Ethics

Model Intermediate

A parameterized mapping from inputs to outputs; includes architecture + learned parameters.

Foundations & Theory

Underfitting Intermediate

When a model cannot capture underlying structure, performing poorly on both training and test data.

Foundations & Theory

Model Governance Intermediate

Policies and practices for approving, monitoring, auditing, and documenting models in production.

Governance & Ethics

Distillation Intermediate

Training a smaller “student” model to mimic a larger “teacher,” often improving efficiency while retaining performance.

Foundations & Theory

Model Risk Management Intermediate

Framework for identifying, measuring, and mitigating model risks.

AI Economics & Strategy

Model Moat Intermediate

Competitive advantage from proprietary models/data.

AI Economics & Strategy

Epoch Intermediate

One complete traversal of the training dataset during training.

Foundations & Theory

Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory

Tool Use Intermediate

Letting an LLM call external functions/APIs to fetch data, compute, or take actions, improving reliability.

Agents & Autonomy

Logits Intermediate

Raw model outputs before converting to probabilities; manipulated during decoding and calibration.

Foundations & Theory

1 2 3 4 5 6 7 8 9 10 11 12 13 14