Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Embedding Intermediate

A continuous vector encoding of an item (word, image, user) such that semantic similarity corresponds to geometric closeness.

Machine Learning

Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs

LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory

Vector Database Intermediate

A datastore optimized for similarity search over embeddings, enabling semantic retrieval at scale.

Large Language Models

A/B Testing Intermediate

Controlled experiment comparing variants by random assignment to estimate causal effects of changes.

Foundations & Theory

Beam Search Intermediate

Search algorithm for generation that keeps top-k partial sequences; can improve likelihood but reduce diversity.

Foundations & Theory

Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy

Gradient Clipping Intermediate

Limiting gradient magnitude to prevent exploding gradients.

AI Economics & Strategy

Planning Intermediate

Methods for breaking goals into steps; can be classical (A*, STRIPS) or LLM-driven with tool calls.

Foundations & Theory

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

NLP Intermediate

AI subfield dealing with understanding and generating human language, including syntax, semantics, and pragmatics.

Foundations & Theory

Memory Augmentation Intermediate

Extending agents with long-term memory stores.

AI Economics & Strategy

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Emergent Coordination Intermediate

Coordination arising without explicit programming.

AI Economics & Strategy

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

Use-Case Classification Intermediate

Categorizing AI applications by impact and regulatory risk.

AI Economics & Strategy

Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy

Graph Neural Network Intermediate

Neural networks that operate on graph-structured data by propagating information along edges.

Model Architectures

Message Passing Neural Network Intermediate

GNN framework where nodes iteratively exchange and aggregate messages from neighbors.

Model Architectures

Optical Flow Intermediate

Pixel motion estimation between frames.

Computer Vision

Seasonality Intermediate

Repeating temporal patterns.

Blue-Green Deployment Intermediate

Maintaining two environments for instant rollback.

MLOps & Infrastructure

Autonomous Agent Advanced

System that independently pursues goals over time.

Agents & Autonomy

ReAct Pattern Advanced

Interleaving reasoning and tool use.

Agents & Autonomy

Central Limit Theorem Advanced

Sum of independent variables converges to normal distribution.

Probability & Statistics

Posterior Distribution Advanced

Updated belief after observing data.

Probability & Statistics

Prior Distribution Advanced

Belief before observing data.

Probability & Statistics

Stochastic Approximation Intermediate

Optimization under uncertainty.

Foundations & Theory

EU AI Act Intermediate

European regulation classifying AI systems by risk.

Governance & Ethics

High-Risk AI System Intermediate

AI used in sensitive domains requiring compliance.

Governance & Ethics

1 2 3 4 5 6 7 8 9 10 11 12 13 14