Results for "model-based"

Model-Based RL

Advanced

RL using learned or known environment models.

Model-based reinforcement learning is like having a map while exploring a new city. Instead of wandering around aimlessly, you can look at the map to plan your route and make better decisions about where to go next. In this type of learning, an AI agent first learns how the environment works—like...

Full Definition View in 3D WordGraph

405 results

Function Calling Intermediate

Constraining model outputs into a schema used to call external APIs/tools safely and deterministically.

Foundations & Theory

VC Dimension Intermediate

A measure of a model class’s expressive capacity based on its ability to shatter datasets.

AI Economics & Strategy

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

Absolute Positional Encoding Intermediate

Encodes token position explicitly, often via sinusoids.

AI Economics & Strategy

Graph Attention Network Intermediate

GNN using attention to weight neighbor contributions dynamically.

Model Architectures

Image Classification Intermediate

Assigning category labels to images.

Computer Vision

Constraint Prompting Intro

Explicit output constraints (format, tone).

Prompting & Instructions

Delimited Prompt Intro

Using markers to isolate context segments.

Prompting & Instructions

AI Hallucination Intermediate

Fabrication of cases or statutes by LLMs.

Symbolic Regression Advanced

Finding mathematical equations from data.

Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics

Parameters Intermediate

The learned numeric values of a model adjusted during training to minimize a loss function.

Foundations & Theory

Vocabulary Intermediate

The set of tokens a model can represent; impacts efficiency, multilinguality, and handling of rare strings.

Transformers & LLMs

Training Pipeline Intermediate

End-to-end process for model training.

MLOps & Infrastructure

Hallucination Intermediate

Model-generated content that is fluent but unsupported by evidence or incorrect; mitigated by grounding and verification.

Model Failure Modes

Overgeneralization Intermediate

Applying learned patterns incorrectly.

Model Failure Modes

Model Inventory Intermediate

Central catalog of deployed and experimental models.

AI Economics & Strategy

Feedback Loop Collapse Intermediate

Model trained on its own outputs degrades quality.

Model Failure Modes

Semi-Supervised Learning Intermediate

Training with a small labeled dataset plus a larger unlabeled dataset, leveraging assumptions like smoothness/cluster structure.

Machine Learning

Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory

Curriculum Learning Intermediate

Ordering training samples from easier to harder to improve convergence or generalization.

Foundations & Theory

Instance Segmentation Intermediate

Pixel-level separation of individual object instances.

Computer Vision

Synthetic Data Intermediate

Artificially created data used to train/test models; helpful for privacy and coverage, risky if unrealistic.

Foundations & Theory

Forecasting Intermediate

Predicting future values from past observations.

Graph Convolution Intermediate

Extension of convolution to graph domains using adjacency structure.

Model Architectures

Deliberative Agent Advanced

Agent reasoning about future outcomes.

Agents & Autonomy

Kalman Filter Intermediate

Optimal estimator for linear dynamic systems.

Exposure Bias Intermediate

Differences between training and inference conditions.

Model Failure Modes

State Estimation Advanced

Inferring the agent’s internal state from noisy sensor data.

Robotics & Embodied AI

Open-Loop Control Advanced

Control without feedback after execution begins.

Robotics & Embodied AI

1 2 3 4 5 6 7 8 9 10 11 12 13 14