Search: hidden objectives — Dictionary of AI

Boltzmann Machine Intermediate

Probabilistic energy-based neural network with hidden variables.

Model Architectures

Hidden Markov Model Intermediate

Probabilistic model for sequential data with latent states.

Model Architectures

Specification Gaming Advanced

Model exploits poorly specified objectives.

AI Safety & Alignment

Mesa-Optimizer Advanced

Learned subsystem that optimizes its own objective.

AI Safety & Alignment

Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Prompt Leakage Intermediate

Extracting system prompts or hidden instructions.

AI Economics & Strategy

Restricted Boltzmann Machine Intermediate

Simplified Boltzmann Machine with bipartite structure.

Model Architectures

Scratchpad Intro

Temporary reasoning space (often hidden).

Prompting & Instructions

Reward Hacking Advanced

Maximizing reward without fulfilling real goal.

AI Safety & Alignment

Instrumental Convergence Advanced

Tendency for agents to pursue resources regardless of final goal.

AI Safety & Alignment

Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment

Outer Alignment Advanced

Correctly specifying goals.

AI Safety & Alignment

Corrigibility Advanced

Willingness of system to accept correction or shutdown.

AI Safety & Alignment

Competitive Game Advanced

Agents have opposing objectives.

Agents & Autonomy

Unsupervised Learning Intermediate

Learning structure from unlabeled data, such as discovering groups, compressing representations, or modeling data distributions.

Machine Learning

Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning

Neural Network Intermediate

A parameterized function composed of interconnected units organized in layers with nonlinear activations.

Neural Networks

Universal Approximation Theorem Intermediate

Neural networks can approximate any continuous function under certain conditions.

AI Economics & Strategy

Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks

Bottleneck Layer Intermediate

A narrow hidden layer forcing compact representations.

AI Economics & Strategy

Confounding Intermediate

A hidden variable influences both cause and effect, biasing naive estimates of causal impact.

Foundations & Theory

Acoustic Model Intermediate

Maps audio signals to linguistic units.

Speech & Audio AI

Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI

Prosody Intermediate

Temporal and pitch characteristics of speech.

Speech & Audio AI

State Space Model Intermediate

Models time evolution via hidden states.

Time Series

Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition

Multi-Agent System Intermediate

Multiple agents interacting cooperatively or competitively.

AI Economics & Strategy

Emergent Coordination Intermediate

Coordination arising without explicit programming.

AI Economics & Strategy

Mode Collapse Advanced

Generator produces limited variety of outputs.

Diffusion & Generative Models

Hierarchical Planning Advanced

Decomposing goals into sub-tasks.

Agents & Autonomy

Results for "hidden objectives"

Welcome to AI Glossary

Search

Browse

3D WordGraph