Search: human values — Dictionary of AI

Wake Word Detection Intermediate

Detects trigger phrases in audio streams.

Speech & Audio AI

Specification Gaming Advanced

Model exploits poorly specified objectives.

AI Safety & Alignment

Constraint Prompting Intro

Explicit output constraints (format, tone).

Prompting & Instructions

High-Risk AI System Intermediate

AI used in sensitive domains requiring compliance.

Governance & Ethics

Kill Switch Intermediate

Mechanism to disable AI system.

Governance & Ethics

Embodied AI Advanced

AI systems that perceive and act in the physical world through sensors and actuators.

Robotics & Embodied AI

Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition

Natural Language Instruction Frontier

Controlling robots via language.

World Models & Cognition

Algorithmic Collusion Advanced

AI tacitly coordinating prices.

Agents & Autonomy

Feature Intermediate

A measurable property or attribute used as model input (raw or engineered), such as age, pixel intensity, or token ID.

Foundations & Theory

Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Optimization

ROC Curve Intermediate

Plots true positive rate vs false positive rate across thresholds; summarizes separability.

Foundations & Theory

PR Curve Intermediate

Often more informative than ROC on imbalanced datasets; focuses on positive class performance.

Evaluation & Benchmarking

Brier Score Intermediate

A proper scoring rule measuring squared error of predicted probabilities for binary outcomes.

Evaluation & Benchmarking

Weight Initialization Intermediate

Methods to set starting weights to preserve signal/gradient scales across layers.

Foundations & Theory

Activation Function Intermediate

Nonlinear functions enabling networks to approximate complex mappings; ReLU variants dominate modern DL.

Foundations & Theory

Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs

Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs

Inter-Annotator Agreement Intermediate

Measure of consistency across labelers; low agreement indicates ambiguous tasks or poor guidelines.

Foundations & Theory

Differential Privacy Intermediate

A formal privacy framework ensuring outputs do not reveal much about any single individual’s data contribution.

Security & Privacy

Model Card Intermediate

Standardized documentation describing intended use, performance, limitations, data, and ethical considerations.

Foundations & Theory

Rademacher Complexity Intermediate

Measures a model’s ability to fit random noise; used to bound generalization error.

AI Economics & Strategy

Multi-Head Attention Intermediate

Allows model to attend to information from different subspaces simultaneously.

AI Economics & Strategy

Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy

Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy

Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy

Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy

Off-Policy Learning Intermediate

Learning from data generated by a different policy.

AI Economics & Strategy

Results for "human values"

Welcome to AI Glossary

Search

Browse

3D WordGraph