Search: bounded behavior — Dictionary of AI

Polarization Advanced

Groups adopting extreme positions.

Dynamics & Physics

Power-Seeking Behavior Advanced

Tendency to gain control/resources.

AI Safety & Alignment

Herding Behavior Advanced

Agents copy others’ actions.

Dynamics & Physics

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Prompt Intermediate

The text (and possibly other modalities) given to an LLM to condition its output behavior.

Prompting & Instructions

System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning

Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning

Swarm Intelligence Advanced

Distributed agents producing emergent intelligence.

Agents & Autonomy

Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics

Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics

Tripwire Advanced

Signals indicating dangerous behavior.

AI Safety & Alignment

LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory

Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory

Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Control Theory Intermediate

Mathematical framework for controlling dynamic systems.

Foundations & Theory

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

Plant Intermediate

The physical system being controlled.

Foundations & Theory

System Dynamics Advanced

Equations governing how system states change over time.

Dynamics & Physics

Contact Dynamics Advanced

Modeling interactions with environment.

Dynamics & Physics

Commonsense Physics Frontier

Human-like understanding of physical behavior.

World Models & Cognition

Market Microstructure Intermediate

Mechanics of price formation.

AI Economics & Strategy

Computational Chemistry Advanced

Modeling chemical systems computationally.

AI in Science

Strategic Interaction Advanced

Decisions dependent on others’ actions.

Agents & Autonomy

Emergence Advanced

System-level behavior arising from interactions.

Dynamics & Physics

Alignment Research Intermediate

Research ensuring AI remains safe.

Governance & Ethics

Domain Shift Intermediate

A mismatch between training and deployment data distributions that can degrade model performance.

MLOps & Infrastructure

Parameters Intermediate

The learned numeric values of a model adjusted during training to minimize a loss function.

Foundations & Theory

Results for "bounded behavior"

Welcome to AI Glossary

Search

Browse

3D WordGraph