Search: trigger-based behavior

Wake Word Detection Intermediate

Detects trigger phrases in audio streams.

Speech & Audio AI

Tripwire Advanced

Signals indicating dangerous behavior.

AI Safety & Alignment

Canary Tokens Intermediate

Detecting unauthorized model outputs or data leaks.

AI Economics & Strategy

Tool Invocation Advanced

Agent calls external tools dynamically.

Agents & Autonomy

Feedback Amplification Advanced

AI reinforcing market trends.

Agents & Autonomy

Power-Seeking Behavior Advanced

Tendency to gain control/resources.

AI Safety & Alignment

Herding Behavior Advanced

Agents copy others’ actions.

Dynamics & Physics

Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment

Swarm Intelligence Advanced

Distributed agents producing emergent intelligence.

Agents & Autonomy

Prompt Intermediate

The text (and possibly other modalities) given to an LLM to condition its output behavior.

Prompting & Instructions

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics

Norm Formation Advanced

Emergence of conventions among agents.

Dynamics & Physics

System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning

Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory

Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning

Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning

Commonsense Physics Frontier

Human-like understanding of physical behavior.

World Models & Cognition

Strategic Interaction Advanced

Decisions dependent on others’ actions.

Agents & Autonomy

LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory

Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory

Control Theory Intermediate

Mathematical framework for controlling dynamic systems.

Foundations & Theory

Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory

Plant Intermediate

The physical system being controlled.

Foundations & Theory

Deceptive Alignment Advanced

Model behaves well during training but not deployment.

AI Safety & Alignment

System Dynamics Advanced

Equations governing how system states change over time.

Dynamics & Physics

Contact Dynamics Advanced

Modeling interactions with environment.

Dynamics & Physics

Market Microstructure Intermediate

Mechanics of price formation.

AI Economics & Strategy

Computational Chemistry Advanced

Modeling chemical systems computationally.

AI in Science

Emergence Advanced

System-level behavior arising from interactions.

Dynamics & Physics

Results for "trigger-based behavior"

Welcome to AI Glossary

Search

Browse

3D WordGraph