Results for "bounded behavior"

AdvertisementAd space — search-top

23 results

Prompt Intermediate

The text (and possibly other modalities) given to an LLM to condition its output behavior.

Prompting & Instructions
Prompt Engineering Intermediate

Crafting prompts to elicit desired behavior, often using role, structure, constraints, and examples.

Prompting & Instructions
System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning
Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models
Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory
Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning
LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory
Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory
Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory
Orchestration Intermediate

Coordinating tools, models, and steps (retrieval, calls, validation) to deliver reliable end-to-end behavior.

Foundations & Theory
Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment
Model Documentation Intermediate

Required descriptions of model behavior and limits.

Governance & Ethics
Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning
Commonsense Physics Frontier

Human-like understanding of physical behavior.

World Models & Cognition
Human-in-the-Loop Control Frontier

Humans assist or override autonomous behavior.

World Models & Cognition
Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition
Formal Verification Advanced

Mathematical guarantees of system behavior.

Agents & Autonomy
Emergence Advanced

System-level behavior arising from interactions.

Dynamics & Physics
Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics
Tripwire Advanced

Signals indicating dangerous behavior.

AI Safety & Alignment
Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning
Herding Behavior Advanced

Agents copy others’ actions.

Dynamics & Physics
Power-Seeking Behavior Advanced

Tendency to gain control/resources.

AI Safety & Alignment

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.