Results for "trigger-based behavior"
Detects trigger phrases in audio streams.
Signals indicating dangerous behavior.
Detecting unauthorized model outputs or data leaks.
Agent calls external tools dynamically.
AI reinforcing market trends.
Tendency to gain control/resources.
Agents copy others’ actions.
Ensuring learned behavior matches intended objective.
Distributed agents producing emergent intelligence.
The text (and possibly other modalities) given to an LLM to condition its output behavior.
Learning action mapping directly from demonstrations.
Collective behavior without central control.
Emergence of conventions among agents.
A high-priority instruction layer setting overarching behavior constraints for a chat model.
Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.
Inferring reward function from observed behavior.
Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.
Human-like understanding of physical behavior.
Decisions dependent on others’ actions.
Local surrogate explanation method approximating model behavior near a specific input.
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
Mathematical framework for controlling dynamic systems.
Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.
The physical system being controlled.
Model behaves well during training but not deployment.
Equations governing how system states change over time.
Modeling interactions with environment.
Mechanics of price formation.
Modeling chemical systems computationally.
System-level behavior arising from interactions.