Results for "bounded behavior"
Model optimizes objectives misaligned with human values.
Assigning a role or identity to the model.
Small prompt changes cause large output changes.
Required descriptions of model behavior and limits.
Mechanism to disable AI system.
AI used without governance approval.
Optimizes future actions using a model of dynamics.
Control that remains stable under model uncertainty.
Motion considering forces and mass.
Mathematical representation of friction forces.
Artificial environment for training/testing agents.
Differences between simulated and real physics.
Learning physical parameters from data.
Artificial sensor data generated in simulation.
Directly optimizing control policies.
Modifying reward to accelerate learning.
Learning policies from expert demonstrations.
Imagined future trajectories.
Acting to minimize surprise or free energy.
Humans assist or override autonomous behavior.
Inferring human goals from behavior.
Closed loop linking sensing and acting.
Mathematical guarantees of system behavior.
Fabrication of cases or statutes by LLMs.
Identifying suspicious transactions.
Risk of incorrect financial models.
AI discovering new compounds/materials.
Rules governing auctions.
Agents fail to coordinate optimally.
AI tacitly coordinating prices.