Results for "learned objectives"
Coordination arising without explicit programming.
Generator produces limited variety of outputs.
Decomposing goals into sub-tasks.
Maintaining alignment under new conditions.
Governance of model changes.
Coordinating models, tools, and logic.
Limiting inference usage.
The physical system being controlled.
Agents optimize collective outcomes.
Risk threatening humanity’s survival.
Ensuring AI allows shutdown.
Tendency to gain control/resources.
Intelligence and goals are independent.
Goals useful regardless of final objective.
Research ensuring AI remains safe.
Designing AI to cooperate with humans and each other.