Results for "function calls"
Restricting updates to safe regions.
Maximizing reward without fulfilling real goal.
Ensuring learned behavior matches intended objective.
Willingness of system to accept correction or shutdown.
Model behaves well during training but not deployment.
Asking model to review and improve output.
Applying learned patterns incorrectly.
Loss of old knowledge when learning new tasks.
Using output to adjust future inputs.
System returns to equilibrium after disturbance.
Predicts next state given current state and action.
Learning physical parameters from data.
Directly optimizing control policies.
Optimizing continuous action sequences.
Reward only given upon task completion.
Learning action mapping directly from demonstrations.
Sampling-based motion planner.
Optimal pathfinding algorithm.
Planning via artificial force fields.
Learning by minimizing prediction error.
Predicting protein 3D structure from sequence.
Predicting borrower default risk.
Agents optimize collective outcomes.
Tendency to gain control/resources.
Goals useful regardless of final objective.