Results for "supervised imitation"
Learning policies from expert demonstrations.
Learning a function from input-output pairs (labeled data), optimizing performance on predicting outputs for unseen inputs.
Training with a small labeled dataset plus a larger unlabeled dataset, leveraging assumptions like smoothness/cluster structure.
Learning from data by constructing “pseudo-labels†(e.g., next-token prediction, masked modeling) without manual annotation.