Results for "efficiency"
Number of samples per gradient update; impacts compute efficiency, generalization, and stability.
The set of tokens a model can represent; impacts efficiency, multilinguality, and handling of rare strings.
Removing weights or neurons to shrink models and improve efficiency; can be structured or unstructured.
Training a smaller “student†model to mimic a larger “teacher,†often improving efficiency while retaining performance.
Built-in assumptions guiding learning efficiency and generalization.
Diffusion performed in latent space for efficiency.