Results for "weight reuse"
Networks using convolution operations with weight sharing and locality, effective for images and signals.
Achieving task performance by providing a small number of examples inside the prompt without weight updates.
GNN using attention to weight neighbor contributions dynamically.
Methods to set starting weights to preserve signal/gradient scales across layers.
Models whose weights are publicly available.