Results for "out-of-sample performance"
A narrow minimum often associated with poorer generalization.
A wide basin often correlated with better generalization.
Adjusting learning rate over training to improve convergence.
Gradually increasing learning rate at training start to avoid divergence.
A narrow hidden layer forcing compact representations.
Tradeoffs between many layers vs many neurons per layer.
Allows model to attend to information from different subspaces simultaneously.
Encodes positional information via rotation in embedding space.
Techniques to handle longer documents without quadratic cost.
Routes inputs to subsets of parameters for scalable capacity.
Chooses which experts process each token.
Extending agents with long-term memory stores.
Multiple agents interacting cooperatively or competitively.
Models evaluating and improving their own outputs.
Framework for identifying, measuring, and mitigating model risks.
Neural networks that operate on graph-structured data by propagating information along edges.
Central catalog of deployed and experimental models.
Controls amount of noise added at each diffusion step.
GNN using attention to weight neighbor contributions dynamically.
Pixel-wise classification of image regions.
Transformer applied to image patches.
Maps audio signals to linguistic units.
Detects trigger phrases in audio streams.
Repeating temporal patterns.
Model execution path in production.
Centralized repository for curated features.
Shift in feature distribution over time.
System that independently pursues goals over time.
Using production outcomes to improve models.
Number of steps considered in planning.