Search: per-parameter — Dictionary of AI

Throughput Intermediate

How many requests or tokens can be processed per unit time; affects scalability and cost.

Transformers & LLMs

Adam Intermediate

Popular optimizer combining momentum and per-parameter adaptive step sizes via first/second moment estimates.

Optimization

Parameter Sharing Intermediate

Using same parameters across different parts of a model.

AI Economics & Strategy

Adaptive Optimization Intermediate

Methods like Adam adjusting learning rates dynamically.

Foundations & Theory

Depth vs Width Intermediate

Tradeoffs between many layers vs many neurons per layer.

AI Economics & Strategy

Inference Cost Intermediate

Cost to run models in production.

AI Economics & Strategy

Segmentation Intermediate

Assigning labels per pixel (semantic) or per instance (instance segmentation) to map object boundaries.

Computer Vision

Throughput Ceiling Intermediate

Maximum system processing rate.

AI Economics & Strategy

Fisher Information Intermediate

Measures how much information an observable random variable carries about unknown parameters.

AI Economics & Strategy

MAP Estimation Intermediate

Bayesian parameter estimation using the mode of the posterior distribution.

AI Economics & Strategy

Likelihood Function Advanced

Probability of data given parameters.

Probability & Statistics

Prior Distribution Advanced

Belief before observing data.

Probability & Statistics

Momentum Intermediate

Uses an exponential moving average of gradients to speed convergence and reduce oscillation.

Optimization

Learning Rate Intermediate

Controls the size of parameter updates; too high diverges, too low trains slowly or gets stuck.

Foundations & Theory

Parameter-Efficient Fine-Tuning Intermediate

Techniques that fine-tune small additional components rather than all weights to reduce compute and storage.

Foundations & Theory

Loss Landscape Intermediate

The shape of the loss function over parameter space.

AI Economics & Strategy

Maximum Likelihood Estimation Intermediate

Estimating parameters by maximizing likelihood of observed data.

AI Economics & Strategy

Sharp Minimum Intermediate

A narrow minimum often associated with poorer generalization.

AI Economics & Strategy

Flat Minimum Intermediate

A wide basin often correlated with better generalization.

AI Economics & Strategy

Posterior Distribution Advanced

Updated belief after observing data.

Probability & Statistics

Objective Surface Intermediate

Visualization of optimization landscape.

Foundations & Theory

Batch Size Intermediate

Number of samples per gradient update; impacts compute efficiency, generalization, and stability.

Foundations & Theory

Compute Intermediate

Hardware resources used for training/inference; constrained by memory bandwidth, FLOPs, and parallelism.

Foundations & Theory

Online Inference Intermediate

Low-latency prediction per request.

MLOps & Infrastructure

Compute Scaling Intermediate

Increasing model capacity via compute.

AI Economics & Strategy

Early Stopping Intermediate

Halting training when validation performance stops improving to reduce overfitting.

Foundations & Theory

Stochastic Gradient Descent Intermediate

A gradient method using random minibatches for efficient training on large datasets.

Foundations & Theory

LoRA Intermediate

PEFT method injecting trainable low-rank matrices into layers, enabling efficient fine-tuning.

Foundations & Theory

Bias Term Intermediate

Systematic error introduced by simplifying assumptions in a learning algorithm.

AI Economics & Strategy

Saddle Point Intermediate

A point where gradient is zero but is neither a max nor min; common in deep nets.

AI Economics & Strategy

Results for "per-parameter"

Welcome to AI Glossary

Search

Browse

3D WordGraph