Search: expected return — Dictionary of AI

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

Alpha Intermediate

Returns above benchmark.

AI Economics & Strategy

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy

Exploration-Exploitation Tradeoff Intermediate

Balancing learning new behaviors vs exploiting known rewards.

AI Economics & Strategy

Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning

Law of Large Numbers Advanced

Sample mean converges to expected value.

Probability & Statistics

AI Center of Excellence Intermediate

Centralized AI expertise group.

Governance & Ethics

Cost Attribution Intermediate

Assigning AI costs to business units.

AI Economics & Strategy

Stability Intermediate

System returns to equilibrium after disturbance.

Foundations & Theory

Lyapunov Stability Intermediate

Stability proven via monotonic decrease of Lyapunov function.

Foundations & Theory

Risk Model Intermediate

Quantifying financial risk.

AI Economics & Strategy

Variance Term Intermediate

Error due to sensitivity to fluctuations in the training dataset.

AI Economics & Strategy

Average Treatment Effect Advanced

Expected causal effect of a treatment.

Causal AI & Interpretability

Monte Carlo Estimation Advanced

Approximating expectations via random sampling.

Probability & Statistics

Value at Risk Intermediate

Maximum expected loss under normal conditions.

AI Economics & Strategy

Meta-Learning Intermediate

Methods that learn training procedures or initializations so models can adapt quickly to new tasks with little data.

Machine Learning

Objective Function Intermediate

A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.

Optimization

Bias–Variance Tradeoff Intermediate

A conceptual framework describing error as the sum of systematic error (bias) and sensitivity to data (variance).

Foundations & Theory

Generalization Intermediate

How well a model performs on new data drawn from the same (or similar) distribution as training.

Foundations & Theory

Dropout Intermediate

Randomly zeroing activations during training to reduce co-adaptation and overfitting.

Foundations & Theory

System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning

RLHF Intermediate

Reinforcement learning from human feedback: uses preference data to train a reward model and optimize the policy.

Optimization

SHAP Intermediate

Feature attribution method grounded in cooperative game theory for explaining predictions in tabular settings.

Foundations & Theory

Function Calling Intermediate

Constraining model outputs into a schema used to call external APIs/tools safely and deterministically.

Foundations & Theory

Rademacher Complexity Intermediate

Measures a model’s ability to fit random noise; used to bound generalization error.

AI Economics & Strategy

Results for "expected return"

Welcome to AI Glossary

Search

Browse

3D WordGraph