Search: state-action value — Dictionary of AI

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy

Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy

Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy

Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy

Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy

Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy

Dynamics Model Advanced

Predicts next state given current state and action.

Reinforcement Learning

State Space Model Intermediate

Models time evolution via hidden states.

Time Series

State Estimation Advanced

Inferring the agent’s internal state from noisy sensor data.

Robotics & Embodied AI

Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy

Control Loop Advanced

Continuous loop adjusting actions based on state feedback.

Robotics & Embodied AI

Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy

State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy

Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy

Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning

On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy

Reflex Agent Advanced

Simple agent responding directly to inputs.

Agents & Autonomy

Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning

Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy

Value at Risk Intermediate

Maximum expected loss under normal conditions.

AI Economics & Strategy

Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning

Kalman Filter Intermediate

Optimal estimator for linear dynamic systems.

Time Series

ReAct Pattern Advanced

Interleaving reasoning and tool use.

Agents & Autonomy

Particle Filter Intermediate

Monte Carlo method for state estimation.

Time Series

Law of Large Numbers Advanced

Sample mean converges to expected value.

Probability & Statistics

Monte Carlo Estimation Advanced

Approximating expectations via random sampling.

Probability & Statistics

Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment

Value Learning Intermediate

Inferring and aligning with human preferences.

Governance & Ethics

Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning

Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning

Results for "state-action value"

Welcome to AI Glossary

Search

Browse

3D WordGraph