Results for "expected return"
Expected return of taking action in a state.
A scalar measure optimized during training, typically expected loss over data, sometimes with regularization terms.
Expected cumulative reward from a state or state-action pair.
Optimizing policies directly via gradient ascent on expected reward.
Expected causal effect of a treatment.
Sample mean converges to expected value.
Maximum expected loss under normal conditions.