Value Function
IntermediateExpected cumulative reward from a state or state-action pair.
AdvertisementAd space — term-top
Definition
Full Definition
Expected cumulative reward from a state or state-action pair.
Expected cumulative reward from a state or state-action pair.