SES

Updated 360 days ago

ID: 32229540/78

www.thphn.com

CLICK HERE TO SEE DETAILS OF COMPANY CHANGES

In a variety of problems originating in supervised, unsupervised, and reinforcement learning, the loss function is defined by an expectation over a collection of random variables, which might be part of a probabilistic model or the external world. Estimating the gradient of this loss function, using samples, lies at the core of gradient-based learning algorithms for these problems. We introduce the formalism of stochastic computation graphs-directed acyclic graphs that include both deterministic functions and conditional probability distributions-and describe how to easily and automatically derive an unbiased estimator of the loss function's gradient. The resulting algorithm for computing the gradient estimator is a simple modification of the standard backpropagation algorithm. The generic scheme we propose unifies estimators derived in variety of prior work, along with variance-reduction techniques therein. It could assist researchers in developing intricate models involving a..

SEARCH FOR SIMILAR COMPANIES

Interest Score

HIT Score

0.00

Domain

thphn.com

Actual

www.thphn.com

98.129.229.92

Status

Category

Other

0 comments Add a comment