Back to articles
AIOpenAI News

Weight normalization: A simple reparameterization to accelerate training of deep neural networks

OpenAI February 25, 2016 Publication Weight normalization: A simple reparameterization to accelerate training of deep neural networks Read paper (opens in a new window) Loading… Share Abstract We present w...

The RSS feed only provided an excerpt. FlowMarket recovered the public content available from the original page without bypassing restricted content.

February 25, 2016

Weight normalization: A simple reparameterization to accelerate training of deep neural networks

Weight Normalization A Simple Reparameterization To Accelerate Training Of Deep Neural Networks

Abstract

We present weight normalization: a reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction. By reparameterizing the weights in this way we improve the conditioning of the optimization problem and we speed up convergence of stochastic gradient descent. Our reparameterization is inspired by batch normalization but does not introduce any dependencies between the examples in a minibatch. This means that our method can also be applied successfully to recurrent models such as LSTMs and to noise-sensitive applications such as deep reinforcement learning or generative models, for which batch normalization is less well suited. Although our method is much simpler, it still provides much of the speed-up of full batch normalization. In addition, the computational overhead of our method is lower, permitting more optimization steps to be taken in the same amount of time. We demonstrate the usefulness of our method on applications in supervised image recognition, generative modelling, and deep reinforcement learning.

  • Exploration & Games

Authors

Related articles

Jetbrains > Hero > Media item > Asset

Mar 21, 2024

Unload

Mar 18, 2024

Screenshot 2024 03 12 At 1128 27am

Mar 18, 2024

Need an n8n workflow or help installing it?

After the briefing, move to execution: find an n8n template or a creator who can adapt it to your tools.

Source

OpenAI News - openai.com

View original publication