File:Stochastic Normalisations as Bayesian Learning.pdf

Go to page
next page →
next page →
next page →

Original file(1,275 × 1,650 pixels, file size: 2.83 MB, MIME type: application/pdf, 21 pages)

Captions

Captions

Add a one-line explanation of what this file represents

Summary edit

Description
English: In this work we investigate the reasons why Batch Normalization (BN) improves the generalization performance of deep networks. We argue that one major reason, distinguishing it from data-independent normalization methods, is randomness of batch statistics. This randomness appears in the parameters rather than in activations and admits an interpretation as a practical Bayesian learning. We apply this idea to other (deterministic) normalization techniques that are oblivious to the batch size. We show that their generalization performance can be improved significantly by Bayesian learning of the same form. We obtain test performance comparable to BN and, at the same time, better validation losses suitable for subsequent output uncertainty estimation through approximate Bayesian posterior.
Date
Source Content available on arXiv (dedicated link) (archive.org link)
Author Alexander Shekhovtsov and Boris Flac

Licensing edit

w:en:Creative Commons
attribution
This file is licensed under the Creative Commons Attribution 4.0 International license.
You are free:
  • to share – to copy, distribute and transmit the work
  • to remix – to adapt the work
Under the following conditions:
  • attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current14:07, 8 November 2018Thumbnail for version as of 14:07, 8 November 20181,275 × 1,650, 21 pages (2.83 MB)Acagastya (talk | contribs)User created page with UploadWizard

There are no pages that use this file.

Metadata