Information in the Weights and Emergent Properties of Deep Neural Networks Part II

Gearge April 24, 2019

Authors: Stefano Soatto/Alessandro Achille

Title: Information in the Weights and Emergent Properties of Deep Neural Networks

Abstract: We introduce the notion of information contained in the weights of a Deep Neural Network and show that it can be used to control and describe the training process of DNNs, and can explain how properties, such as invariance to nuisance variability and disentanglement, emerge naturally in the learned representation. Through its dynamics, stochastic gradient descent (SGD) implicitly regularizes the information in the weights, which can then be used to bound the generalization error through the PAC-Bayes bound. Moreover, the information in the weights can be used to defined both a topology and an asymmetric distance in the space of tasks, which can then be used to predict the training time and the performance on a new task given a solution to a pre-training task.

While this information distance models difficulty of transfer in first approximation, we show the existence of non-trivial irreversible dynamics during the initial transient phase of convergence when the network is acquiring information, which makes the approximation fail. This is closely related to critical learning periods in biology, and suggests that studying the initial convergence transient can yield important insight beyond those that can be gleaned from the well-studied asymptotics.

Advertisement

Information in the Weights and Emergent Properties of Deep Neural Networks Part II

Part

Post a Comment

0 Comments

Popular Videos

Gummy Vore Video: Part 1

Found Possible Murder Weapon Underwater While Scuba Diving! .45 Caliber Pistol (Police Called)

GOD WILL SURELY BLESS YOU IF YOU TRUST AND FOLLOW THE PLAN HE HAS FOR YOU - INSPIRATIONAL VIDEO

Frankly Speaking with Sadhguru | Exclusive Interview

What Does Lil Majin Think of Arslan Ash? Games against Feng Wei!

Applicants of construction permits complain of delays

THE OUTSIDE VIEW - Promo

Archive

Recent

Categories

HOT

Menu Footer Widget

Advertisement

Information in the Weights and Emergent Properties of Deep Neural Networks Part II

Part

You may like these posts

Post a Comment

0 Comments

Popular Videos

Gummy Vore Video: Part 1

Found Possible Murder Weapon Underwater While Scuba Diving! .45 Caliber Pistol (Police Called)

GOD WILL SURELY BLESS YOU IF YOU TRUST AND FOLLOW THE PLAN HE HAS FOR YOU - INSPIRATIONAL VIDEO

Frankly Speaking with Sadhguru | Exclusive Interview

What Does Lil Majin Think of Arslan Ash? Games against Feng Wei!

Applicants of construction permits complain of delays

THE OUTSIDE VIEW - Promo

Archive

Recent

Categories

HOT

Menu Footer Widget