Entropy, free energy, and work of restricted Boltzmann machines

Sangchul Oh*, Abdelkader Baggag, Hyunchul Nha

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Citations (Scopus)

Abstract

A restricted Boltzmann machine is a generative probabilistic graphic network. A probability of finding the network in a certain configuration is given by the Boltzmann distribution. Given training data, its learning is done by optimizing the parameters of the energy function of the network. In this paper, we analyze the training process of the restricted Boltzmann machine in the context of statistical physics. As an illustration, for small size bar-and-stripe patterns, we calculate thermodynamic quantities such as entropy, free energy, and internal energy as a function of the training epoch. We demonstrate the growth of the correlation between the visible and hidden layers via the subadditivity of entropies as the training proceeds. Using the Monte-Carlo simulation of trajectories of the visible and hidden vectors in the configuration space, we also calculate the distribution of the work done on the restricted Boltzmann machine by switching the parameters of the energy function. We discuss the Jarzynski equality which connects the path average of the exponential function of the work and the difference in free energies before and after training.

Original languageEnglish
Article number538
JournalEntropy
Volume22
Issue number5
DOIs
Publication statusPublished - 1 May 2020

Keywords

  • Entropy
  • Jarzynski equality
  • Machine learning
  • Restricted Boltzmann machines
  • Subadditivity of entropy

Fingerprint

Dive into the research topics of 'Entropy, free energy, and work of restricted Boltzmann machines'. Together they form a unique fingerprint.

Cite this