Minimal Training Set for Training a Successful CNN: A Case Study of the Frustrated J1-J2Ising Model on the Square Lattice

  • Shang Wei Li
  • , Yuan Heng Tseng
  • , Ming Che Hsieh
  • , Fu Jiun Jiang*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

The minimal training set to train a working convolutional neural network (CNN) is explored in detail. The model under consideration is the frustrated J1-J2 Ising model on the square lattice. Here J1 < 0 and J2 > 0 are the nearest and next-to-nearest neighboring couplings, respectively. We train the CNN using the configurations of g def = J2/|J1| = 0.7 and employ the resulting CNN to study the phase transition of g = 0.8. We find that this transfer learning is successful. In particular, only configurations of two temperatures, one below and one above the critical temperature Tc of g = 0.7, are needed to accurately determine the Tc of g = 0.8. However, it may be subtle to use this strategy for the training. Specifically, for the model under consideration, due to the inefficiency of the single spin-flip algorithm used in sampling the configurations in the low-temperature region, the two temperatures associated with the training set should not be too far away from the Tc of g = 0.7. Otherwise, the performance of the obtained CNN is not of high quality, and hence cannot determine the Tc of g = 0.8 accurately. For the model under consideration, we also uncover the condition for training a successful CNN when only configurations of two temperatures are considered as the training set.

Original languageEnglish
Article number113A02
JournalProgress of Theoretical and Experimental Physics
Volume2025
Issue number11
DOIs
Publication statusPublished - 2025 Nov 1

ASJC Scopus subject areas

  • General Physics and Astronomy

Fingerprint

Dive into the research topics of 'Minimal Training Set for Training a Successful CNN: A Case Study of the Frustrated J1-J2Ising Model on the Square Lattice'. Together they form a unique fingerprint.

Cite this