TY - JOUR
T1 - Minimal Training Set for Training a Successful CNN
T2 - A Case Study of the Frustrated J1-J2Ising Model on the Square Lattice
AU - Li, Shang Wei
AU - Tseng, Yuan Heng
AU - Hsieh, Ming Che
AU - Jiang, Fu Jiun
N1 - Publisher Copyright:
© 2025 The Author(s). Published by Oxford University Press on behalf of the Physical Society of Japan.
PY - 2025/11/1
Y1 - 2025/11/1
N2 - The minimal training set to train a working convolutional neural network (CNN) is explored in detail. The model under consideration is the frustrated J1-J2 Ising model on the square lattice. Here J1 < 0 and J2 > 0 are the nearest and next-to-nearest neighboring couplings, respectively. We train the CNN using the configurations of g def = J2/|J1| = 0.7 and employ the resulting CNN to study the phase transition of g = 0.8. We find that this transfer learning is successful. In particular, only configurations of two temperatures, one below and one above the critical temperature Tc of g = 0.7, are needed to accurately determine the Tc of g = 0.8. However, it may be subtle to use this strategy for the training. Specifically, for the model under consideration, due to the inefficiency of the single spin-flip algorithm used in sampling the configurations in the low-temperature region, the two temperatures associated with the training set should not be too far away from the Tc of g = 0.7. Otherwise, the performance of the obtained CNN is not of high quality, and hence cannot determine the Tc of g = 0.8 accurately. For the model under consideration, we also uncover the condition for training a successful CNN when only configurations of two temperatures are considered as the training set.
AB - The minimal training set to train a working convolutional neural network (CNN) is explored in detail. The model under consideration is the frustrated J1-J2 Ising model on the square lattice. Here J1 < 0 and J2 > 0 are the nearest and next-to-nearest neighboring couplings, respectively. We train the CNN using the configurations of g def = J2/|J1| = 0.7 and employ the resulting CNN to study the phase transition of g = 0.8. We find that this transfer learning is successful. In particular, only configurations of two temperatures, one below and one above the critical temperature Tc of g = 0.7, are needed to accurately determine the Tc of g = 0.8. However, it may be subtle to use this strategy for the training. Specifically, for the model under consideration, due to the inefficiency of the single spin-flip algorithm used in sampling the configurations in the low-temperature region, the two temperatures associated with the training set should not be too far away from the Tc of g = 0.7. Otherwise, the performance of the obtained CNN is not of high quality, and hence cannot determine the Tc of g = 0.8 accurately. For the model under consideration, we also uncover the condition for training a successful CNN when only configurations of two temperatures are considered as the training set.
UR - https://www.scopus.com/pages/publications/105022717147
UR - https://www.scopus.com/pages/publications/105022717147#tab=citedBy
U2 - 10.1093/ptep/ptaf151
DO - 10.1093/ptep/ptaf151
M3 - Article
AN - SCOPUS:105022717147
SN - 2050-3911
VL - 2025
JO - Progress of Theoretical and Experimental Physics
JF - Progress of Theoretical and Experimental Physics
IS - 11
M1 - 113A02
ER -