Structural Weight Transfer for Grokked Networks
Monitor neural network health and detect overfitting early