Such a learning paradigm often makes it more difficult to achieve the convergence of training models, and therefore the resulting network models are not easy to generalize, typically in machine ...