Time Matters in Regularizing Deep Networks:
Weight Decay and Data Augmentation Affect Early Learning
Dynamics, Matter Little Near Convergence
Aditya Golatkar, Alessandro Achille, Stefano Soatto
Department of Computer Science
University of California, Los Angeles
{aditya29,achille,soatto}@cs.ucla.edu
Abstract
Regularization is typically understood as improving generalization by altering
the landscap ...


雷达卡



京公网安备 11010802022788号







