Efficient Sparse Neural Network Training

Konstantin Shvedov

Developments in neural networks have led to advanced models requiring large amounts of training time and resources. To reduce the environmental impact and to decrease the training times of models, acceleration techniques have been developed. One method is neural network pruning, which removes insignificant weights and preempts the generation of sparse models. This paper attempts to improve and explore a method of training sparse neural networks efficiently processing only non-zero values using optimized just-in-time kernels from the Libsxmm library while randomly pruning network layers at initialization. The algorithms explored within this paper show a proof of concept and the possibility of improving training time beyond what the highly optimized PyTorch library is currently capable of. Through the work in this paper algorithm's processing times are sped up over 100-fold. Further, this work provides additional evidence that advanced pruning algorithms and other improvements can significantly reduce training times and resources.