Variational Student: Learning Compact And Sparser Networks In Knowledge Distillation Framework

The holy grail in deep neural network research is porting the memory- and computation-intensive network models on embedded platforms with a minimal compromise in model accuracy. To this end, we propose Variational Student where we reap the benefits of com
