Data Parallelism in Training Sparse Neural Networks

  • Namhoon Lee (University of Oxford)*; Philip Torr (University of Oxford); Martin Jaggi (EPFL)
  • Paper
  • Slide