Why does Unsupervised Pre-training Help Deep Learning?

   Abstract