DropMax: Adaptive Variational Softmax A PyTorch implementation of "DropMax" arxiv paper Original TF Repo Python 3.6, Pytorch 1.0.0 Compare with CrossEntropy Loss on MNIST Dataset MNIST Result Running Example python3 main.py