Should I Use Softmax As Output When Using Cross Entropy Loss In Pytorch?

July 25, 2024 Post a Comment

I have a problem with classifying fully connected deep neural net with 2 hidden layers for MNIST dataset in pytorch. I want to use tanh as activations in both hidden layers, but in

Solution 1:

As stated in the torch.nn.CrossEntropyLoss() doc:

This criterion combines nn.LogSoftmax() and nn.NLLLoss() in one single class.

Therefore, you should not use softmax before.

Learn Python Tutorials

Should I Use Softmax As Output When Using Cross Entropy Loss In Pytorch?

Solution 1:

Post a Comment for "Should I Use Softmax As Output When Using Cross Entropy Loss In Pytorch?"