Variational Bayesian Learning of Probabilistic Discriminative Models With Latent Softmax Variables