Exploring Cross-Entropy Loss in Large Language Models.