Understanding how Large Language Models generate text through the inference process.