Abstract
We use n-gram language models to investigate how far language approximates an optimal code for human communication in terms of Information Theory, and what differences there are between Learner proficiency levels. Although the language of lower level learners is simpler, it is less optimal in terms of information theory, and as a consequence more difficult to process.