Abstract
This article focuses on the practical problems that go along with the sequencing of older non-standardised spelling forms into consonant and vowel graphemes. The issue of segmental parsing is important in research on the development of Dutch spelling, mostly drawing from thirteenth and fourteenth century charter spellings. The research aims are introduced in part 1, and then the problems of defining graphs and then syllables are discussed in part 2. These problems stem from the theoretical graphemic level, but have a very direct impact on the practical level, as they hinder the automatic parsing of tokens in the corpus, as discussed in part 3. The goal of this article is to provide partial solutions in parsing of non-standardised language data for graphemic research.