We are developing a corpus in order to investigate argument realization in detail for pre-modern Japanese, giving a comprehensive account of the basic grammar of each major stage of the language and allowing for both synchronic and diachronic analyses. When completed, the corpus will contain texts from the 8th century until the beginning of the 16th century. The results of the project will impact the description and understanding of pre-modern Japanese and its changes through time, furthering our understanding and interpretation of earlier texts. The project is also expected to have implications for general linguistic theory, both with regard to frameworks for understanding verb semantics and clause structure, and with regard to the application of syntactic theory to 'dead' languages. This paper focuses on the initial stages of corpus building, including methods for encoding orthography, morphology, and syntax.