The Academia Sinica Ancient Chinese Corpus is designed for linguistic research. The corpus contains ancient texts that are selected because of their usefulness in grammatical and lexical studies, as well as an inspection program with keyword searching, statistics, and collocation functions. The corpus is divided into three subcorpora according to stages of grammatical developments, thus both synchronic and diachronic studies can be performed on them. Their current sizes are as follows: A. Old Chinese subcorpus (from pre-Qin to Pre-Han):5,128,068 characters. B. Middle Chinese subcorpus (from Late Han to the Six Dynasties):8,101,662 characters. C. Early Mandarin Chinese subcorpus (from Tang to Ching):4,406,381 characters. A great portion of the texts from the Old Chinese subcorpus (4,497,051 characters) has been textually classified and marked-up according to their source books, author, text genre etc. A substantive part (520,794 characters) of the same subcorpus has also been segmented into words, which are in turn given part-of-speech tagging. Results of the above two tasks form the basis of our Old Chinese Lexical Database.