REFERENCES
English References
Bachman, L., &; Purpura, J. (2008). Language assessments: Gate-keepers or door-openers? In B. M. Spolsky &; Francis M. Hult (Eds.), Blackwell handbook of educational linguistics. Oxford, UK: Blackwell.
Birmbaum, A. (1968). Some latent trait models and their use in inferring an examinee’s ability. In F. M. Lord, and M. R. Novik, (Eds.), Statistical Theories of Mental Test Scores. MA: Addison-Wesley.
Bock, R. D., Mislevy, R. J., &; Woodson, C. E. M. (1982). The next stage in educational assessment. Educational Researcher, 11, 4-11, 16.
Choppin, B. (1976). Recent development in item banking. In D. N. de Gruijter &; L. J. van der Kamp (Eds.), Advances in psychological and educational measurement. London: Wiley.
Ebel, R. L. (1991). Essentials of educational measurement. N.J. : Prentice Hall.
Ebel, R. L. &; Frisbie, D. A. (1991). Essentials of educational measurement (5th Ed.). NJ: Prentice-Hall.
Gullikson, H. (1987). Theory of mental tests. NJ: Lawrence Erlbaum Associates.
Guion, R. M., &; Ironson, G. H.(1983).Latent trait theory for organizational research. Organizational Behavior and Human Performance, 31, 54-87.
Hambleton, R. K. (1979). Latent trait models and their applications. In R. Traub, (Ed.), New Directions for Testing and Measurement (Volume 4.) Methodological Developments. SF: Jossey Bass.
Hambleton, R. K. and Cook, L. L. (1977). Latent trait models and their use in the analysis of educational test data. Journal of Educational Measurement, 14, 2, 75-96.Holland, P. W. and Wainer, H. (Eds.). (1993). Differential item functioning. NJ: Lawrence Erlbaum Associates.
Kunnan, A. J. (2000). Fairness and validation in language assessment: Selected papers from the 19th Language Testing Research Colloquium, Orlando, Florida. Cambridge: CUP.
Lohman, D. F. &; Snow, R. E. (1993). Cognitive psychology, new test design, and new test theory: an introduction. In Frederiken, N., Mislevy, R. J. &; Bejar, I. I, (Eds). Test theory for a new generation of tests. NJ: Lawrence Erlbaum Associates.Lord, F. M. (1952). A theory of test scores. Psychometrika, 24, 1-18.Lord, F. M. (1980). Application of item response theory to practical testing problems. NJ: NJ: Lawrence Erlbaum Associates.
Lord, F. M., &; Novick, M. R. (1968). Statistical theories of mental test scores.MA: Addison-Wesley.
McDonald,R. P. (1999). Test theory: A unified treatment. NJ: Lawrence Erlbaum Associates.
Messick, S., Beaton, A. E., &; Lord, F. M. (1983). National assessment of Educational Progress Reconsidered: A new design for a new era. NJ: NAEP.
Noll, V. H., Scannell, D. P., &; Craig, R. C. (1979). Introduction to educational measurement (4th Ed.). MA: Houghton Mifflin.
Penfield, R. D. and Lam, T. C. M. (2000). Assessing differential item functioning in performance assessment: review and recommendations. Educational Measurement: Issues and Practice, 11(3), 5-15.
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: Danish Institute of Education Research.
Richards, J.C., Platt, J. &; Platt, H. (1992). Longman dictionary of language teaching and applied linguistics. UK: Longman Group.
Shohamy, E. (2000). Fairness in testing. In A. J. Kunnan (Ed.), Fairness and validation in language assessment: papers from the 19th Language Testing Research Colloquium, Orlando , Florida (pp. 15-19). Cambridge: CUP.
Spearman, C. (1904). The proof and measurement of association between two things. American Journal of Psychology, 15, 72-101.
Spearman, C. (1907). Demonstration of formulae for true measure of correlation. American Journal of Psychology, 18, 161-169.
Spearman, C. (1910). Correlation calculated with faulty data. British Journal of Psychology, 3, 271-295.
Theunissen, T. J. J. M. (1985). Binary programming and test design. Psychometrika, 50, 411-420.Tucker, L. R. (1946). Maximum validity of a test with equivalent items. Psychometrika, 11, 1-13.Tung, H. C. (2008). Historical developments of the English tests used in Joint College Entrance Examination in the past fifty years. Unpublished Dissertation at National Kaohsiung Normal University.
Weiss, D. J. (1984). Application of computerized adaptive testing to educational problem. Journal of Educational Measurement, 21, 361-376.
Wright, B. D. (1977). Solving measurement problems with the Rasch model. Journal of Educational Measurement, 14, 97-116.
Chinese References
陳柏熹. (2006). IRT在測驗編制上的應用. Retrieved from http://www.bctest.ntnu.edu.tw/issue.htm
Bibliography
Airasian, P.W., &; Madaus, G.F. (1983). Linking testing and instruction: Policy issues. Journal of Educational Measurement, 20(2), 103-118.
Alderson, J. C. and L. Hamp-Lyons, (1996). TOEFL preparation courses: A study of washback. Language Testing 13: 280–297.
Alderson, J. C. and A. H. Urquhart (1985a). The effect of students’ academic discipline on their performance on ESP reading tests. Language Testing 2:192–204.
Angelis, P. J.(1982). Academic needs and priorities for testing. American Language Journal, 1, 41-56.
Bachman, L. F. (1990) Fundamental Considerations in Language Testing. Oxford: Oxford University Press.
Bachman, L. F. and D. Eignor (1998). Recent advances in quantitative test analysis. In Bersoff, D. (1984). Social and legal influences on test development and usage. In B. Plake (Ed.) Social and Technical Issues in Testing: 87–109.Campbell, P.B. (1989). The Hidden Discriminator: Sex and Race Bias in Educational Research. Groton, MA: Women's Educational Equity Act Program. ERIC Document Reproduction Service No. ED 322 174.
Camilli, G. (1993). The case against item bias detection techniques based on internal criteria: Do item bias procedures obscure test fairness issues? In (Holland and Wainer) 397–413.Canale, M. &; Swain, M. (1980). Theoretical bases of communicative approaches to second language teaching and testing. Applied Linguistics 1, 1-47.
Chen, Z. and G. Henning. (1985). Linguistic and cultural bias in language proficiency tests. Language Testing 2: 155-163.
Green, A. (1997). Verbal Protocol Analysis in Language Testing Research. Cambridge: Cambridge University Press.
Hale, G. (1988). Student major field and text content: Interactive effects on reading comprehension in the TOEFL. Language Testing, 5: 49–61.
Huang, T. S. (1997). A qualitative analysis of The JCEE English tests. Taipei: The
Crane Publishing Co., Ltd.
Hughes A. &; Porter, D. (1983). Current developments in language testing. London: Academic Press.
Klein, S.S. (Ed.) (1985). Handbook for Achieving Sex Equity through Education. Baltimore, MD: Johns Hopkins University Press. ERIC Document Reproduction Service No. ED 290 810.
Kunnan, A. J. (1992). An investigation of a criterion-referenced test using G-theory, and factor and cluster analysis. Language Testing 9: 30–49.Kunnan, A. J. (1995). Test-taker characteristics and Test Performance: A Structural Modeling Approach. Cambridge: Cambridge University Press.
Madsen, H. S. (1983). Techniques in testing. NY: Oxford University Press.
Rosser, P. (1989). The SAT Gender Gap: Identifying the Causes. Washington, DC: Center for Women Policy Studies. ERIC Document Reproduction Service No. ED 311 087.
Ryan, K. and L. F. Bachman (1992). Differential item functioning on two tests of EFL proficiency. Language Testing 9: 12–29.
Shohamy, E. and O. Inbar (1991) Construct validity of listening comprehensive test of oral proficiency. Language Testing 8: 23-40Tittle, C.K. (1979). What to Do About Sex Bias in Testing. Princeton, NJ: ERIC Clearinghouse on Tests, Measurement, and Evaluation. ERIC Document Reproduction Service No. ED 183 628.
Wiseman, S. (1961). Examinations and English education. Manchester, England: Manchester University Press.
Yu, K. H. (1983). Language proficiency and its assessment: What is in the old bottle and what is new? English Teaching &; Learning, 11(1), 39-50.
Zeidner, M. (1986). Are English language aptitude tests biased towards culturally different minority groups? Some Israeli findings. Language Testing 3: 80–95.
Zeidner, M. (1987). A comparison of ethnic, sex and age biases in the predictive validity of English language aptitude tests: Some Israeli data. Language Testing 4: 55–71.