林世華、陳學志、盧雪梅(2004)。國民中小學九年一貫課程學習成就評量指標與方法手冊。台北:教育部。
郭生玉 (2004)。教育測驗與評量。 新北市: 精華書局。
Anderson, L. W. (1999). Rethinking Bloom's Taxonomy: Implications for Testing and Assessment.
Anderson, L. W., & Krathwohl, D. R. (2001). A taxonomy for learning, teaching, and assessing: a revision of Bloom's taxonomy of educational objectives. New York: Longman.
Bennett, R. E., & Ward, W. C. (1993). Construction Versus Choice in Cognitive Measurement: Issues in Constructed Response, Performance Testing, and Portfolio Assessment: L. Erlbaum Associates.
Bloom, B. S. (1956). Taxonomy of educational objectives; the classification of educational goals. New York: Longmans, Green.
Breger, D. C. (1995). The inquiry paper. Science Scope, 19(2), 27-32.
Calkins, L. M. C. (1994). The art of teaching writing: Heinemann.
Carter, P. L., Ogle, P. K., & Royer, L. B. (1993). Learning logs:What are they and how do we use them? In N. L. Webb & A. F. Coxford (Eds.), Assessment in the mathematics classroom (pp.87-96). Reston, VA:National Council of Teachers of Mathematics.
Cizek, G. J., & Bunch, M. B. (2007). Standard setting: A guide to establishing and evaluating performance standards on tests: SAGE Publications Ltd.
Eckes, T. (2005). Examining rater effects in TestDaF writing and speaking performance assessments: A many-facet Rasch analysis. Language Assessment Quarterly: An International Journal, 2(3), 197-221.
Eckes, T. (2009). Many-facet Rasch measurement. Reference supplement to the manual for relating language examinations to the Common European Framework of Reference for Languages: Learning, teaching, assessment.
Ediger, M. (1998). Writing and the Pupil in the Science Curriculum. (ERIC Document Reproduction Service No. ED 426 846).
Foltz, P. W., Laham, D., & Landauer, T. K. (1999). Automated essay scoring: Applications to educational technology. Paper presented at the World Conference on Educational Multimedia, Hypermedia and Telecommunications.
Foster, G. (1984). Technical writing and science writing. Is there a difference and what does it matter? . Paper presented at the Annual Meeting of the Conference on College Composition and Communication 35th, New York City, USA, 29-31.
Gronlund, N. E. (1985). Measurement and evaluation in teaching. New York: Macmillan.
Horton, P. B., Fronk, R. H., & Walton, R. W. (1985). The effect of writing assignments on achievement in college general chemistry. Journal of Research in Science Teaching, 22(6), 535-541.
Huang, Y. C. (1999). A study of reformulation relations in scientific reports. (Unpublished master’ s thesis), University of Tsing Hua, Hsinchu, Taiwan, ROC.
Kintsch, E., Steinhart, D., Stahl, G., LSA Research Group, L. R. G., Matthews, C., & Lamb, R. (2000). Developing summarization skills through the use of LSA-based feedback. Interactive Learning Environments, 8(2), 87-109.
Kirkpatrick, L. D., & Pittendrigh, A. S. (1984). A writing teacher in the physics classroom. The Physics Teacher, 22(3), 159-164. doi: doi:http://dx.doi.org/10.1119/1.2341502Knoch, U., Read, J., & von Randow, J. (2007). Re-training writing raters online: How does it compare with face-to-face training? Assessing Writing, 12(1), 26-43. doi: http://dx.doi.org/10.1016/j.asw.2007.04.001Landy, F. J., & Farr, J. L. (1983). The Measurement of Work Performance: Methods, Theory, and Applications. New York, NY: Academic Press.
Langer, J. A., & Applebee, A. N. (1987). How Writing Shapes Thinking: A Study of Teaching and Learning(NCTE Research Report No. 22). Urbana, Illinois: National Council of Teachers of English.
León, J. A., Olmos, R., Escudero, I., Cañas, J. J., & Salmerón, L. (2006). Assessing short summaries with human judgments procedure and latent semantic analysis in narrative and expository texts. Behavior research methods, 38(4), 616-627.
Linacre, J. M. (1989). Many-facet Rasch measurement. Chicago: MESA Press.
Miller, R. G., & Calfee, R. C. (2004). Building a better reading-writing assessment: Bridging cognitive theory, instruction, and assessment. English Leadership Quarterly, 26(3), 6-13.
Newell, G. E. (1984). Learning from Writing in Two Content Areas: A Case Study/protocol Analysis. Research in the Teaching of English, 18(3), 265-287.
Newell, G. E. (1986). Learning from writing: Examining our assumptions. English Quarterly, 19, 291-302.
Palinscar, A. S., & Brown, A. L. (1984). Reciprocal teaching of comprehension-fostering and comprehension-monitoring activities. Cognition and instruction, 1(2), 117-175.Rasch, G. (1960). Studies in mathematical psychology: I. Probabilistic models for some intelligence and attainment tests.
Rivard, L. O. P. (1994). A review of writing to learn in science: Implications for practice and research. Journal of Research in Science Teaching, 31(9), 969-983.
Roid, G. H. (1994). Patterns of writing skills derived from cluster analysis of direct-writing assessments. Applied Measurement in Education, 7(2), 159-170.
Rowell, P. M. (1997). Learning in school science: The promises and practices of writing. Studies in Science Education, 30(1), 19-56.Schwarz, G. (1978). Estimating the Dimension of a Model. 461-464. doi: 10.1214/aos/1176344136
Sensenbaugh, R. (1989). ERIC/RCS: Writing Across the Curriculum: Evolving Reform. Journal of Reading, 32, 462-465.
Slotta, J. D., Chi, M. T., & Joram, E. (1995). Assessing students' misclassifications of physics concepts: An ontological basis for conceptual change. Cognition and instruction, 13(3), 373-400.
Stepanek, J. S. (1997). Assessment strategies to inform science and mathematics instruction [microform] : it's just good teaching / [Jennifer Stepanek, Denise Jarrett]. [Portland, OR] : [Washington, DC]: Northwest Regional Educational Laboratory ; U.S. Dept. of Education, Office of Educational Research and Improvement, Educational Resources Information Center.
Thall, E., & Bays, G. (1989). Utilizing ungraded writing in the chemistry classroom. Journal of Chemical Education, 66, 662-663.
Toranj, S., & Ansari, D. N. (2012). Automated Versus Human Essay Scoring: A Comparative Study. Theory & Practice in Language Studies, 2(4), 719-725. doi: 0.4304/tpls.2.4.719-725
Valenti, S., Cucchiarelli, A., & Panti, M. (2002). Computer based assessment systems evaluation via the ISO9126 quality model. Journal of Information Technology Education: Research, 1(1), 157-175.Valenti, S., Neri, F., & Cucchiarelli, A. (2003). An overview of current research on automated essay grading. Journal of Information Technology Education: Research, 2(1), 319-330.VanDeWeghe, R. (1987). Making and remaking meaning: Developing literary responses through purposeful informal writing. English Quarterly, 20, 38-51.
Witkin, S. L. (2000). Writing social work. Social Work, 45(5), 389-394.