| 期刊論文1. | Crocker, L.(1997)。Assessing content representativeness of performance assessment exercises。Applied Measurement in Education,10(1),83-95。 | 2. | Wolfe, E. W.、Gitomer, D. H.(2001)。The influence of changes in assessment design on the psychometric quality of scores。Applied Measurement in Education,14(1),91-107。 | 3. | Shavelson, R. J.、Baxter, G. P.、Pine, J.(1991)。Performance assessment in science。Applied Measurement in Education,4(4),347-362。 | 4. | Kane, Michael、Crooks, Terry、Cohen, Allan S.(1999)。Validating Measures of Performance。Educational Measurement: Issues and Practice,18(2),5-17。 | 5. | Baxter, G. P.、Glaser, R.(1998)。Investigating the cognitive complexity of science assessments。Educational Measurement: Issues and Practice,17(3),37-45。 | 6. | Brennan, R. L.(2000)。(Mis)conceptions about generalizability theory。Educational Measurement: Issues and Practice,19(1),5-10。 | 7. | Cronbach, L. J.、Linn, R. L.、Brennan, R. L.、Haertel, E. H.(1997)。Generalizability analysis for performance assessments of student achievement or school effectiveness。Educational and Psychological Measurement,57(3),373-399。 | 8. | Kane, M. T.(1982)。A sampling model of validity。Applied Psychological Measurement,6,125-160。 | 9. | Linn, R. L.、Burton, E.、DeStefano, L.、Hanson, M.(1996)。Generalizability of New Standard Project 1993 pilot study tasks in mathematics。Applied Measurement in Education,9(3),201-214。 | 10. | Moss, P. A.(1995)。Themes and variations in validity theory。Educational Measurement: Issues and Practice,14(2),5-13。 | 11. | Nichols, P. D.、Kuehl, B. J.(1999)。Prophesying the reliability of cognitively complex assessment。Applied Measurement in Education,12(1),73-94。 | 12. | Nichols, P. D.、Smith, P. L.(1998)。Contextualizing the interpretation of reliability data。Educational Measurement: Issues and Practice,17(3),24-36。 | 13. | Nichols, P. D.、Sugrue, B.(1999)。The lack of fidelity between cognitively complex constructs and conventional test development practice。Educational Measurement: Issues and Practice,18(2),18-29。 | 14. | Parkes, J.(2001)。The role of transfer in the variability of performance assessment scores。Educational Assessment,7(2),143-164。 | 15. | Shavelson, R. J.、Ruiz-Primo, M. A.、Wiley, E. W.(1999)。Note on sources of sampling variability in science performance。Journal of Educational Measurement,36(1),61-71。 | 16. | Webb, N. M.、Schlackman, J.、Sugrue, B.(2000)。The dependability and interchangeability of assessment methods in science。Applied Measurement in Education,15(3),277-301。 | 17. | 張麗麗(20040400)。影響教師自評實作評量實施品質相關因素之探討。南師學報(教育類),38(1),95-120。 延伸查詢 | 18. | Brennan, R. L.、Johnson, E. G.(1995)。Generalizability of performance assessments。Educational Measurement: Issues and Practice,14(4),9-12。 | 19. | Gao, X.、Brennan, R. L.(2001)。Variability of estimated variance components and related statistics in a performance assessment。Applied Measurement in Education,14(2),191-203。 | 20. | Gao, X.、Shavelson, R. J.、Baxter, G. P.(1994)。Generalizability of large-scale performance assessments in science: Promises and problems。Applied Measurement in Education,7(4),323-342。 | 21. | Lane, S.、Liu, M.、Ankenmann, R. D.、Stone, C. A.(1996)。Generalizability and validity of a mathematics performance assessment。Journal of Educational Measurement,33(1),71-92。 | 22. | McBee, M. M.、Barnes, L. B.(1998)。The generalizability of a performance assessment measuring achievement in eighth-grade mathematics。Applied Measurement in Education,11(2),179-194。 | 23. | Messick, Samuel(1994)。The Interplay of Evidence and Consequences in the Validation of Performance Assessments。Educational Researcher,23(2),13-23。 | 24. | 張麗麗(20020100)。評量改革的應許之地,虛幻或真實?--談實作評量之作業與表現規準。教育研究,93,76-86。 延伸查詢 | 25. | Shavelson, R. J.、Baxter, G. P.、Gao, X.(1993)。Sampling variability of performance assessments。Journal of Educational Measurement,30(3),215-232。 | 學位論文1. | 林敬修(2003)。影響國小數學科實作評量信度相關因素之類推性理論分析(碩士論文)。屏東師範學院,台北。 延伸查詢 | 2. | 吳欣黛(1998)。實作評量在效度上的真實性與直接性(碩士論文)。國立台北師範學院,台北。 延伸查詢 | 圖書1. | Brennan, R. L.(1992)。Elements of generalizability theory。Iowa City, IA:American College Testing Program。 | 2. | Crick, J. E.、Brennan, R. L.(1982)。GENOVA: A generalized analysis of variance system (FORTRAN IV computer program and manual)。Dorchester, Mass:Computer Facilities, University of Massachusetts at Boston。 | 3. | Gao, X.(1996)。Sampling variability and generalizability of work keys listening and writing scores。Iowa City, Iowa:ACT。 | 4. | Northwest Regional Educational Laboratory(1994)。A toolkit for professional developers: Alternative assessment。Portland, OR:Northwest Regional Educational Laboratory。 | 5. | Brennan, R. L.(2001)。Generalizability Theory。New York, NY:Springer-Verlag。 | 6. | California State Department of Education(1989)。A question of thinking: A first look at students' performance on open-ended questions in mathematics。Sacramento, CA:California State Department of Education。 | 7. | Herman, Joan L.、Aschbacher, Pamela R.、Winters, Lynn(1992)。A practical guide to alternative assessment。Alexandria, VA:Association for Supervision and Curriculum Development。 | 8. | Shavelson, Richard J.、Webb, Noreen M.(1991)。Generalizability Theory: A Primer。Newbury Park, California:Sage。 | 9. | Stiggins, Richard J.(1994)。Student-centered classroom assessment。New York, NY:Macmillan Publishing Company。 | 10. | Wiggins, G. P.(1993)。Assessing student performance: Exploring the purpose and limits of testing。Jossey-Bass。 | 11. | Wiggins, Grant P.(1998)。Educative assessment: Designing assessments to inform and improve student performance。San Francisco, CA:Jossey-Bass Publishers。 | 12. | Crocker, L.、Algina, J.(1986)。Introduction to Classical and Modern Test Theory。Holt, Rinehart & Winston。 | 13. | 教育部(2000)。國民中小學九年一貫課程暫行綱要。台北市:教育部。 延伸查詢 | 14. | National Council of Teachers of Mathematics(1989)。Curriculum and evaluation standards for school mathematics。Reston, VA:National Council of Teachers of Mathematics。 | 圖書論文1. | Brennan, R. L.(1996)。Generalizability of performance assessments。Technical issues in large-scale performance assessment。Washington, DC:U.S. Department of Education, Office of Educational Research and Improvement。 | 2. | Mislevy, R.(1990)。Foundations of a new test theory。Test theory for a new generation of tests。Hillsdale, NJ:Lawrence Erlbaum Associates。 | 3. | Romberg, T. A.、Wilson, L. D.(1995)。Issues related to the development of an authetic assessment system for school mathematics。Reform in school mathematics and authentic assessment。Albany, NY:State University of NewYork Press。 | 4. | Feldt, L. S.、Brennan, R. L.(1989)。Reliability。Educational measurement。New York, NY:Macmillan Press。 | 5. | Resnick, Lauren B.、Resnick, Daniel P.(1992)。Assessing the Thinking Curriculum: New Tools for Educational Reform。Changing Assessments: Alternative Views of Aptitude, Achievement and Instruction。Kluwer Academic Publishers。 | |