| 期刊論文1. | Kretchmar, Jennifer(2006)。Assessing the reliability of ratings used in undergraduate admission decisions。Journal of College Admission,192,10-15。 | 2. | Michael, T. K.(1982)。A Sampling Model for Validity。Applied Psychological Measurement,6,125-160。 | 3. | Baker, E. L.、Abedi, J.(1996)。Dimensionality and generalizability of domain-independent performance assessments。Journal of Education Research,89(4),197-205。 | 4. | Chafouleas, S. M.、Christ, T. J.、Riley-Tillman, T. C.、Briesch, A. M.、Chanese, J. A. M.(2007)。Generalizability and dependability of direct behavior ratings to assess social behavior of preschoolers。School Psychology Review,36,63-79。 | 5. | Hemandez-Lloreda, M. V.、Colmenares, R.(2006)。The utility of generalizability theory in the study of animal behaviour。Animal Behaviour,71(4),983-988。 | 6. | Hulsman, R. L.、Mollema, E. D.、Oort, F. J.、Hoos, A. M.、de Haes, J. C.(2006)。Using standardized video cases for assessment of medical communication skills: Reliability of an objective structured video examination by computer。Patient Education And Counseling,60(1),24-31。 | 7. | Klein, S. P.、Stecher, B. M.、Shavelson, R. J.、McCaffrey, D.、Ormseth, T.、Bell, R. M.(1998)。Analytic versus holistic scoring performance tasks。Applied Measurement in Education,11(2),121-137。 | 8. | Marzano, R. L.(2002)。A comparison of selected methods of scoring classroom assessments。Applied Measurement in Education,15(3),249-267。 | 9. | Nupbaum, A.(1984)。Multivariate generalizability theory in educational measurement: An empirical study。Applied Psychological Measurement,8(2),219-230。 | 10. | Quirk, M.、Mazor, K.、Haley, H.、Wellman, S.、Keller, D.、Ha-tem, D.(2005)。Reliability and validity of checklists and global ratings by standardized students, trained raters, and faculty raters in an objective structured teaching exercise。Teaching and Learning in Medicine,17(3),202-209。 | 11. | Schiinemann, H. J.、Norman, G.、Puhan, M. A.、Stahl, E.、Griffith, L.(2007)。Application of generalizability theory confirmed lower reliability of the standard gamble than the feeling thermometer。Journal of Clinical Epidemiology,60(12),1256-1262。 | 12. | Taylor, C. S.(1998)。An investigation of scoring methods for mathematics performance-based assessments。Educational Measurement,5(3),195-224。 | 13. | Wass, V.、McGibbon, D.、van der Vleuten, C.(2001)。Composite undergraduate clinical examinations: How should the components be combined to maximize reliability。Medical Education,35(4),326-330。 | 14. | 李茂能(19960600)。信度考驗的另一途徑:推論力理論。國民教育研究學報,2,27-48。 延伸查詢 | 15. | McMillan, S. C.(1985)。A comparison of professional performance examination scores of graduating associate and baccalaureate degree nursing students。Research in Nursing and Health,8(2),167-172。 | 16. | Wolfe, E. W.、Gitomer, D. H.(2001)。The influence of changes in assessment design on the psychometric quality of scores。Applied Measurement in Education,14(1),91-107。 | 17. | 黃瓊蓉(20040600)。Generalizability of the Writing Performance Assessment。測驗學刊,51(1),29-44。 | 18. | 盧雪梅(19980100)。實作評量的應許、難題和挑戰。教育資料與研究,20,1-5。 延伸查詢 | 19. | Brennan, R. L.、Johnson, E. G.(1995)。Generalizability of performance assessments。Educational Measurement: Issues and Practice,14(4),9-12。 | 20. | Gao, X.、Shavelson, R. J.、Baxter, G. P.(1994)。Generalizability of large-scale performance assessments in science: Promises and problems。Applied Measurement in Education,7(4),323-342。 | 21. | McBee, M. M.、Barnes, L. B.(1998)。The generalizability of a performance assessment measuring achievement in eighth-grade mathematics。Applied Measurement in Education,11(2),179-194。 | 22. | 余民寧(1992)。測量理論的發展趨勢。心理測驗的發展與應用,34-39。 延伸查詢 | 23. | 林本源(20030300)。應用概化理論估計運動技能測驗多層面變異來源的信度。體育學報,34,243-251。 延伸查詢 | 24. | 吳毓瑩(19980100)。我看、我畫、我說、我演、我想、我是誰呀?--卷宗評量之概念、理論、與應用。教育資料與研究,20,13-17。 延伸查詢 | 25. | Clauser, Brian E.(2000)。Recurrent issues and recent advances in scoring performance assessment。Applied Psychological Measurement,24(4),310-324。 | 26. | Pine, J.、Goldman, S. R.、Baxter, G. P.、Shavelson, R. J.(1992)。Evaluation of procedure-based scoring for hands-on science assessment。Journal of Educational Measurement,29(1),1-17。 | 27. | Harwell, M.(1999)。Evaluating the validity of educational rating data。Educational and Psychological Measurement,59(1),25-37。 | 28. | Shavelson, R. J.、Mayberry, P. W.、Li, Weichang、Webb, N. M.(1990)。Generalizability of Job Performance Measurements: Marine Corps Rifleman。Military Psychology,2(3),129-144。 | 會議論文1. | 夏萍洄、盧純華(1996)。護理人員執照考試反映教育目標與基本就業能力之研究。科學教育研究計劃八十六年度成果討論會,140-145。 延伸查詢 | 學位論文1. | 辛慶偉(1998)。國小自然科卷宗評量建構效度之探究(碩士論文)。國立臺北師範學院。 延伸查詢 | 2. | 詹元智(2002)。國小數學科實作評量之效度探討(碩士論文)。屏東師範學院。 延伸查詢 | 3. | 桂怡芬(1996)。自然科實作評量的效度探討(碩士論文)。國立台北師範學院。 延伸查詢 | 圖書1. | van der Linden, W. J.、Hambleton, R. K.(1997)。Handbook of Modern Item Response Theory。New York, NY:Springer-Verlag。 | 2. | Kline, R. B.(1995)。Principles and practice of structural equation modeling。New York, NY:Guilford Press。 | 3. | Brennan, R. L.(2001)。Generalizability Theory。New York, NY:Springer-Verlag。 | 4. | Shavelson, Richard J.、Webb, Noreen M.(1991)。Generalizability Theory: A Primer。Newbury Park, California:Sage。 | 5. | Crick, J. E.、Brennan, R. L.(1983)。Manual for GENOVA: A generalized analysis of variance system。Iowa City, IA:American College Testing Service。 | 6. | Thorndike, Robert M.(1997)。Measurement and evaluation in psychology and education。Upper Saddle River, NJ:Prentice-Hall。 | 7. | 郭生玉(2004)。教育測驗與評量。精華。 延伸查詢 | 8. | Brennan, R. L.(1992)。Elements of Generalizability Theory。Iowa, IA:The American College Testing Program。 | |