| 期刊論文1. | 陳柏熹、邱佳民、曾芬蘭(20100600)。高中職入學制度中在校成績採計校正方式之比較。教育科學研究期刊,55(2),115-139。 延伸查詢 | 2. | Brandon, P. R.(2004)。Conclusions about frequently studied modified Angoff standard setting topics。Applied Measurement in Education,17(1),59-88。 | 3. | Douglas, D.(1994)。Quantity and quality in speaking test performance。Language Testing,11(2),125-144。 | 4. | Orr, M.(2002)。The FCE speaking test: using rater reports to help interpret test scores。System,30(2),143-154。 | 5. | Brown, A.(1995)。The effect of rater variables in the development of an occupation-specific language performance test。Language Testing,12(1),1-15。 | 6. | Kane, Michael(1994)。Validating the performance standards associated with passing scores。Review of Educational Research,64(3),425-461。 | 7. | 宋曜廷、周業太、曾芬蘭(20140300)。十二年國民基本教育的入學考試與評量變革。教育科學研究期刊,59(1),1-32。 延伸查詢 | 8. | 林清山(19850100)。群聚分析的理論和統計方法以及應用群聚分析的實徵性研究。測驗年刊,32,155-180。 延伸查詢 | 9. | Dempster, A. P.、Laird, N. M.、Rubin, D. B.(1977)。Maximum likelihood from incomplete data via the EM algorithm (with discussion)。Journal of the Royal Statistical Society, Series B (Methodological),39(1),1-38。 | 10. | Ang-Aw, H. T.、Goh, C. C. M.(2011)。Understanding discrepancies in rater judgment on national-level oral examination tasks。RELC Journal,42(1),31-51。 | 11. | Impara, J. C.、Plake, B. S.(2005)。Teachers' ability to estimate item difficulty: A test of the assumption in the Angoff standard setting method。Journal of Educational Measurement,35(1),69-81。 | 12. | Huang, Z.(1997)。A fast clustering algorithm to cluster very large categorical data sets in data mining。Data Mining and Knowledge Discovery,2(3),1-8。 | 13. | Violato, C.、Marini, A.、Lee, C.(2003)。A validity study of expert judgment procedures for setting cutoff scores on high-stakes credentialing examinations using cluster analysis。Evaluation & the health professions,26(1),59-72。 | 14. | Plake, B. S.、Melican, G. J.、Mills, C. N.(1991)。Factors influencing intrajudge consistency during standard-setting。Educational Measurement: Issues and Practice,10(2),15-16。 | 15. | Khalid, M. N.(2011)。Cluster analysis: A standard setting technique in measurement and testing。Journal of Applied Quantitative Method,6(2),46-58。 | 16. | Lumley, T.(1998)。Perceptions of language-trained raters and occupational experts in a test of occupational English language proficiency。English for Specific Purposes,17(4),347-367。 | 17. | 吳慧珉、蘇珊玉(2014)。標準設定之效度評估:以2010年臺灣八年級學生國語文學習成就資料為例。測驗統計年刊,22,1-21。 延伸查詢 | 18. | 林世華、謝佩蓉、謝進昌(20121200)。表現標準設定之擴大參與:教學現場效度證據。教育研究與發展期刊,8(4),1-18。 延伸查詢 | 19. | 凃柏原(2016)。標準設定介紹。飛揚雙月刊,96。 延伸查詢 | 20. | 謝名娟、謝進昌(20130700)。標準設定實施程序與外在效度驗證。國家菁英,9(2)=34,149-162。 延伸查詢 | 21. | Bhola, D. S.、Impara, J. C.、Buckendahl, C. W.(2003)。Aligning tests with states' content standards: Methods and issues。Educational Measurement: Issues and Practice,22(3),21-29。 | 22. | Buckendahl, C. W.、Smith, R. W.、Impara, J. C.、Plake, B. S.(2006)。A comparison of Angoff and bookmark standard setting methods。Journal of Educational Measurement,39(3),253-263。 | 23. | Cizek, G. J.(1996)。Setting passing scores。Educational Measurement: Issues and Practice,15(2),20-31。 | 24. | Giraud, G.、Impara, J. C.、Buckendahl, C.(2000)。Making the cut in school districts: Alternative methods for setting cut-scores。Educational Assessment,6,291-304。 | 25. | Hess, B.、Subhiyah, R. G.、Giordano, C.(2007)。Convergence between cluster analysis and the Angoff method for setting minimum passing scores on credentialing examinations。Evaluation & the Health Professions,30(4),362-375。 | 26. | van Nijlen, D.、Janssen, R.(2008)。Modeling judgments in the Angoff and Contrasting-groups methods of standard setting。Journal of Educational Measurement,45(1),45-63。 | 27. | Wishart, D.(1969)。An algorithm for hierarchical classifications。Biometrics,25,165-170。 | 28. | Cizek, Gregory J.(1996)。Standard-setting guidelines。Educational Measurement: Issues and Practice,15(1),13-21。 | 29. | 謝進昌(20060300)。精熟標準設定方法的歷史演進與詮釋的新概念。國民教育研究學報,16,157-193。 延伸查詢 | 30. | 黃馨瑩、謝名娟、謝進昌(20130600)。臺灣學生學習成就評量英語科標準設定之效度評估研究。教育與心理研究,36(2),87-112。 延伸查詢 | 31. | Cohen, Jacob(1960)。A Coefficient of Agreement for Nominal Scales。Educational and Psychological Measurement,20(1),37-46。 | 32. | Lumley, T.、McNamara, T. F.(1995)。Rater characteristics and rater bias: Implications for training。Language Testing,12(1),54-71。 | 33. | Norcini, J. J.、Shea, J. A.、Kanya, D. T.(1988)。The effect of various factors on standard setting。Journal of Educational Measurement,25(1),57-65。 | 34. | Sireci, S. G.、Robin, F.、Patelis, T.(1999)。Using cluster analysis to facilitate standard setting。Applied Measurement in Education,12(3),301-325。 | 35. | Kane, M.(1998)。Choosing between examinee-centered and test-centered standing setting methods。Educational Assessment,5(3),129-145。 | 會議論文1. | Huang, Z.(1997)。Clustering large data sets with mixed numeric and categorical values。The 1st Pacific-Asia Conference on Knowledge Discovery and Data Mining,21-34。 | 2. | Nassif, P. M.(1978)。Standard setting for criterion referenced teacher licensing tests。The annual meeting of the National Council on Measurement in Education。Toronto, ON。 | 3. | Shepard, L. A.(1995)。Implications for standard setting of the National Academy of Education evaluation of the National Assessment of Educational Progress achievement levels。The Joint Conference on Standard Setting for Large-Scale Assessments。Washington, DC:National Assessment Governing Board:National Center for Education Statistics。143-160。 | 4. | Tseng, F. L.、Chiou, J. M.、Sung, Y. T.(2015)。A validity study for Yes/No Angoff standard setting method using cluster analysis。2015 12th International Conference on Fuzzy Systems and Knowledge Discovery。 | 圖書1. | Mooi, E.、Sarstedt, M.(2011)。A concise guide to market research: The process, data, and methods using IBM SPSS statistics。Springer-Verlag。 | 2. | Cizek, G. J.、Bunch, M. B.(2007)。Standard setting: A guide to establishing and evaluating performance standards on tests。Sage。 | 3. | Kaftandjieva, F.(2010)。Methods for setting cut scores in criterion-referenced achievement tests: A comparative analysis of six recent methods with an application to tests of reading in EFL。Cito, Arnhem:European Association for Language Testing and Assessment。 | 4. | Timm, N. H.(2002)。Applied multivariate analysis。New York, NY:Springer-Verlag。 | 5. | McLachlan, G.、Krishnan, T.(2008)。The EM algorithm and extensions。Hoboken, NJ:Wiley-Interscience。 | 6. | Ritter, G.(2015)。Robust cluster analysis and variable selection。NJ:CRC Press Book。 | 7. | Tinsley, H.、Brown, S.(2000)。Handbook of applied multivariate statistics and mathematical modeling。San Diego, CA:Academic Press。 | 圖書論文1. | Angoff, William H.(1971)。Scales, norms, and equivalent scores。Educational measurement。Washington, DC:American Council on Education。 | 2. | Cizek, Gregory J.(2006)。Standard setting。Handbook of test development。Mahwah, NJ:Lawrence Erlbaum Associates。 | 3. | Hambleton, R. K.、Pitoniak, M. J.(2006)。Setting performance standards。Educational measurement。Westport, CT:American Council on Education。 | 4. | Kane, M.(2001)。So much remains the same: Conception and status of validation in setting standards。Standard setting: Concepts, methods, and perspectives。Mahwah, NJ:Lawrence Erlbaum Associates。 | 5. | Linacre, J. M.、Wright, B. D.(2004)。Construction of measures from many-facet data。Introduction to Rasch measurement: Theory, models and applications。Maple Grove, MN:JAM Press。 | 6. | Loomis, S. C.、Bourque, M. L.(2001)。From tradition to innovation: Standard setting on the National Assessment of Educational Progress。Standard setting: Concepts, methods, and perspectives。Mahwah, NJ:Lawrence Erlbaum Associates。 | 7. | Sireci, S. G.(2001)。Standard setting using cluster analysis。Setting performance standards: Concepts, methods, and perspectives。Mahwah, NJ:Lawrence Erlbaum Associates。 | 8. | Myford, C. M.、Wolfe, E. W.(2004)。Detecting and measuring rater effects using many-facet Rasch measurement。Introduction to Rasch measurement: Theory, models and applications。Maple Grove, MN:JAM Press。 | |