:::

詳目顯示

回上一頁
題名:線上題庫等化連結方式之比較
書刊名:花蓮師院學報
作者:陳新豐 引用關係
作者(外文):Chen, Shin-feng
出版日期:2003
卷期:17(教育類)
頁次:頁153-191
主題關鍵詞:題庫等化電腦化適性測驗Item bankEquatingComputerized adaptive testing
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(1) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:26
     「測驗與評量」一直是教學過程中不可或缺的一環,它們可以反應出學生在學習過程中了解知識的程度,以供教學者掌握學生的學習狀況。另從資訊科技的發展來看,多媒體與網際網路使測驗打破時空的限制並具資源分享的功能。同時,題庫系統有助於使測驗的品質更為理想,但如何結合試題反應理論與題庫系統的建置,使其適用範圍更為廣泛,是本研究的主要研究動機。基於上述研究動機,本研究旨在結合試題反應理論、題庫等化,探討線上題庫等化連結策略之優劣,以提供建置電腦化線上適性測驗系統之基礎條件。具體而言,本研究的主要目的為連結不同時間點所收集的線上題庫,進行題庫等化連結,並比較其連結效益。 以下為本研究主要所獲得的結論: 1.題庫等化轉換常數方法以Mean/Mean和Haebara等方法較佳 在本研究進行等化連結時,共進行等化常數轉換方法中,線上轉換常數方法中平均數/平均數(Mean/Mean)及平均數和標準差法(Mean/Sigma);以及非線性轉換常數方法中特徵曲線法(Stocking-Lord及Haebara)等四種方法。研究結果顯示:就鑑別度而言,在偏差、均差及均方差方面,以Mean/Mean的方法較佳,其餘三種方法的連結效果相近;就難度而言,在偏差、均差及均方差方面,以Haebara的方法較佳,有7次是差異最小。整體而言,以Mean/Mean和Haebara的連結效果較佳。綜言之,在四種等化連結的方法中,以Mean/Mean和Haebara兩種方法做為題庫等化連結的方法,其效果較為理想,可提供後續探討題庫等化的研究參考。 2.線上測驗與紙筆測驗的試題訊息量相近,但難度偏高 本研究線上測驗的甲乙丙三式題目與九十學年度國中基本學力測驗的結果相較,發現:乙丙兩份試題的訊息曲線形狀與國中基本學力測驗相似,表示兩類測驗所提供的訊息量相近,並無太大的差別。有明顯分別的是,甲乙丙三式的題目訊息曲線都較基本學力測驗右偏,顯示學生在回答線上測驗有較困難的情形,此種現象應是線上題目施測時,答題者無法在電腦上直接作答,還需要將幾何圖形畫到紙本上,在紙本上演算後,再回答線上試題,故就步驟而言,造成學生加倍的負擔,導致訊息曲線呈現右偏的情況。 3.線上測驗連結效益良好 本研究在連結方面發現,線上測驗所收集到的反應資料有四分之三以上皆具有連結的效果,亦即「線上題庫與適性測驗整合系統」所收集到的三式反應資料的試題,對於國中基本學力的測驗題庫具有實質上的效果。
     “Tests and Evaluation” have occupied a vital corner in education field. In terms of the tests and evaluation, students' learning and knowledge understanding can be reflected and the results can be provided to educators who, therefore, can control students' learning condition. Besides, in the perspective of the development in information technology, thanks to the multimedia and internet which have broken down the confinement of time and space and offered a variety of shared resource, the item bank systems, thus, can facilitate test quality. Therefore, how to connect item response theory with the construction of item bank systems and then how to widen up the application range should deserve further study. Consequently, in terms of the combination of item response theory and item bank equation, the purpose of the study is to discuss the strategic advantages of on-line item bank equation, and to provide basic constructive condition for the computerized system of on-line adaptive testing. Concretely speaking, the study aims at linking item banks collected from different periods of time, equating these linked item banks, and then, comparing the linking performances. The conclusions of this study can be summarized as follow: 1. Concerning the constant transformation of item bank equation, Mean/Mean Methods and Haebara Methods are better options. During the equating and linking, four methods are applied in this study: from linear constant transformation methods, Mean/Mean and Mean/Sigma Methods are presented; from the non-linear methods, two kinds of Characteristic Curve Methods are presented and they are Methods of Stocking-Lord and Haebara. The study shows: concerning discrimination, Mean/Mean Methods display better result in deviation, average deviation and mean square deviation while linking results of the other three methods are similar to each other. Concerning difficulties, Haebara Methods achieve better performance--seven of the results perform the minimal deviation. Generally speaking, Mean/Mean Methods and Haebara Methods have better linking performance. To sum up, among the four equating and linking methods, Mean/Mean and Haebara Methods perform better in the linking of item bank equation. This can be a general reference to those who are interested in item bank equation research. 2. Item information between on-line tests and paper-pencil tests is similar but indicates more difficulties. Three types of on-line test items, Item A, B, and C, and one test item from the Basic Competency Test (BCT) for junior high students in 1991 are employed in this study. The findings are:Information curves of Item B and C are similar to that of the BCT item for junior high students. These items all provide similar test information and therefore, no significant differences can be found. However, information curves of Item A, B, and C are right skewed while that of BCT item for junior high students is not. The result suggests that students have difficulties in on-line tests. The possible explanation is while doing the on-line items, only after the students have drawn the geometric diagram and finished the calculation on their paper can they proceed questions on computers. That is, students are not able to start their on-line tests only by computer. In consequence, the complicate solution process doubles students' burden. This is why the information curve of on-line item is right skewed. 3. Linking performance of on-line test is good The study displays that three-fourths of the responded data collected from on-line tests show linking performance. That is, three responded tests collected from the integrated internet system of on-line item bank and computerized adaptive testing prove to have practical impact on the BCT item banks for junior high school students.
期刊論文
1.Haebara, T.(1980)。Equating logistic ability scales by a weighted least squares method。Japanese Psychological Research,22(3),144-149。  new window
2.Klein, L. W.、Jarjoura, D.(1985)。The importance of content representation for common-item equating with non-random groups。Journal of Educational Measurement,22,197-206。  new window
3.Cook, L. L.、Eignor, D. R.(1991)。An NCME instructional module on IRT equating methods。Educational Measurement: Issues and Practice,10(3),37-45。  new window
4.余民寧(19930600)。試題反應理論的介紹(10)--測驗分數的等化。研習資訊,10(3),11-16。  延伸查詢new window
5.Vale, C. D.(1986)。Linking item parameters onto a common scale。Applied Psychological Measurement,10(4),333-344。  new window
6.Millman, J.、Arter, J. A.(1984)。Issue in Banking。Journal of Educational measurement,21(4),315-330。  new window
7.何榮桂(1994)。電腦化題庫概述。測驗與輔導,126,2576-2577。  延伸查詢new window
8.Ager, T.(1993)。Online placement testing in mathematics and chemistry。Journal and Computer-Based Instruction,20(2),52-57。  new window
9.Van der Linden, W. J.、Veldkamp, B. P.、Reese, L. M.(2000)。An integer programming approach to item pool design。Applied Psychological Measurement,24(2),139–150。  new window
10.Stocking, M. L.、Lord, F. M.(1983)。Developing a common metric in item response theory。Applied Psychological Measurement,7(2),201-210。  new window
11.余民寧(19930400)。試題反應理論的介紹(9)--測驗分數的等化。研習資訊,10(2),6-11。  延伸查詢new window
會議論文
1.周倩(1998)。電腦網路輔助測驗與評量:發展趨勢與研究方向。第十四屆科學教育學術研討會,15-24。  延伸查詢new window
2.何榮桂、蘇建誠(1997)。遠距適性態度測驗系統設計。第六屆國際電腦輔助教學研討會。臺北:銘傳管理學院。175-185。  延伸查詢new window
3.孫光天、陳新豐、吳鐵雄(1998)。線上適性測驗回饋對作答情緒與動機影響之研究。第七屆國際電腦輔助教學研討會,(會議日期: 3月19-21日)。臺北:高師大。9-14。  延伸查詢new window
4.Reckase, M. D.(1981)。Tailored Testing, Measurement Problems and Latent Trait Theory。The annual meeting of the National Council on Measurement in Education。Los Angeles。  new window
5.陳新豐、吳鐵雄(1999)。線上適性測驗系統之研發。教育與心理測驗學術研討會。  延伸查詢new window
6.Cook, L. L.、Eignor, D. R.(1981)。Score equating and item response theory: Some practical considerations。The annual meeting of the American Educational Research Association and the National Conference on Measurement in Education。Los Angeles。  new window
7.Cook, L. L.、Eignor, D. R.(1983)。An Investigation o f the feasibility of applying item response theory to equate achievement tests。The annual meeting of the American Educational Research Association。Montreal。  new window
8.Skaggs, Gary、Lissitz, Robert W.(1982)。IRT Test Equating: Relevant Issues and a Review of Recent Research。The Annual Meeting of the American Educational Research Association。Los Angeles。  new window
研究報告
1.Mazzeo, J.、Harvey, A. L.(1988)。The equivalence of scores from automated and conventional educational and psychological tests: A Review of the literature。New York:College Entrance Examination Board。  new window
2.交通部統計處(2001)。台灣地區民眾使用網際網路狀況調查報告。  延伸查詢new window
3.Stocking, M. L.(1994)。Three practical issues for modern adaptive testing item pools。Princeton, NJ:Educational Testing Service。  new window
4.洪碧霞、吳裕益、陳英豪、黃淑津、蕭淳元、徐綺穗、丁振豐(1991)。題目IRT參數量尺化系列研究 (計畫編號:NSC80-0301-H-024-01)。  延伸查詢new window
5.Mckinley, R. L.、Reckase, M. D.(1981)。A comparison o f procedures for constructing large item pools (計畫編號:81-3)。Columbia, MO:University of Missouri, Department of Educational Psychology。  new window
學位論文
1.陳麗如(1998)。電腦化適性測驗題庫之品質管理策略(碩士論文)。國立師範大學。  延伸查詢new window
2.陳新豐(1999)。多媒體線上適性測驗系統發展及其相關研究(碩士論文)。臺南師範學院。  延伸查詢new window
3.李盛祖(1997)。國小數學乘法系列診斷測驗題庫的建立與應用研究(碩士論文)。國立臺灣師範大學。  延伸查詢new window
4.施叡凝(2000)。網際網路上的智慧型考試系統(碩士論文)。國立東華大學。  延伸查詢new window
5.惠志堅(1997)。生活科技教師網路教學諮詢系統發展研究(碩士論文)。國立高雄師範大學。  延伸查詢new window
6.蔡福興(1999)。國中生活科技教學活動網路資源系統發展研究(碩士論文)。國立臺灣師範大學。  延伸查詢new window
7.賴信仁(1997)。題目參數校準研究(碩士論文)。國立臺灣師範大學。  延伸查詢new window
圖書
1.Zimowski, M. F.、Muraki, E.、Mislevy, R. J.、Bock, R. D.(1996)。BILOG-MG: Multiple-group IRT analysis and test maintenance for binary items。Chicago, IL:Scientific Software International, Inc.。  new window
2.Sands, William A.、Waters, Brian K.、McBride, James R.(1997)。Computerized adaptive testing: from inquiry to operation。American Psychological Association。  new window
3.Hambleton, R. K.、Swaminathan, H.、Rogers, H. J.(1991)。Fundamentals of item response theory。Newbury Park, California:Sage Publications。  new window
4.Baker, Frank B.(1992)。Item Response Theory: Parameter Estimation Techniques。New York:Marcel Dekker, Inc.。  new window
5.Lord, Frederic M.(1980)。Applications of Item Response Theory to Practical Testing Problems。Lawrence Erlbaum Associates, Inc.。  new window
6.Hambleton, Ronald K.、Swaminathan, H.(1985)。Item Response Theory: Principles and Applications。Boston:Kiuwer Nijhoff Publishing。  new window
7.Wainer, Howard、Dorans, Neil J.(2000)。Computerized adaptive testing: A primer。Mahwah, N. J.:Lawrence Erlbaum Associates。  new window
8.Kolen, M. J.、Brennan, R. J.(1995)。Test Equating: Methods and Practices。New York:Springer-Verlag。  new window
單篇論文
1.Vale, C. D.,Maurelli, V. A.,Gialluca, K. A.,Weiss, D. J.,Ree, M. J.(1981)。Methods of linking item parameter: Final Report,St. Paul, Minn:Assessment Systems Corp。(ED 210314)。  new window
圖書論文
1.Flaugher, R.(2000)。Item pools。Computerized adaptive testing: A primer。Lawrence Erlbaum Associates。  new window
2.Angoff, William H.(1971)。Scales, norms, and equivalent scores。Educational measurement。Washington, DC:American Council on Education。  new window
3.Angoff, W. H.(1982)。Summary and derivation of equating methods used at ETS。Test equating。New York:Academic Press。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE