:::

詳目顯示

回上一頁
題名:Rasch模式概率比法的差異試題功能分析
書刊名:中華心理學刊
作者:王文中張智宏
作者(外文):Wang, Wen-chungChang, Chihhung
出版日期:1998
卷期:40:1
頁次:頁15-32
主題關鍵詞:試題差異功能Rasch模式概率比檢驗試題反應理論試題偏誤Differential item functioningRasch modelLikelihood-ratio testItem response theoryItem bias
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(1) 博士論文(1) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:20
  • 點閱點閱:244
     試題差異功能(DIF)的偵測已是目前測驗理論發展的重要課題。傳統上,以試題 反應理論來分析 DIF 的作法多建立在比較兩個團體的試題參數差異上。 這種方法的先決條 件為共變數矩陣估計的準確性,可是事實上它的精確估計非常困難。再者,這種作法並不直 接估計 DIF 的大小, 因此無法深入評估各個試題的 DIF 狀況。 本研究改進了 Thissen, Steinberg 和 Gerrard ( 1986 )的作法,直接估計 DIF 參數。這種做法建立在多向度隨 機係數多項洛基模式( Adms,Wilson,& Wang,1997 )。 電腦模擬研究的結果發現所有 參數(含 DIF 參數)的回復性相當好。我們分別以試題參數差異法、DIF 參數 z 檢定法、 概率比法等三種方法分析了性向測驗中的語文分測驗。不論就理論上還是實際上,均以概率 比法的效果最佳。
     Differential item functioning(DIF) analysis has been a major issue in test development. DIF analysis with item response theory are usually based on differences in item parameters between two groups. This approach assumes that accurate estimates of the covariance matrix are available. However, it has been shown that they are extremely difficult to compute. In addition, this approach does not directly estimate DIF, which makes the evaluation of DIF difficult. In this paper, we elaborate the work proposed by Thissen, Steinberg, and Gerrard (1986) and directly estimate DIF parameters. This approach is made possible by the multidimensional random coefficients multinomial logit model (Adams, Wilson, & Wang, 1997). Results of the simulation study show that all the parameters, including DIF parameters, were recovered very well. A real data set of a verbal subscale from an aptitude test was analyzed in three ways: item parameter difference, DIF parameter z test, and likelihood-ratio. The likelihood-ratio approach gives best results in terms of both theoretical and practical advantages.
期刊論文
1.Masters, G. N.(1982)。A Rasch model for partical credit scoring。Psychometrika,47,149-174。  new window
2.Bock, R. D.、Lieberman, M.(1970)。Fitting a response model for n dichotomously scored items。Psychometrika,35,179-197。  new window
3.Lord, F. M.(1953)。The Relation of Test Score to the Trait Underlying the Test。Educational and Psychological Measurement,13,517-548。  new window
4.王文中(19961200)。幾個有關Rasch測量模式的爭議。教育與心理研究,19,1-25。new window  延伸查詢new window
5.Dorans, N. J.、Kulick, E.(1986)。Demonstrating the utility of the standardization approach to assessing unexpected differential item performance on the SAT。Journal of Educational Measurement,23,355-368。  new window
6.Adams, Raymond J.、Wilson, Mark R.、Wang, Wen-chung(1997)。The multidimensional random coefficients multinomial logit model。Applied Psychological Measurement,21(1),1-23。  new window
7.Mantel, N.、Haenszel, W.(1959)。Statistical Aspects of the Analysis of Data from Retrospective Studies of Disease。Journal of the National Cancer Institute,22(4),719-748。  new window
8.Adams, R. J.、Wilson, M. R.、Wu, M. L.(1997)。Multilevel item response modeling: An approach to errors in variable regression。Journal of Educational and Behavioral Statistics,22,47-76。  new window
9.Angoff, W. H.、Sharon, A. T.(1974)。The evaluation of differences in test performance of two or more groups。Educational and Psychological Measurement,34,807-816。  new window
10.Bock, R. D.、Aitkin, M.、Bock, R D.、Aitkin, W.(1981)。Marginal maximum likelihood estimation of item parameters: an application of an EM algorithm。Psychometrika,46,443-459。  new window
11.Cleary, T. A.、Hilton, T. J.(1968)。An investigation of item bias。Educational and Psychological Measurement,5,115-124。  new window
12.McLaughlin, M. E.、Drasgow, F.(1987)。Lord's chi-square test of item bias with estimated and with known person parameters。Applied Psychological Measurement,11,161-173。  new window
13.Neyman, J.、Pearson, E. S.(1928)。On the use and interpretation of certain test criteria for purposes of statistical inference。Biometrika,20A(1/2),175-240。  new window
14.Neyman, J.、Pearson, E. S.(1928)。On the use and interpretation of certain test criteria for purposes of statistical inference, Part II。Biometrika, A,20,263-294。  new window
15.Scheumean, J. D.(1979)。A method of assessing bias in test items。Journal of Educational Measurement,16,143-152。  new window
16.Shepard, L. A.、Camilli, G.、Averill, M.(1981)。Comparison of procedures for detecting test-item bias with both internal and external ability criteria。Journal of Educational Statistics,6,317-375。  new window
17.Thissen, D.、Steinberg, L.、Gerrard, M.(1986)。Beyond group mean differences: The concept of item bias。Psychological Bulletin,99,118-128。  new window
18.Wald, A.(1943)。Tests of statistical hypotheses concerning several parameters when the number of observations is large。Transactions of the American Mathematical Society,54,426-482。  new window
19.Wilson, M. R.、王文中(1995)。Complex composites: Issues that arise in combining different modes of assessment。Applied Psychological Measurement,19,51-72。  new window
20.Zwick, R.、Thayer, D. T.、Wingersky, M.(1994)。A simulation study of methods for assessing differential item functioning in computerized adaptive tests。Applied Psychological Measurement,18,121-140。  new window
會議論文
1.Angoff, W. H.(1972)。A technique for the investigation of cultural differences。Honolulu, HI。  new window
2.王文中(1997)。Estimating rater severity with multilevel and multidimensional item response modeling。Chicago, IL。  new window
3.王文中(1998)。An ANOVA-like Rasch analysis of differential item functioning。San Diego。  new window
學位論文
1.王文中(1994)。Implementation and application of the multidimensional random coefficients multinomial logit model(博士論文)。University of California,Berkeley。  new window
圖書
1.Birnbaum, A.(1968)。Some latent trait models and their user in inferring an examinee’s ability。Statistical theories of mental rest scores。Reading, MA:Addison-Wesley。  new window
2.Rasch, G.(1960)。Probabilistic models for some intelligence and attainment tests。Copenhagen:The Danish Institute of Educational Research。  new window
3.Rao, C. R.(1973)。Linear Statistical Inference and its Applications。Wiley Eastern Limited。  new window
4.Wright, B. D.、Masters, G. N.(1982)。Rating scale analysis: Rasch measurement。Chicago:MESA Press。  new window
5.Wright, B. D.、Stone, M. H.(1979)。Best Test Design: Rasch Measurement。Chicago, IL:Mesa Press。  new window
6.Lord, Frederic M.(1980)。Applications of Item Response Theory to Practical Testing Problems。Lawrence Erlbaum Associates, Inc.。  new window
7.Cardall, C.、Coffman, W. E.(1964)。A method for comparing the performance of different groups on the same items of a test。Research and Development Reports, 9。Princeton, NJ。  new window
8.王文中、Wilson, M. R.、Adams, R. J.、Wang, W. C.(1997)。Rasch Models for Multidimensionality between Items and within Items。Objective Measurement: Theory into Practice, Vol. 4。Norwood, NJ。  new window
9.Wright, D. J.(1987)。An empirical comparison of the Mantel-Haenszel and standardization methods of detecting differential item performance。Differential item functioning on the Scholastic Aptitude Test。Princeton, NJ。  new window
10.Wu, M. L.、Adams, R. J.、Wilson, M. R.(1998)。ConQuest。ConQuest。Camberwell, Australia。  new window
圖書論文
1.Adams, R. J.、Wilson, M. R.(1996)。Formulating the Rasch model as a mixed coefficients multinomial logit。Objective measurement: Theory into practice。Norwood, NJ:Ablex。  new window
2.Holland, P. W.、Thayer, D. T.、Holland, W. P.(1988)。Differential Item Performance and the Mantel-Haenszel Procedure。Test Validity。Hillsdale, NJ:Lawrence Erlbaum Associates, Inc.。  new window
3.Thissen, D.、Steinberg, L.、Wainer, H.(1988)。Use of item response theory in the study of group differences in trace lines。Test Validity。Hillsdale, NJ:Lawrence Erlbaum。  new window
4.Lord, F. M.(1977)。A study of item bias, using item characteristic curve theory。Basic problems in cross-cultural psychology。Amsterdam:Swets and Zeitlinger。  new window
5.王文中、Wilson, M. R.(1996)。Comparing multiple-choice-items and performance-based items using item response modeling。Objective measurement: Theory into practice。Norwood, NJ:Ablex。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
:::
無相關書籍
 
無相關著作
 
QR Code
QRCODE