:::

詳目顯示

回上一頁
題名:DIF成因之初探:試題特徵與差異試題功能之關聯
書刊名:教育心理學報
作者:孫國瑋陳承德施慶麟 引用關係
作者(外文):Sun, Guo-weiChen, Cheng-teShih, Ching-lin
出版日期:2018
卷期:50:2
頁次:頁167-188
主題關鍵詞:DIF成因差異試題功能差異層面功能線性邏輯斯測驗模式隨機效果線性邏輯斯測驗模式DIF sourceDifferential item functioningDifferential facet functioningLinear logistic test modelRandom effects linear logistic test model
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(1) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:1
  • 共同引用共同引用:30
  • 點閱點閱:3
近年來,研究者對於差異試題功能(differential item functioning, DIF)議題的探討,已由「檢測」DIF轉變為「解釋」DIF。以往對於DIF試題的解釋,多有賴於專家質性審查的方式。然而,如果能有量化分析的證據輔助專家審查,可對DIF成因的判斷有所幫助。本研究透過分析DIF試題之特徵,找出試題特徵與DIF之關聯,作為後續專家審查時判斷DIF成因的參考。為此,本研究採用線性邏輯斯測驗模式(linear logistic test model, LLTM)及隨機效果線性邏輯斯測驗模式(random effects linear logistic test model, LLTM-R)針對測驗中各試題特徵進行所謂的差異層面功能(differential facet functioning, DFF)之檢測,藉以說明試題特徵與DIF之關聯。模擬研究結果顯示試題的DIF程度受到該試題特徵的DFF效果之影響。此外,測驗的Q矩陣密度較高時(例如60%),可能因型一誤差之膨脹而檢測出高比例的DIF試題;本研究另以實徵資料說明如何針對試題進行DFF分析,藉以找出與DIF有關的試題特徵,並作為後續試題修正之方向。根據結果,本研究建議採用LLTM-R進行DFF檢測,可有助於釐清試題特徵與DIF之關聯。
Because assessment methods for differential item functioning (DIF) have been developed and thoroughly investigated, the focus in DIF research has shifted to explaining DIF phenomena. Experts in this field are recruited to tap possible sources of DIF. Quantitative analysis results help experts reviewing DIF to locate sources for DIF items. This study aimed to demonstrate the use of the differential facet functioning (DFF) procedure implemented using the linear logistic test model (LLTM) and random effects linear logistic test model (LLTM-R) to explain possible DIF sources. The efficiency of LLTM and LLTM-R in detecting DFF under various conditions was also evaluated. The simulation results indicated that the DIF effect was significantly influenced by the DFF effect of item properties. Moreover, as the design matrices had a high density (e.g., 60%), Type-I error rates of DIF assessment were seriously inflated. We also demonstrated the procedure of DFF analysis with an empirical data. The result showed that most DIF items were related to two item properties, which would be provided as possible DIF sources in the item-review meeting. Researchers should implement DFF assessment using LLTM-R to help explain DIF sources.
期刊論文
1.Xie, Y.、Wilson, M.(2008)。Investigating DIF and extensions using an LLTM approach and also an individual difference approach: An international testing context。Psychology Science Quartely,50(3),403-416。  new window
2.Green, K. E.、Smith, R. M.(1987)。A Comparison of Two Methods of Decomposing Item Difficulties。Journal of Educational Statistics,12(4),369-381。  new window
3.Shealy, R. T.、Stout, W. F.(1993)。A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DIF as well as item bias/DIF。Psychometrika,58(2),159-194。  new window
4.Douglas, J. A.、Roussos, L. A.、Stout, W.(1996)。Item-bundle DIF hypothesis testing: Identifying suspect bundles and assessing their differential functioning。Journal of Educational Measurement,33(4),465-484。  new window
5.Roussos, L.、Stout, W.(1996)。A multidimensionality based DIF analysis paradigm。Applied Psychological Measurement,20(4),355-371。  new window
6.De Boeck, P.(2008)。Random item IRT models。Psychometrika,73(4),533-559。  new window
7.Gierl, M. J.、Bisanz, J.、Bisanz, G. L.、Boughton, K. A.、Khaliq, S. N.(2001)。Illustrating the utility of differential bundle functioning analyses to identify and interpret group differences on achievement tests。Educational Measurement:Issues and Practice,20,26-36。  new window
8.Gierl, M. J.、Bisanz, J.、Bisanz, G. L.、Boughton, K. A.(2003)。Identifying content and cognitive skills that produce gender differences in mathematics: A demonstration of the DIF analysis paradigm。Journal of Educational Measurement,40(4),281-306。  new window
9.曾明基、邱皓政(20150300)。研究生評鑑教師教學的結果真的可以與大學生一起比較嗎?多群組混合MIMIC-DIF分析。測驗學刊,62(1),1-23。new window  延伸查詢new window
10.Fischer, G. H.(1973)。The linear logistic test model as an instrument in educational research。Acta Psychologica,37(6),359-374。  new window
11.Gierl, M. J.、Khaliq, S. N.(2001)。Identifying Sources of Differential Item and Bundle Functioning on Translated Achievement Tests: A Confirmatory Analysis。Journal of Educational Measurement,38(2),164-187。  new window
12.Engelhard, G. Jr.(1992)。The measurement of writing ability with a many-faceted Rasch model。Applied Measurement in Education,5,171-191。  new window
13.Schwarz, Gideon(1978)。Estimating the Dimension of a model。The Annals of Statistics,6(2),461-464。  new window
14.蕭偉智、傅家珍(20121200)。國中八年級自然科定期評量之性別差別試題功能(DIF)分析。新竹教育大學教育學報,29(2),35-64。new window  延伸查詢new window
15.賴姿伶、余民寧(20151200)。應徵者與在職者在多分題人格測驗的作答差異之研究:試題層次與試題組合層次的分析。人力資源管理學報,15(4),91-120。new window  延伸查詢new window
16.林月仙(20130600)。中文色塊測驗認知成分分析:LLTM與SEM取向。教育與心理研究,36(2),113-144。new window  延伸查詢new window
17.侯雅齡(20130600)。高級中學自然科學術性向測驗編製。科學教育學刊,21(2),189-213。new window  延伸查詢new window
18.黃宏宇、洪素蘋(20090900)。建構效度檢驗之線性與非線性取向:以學生創意自我效能量表為例。屏東教育大學學報. 教育類,33,489-513。new window  延伸查詢new window
19.廖彥棻(20150900)。英文學科能力測驗選擇題之性別差異與差異試題功能分析。東吳外語學報,41,21-59。new window  延伸查詢new window
20.Baker, F. B.(1993)。Sensitivity of the linear logistic test model to misspecification of the weight matrix。Applied Psychological Measurement,17,201-210。  new window
21.Bates, D.、Maechler, M.、Bolker, B.、Walker, S.(2014)。lme4: Linear mixed-effects models using Eigen and S4。R Package Version,1(7)。  new window
22.Beretvas, S. N.、Cawthon, S. W.、Lockhart, L. L.、Kaye, A. D.(2012)。Assessing impact, DIF, and DFF in accommodated item scores a comparison of multilevel measurement model parameterizations。Educational and Psychological Measurement,72(5),754-773。  new window
23.Choi, I. H.、Wilson, M.(2015)。Multidimensional classification of examinees using the mixture random weights linear logistic test model。Educational and Psychological Measurement,75(1),78-101。  new window
24.Ercikan, K.(2002)。Disentangling sources of differential item functioning in multilanguage assessments。International Journal of Testing,2(3/4),199-215。  new window
25.Ercikan, K.、Arim, R. G.、Law, D. M.、Lacroix, S.、Gagnon, F.、Domene, J. F.(2010)。Application of think-aloud protocols in examining sources of differential item functioning。Educational Measurement: Issues and Practice,29(2),24-35。  new window
26.Gierl, M. J.、Bolt, D. M.(2001)。Illustrating the use of nonparametric regression to assess differential item and bundle functioning among multiple groups。International Journal of Testing,1(3/4),249-270。  new window
27.Jin, K. Y.、Wang, W. C.(2017)。Assessment of Differential Rater Functioning in Latent Classes with New Mixture Facets Models。Multivariate Behavioral Research,52(3),391-402。  new window
28.Magis, D.、Beland, S.、Tuerlinckx, F.、De Boeck, P.(2010)。A general framework and an R package for the detection of dichotomous differential item functioning。Behavior Research Methods,42,847-862。  new window
29.Oliveri, M. E.、Ercikan, K.(2011)。Do different approaches to examining construct comparability lead to similar conclusions?。Applied Measurement in Education,24,1-18。  new window
30.Sinharay, S.、Dorans, N. J.、Grant, M. C.、Blew, E. O.(2009)。Using past data to enhance small sample DIF estimation: A Bayesian approach。Journal of Educational and Behavioral Statistics,34,74-96。  new window
31.Van den Noortgate, W.、De Boeck, P.(2005)。Assessing and explaining differential item functioning using logistic mixed models。Journal of Educational and Behavioral Statistics,30,443-464。  new window
32.Zumbo, B. D.(2007)。Three generation of DIF analyses: Considering where it has been, where it is now, and where it is going。Language Assessment Quarterly: An International Journal,4,223-233。  new window
33.Zumbo, B. D.、Liu, Y.、Wu, A. D.、Shear, B. R.、Olvera Astivia, O. L.、Ark, T. K.(2015)。A methodology for Zumbo's third generation DIF analyses and the ecology of item responding。Language Assessment Quarterly,12(1),136-151。  new window
34.張銘秋、謝秀月、徐秋月(20100100)。PISA科學素養之試題認知成份分析。課程與教學,13(1),1-20。new window  延伸查詢new window
35.王佳琪、何曉琪、鄭英耀(20140900)。「科學創造性問題解決測驗」之發展。測驗學刊,61(3),337-360。new window  延伸查詢new window
36.蘇旭琳、陳柏熹(20081200)。DIF分析在小樣本情境中的偵測效果--以視障生和普通生在國中基測數學科之DIF為例。測驗學刊,55(4),761-791。new window  延伸查詢new window
37.Mazor, K. M.、Clauser, B. E.、Hambleton, R. K.(1992)。The Effect of Sample Size on the Functioning of the Mantel-Haenszel Statistic。Educational and Psychological Measurement,52(2),443-451。  new window
38.Mendes-Barnett, S.、Ercikan, K.(2006)。Examining Sources of Gender DIF in Mathematics Assessments Using a Confirmatory Multidimensional Model Approach。Applied Measurement in Education,19(4),289-304。  new window
會議論文
1.Bolt, D.(2002)。Studying the potential of nuisance dimensions using bundle DIF and multidimensional IRT analyses。The annual meeting of the National Council on Measurement in Education。New Orleans, LA。  new window
圖書
1.De Boeck, P.、Wilson, M.(2004)。Explanatory item response models: A generalized linear and nonlinear approach。New York:Springer。  new window
2.Kaplan, D.(2009)。Structural equation modeling: Foundations and extensions。Sage。  new window
3.Linacre, J. M.(1989)。Many-facet Rasch measurement。Chicago, IL:MESA Press。  new window
4.Linacre, J. M.(2017)。Winsteps® Rasch measurement computer program。Beaverton, Oregon:Winsteps.com。  new window
5.R Core Team(2015)。R: A language and environment for statistical computing。Vienna:R Foundation for Statistical Computing。  new window
6.Rasbash, J.、Charlton, C.、Browne, W. J.、Healy, M.、Cameron, B.(2009)。MLwiN。Centre for Multilevel Modelling, University of Bristol。  new window
7.Rasch, G.(1960)。Probabalistic models for some intelligence and attainment tests。Copenhagen:The Danish Institute for Educational Research。  new window
8.Sakamoto, Y.、Ishiguro, M.、Kitagawa, G.(1986)。Akaike information criterion statistics。Dordrecht:D. Reidel。  new window
9.Spiegelhalter, D. J.、Thomas, A.、Best, N. G.、Lunn, D.(2003)。WinBUGS version 1.4 users manual。Cambridge:MRC Biostatistics Unit。  new window
10.Wu, M. L.、Adams, R. J.、Wilson, M.(1998)。ACER ConQuest: Generalized item response modelling software manual。Melbourne, Victoria:The Australian Council for Educational Research Ltd。  new window
11.Zumbo, B. D.(1999)。A handbook on the theory and methods of differential item functioning (DIF)。Ottawa, Ontario:Directorate of human resources research and evaluation, department of National defense。  new window
其他
1.Drabinová, A.,Martinková, P.(2016)。Detection of differential item functioning with non-linear regression: Non-IRT approach accounting for guessing,http://hdl.handle.net/11104/0259498。  new window
圖書論文
1.Janssen, R.、Schepers, J.、Peres, D.(2004)。Models with item and item group predictors。Explanatory item response models: A generalized linear and nonlinear approach。New York:Springer-Verlag。  new window
2.Angoff, W. H.(1993)。Perspectives on differential item functioning methodology。Differential item functioning。Hillsdale, NJ:Lawrence Erlbaum。  new window
3.Holland, P. W.、Thayer, D. T.(1988)。Differential item performance and the Mentel-Haenszel procedure。Test validity。Hillsdale, NJ:Erlbaum。  new window
4.Janssen, R.(2010)。Modeling the effect of item designs within the Rasch model。Measuring psychological constructs: Advances in modelbased approaches。Washington, DC:American Psychological Association。  new window
5.Meulders, M.、Xie, Y.(2004)。Person-by-item predictors。Explanatory item response models。New York:Springer。  new window
6.Wilson, M.、De Boeck, P.(2004)。Descriptive and explanatory item response models。Explanatory item response models: A generalized linear and nonlinear approach。New York, NY:Springer-Verlag。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
:::
無相關著作
 
無相關點閱
 
QR Code
QRCODE