:::

詳目顯示

回上一頁
題名:使用驗證性補償多維IRT模型進行認知診斷評估
書刊名:心理學報
作者:詹沛達陳平邊玉芳
作者(外文):Zhan, Pei-da; ;Chen, PingBian, Yu-fang
出版日期:2016
卷期:2016(10)
頁次:1347-1356
主題關鍵詞:項目反應理論多維項目反應理論認知診斷模型認知診斷Q矩陣驗證性因素分析Item response theoryMultidimensional item response theoryCognitive diagnostic modelsCognitive diagnosisQ matrixConfirmatory factor analysis
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(2) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:2
  • 共同引用共同引用:0
  • 點閱點閱:9
隨著人們對測驗反饋結果精細化的需求逐漸提高,具有認知診斷功能的測量方法逐漸受到人們的關注。在認知診斷模型(CDMs)閃耀著光芒的同時,另一類能夠在連續量尺上提供精細反饋的多維IRT模型(MIRTMs)似乎受到些許冷落。為探究MIRTMs潛在的認知診斷功能,本文以補償模型為視角,聚焦于分別屬于MIRTMs的多維兩參數logistic模型(M2PLM)和屬于CDMs的線性logistic模型(LLM);之后為使兩者具有可比性,可對補償M2PLM引入驗證性矩陣(Q矩陣)來界定題目與維度之間的關系,進而得到驗證性的補償M2PLM(CC-M2PLM),并通過把潛在特質按切點劃分為跨界屬性,以期使CC-M2PLM展現出其本應具有的認知診斷功能;預研究表明logistic量尺上的0點可作為相對合理的切點;然后,通過模擬研究對比探究CC-M2PLM和LLM的認知診斷功能,結果表明CC-M2PLM可用于分析診斷測驗數據,且認知診斷功能與直接使用LLM的效果相當;最后,以兩則實證數據為例來說明CC-M2PLM在實際診斷測驗分析中的可行性。
Traditional testing methods, such as classical testing theory or unidimensional item response theory models(UIRMs), typically provide a single sum score or overall ability. Advances in psychometrics have focused on measuring multiple dimensions of ability to provide more detailed and refined feedback for students. In recent years, cognitive diagnostic models(CDMs) have received great attention, particularly in the areas of educational and psychological measurement. The outcome of a DCM analysis is a profile of a set of attributes, α, also called a latent class, for each person; this provides cognitive diagnostic information about distinct skills underlying a test that students mastery or non-mastery. During the same period, another kind of models, multidimensional IRT models(MIRTMs), which also can provide fine-grained information about students’ strengths and weaknesses in the learning process were neglected. MIRTMs are different from CDMs in that latent variables in MIRTMs are continuous(namely, latent traits; θ) rather than categorical(typically binary). However, categorical variables in CDMs may be too rough to describe students’ skills when compared with the continuous latent traits in MIRTMs. Diagnostic measurement is the process of analyzing data from a diagnostic assessment for the purpose of making classification-based decisions. Currently, all testing method that have cognitive diagnostic function require substantive information about the attributes involved in specific items. Especially for CDMs, a confirmatory matrix that indicating which latent variables are required for an item, often referred to as Q matrix, is a essential term to analysis response data. Actually, such confirmatory matrices also exist in some multidimensional IRT models(MIRTMs), such as the scoring matrix in multidimensional random coefficients multinomial logit model. Therefore, it can be deduced that when MIRTMs are formulated in a confirmatory model defined by Q matrix, may also have diagnostic potential. Although some articles have noticed that viewpoint(e.g., Embretson & Yang, 2013; Stout, 2007; Wang & Nydick, 2015), no one really explored the diagnostic potential of confirmatory MIRTMs(C-MIRTMs). The main reason can be deduced that latent traits in MIRTMs are continuous, which can not be directly used to make classification-based diagnostic decisions. No matter MIRTMs or CDMs, multidimensional models normally can be specified into compensatory and non-compensatory models due to the relationship among dimensions. In compensatory models, students with high level on one dimension can compensate for lower levels on the other dimensions. Conversely, non-compensatory models assume that every dimensions are independent or partially independent with each others. Comparatively speaking, compensatory models are more general than non-compensatory models. Thus, only two compensatory models were concerned in this study, multidimensional 2-parameter logistic model(M2PLM) and linear logistic model(LLM) respectively, due to space limited. To explore the cognitive diagnostic function of MIRTMs, a confirmatory compensatory M2PLM(CC-M2PLM) were presented by introducing Q matrix in the item response function of M2 PLM firstly. Then a cutoff point(CP) was used to transform estimated latent traits in CC-M2 PLM to categorical variables(namely, trans-border attributes). This transformation step can be done after data analysis, thus two kinds of analysis results can be reported simultaneously: continuous latent traits and categorical trans-border attributes. Therefore, a suitable CP is very important, because of different CP will lead to different classification results. A simple pilot study was done to found the suitable CP: a test created with the CC-M2 PLM but estimated with the LLM revealed that the LLM approximately divided the latent traits distribution in half, with a value of zero in IRT scale being the location of where masters(α = 1 if θ > 0) and non-masters(α = 0 if θ ≤ 0) were set. According to the result of pilot study, the CP was set equal to 0 for all dimensions(i.e., CPk = 0). Parameters in CC-M2 PLM and LLM can be estimated by the mirt and CDM packages in R respectively. In simulation study, a series of simulations were conducted to evaluate cognitive diagnostic function of CC-M2 PLM. The response data was generated by LLM, which can be treated as a diagnostic measurement dataset. CC-M2 PLM and LLM were all used to fit that dataset, and results showed that the pattern(profile) correct classification ratio(PCCR) and the attribute correct classification ratio(ACCR) of trans-border attributes(from CC-M2PLM) and estimated attributes(from LLM) are almost same, the extent of most differences are smaller than 1%. Results of simulation study indicated that CC-M2 PLM can be used to diagnostic measurement and its cognitive diagnostic function was as good as that of LLM. Finally, two empirical examples of diagnostic measurement were given to demonstrate applications and implications of the CC-M2 PLM.
期刊論文
1.Maris, E.(1999)。Estimating multiple classification latent class models。Psychometrika,64(2),187-212。  new window
2.Wang, C.、Chang, H.-H.、Huebner, A.(2011)。Restrictive stochastic item selection methods in cognitive diagnostic computerized adaptive testing。Journal of Educational Measurement,48,255-273。  new window
3.Bolt, D. M.、Lall, V. F.(2003)。Estimation of compensatory and noncompensatory multidimensional item response models using Markov chain Monte Carlo。Applied Psychological Measurement,27(6),395-414。  new window
4.de la Torre, Jimmy(2009)。DINA model and parameter estimation: A didactic。Journal of Educational and Behavioral Statistics,34(1),115-130。  new window
5.Junker, B. W.、Sijtsma, K.(2001)。Cognitive assessment models with few assumptions, and connections with nonparametric item response theory。Applied Psychological Measurement,25(3),258-272。  new window
6.de la Torre, J.(2011)。The generalized DINA model framework。Psychometrika,76(2),179-199。  new window
7.De la Torre, J.、Douglas, J. A.(2004)。Higher-order latent trait models for cognitive diagnosis。Psychometrika,69(3),333-353。  new window
8.丁樹良、楊淑群、汪文義(2010)。可達矩陣在認知診斷測驗編制中的重要作用。江西師範大學學報(自然科學版),34(5),490-494。  延伸查詢new window
9.Huang, H. Y.、Wang, W. C.(2014)。The random-effect DINA model。Journal of Educational Measurement,51,75-97。  new window
10.Li, X. M.、Wang, W.-C.(2015)。Assessment of differential item functioning under cognitive diagnosis models: The DINA model example。Journal of Educational Measurement,52(1),28-54。  new window
11.詹沛達、邊玉芳(2015)。概率性輸入,噪音"與"門(PINA)模型。心理科學,38,1230-1238。  延伸查詢new window
12.詹沛達、李曉敏、王文中、邊玉芳、王立君(2015)。多維題組效應認知診斷模型。心理學報,47(5),689-701。  延伸查詢new window
13.Henson, R. A.、Templin, J. L.、Willse, J. T.(2009)。Defining a family of cognitive diagnosis models using log-linear models with latent variables。Psychometrika,74(2),191-210。  new window
14.Chiu, C. Y.、Douglas, J. A.、Li, X. D.(2009)。Cluster analysis for cognitive diagnosis: Theory and applications。Psychometrika,74(4),633-665。  new window
15.Sun, J.、Xin, T.、Zhang, S. M.、de la Torre, J.(2013)。A polytomous extension of the generalized distance discriminating method。Applied Psychological Measurement,37(7),503-521。  new window
16.Leighton, J. P.、Gierl, M. J.、Hunka, S. M.(2004)。The attribute hierarchy method for cognitive assessment: A variation on Tatsuoka's rule-space approach。Journal of Educational Measurement,41(3),205-237。  new window
17.Tatsuoka, K. K.(1983)。Rule space: An approach for dealing with misconceptions based on item response theory。Journal of Educational Measurement,20(4),345-354。  new window
18.Bradshaw, L.、Templin, J.(2014)。Combining item response theory and diagnostic classification models: A psychometric model for scaling ability and diagnosing misconceptions。Psychometrika,79,403-425。  new window
19.Cai, L.(2010)。Metropolis-Hastings Robbins-Monro algorithm for confirmatory item factor analysis。Journal of Educational and Behavioral Statistics,35,307-335。  new window
20.Chen, P.、Xin, T.、Wang, C.、Chang, H.-H.(2012)。Online calibration methods for the DINA model with independent attributes in CD-CAT。Psychometrika,77(2),201-222。  new window
21.Embretson, S. E.、Yang, X.(2013)。A multicomponent latent trait model for diagnosis。Psychometrika,78(1),14-36。  new window
22.Stout, W.(2007)。Skills diagnosis using IRT-based continuous latent trait models。Journal of Educational Measurement,44,313-324。  new window
23.Tatsuoka, C.(2002)。Data analytic methods for latent partially ordered classification models。Journal of the Royal Statistical Society: Series C (Applied Statistics),51,337-350。  new window
24.Templin, J.、Bradshaw, L.(2013)。Measuring the reliability of diagnostic classification model examinee estimates。Journal of Classification,30,251-275。  new window
25.Templin, J.、Bradshaw, L.(2014)。Hierarchical diagnostic classification models: A family of models for estimating and testing attribute hierarchies。Psychometrika,79,317-339。  new window
26.Wang, C.、Nydick, S. W.(2015)。Comparing two algorithms for calibrating the restricted non-compensatory multidimensional IRT model。Applied Psychological Measurement,39,119-134。  new window
27.詹沛達、邊玉芳、王立君(2016)。重參數化的多分屬性診斷分類模型及其判准率影響因素。心理學報,48,318-330。  延伸查詢new window
會議論文
1.Zhan, P. D.、Wang, W.-C.、Bian, Y. F.、Li, X. M.(2016)。Higher-order cognitive diagnostic models for polytomous latent attributes。The annual meeting of the National Council on Measurement in Education。Washington, DC。  new window
2.Zhan, P. D.、Wang, W.-C.、Li, X. M.、Bian, Y. F.(2016)。The probabilistic-inputs, noisy conjunctive model for cognitive diagnosis。The annual meeting of the American Educational Research Association。Washington, DC。  new window
研究報告
1.von Davier, M.(2005)。A general diagnostic model applied to language testing data。Princeton, NJ:Educational Testing Service。  new window
學位論文
1.Karelitz, T. M.(2004)。Ordered category attribute coding framework for cognitive assessments(博士論文)。University of Illinois at Urbana-Champaign。  new window
圖書
1.Reckase, M. D.(2009)。Multidimensional item response theory。New York, NY:Springer。  new window
其他
1.Chalmers, P.,Pritikin, J.,Robitzsch, A.,Zoltak, M.,Kim, K.,Falk, C. F.,Meade, A.(2015)。mirt: Multidimensional Item Response Theory,http://CRAN.R-project.org/package=mirt。  new window
2.Robitzsch, A.,Kiefer, T.,George, C. A.,Uenlue, A.(2015)。CDM: Cognitive Diagnosis Modeling,http://CRAN.R-project.org/package=CDM。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE