:::

詳目顯示

回上一頁
題名:貝氏三階層IRT隨機截距之潛在迴歸模式的發展與應用
書刊名:中華心理學刊
作者:黃宏宇 引用關係洪素蘋 引用關係
作者(外文):Huang, Hung-yuHung, Su-pin
出版日期:2010
卷期:52:3
頁次:頁309-326
主題關鍵詞:Rasch模式貝氏推論階層模式試題反應理論潛在迴歸Bayesian inferenceItem response theoryLatent regressionMultilevel modelRasch model
原始連結:連回原系統網址new window
相關次數:
  • 被引用次數被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0)
  • 排除自我引用排除自我引用:0
  • 共同引用共同引用:0
  • 點閱點閱:58
傳統試題反應理論中的潛在迴歸模式,自變項屬於觀察變項,其測量誤差往往被忽略。本研究旨在建構貝氏三階層IRI隨機截距項之潛在迴歸模式,除了透過一系列的模擬研究發展此模式之外,同時以兩個實徵研究說明此新模式之應用。本研究首先,將潛在自變項納入IRT的潛在迴歸模式中,同時考慮資料具有巢套的特性,並表徵在隨機截距項中。接著,模擬資料產生自貝氏三階層IRI隨機截距項之潛在迴歸模式,針對模式適配度進行探討。模式絕對性適配指標(PPMC)採用的是各受試群體在效標變項上平均觀察分數分布之標準差,而DIC則是用來進行模式競爭,以此兩個指標用來診斷資料與模式的適配程度。此外,在參數回復性方面,在貝氏三階層IRT隨機截距項之潛在迴歸模式分析下,各項參數回復性均佳,但如果忽略隨機截距項的存在,則會造成潛在迴歸方程式中殘差變異數估計值的嚴重偏誤,低估參數估計的分散程度,且高估測驗的信度。本研究也運用台灣教育長期追蹤資料庫(TEPS)的認知能力測驗,以及國中基本學力測驗(BCTEST)中數學科與英文科成績為例,作為兩個實徵研究來以說明貝氏三階層IRI隨機截距項之潛在迴歸模式之應用,最後提出理論與實務上相關的建議。
Multidimensional item response theory (MIRT) has received much attention developing into many different models. Traditionally, the standard IRT or MIRT models have two-level structure. Three-level IRT latent regression models were proposed with a MML algorithm, the predictors in IRT latent regression, however, have been assumed to be error-free. Recently, research has explored the application of a Bayesian IRT approach. This study aims to explore the Bayesian three-level IRT random intercept latent regression model (Bayesian 3L-IRT-RILRM) and assess the accuracy of its parameter recovery and efficiency. All of simulations were based on the one-parameter logistic model under the 3L-IRT-RILRM. Three tests with 20 items in each test were analyzed and 40 clusters, each containing 50 examinees, were simulated. The generated data sets were fitted to the four different models: the proposed model, the two-level latent regression model, the conventional MIRT model, and the conventional unidimensional IRT models respectively. The computer program WinBUGS with Metropolis-Hastings sampling was implemented to estimate model parameters. The Bayesian model-data fit checking techniques, such as posterior predictive model checking (PPMC), pseudo Bayes factor (PsBF) and Bayesian DIC, were implemented to choose which model was better. The results of PPMC produce an analytic index which can identify the 3L-IRT-RILRM as the best model. Furthermore the proposed model was considered best to describe the generated data through model comparison. The model parameter estimates were recovered fairly well in the framework of the Bayesian approach if the generated data was fitted to the proposed model. If the random intercept in latent regression was ignored, the parameter estimates would be biased and the precision of estimation, as well as the test reliability would be overestimated. Finally, two empirical data sets from the TEPS and BCTEST were used to illustrate the use of 3L-IRT-RILRM as the analytic model for comparison with other competitive models. 3L-IRT-RILRM is reliable and provides the most complete description of real data. Further studies and recommendations are addressed by the authors for extending more general models.
期刊論文
1.Masters, G. N.(1982)。A Rasch model for partical credit scoring。Psychometrika,47,149-174。  new window
2.Li, Y. M.、Bolt, D. M.、Fu, J. B.(2006)。A comparison of alternative models for testlets。Applied Psychological measurement,30(1),3-21。  new window
3.Sireci, S. G.、Thissen, D.、Wainer, H.(1991)。On the Reliability of Testlet-Based Tests。Journal of Educational Measurement,28(3),237-247。  new window
4.Thissen, D.、Steinberg, L.、Mooney, J. A.(1989)。Trace Lines for testlets: a use of multiple-categorical response models。Journal of Educational Measurement,26(3),247-260。  new window
5.Wainer, H.(1995)。Precision and differential item functioning on a testlet-based test: The 1991 Law School Admissions Tests as an example。Applied Measurement in Education,8(2),157-186。  new window
6.Bradlow, E. T.、Wainer, H.、Wang, X.(1999)。A Bayesian Random Effects Model for Testlets。Psychometrika,64(2),153-168。  new window
7.Wainer, H.、Thissen, D.(1996)。How is Reliability Related to the Quality of Test Scores? What is the Effect of Local Dependence on Reliability?。Educational Measurement: Issues and Practice,15(1),22-29。  new window
8.Reckase, M. D.(1997)。The Past and Future of Multidimensional Item Response Theory。Applied Psychological Measurement,21(1),25-36。  new window
9.Jöreskog, K. G.(1971)。Simultaneous factor analysis in several populations。Psychometrika,36(4),409-426。  new window
10.Yen, W. M.(1993)。Scaling performance assessments: Strategies for managing local item dependence。Journal of Educational Measurement,30(3),187-213。  new window
11.Wang, W. C.、Wilson, M.(2005)。Exploring local item dependence using a random-effects facet model。Applied Psychological Measurement,29(4),296-318。  new window
12.Embretson, S. E.(1991)。A Multidimensional Latent Trait Model for Measuring Learning and Change。Psychometrika,56,495-516。  new window
13.Tanner, Martin A.、Wong, Wing Hung(1987)。The calculation of posterior distributions by data augmentation (with discussion)。Journal of the American Statistical Association,82(398),528-550。  new window
14.Wainer, H.、Wang, X. H.(2000)。Using a New Statistical Model for Testlets to Score TOEFL。Journal of Educational Measurement,37(3),203-220。  new window
15.Mislevy, R. J.、Beaton, A. E.、Kaplan, B.、Sheehan, K. M.(1992)。Estimating population characteristics from sparse matrix samples of item responses。Journal of Educational Measurement,29(2),133-161。  new window
16.Whitely, S. E.(1980)。Multicomponent Latent Trait Models for Ability Tests。Psychometrika,45,479-494。  new window
17.Andrich, D.(1978)。A Rating Formulation for Ordered Response Categories。Psychometrika,43(4),561-573。  new window
18.Patz, R. J.、Junker, B. W.(1999)。A straightforward approach to Markov Chain Monte Carlo methods for item response models。Journal of Educational and Behavioral Statistics,24,146-178。  new window
19.Schwarz, Gideon(1978)。Estimating the Dimension of a model。The Annals of Statistics,6(2),461-464。  new window
20.Wang, Wen-Chung、Wilson, Mark(2005)。The Rasch testlet model。Applied Psychological Measurement,29(2),126-149。  new window
21.Gelfand, Alan E.、Smith, Adrian F. M.(1990)。Sampling-Based Approaches to Calculating Marginal Densities。Journal of the American Statistical Association,85(410),398-409。  new window
22.Geman, S.、Geman, D.(1984)。Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images。IEEE Transactions on Pattern Analysis and Machine Intelligence,6(6),721-741。  new window
23.Adams, Raymond J.、Wilson, Mark R.、Wang, Wen-chung(1997)。The multidimensional random coefficients multinomial logit model。Applied Psychological Measurement,21(1),1-23。  new window
24.Akaike, Hirotsugu(1974)。A new look at the statistical model identification。IEEE Transactions on Automatic Control,19(6),716-723。  new window
25.Chib, Siddhartha、Greenberg, Edward(1995)。Understanding the Metropolis-Hastings Algorithm。American Statistician,49(4),327-335。  new window
26.Albert, J. H.(1992)。Bayesian estimation of normal ogive item response curves using Gibbs sampling。Journal of Educational Statistics,17,251-269。  new window
27.Baker, F. B.(1998)。An investigation of the item parameter recovery characteristics of a Gibbs sampling procedure。Applied Psychological Measurement,22,163-169。  new window
28.Beguin, A. A.,、Glas, C. A. W.(2001)。MCMC estimation and some model-fit analysis of multidimensional IRT models。Psychometrika,66,541-562。  new window
29.Bolt, D. M., Cohen, A. S.,、Wollack, J. A.(2002)。Item parameter estimation under conditions of test speededness: Application of a mixture Rasch model with ordinal constraints。Journal of Educational Measurement,39,331-348。  new window
30.Bolt, D. M.,、Lall, V. F.(2003)。Estimation of compensatory and noncompensatory multidimensional item response models using Markov chain Monte Carlo。Applied Psychological Measurement,27,395-414。  new window
31.Fox, J. P.,、Glas, C. A. W.(2003)。Bayesian modeling of measurement error in predictor varables using item response theory。Psychometrika,68,169-191。  new window
32.Geisser, S.,、Eddy, W.(1979)。A predictive approach to model selection。Journal of American Statistical Association,74,153-160。  new window
33.Newton, M. A.,、Raftery, A. E.(1994)。Approximate Bayesian inference by the weighted likelihood bootstrap (with discussion)。Journal of the Royal Statistical Society, Series B,56,3-48。  new window
34.Patz, R. J., Junker, B. W., Johnson, M. S.,、Mariano, L. T.(2002)。The hierarchical rater model for rated test items and its application to large-scale educational assessment data。Journal of Educational and Behavioral Statistics,27,341-384。  new window
35.Sheng, Y.,、Wikle, C. K.(2008)。Bayesian multidimensional IRT models with a hierarchical structure。Educational and Psychological Measurement,68,413-430。  new window
36.Sinharay, S.(2005)。Assessing fit unidimensional item response theory models using a Bayesian approach。Journal of Educational Measurement,42,375-394。  new window
37.Sinharay, S., Johnson, M. S.,、Stern, H. S.(2006)。Posterior predictive assessment of item response theory models。Applied Psychological Measurement,30,298-321。  new window
38.Spiegelhalter, D. J., Best, N. G., Carlin, B. P.,、van der Linde, A.(2002)。Bayesian measures of model complexity and fit。Journal of the Royal Statistical Society, Series B, Methodological,64,583-616。  new window
39.Wainer, H.,、Lukhele, R.(1997)。How reliable are TOEFL scores?。Educational and Psychological Measurement,57,741-758。  new window
40.Wang, W.-C., Wilson, M. R.,、Shih, C.-L.(2006)。Modeling randomness in judging rating scales with a random-effects rating scale model。Journal of Educational Measurement,43,335-353。  new window
41.Wilson, M. R.,、Hoskens, M.(2001)。The rater bundle model。Journal of Educational and Behavioral Statistics,26,283-306。  new window
研究報告
1.Rabe-Hesketh, S., Pickles, A.,、Skrondal, A.(2001)。GLLAMM manual. Technical Report 2001/01。London。  new window
學位論文
1.Huang, H.-Y.(2009)。The hierarchical structure item response model and its application to computerized adaptive testing,Taipei, Taiwan.。  new window
圖書
1.Rabe-Hesketh, Sophia、Skrondal, Anders(2005)。Multilevel and Longitudinal Modeling Using Stata。College Station:Stata Press。  new window
2.Birnbaum, A.(1968)。Some latent trait models and their user in inferring an examinee’s ability。Statistical theories of mental rest scores。Reading, MA:Addison-Wesley。  new window
3.De Boeck, P.、Wilson, M.(2004)。Explanatory item response models: A generalized linear and nonlinear approach。New York:Springer-Verlag。  new window
4.Baker, F. B.、Kim, S. H.(2004)。Item response theory: Parameter estimation techniques。New York:Marcel Dekker, Inc。  new window
5.Wu, M. L.、Adams, R. J.、Wilson, M. R.(1998)。ACER ConQuest: Generalised Item Response Modeling Software。Melbourne:Australian Council for Educational Research。  new window
6.Smith, Jr. E. V.、Smith, R. M.(2004)。Introduction to Rasch Measurement: Theory, Models and Applications。Maple Grove, MN:JAM Press。  new window
7.van der Linden, W. J.、Hambleton, R. K.(1997)。Handbook of Modern Item Response Theory。New York, NY:Springer-Verlag。  new window
8.Muthén, L. K.、Muthén, B. O.(2006)。Mplus user's guide。Los Angeles, CA:Muthén & Muthén。  new window
9.Spiegelhalter, D. J.、Best, N.、Thomas, A.(2003)。WinBUGS version 1.4 [Computer program]。MRC Biostatistics Unit, Institute of Public Health。  new window
10.Bryk, Anthony S.、Raudenbush, Stephen W.(1992)。Hierarchical Linear Models in Social and Behavioral Research: Applications and Data Analysis Methods。Newbury Park, CA:Sage Publications。  new window
11.Gelman, A.、Carlin, J. B.、Stern, H. S.、Rubin, R. B.(2003)。Bayesian data analysis。London:Chapman and Hall。  new window
12.Bond, T. G.、Fox, C. M.(2001)。Applying the Rasch model: Fundamental measurement in the human sciences。Mahwah, New Jersey:Lawrence Erlbaum Associates, Inc.。  new window
13.Kelloway, E. K.(1998)。Using LISREL for Structural Equation Modeling。Sage。  new window
14.Lord, F. M.(1980)。Applications of item response theory to practical testing problems。Lawrence Erlbaum Associates。  new window
15.Hambleton, R. K.、Swaminathan, H.(1985)。Item Response Theory: Principles and Applications。Boston, Massachusetts:Kluwer-Nijhoff。  new window
16.Embretson, Susan E.、Reise, Steven P.(2000)。Item Response Theory for Psychologists。Lawrence Erlbaum Associates, Inc.。  new window
17.Gagne, Ellen D.、Yekovich, Frank R.、Yekovich, Carol Walker(1993)。The Cognitive Psychology of School Learning。New York, NY:Harper Collins College:Addison Wesley Longman。  new window
18.Jöreskog K. G.、Sörbom D.(2001)。LISREL Version 8.51 [Computer software]。Chicago。  new window
19.Press, S. J.(2003)。Subjective and objective Bayesian statistics: Principle, models, and applications (2nd ed.)。Hoboken, NJ。  new window
20.Raftery, A. E.(1996)。Hypothesis testing and model selection。Markov chain Monte Carlo in practice。London。  new window
 
 
 
 
第一頁 上一頁 下一頁 最後一頁 top
QR Code
QRCODE