華語文閱讀測驗信度效度分析與垂直等化研究__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
外文摘要
引文資料

題名：	華語文閱讀測驗信度效度分析與垂直等化研究
書刊名：	華語文教學研究
作者：	藍珮君／陳柏熹
作者(外文)：	Lan, Pei-jiun／Chen, Po-hsi
出版日期：	2014
卷期：	11:1
頁次：	頁99-125
主題關鍵詞：	華語文能力測驗；信度；效度；試題反應理論；垂直等化；Mandarin test；Reliability；Validity；Item response theory；Vertical equating
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(1) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:1 共同引用:0 點閱:106

本文旨在探討華語文閱讀測驗四個測驗等級：基礎級、進階級、高階級與流利級的信度與效度表現，並將四個等級試題難度連結至同一量尺上。樣本來自2011年5月與11月正式考試，及2012年預試之考生作答反應資料，以古典測驗理論與試題反應理論進行分析。研究結果顯示：1. 閱讀測驗信度良好，各等測驗KR20信度係數接近或達到0.90以上，IRT估計標準誤換算後的信度數值皆達到0.90以上，且各測驗通過門檻的考生能力值亦有較高的測驗訊息量與較低的估計標準誤；2. 閱讀測驗具有建構效度，各等級因素分析結果抽出閱讀理解單一因素，解釋變異量在66.91%以上，且各等級試題與模式適配比例達87.5%以上；3. 四等測驗試題難度分佈良好；4. 進階與高階級測驗折半合併為一等測驗，通過門檻之測驗訊息量及估計標準誤，與原進階級測驗相當，略差於原高階級測驗，將此兩等級測驗合併為一等測驗在實務上應為可行，惟組卷時試題難度比例需再做調整。

以文找文

The purpose of this study is to investigate the reliability, validity and vertical equating of the Reading subtest of the Test of Chinese as a Foreign Language. Four levels are included in the reading section, they are Level 2, 3, 4, and 5, respectively. The analysis data was sampled from the formal version of the test administered in 2011 and pretest version in 2012. The results showed that, first, the coefficients of the Kuder-Richardson 20 were closed to or higher than .90. Moreover, large test information is provided to the value of cutoff which is determined an examinee is passed or failed. In other words, low standard error of estimation was obtained for the examinees. Second, the results of factor analysis showed that only one factor was extracted, which could account for above 66% of the variance. In addition, the results of Rasch analysis revealed that more than 87.5% of the items fit the model well. Third, there is a suitable range of difficulties for each level of test. Finally, standard error of estimation about the cutoff values were similar to Level 3 but lower than Level 4 when the items in Level 3 and 4 were split to assemble two tests (i.e., test information on the cutoff values for the even items included in Level 3 and 4, the odd items included in Level 3 and 4, and items in Level 3 and 4). That is these two adjacent levels can be combined to form a composite level of test in the future to reduce the burden for examinees and developers of the test. However, the item difficulty distribution of the composite test should be adjusted.

以文找文

期刊論文
1.	Lai, J., D.、Celia,, C. H.、Chang, R.、Bode, K.、Heinemann, A. W.(2003)。Item banking to improve, shorten, and computerize self-reported fatigue: An il¬lustration of steps to create a core item bank from the FACIT-Fatigue scale。Quality of Life Research，12，485-501。
2.	Sawaki, Y.、Strieker, L. J.、Oranje, A. H.(2009)。Factor structure of the TOEFL Internet-based test. Language Testing26(1)，5-30。
3.	Yu, Chong Ho.(2005)。Test Equating by Common Items and Common Subjects: Concepts and Applications。Practical Assessment, Research & Evaluation，10(4)，1-19。
4.	符華均、李亞男、李佩澤、張鐵英(2013)。新漢語水平考試HSK(五級)效度研究。考試研究，3，65-69。延伸查詢

會議論文
1.	藍珮君、林玲英(2011)。新版華語文能力測驗與CEFR之連結：標準設定方法的應用。ALTE第四屆國際研討會，(會議日期: 2011070)。延伸查詢

圖書
1.	Bond, Trevor G.、Fox, Christine M.(2007)。Applying the Rasch Model: Fundamental Measurement in the Human Sciences。Mahwah, New Jersey：Lawrence Erlbaum Associates。
2.	陳柏熹(2011)。心理與教育測驗：測驗編製理論與實務。精策教育有限公司。延伸查詢
3.	王文中、呂金燮、吳毓瑩、張郁雯、張淑慧(2004)。教育測驗與評量--教室學習觀點。臺北市：五南圖書出版有限公司。延伸查詢
4.	吳明隆(2003)。SPSS統計應用實務。臺北：松崗電腦圖書資料公司。延伸查詢
5.	郭生玉(2000)。心理與教育測驗。臺北縣中和市。延伸查詢
6.	王寶墉(1995)。現代測驗理論。台北市：心理出版社。延伸查詢
7.	Wright, B. D.、Stone, M. H.(1979)。Best Test Design: Rasch Measurement。Chicago, IL：Mesa Press。
8.	余民寧(2009)。試題反應理論（IRT）及其應用。心理出版社。延伸查詢

其他
1.	Educational Testing Service.(2007)。TOEFL iBT Score Reliability and General- izability.，http://www.ets.org/Media/Tests/ TOEFL /pdf/TOEFL iBT Score Reliability Generalizability.pdf。
2.	Educational Testing Service.(2011)。Reliability and Comparability of TOEFL iBT® Scores(PDF).，http://www.ets.0rg/s/t0efl/pdf/t0efl ibt research slv3.pdf。
3.	Winsteps and Rasch measurement Software.(2013)。Misfit diagnosis: Infit outfit mean-square standardized.，http://www.winsteps.com/ win- man/index.htm7diagnosingmisfit.htm.。
4.	張晉軍(2011)。新漢語水準考試（HSK)品質報告，http://blog.sina.com.en/s/blog 53e7clld0100v71z.html。延伸查詢

圖書論文
1.	柴省三(2012)。關於HSK閱讀理解測驗構想效度的實徵研究。世界漢語教學。北京市：北京語言大學。延伸查詢

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	休閒運動體適能俱樂部員工情緒耗竭量表的發展：日誌法重複測量資料的信效度分析
2.	中文版「低血糖感知障礙量表」之發展與心理計量特性
3.	中文版SATED睡眠健康評估量表之發展與信效度驗證
4.	中文版「主觀職涯成功量表」之發展與信效度檢驗
5.	評估品牌服務品質量表之信度與效度
6.	社區整合量表修訂版於腦中風病人之心理計量測試
7.	「心田寬恕量表」之中文化與信、效度分析
8.	校務檔案怎麼評？校務經營檔案之信效度分析
9.	PDCA循環問卷之信效度初步分析
10.	Reliability and Validation of Wearable Sensors-monitored Smart Calf Sleeve and Smart Socks in Different Walking Speed
11.	重症單位老年病人照護需求量表之發展與驗證
12.	「大學生批判思考意向量表」之編製
13.	教師の成長を測る「共通専門能力尺度」の作成--台湾の大学で働く日本語教師を対象として
14.	中文版存在空虛感量表之信度與效度研究
15.	題型對學生數學表現水準之影響--以相似形為例

1.	臨床教育環境量表中文精簡版信效度評估與應用
2.	運動教練評鑑指標建構及評鑑量表編製之研究
3.	史羅二氏工作狂量表中文版心理計量特質驗證及其前因後果模式相關因素之驗證研究
4.	保齡球注意力量表的編製

無相關書籍

無相關著作

1.	華語教師跨文化能力培訓之省思
2.	《詞彙之旅》書評：充實讀者知識更培養其分析能力
3.	新北市平溪區銀髮族居民資訊需求與資訊行為特性
4.	Dervin與Weick意義建構理論之分析與比較
5.	臺灣地形圖的圖名、圖號系統及其與地圖編目的關係
6.	臺灣民眾網路素養之調查研究
7.	大學圖書館館員於職場之資訊尋求行為研究
8.	臺灣大學典藏長澤文庫《令巻八醫疾令比校長澤伴雄自筆》版本探究
9.	「保存-資源善用」環境態度量表之編製研究
10.	觀光工廠遊客體驗行銷、服務品質、顧客滿意度與忠誠度之關係研究
11.	遷移物種公約特殊保育合作機制之研究--兼論我國參與波昂公約網絡之可行性與意義
12.	課程政策與教師課程意識：以藝術與人文學習領域為例
13.	探討國家公園志工訓練、地方依附感與組織承諾之關係
14.	權利與法治--德沃金法哲學的詮釋特徵與實踐意義

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫