作業取樣對數學實作評量分數類推之影響__臺灣人文及社會科學引文索引資料庫

:::

詳目顯示

第 1 筆 / 總合 1 筆

/1頁

來源文獻資料
摘要
引文資料

題名：	作業取樣對數學實作評量分數類推之影響
書刊名：	教育研究資訊
作者：	張麗麗
作者(外文)：	Chang, Lily
出版日期：	2003
卷期：	11:6
頁次：	頁65-100
主題關鍵詞：	數學實作評量；類推性理論；信度；Mathematics performance assessment；Generalizability theory；Reliability
原始連結：	連回原系統網址
相關次數：	被引用次數:期刊(0) 博士論文(0) 專書(0) 專書論文(0) 排除自我引用:0 共同引用:19 點閱:25

　　　　　本研究主要的目的在探討不同作業分層及類推設計對數學實作評量分數之變異及類推的影響。受試者為132位國小五年級學童，評分者為兩位具實作評量計分經驗的國小教師。實作評量包括五題建構反應型式的紙筆測驗，計分規準採分析式計分規準，包括數學理解、程序與策略、溝通三個向度。正式計分前，兩位評分者均接受嚴謹的評分者訓練。為探討不同作業分層對分數變異及類推的影響，研究者根據作業的難度、所測認知能力、內容與歷程、內部結構等將作業分為同質性不等的五個層次，分別進行「p × t × r」及「p × r × (t:s)」兩種不同的類推設計。類推分析的結果顯示，提高作業同質性雖能適度提高分數的類推程度，但因「受試者 × 作業」交互作用之變異所佔的比例仍然相當高（約六成），以至於各種作業分層之類推結果差異不大。控制作業層次的「p × r × (t:s)」分隔設計，雖然能降低作業層次間的變異，但卻因為多了一項誤差面向，使得其整體的類推程度要低於「p × t × r」設計之類推結果。造成p × t變異無法透過作業分層及類推設計有效控制的可能原因，雖然包括分層變項選取不佳、作業所測內容差異大、所測構念屬複雜認知構念等因素，但因作業間的共變情形與傳統測驗相近，故結果強烈暗示作業題數少是造成p × t變異的關鍵性因素。本文對造成p × t變異的可能原因及解決方法均有提出討論。

以文找文

期刊論文
1.	Crocker, L.(1997)。Assessing content representativeness of performance assessment exercises。Applied Measurement in Education，10(1)，83-95。
2.	Wolfe, E. W.、Gitomer, D. H.(2001)。The influence of changes in assessment design on the psychometric quality of scores。Applied Measurement in Education，14(1)，91-107。
3.	Shavelson, R. J.、Baxter, G. P.、Pine, J.(1991)。Performance assessment in science。Applied Measurement in Education，4(4)，347-362。
4.	Kane, Michael、Crooks, Terry、Cohen, Allan S.(1999)。Validating Measures of Performance。Educational Measurement: Issues and Practice，18(2)，5-17。
5.	Baxter, G. P.、Glaser, R.(1998)。Investigating the cognitive complexity of science assessments。Educational Measurement: Issues and Practice，17(3)，37-45。
6.	Brennan, R. L.(2000)。(Mis)conceptions about generalizability theory。Educational Measurement: Issues and Practice，19(1)，5-10。
7.	Cronbach, L. J.、Linn, R. L.、Brennan, R. L.、Haertel, E. H.(1997)。Generalizability analysis for performance assessments of student achievement or school effectiveness。Educational and Psychological Measurement，57(3)，373-399。
8.	Kane, M. T.(1982)。A sampling model of validity。Applied Psychological Measurement，6，125-160。
9.	Linn, R. L.、Burton, E.、DeStefano, L.、Hanson, M.(1996)。Generalizability of New Standard Project 1993 pilot study tasks in mathematics。Applied Measurement in Education，9(3)，201-214。
10.	Moss, P. A.(1995)。Themes and variations in validity theory。Educational Measurement: Issues and Practice，14(2)，5-13。
11.	Nichols, P. D.、Kuehl, B. J.(1999)。Prophesying the reliability of cognitively complex assessment。Applied Measurement in Education，12(1)，73-94。
12.	Nichols, P. D.、Smith, P. L.(1998)。Contextualizing the interpretation of reliability data。Educational Measurement: Issues and Practice，17(3)，24-36。
13.	Nichols, P. D.、Sugrue, B.(1999)。The lack of fidelity between cognitively complex constructs and conventional test development practice。Educational Measurement: Issues and Practice，18(2)，18-29。
14.	Parkes, J.(2001)。The role of transfer in the variability of performance assessment scores。Educational Assessment，7(2)，143-164。
15.	Shavelson, R. J.、Ruiz-Primo, M. A.、Wiley, E. W.(1999)。Note on sources of sampling variability in science performance。Journal of Educational Measurement，36(1)，61-71。
16.	Webb, N. M.、Schlackman, J.、Sugrue, B.(2000)。The dependability and interchangeability of assessment methods in science。Applied Measurement in Education，15(3)，277-301。
17.	張麗麗(20040400)。影響教師自評實作評量實施品質相關因素之探討。南師學報（教育類），38(1)，95-120。延伸查詢
18.	Brennan, R. L.、Johnson, E. G.(1995)。Generalizability of performance assessments。Educational Measurement: Issues and Practice，14(4)，9-12。
19.	Gao, X.、Brennan, R. L.(2001)。Variability of estimated variance components and related statistics in a performance assessment。Applied Measurement in Education，14(2)，191-203。
20.	Gao, X.、Shavelson, R. J.、Baxter, G. P.(1994)。Generalizability of large-scale performance assessments in science: Promises and problems。Applied Measurement in Education，7(4)，323-342。
21.	Lane, S.、Liu, M.、Ankenmann, R. D.、Stone, C. A.(1996)。Generalizability and validity of a mathematics performance assessment。Journal of Educational Measurement，33(1)，71-92。
22.	McBee, M. M.、Barnes, L. B.(1998)。The generalizability of a performance assessment measuring achievement in eighth-grade mathematics。Applied Measurement in Education，11(2)，179-194。
23.	Messick, Samuel(1994)。The Interplay of Evidence and Consequences in the Validation of Performance Assessments。Educational Researcher，23(2)，13-23。
24.	張麗麗(20020100)。評量改革的應許之地，虛幻或真實？--談實作評量之作業與表現規準。教育研究，93，76-86。延伸查詢
25.	Shavelson, R. J.、Baxter, G. P.、Gao, X.(1993)。Sampling variability of performance assessments。Journal of Educational Measurement，30(3)，215-232。

學位論文
1.	林敬修(2003)。影響國小數學科實作評量信度相關因素之類推性理論分析(碩士論文)。屏東師範學院，台北。延伸查詢
2.	吳欣黛(1998)。實作評量在效度上的真實性與直接性(碩士論文)。國立台北師範學院，台北。延伸查詢

圖書
1.	Brennan, R. L.(1992)。Elements of generalizability theory。Iowa City, IA：American College Testing Program。
2.	Crick, J. E.、Brennan, R. L.(1982)。GENOVA: A generalized analysis of variance system (FORTRAN IV computer program and manual)。Dorchester, Mass：Computer Facilities, University of Massachusetts at Boston。
3.	Gao, X.(1996)。Sampling variability and generalizability of work keys listening and writing scores。Iowa City, Iowa：ACT。
4.	Northwest Regional Educational Laboratory(1994)。A toolkit for professional developers: Alternative assessment。Portland, OR：Northwest Regional Educational Laboratory。
5.	Brennan, R. L.(2001)。Generalizability Theory。New York, NY：Springer-Verlag。
6.	California State Department of Education(1989)。A question of thinking: A first look at students' performance on open-ended questions in mathematics。Sacramento, CA：California State Department of Education。
7.	Herman, Joan L.、Aschbacher, Pamela R.、Winters, Lynn(1992)。A practical guide to alternative assessment。Alexandria, VA：Association for Supervision and Curriculum Development。
8.	Shavelson, Richard J.、Webb, Noreen M.(1991)。Generalizability Theory: A Primer。Newbury Park, California：Sage。
9.	Stiggins, Richard J.(1994)。Student-centered classroom assessment。New York, NY：Macmillan Publishing Company。
10.	Wiggins, G. P.(1993)。Assessing student performance: Exploring the purpose and limits of testing。Jossey-Bass。
11.	Wiggins, Grant P.(1998)。Educative assessment: Designing assessments to inform and improve student performance。San Francisco, CA：Jossey-Bass Publishers。
12.	Crocker, L.、Algina, J.(1986)。Introduction to Classical and Modern Test Theory。Holt, Rinehart & Winston。
13.	教育部(2000)。國民中小學九年一貫課程暫行綱要。台北市：教育部。延伸查詢
14.	National Council of Teachers of Mathematics(1989)。Curriculum and evaluation standards for school mathematics。Reston, VA：National Council of Teachers of Mathematics。

圖書論文
1.	Brennan, R. L.(1996)。Generalizability of performance assessments。Technical issues in large-scale performance assessment。Washington, DC：U.S. Department of Education, Office of Educational Research and Improvement。
2.	Mislevy, R.(1990)。Foundations of a new test theory。Test theory for a new generation of tests。Hillsdale, NJ：Lawrence Erlbaum Associates。
3.	Romberg, T. A.、Wilson, L. D.(1995)。Issues related to the development of an authetic assessment system for school mathematics。Reform in school mathematics and authentic assessment。Albany, NY：State University of NewYork Press。
4.	Feldt, L. S.、Brennan, R. L.(1989)。Reliability。Educational measurement。New York, NY：Macmillan Press。
5.	Resnick, Lauren B.、Resnick, Daniel P.(1992)。Assessing the Thinking Curriculum: New Tools for Educational Reform。Changing Assessments: Alternative Views of Aptitude, Achievement and Instruction。Kluwer Academic Publishers。

推文
推薦
引用網址
引用嵌入語法
轉寄

top

:::

相關期刊
相關論文
相關專書
相關著作
熱門點閱

1.	分析型評分規準應用於磨課師的設計類課程
2.	多元教學與評量對國小學童音樂學習態度與成就之探討
3.	國小中年級戲劇賞析評分規範與實作評量任務之發展研究
4.	實作評量融入師資生班級經營課程實施成效探究
5.	國小戲劇課程之實作評量研究--建立「戲劇基本能力」之評分規範
6.	在探究教學的脈絡下融入自願式檔案評量於高二數學課室之行動研究
7.	多元評量的活化、迷思、與神話--教學歷程的個案研究
8.	過程技能能力要項藉由能力指標聚焦並轉化成實作評量設計之行動研究
9.	實作評量在兩性關係課程上的應用：以技術學院學生為例
10.	國小五年級學童分數概念實作評量及其SS分析
11.	影響教師自評實作評量實施品質相關因素之探討
12.	多元評量概念在課程標準演變中之定向與意義
13.	自然科實施多元評量之行動研究
14.	從分數的意義談實作評量效度的建立
15.	評量改革的應許之地，虛幻或真實？--談實作評量之作業與表現規準

1.	模糊德菲法與層級分析法在國中資優生研究作品評量指標建構之應用
2.	實施科技輔助合作問題解決教學於STEM課程中對學習成效、合作問題解決能力及實作技能影響之研究
3.	幼兒教師的評量行為及其相關因素之研究—從努力及和諧角度探討
4.	從全人教育觀點探討台南市國小低年級實施多元評量之研究
5.	數位歷程檔案評量應用於設計作品及其對學習動機與成效之關聯性

無相關書籍

無相關著作

無相關點閱

QR Code

臺灣人文及社會科學引文索引資料庫系統

詳目顯示

臺灣人文及社會科學引文索引資料庫