教育评价

探索增值评价的中国路向:基于美国实践经验的批判性分析

  • 韩玉梅 ,
  • [美]严文蕃 ,
  • 蒋丹
展开
  • 1. 西南大学西南民族教育与心理研究中心,重庆 400715
    2. 西南大学教育政策研究所,重庆 400715
    3. 美国马萨诸塞大学波士顿分校,波士顿 02125-3393

录用日期: 2022-09-01

  网络出版日期: 2023-01-18

基金资助

国家社会科学基金一般项目“西藏自治区教育脱贫攻坚典型案例研究”(20BMZ147)

Exploring the Chinese Pathway of Value-Added Assessment: Critical Analysis of the U.S. Practice

  • Yumei Han ,
  • Wenfan Yan ,
  • Dan Jiang
Expand
  • 1. Center for Studies of Education and Psychology of Ethnic Minorities in Southwest China, Southwest University, Chongqing 400715, China
    2. Center for Educational Policy Studies, Southwest University, Chongqing 400715, China
    3. University of Massachusetts Boston, Boston, MA 02125-3393, USA

Accepted date: 2022-09-01

  Online published: 2023-01-18

摘要

本文采用国际比较和批判性分析的视角,以教师增值评价为抓手,借鉴美国增值评价的理论和实践探索经验,批判性审思教师增值评价的本质特性、技术边界、应用限度、融合趋势、实施条件等问题,多方位探寻增值评价在中国教育评价体系中的价值路向、技术路向、应用路向、发展路向和实践路向。分析发现,教师增值评价具有以学生成绩增幅结果为依据诊断教师贡献度、以学生成绩增长的因果效应表征狭义“教师效能”、以增值模型为核心技术测度“增值”、以特定年级学科教师为适评对象等本质特征和局限,因此,在充分肯定和最大化发挥其循证导向和技术优势的同时,要避免对其价值的过分夸大和泛化。增值模型在信效度和统计偏误等技术层面依然存在激烈争议和未解难题,应针对性开展国际前沿技术探索,以破立并举的原则实现本土化技术突破与创新。美国教师增值评价政策演进和大量实证研究证明,增值评价结果服务高利害决策存在较高风险,应强化其诊断功能。增值评价与多元评价方法的融合趋势成为国际共识,应以价值多元化为导向,以利益相关主体的多元协商为原则,构建增值评价与多元化测评方法及多样化证据的系统性融合机制。增值评价从理论走向实践,须逐步在全国学生学业成就及进步程度测评体系和工具体系建设、动态追踪数据库建设、专业团队建设、全过程反馈机制建设等方面做好充足准备,最大化实现增值评价可能为整个教育评价体系所“增”之“值”。

本文引用格式

韩玉梅 , [美]严文蕃 , 蒋丹 . 探索增值评价的中国路向:基于美国实践经验的批判性分析[J]. 华东师范大学学报(教育科学版), 2023 , 41(2) : 63 -80 . DOI: 10.16382/j.cnki.1000-5560.2023.02.006

Abstract

Comparative study and critical analysis approach were applied to analyze the essential features, technical issues, applicable limits, integration trends and preparative conditions of teacher value-added assessment, as well as to detect its pathway in terms of value proposition, technical breakthrough, application focus, developmental framework and practical transformation. Teacher value-added assessment measures teachers’ contribution to students’ learning based on their achievement growth, with the usage of value-added modelling. The approach quantifies the causal effect of teachers’ teaching upon student learning growth, and therefore provides quantitative evidence to diagnose levels of teachers’ effectiveness. Such features imply evidence-based orientation and technical specialties of value-added assessment, however, considering the limitations, over estimation of its value and over generalization in application should be avoided. Debates and unsolved technical challenges still exist around the issues of value-added modelling in terms of reliability, validity and statistical bias, Chinese efforts should be made to challenge the cutting-edge technological innovation and overcome the unsolved difficulties. Drawn from the evaluation reform history and empirical evidence of the U.S., it is proven that teacher value-added assessment should avoid risk of serving high-stakes decisions and highlight its diagnostic function. Integration of value-added assessment with diverse measurements becomes international trend in common, local efforts should be made to construct an integrated assessment system based on value pluralism and negotiation between multiple stakeholders. To promote practical implication of value-added assessment, preparative conditions should be met in development of national system of standardized assessment of student achievement and growth, development of national databases, organization of a professional team of expertise, as well as establishment of reporting and reflection mechanism for value-added assessment results and impacts, in order to add the maximized value to the new evaluation system.

参考文献

null 边玉芳, 林志红. (2007). 增值评价: 一种绿色升学率理念下的学校评价模式. 北京师范大学学报(社会科学版), (06), 11- 18.
null 边玉芳, 孙丽萍. (2015). 教师增值性评价的进展及在我国应用的建议. 教师教育研究, (01), 88- 95+112.
null 陈玉琨. (2007). 教育评价学. 北京: 人民教育出版社.
null 古巴, 林肯. (2008). 第四代评估(秦霖, 蒋艳玲等译). 北京: 中国人民大学出版社.
null 郭元祥, 王秋妮. (2021). 增值评价研究的知识图谱与前景展望. 教育测量与评价, (07), 3- 10.
null 马晓强, 彭文蓉, 萨丽·托马斯. (2006). 学校效能的增值评价——对河北省保定市普通高中学校的实证研究. 教育研究, (10), 77- 84.
null 邵越洋, 刘坚. (2020). 增值评价: 关注学校为每一位学生的成长助力——以北京市某区教育实证研究数据为例. 中国考试, (09), 40- 45.
null 王斌华. (2005). 教师评价: 增值评价法. 教育理论与实践, (12), 20- 23.
null 辛涛, 张文静, 李雪燕. (2009). 增值性评价的回顾与前瞻. 中国教育学刊, (04), 40- 43.
null 杨小微. (2020). 在教育公平意义上理解和运用增值评价. 教育测量与评价, (08), 8- 10+18.
null 杨欣. (2022). 教育评价改革的算法追问. 华东师范大学学报(教育科学版), (01), 19- 29.
null 张兴. (1998). 引进增值观念 推进素质教育. 教育导刊, (02-03), 14+81.
null 郑智勇, 宋乃庆. (2021). 新时代基础教育增值评价的三重逻辑. 教育发展研究, (10), 1- 7+17.
null 赵勇. (2021). 教育评价的几大问题及发展方向. 华东师范大学学报(教育科学版), (04), 1- 14.
null 朱德全, 宋乃庆. (2013). 教育统计与测评技术. 重庆: 西南师范大学出版社.
null AERA (American Educational Research Association). (2015). AERA issues statement on the use of value-added models in evaluation of educators and education preparation programs. Retrieved from https://www.aera.net/Newsroom/News-Releases-and-Statements/AERA-Issues-Statement-on-the-Use-of-Value-Added-Models-in-Evaluation-of-Educators-and-Educator-Preparation-Programs.
null ASA(American Statistical Association). (2014). ASA statement on using value-added models for educational assessment. Retrieved from http://www.amstar.org/policy/pdfs/ASA_VAM_Statement.pdf.
null Amrein-Beardsley, A. (2008). Methodological concerns about the education value-added assessment system. Educational Researcher, 37 (2), 65- 75.
null Amrein-Beardsley, A. & Holloway, J. (2019). Value-added models for teacher evaluation and accountability: Commonsense assumptions. Educational Policy, 33 (3), 516- 542.
null Amrein-Beardsley, A., Pivovarova, M. & Geiger, T. (2016). Value-added models: What the experts say. Phi Delta Kappan, 98 (2), 35- 40.
null Ballou, D. & Springer, M. G. (2015). Using student test scores to measure teacher performance: Some problems in the design and implementation of evaluation systems. Educational Researcher, 44 (2), 77- 86.
null Bill & Melinda Gates Foundation. (2010). Learning about teaching: Initial findings from the measures of effective teaching project. MET Project policy brief. Retrieved from http://k12education.gatesfoundation.org/download/?Num=2683&filename=Preliminary_Finding-Policy_Brief.pdf.
null Bill & Melinda Gates Foundation. (2012). Gathering feedback for teaching: Combing high-quality observations with student surveys and achievement gains. MET project research paper. Retrieved from https://files.eric.ed.gov/fulltext/ED540960.pdf.
null Blazar, D., Litke, E. & Barmore, J. (2016). What does it mean to be ranked a “High” or “Low” value-added teacher? Observing differences in instructional quality across districts. American Educational Research Journal, 53 (2), 324- 359.
null Braun, H. I. (2005). Using student progress to evaluate teachers: A primer on value-added models. Educational Testing Service (ETS) Policy Information Center report. Retrieved from https://www.ets.org/Media/Research/pdf/PICVAM.pdf.
null Braun, H. (2015). The value in value-added depends on ecology. Educational Researcher, 44 (2), 127- 131.
null Briggs, D. & Domingue, B. (2011). Due diligence and the evaluation of teachers: A review of the value-added analysis underlying the effectiveness rankings of Los Angeles Unified School District teachers by the Los Angeles Times. Boulder, CO: National Education Policy Center. Retrieved from https://files.eric.ed.gov/fulltext/ED516008.pdf.
null Chetty, R., Friedman, J. & Rockoff, R. (2014). Measuring the impacts of teachers II: Teacher value-added and student outcomes in adulthood. American Economic Review, 104 (9), 2633- 2679.
null Collins, C & Amrein-Beardsley, A. (2014). Putting growth and value-added models on the map: A national overview. Teacher College Record, 116 (1), 1- 32.
null Darling-Hammond, L. (2000). Teacher quality and student achievement: A review of state policy evidence. Education Policy Analysis Archives, 8 (1), 1- 44.
null Darling-Hammond, L. (2015). Can value added add value to teacher evaluation? Educational Researcher, 44(2), 132—137.
null Darling-Hammond, L., Amrein-Beardsley, A., Haertel, E, Rothstein, J. (2012). Evaluating teacher evaluation. Phi Delta Kappan, 93 (6), 8- 15.
null Deardorff, A. (2016). Deardorffs’ glossary of international economics. Retrieved from http://www-personal.umich.edu/~alandear/glossary/v.html#ValueAdded.
null Dee, T. & Wyckoff, J. (2015). Incentive, selection, and teacher performance: Evidence from IMPACT. Journal of Policy Analysis and Management, 34 (2), 267- 297.
null Dragoset L., et. al. (2015). Usage of policies and practices promoted by Race to the Top. Washington, D. C.: U. S. Department of Education, Institute of Education Sciences, National Center for Educational Evaluation and Regional Assistance. Retrieved from https://files.eric.ed.gov/fulltext/ED559916.pdf.
null Education Week. (2015, October 6). Teacher evaluation heads to the courts. Education Week. Retrieved from https://www.edweek.org/policy-politics/teacher-evaluation-heads-to-the-courts.
null Everson, K. C. (2017). Value-added modeling and educational accountability: Are we answering the real questions? Review of Educational Research, 87(1), 35—70.
null Florida Department of Education. (2022). Performance evaluation. Retrieved from https://www.fldoe.org/teaching/performance-evaluation/.
null Glazerman, S. , Loeb, S. , Goldhaber, D. , Staiger, D. , Raudenbush, S. & Whitehurst, G. (2010). Evaluating teachers: The important role of value-added. The Brookings Brown Center Task Group on Teacher Quality. Retrieved from https://www.brookings.edu/wp-content/uploads/2016/06/1117_evaluating_teachers.pdf.
null Goe, L., Bell, C. & Little, O. (2008). Approaches to evaluating teacher effectiveness: A research synthesis. National Comprehensive Center for Teacher Quality, ETS. Retrieved from https://gtlcenter.org/sites/default/files/docs/EvaluatingTeachEffectiveness.pdf.
null Goldhaber, D. (2015). Exploring the potential of value-added performance measures to affect the quality of the teacher workforce. Educational Researcher, 44 (2), 87- 95.
null Good, T. L. (2014). What do we know about how teachers influence student performance on standardized tests: And why do we know so little about other student outcomes. Teachers College Record, 116 (1), 1- 41.
null Haertel, E. (2013). Reliability and validity of inferences about teachers based on student test scores. Princeton, NJ: Educational Testing Service.
null Hanushek, E. (1971). Teacher characteristics and gains in student achievement: Estimation using micro data. The American Economic Review, 61 (2), 280- 288.
null Harris, E. A. (2016, May 10). Court vacates Long Island teacher’s evaluation tied to test scores. New York Times. Retrieved from https://www.nytimes.com/2016/05/11/nyregion/court-vacates-long-island-teachers-evaluation-tied-to-student-test-scores.html?_r=1.
null Harris, D. & Herrington, C. D. (2015). Editors’ introduction: The use of teacher value-added measures in schools: New evidence, unanswered questions, and future prospects. Educational Researcher, 44 (2), 71- 76.
null Johnson, S. M. (2015). Will VAMS reinforce the walls of the egg-crate school? Educational Researcher, 44(2), 117?126.
null Koedel, C., Mihaly, K., & Rockoff, J. E. (2015). Value-added modeling: A review. Economics of Education Review, (47), 180- 195.
null Kupermintz, H. (2003). Teacher effects and teacher effectiveness: A validity investigation of the Tennessee Value Added Assessment System. Educational Evaluation and Policy Analysis, 25 (3), 287- 298.
null McCaffrey, D. F. , Lockwood, J. R. & Hamilton, L. S. (2003). Evaluating value-added models for teacher accountability. Santa Monica, CA: RAND Corporation.
null Millman, J. (1997). Grading teachers, grading schools: Is student achievement a valid evaluation measure? Corwin Press, Inc.
null National Conference of State Legislatures. (2010). Education bill tracking database. Retrieved from https://www.ncsl.org/research/education/education-bill-tracking-and-databases.aspx
null NRC (National Research Council). (2010). Getting value out of value-added: Report of a workshop. Washington, D. C.: The National Academies Press.
null NRC (National Research Council). (2011). Incentives and test-based accountability in education. Washington, D. C.: The National Academies Press.
null Opper, I. M. (2019). Value-added modeling 101: Using student test scores to help measure teaching effectiveness. RAND Corporation. Retrieved from file: ///C: /Users/Admin/Downloads/RAND_RR4312z1%20(1). pdf.
null Sanders, W. L. & Horn, S. P. (1994). The Tennessee Value-Added Assessment System (TVAAS): Mixed-model methodology in educational assessment. Journal of Personnel Evaluation in Education, 8 (3), 299- 311.
null Sanders, W. L. & Horn, S. P. (1998). Research findings from the Tennessee Value-Added Assessment System (TVAAS) Database: Implications for educational evaluation and research. Journal of Personnel Evaluation in Education, 12 (3), 247- 256.
null Sass, T. R. (2008). The stability of value-added measures of teacher quality and implications for teacher compensation policy. Washington DC: National Center for Analysis of Longitudinal Data in Education Research (CALDER). Retrieved from https://files.eric.ed.gov/fulltext/ED508273.pdf.
null Schweig, J. (2019). Measuring teaching effectiveness: Understanding common, uncommon, and combined methods. RAND Corporation. Retrieved from https://www.rand.org/content/dam/rand/pubs/research_reports/RR4300/RR4312z4/RAND_RR4312z4.pdf.
null SDP (Strategic Data Project). (2012). Value-added measures: How and why the Strategic Data Project uses them to study teacher effectiveness. Retrieved from https://hwpi.harvard.edu/files/sdp/files/sdp-va-memo_0.pdf.
null Tennessee Department of Education. (2022). TEAM: Tennessee Educator Acceleration Model. Retrieved from https://team-tn.org/.
null Tucker, P. D. & Stronge, J. H. (2005). Linking teacher evaluation and student learning. Alexandria, VA: Association for Supervision and Curriculum Development.
null USDOE (U. S. Department of Education). (2002). No Child Left Behind. Retrieved from https://www2.ed.gov/nclb/landing.jhtml.
null USDOE (U. S. Department of Education). (2009). Race to the Top program executive summary. Washington, D. C. Retrieved from https://www2.ed.gov/programs/racetothetop/executive-summary.pdf.
null USDOE (U. S. Department of Education). (2016). Every Student Success Act. Retrieved from https://www.ed.gov/essa?src=rn.
null USNCEE (U. S. National Commission on Excellence in Education). (1983). A nation at risk: The imperative for educational reform. U. S. Department of Education Archived Information. Retrieved from https://www2.ed.gov/pubs/NatAtRisk/recomm.html.
文章导航

/