共查询到19条相似文献,搜索用时 890 毫秒
1.
矩阵取样技术对解决教育等社会科学领域大规模调查中的较低应答率和较差的数据质量问题具有独特价值,在大规模测评中所获得的测评结果具有更高的群体代表性和稳定性。依照每个测评项目的测验目的,考虑对测验结果参数估计准确性的具体要求、综合权衡测验开发中的各种相关成本、选择题册设计方式,从而确定具体的方案是该项技术实际运用过程中必须慎重对待的三个基本问题。 相似文献
2.
3.
教育问责背景下学生学业评测的进展 总被引:1,自引:0,他引:1
近年来,教育问责背景下学生学业评测作为教育改革与发展的重要内容备受人们关注。学业评测主要在三个领域得到了广泛的发展:在监测教育质量的宏观层面,为了扩大考查范围或测试覆盖面,同时缩短被试的测试时间,包含矩阵取样设计、测验等值技术的大尺度学业测评的新技术广泛应用在教育质量监测的实践当中;在对学校效能进行评价的中观层面,使用发展性评价方法如增值性评价,用学生多次考试成绩的追踪数据来评价学校的教育情况,不仅能够更合理地评价学校和教师的工作,也使学生的成长和发展更加受到关注;在促进学生学业发展的微观层面,为了学习的评价理念和认知诊断技术的发展使得测验对教学反馈作用得以显现和发展。当前学业评测的新理念、新技术和新实践,要求教育部门和教育工作者只有重视测量与评价技术、改变考试观念,才能更好的使学业评测工作服务于教育事业的发展。 相似文献
4.
计算机化自适应多阶段测试是精准减负的一种有效手段,因为它会自动引导学生尽可能作答与其能力水平相适应的题目,从而节省出作答太难或太易题目所浪费的大量时间和精力.不过,我国目前的一些计算机测试系统缺乏现代测评技术的有力支撑,部分题库在知识内容和能力维度的标识与编码、题目参数的估计与等值,以及分数的算法与使用方面存在着较大缺陷.本文简要分析了自适应测试的基本模式、操作流程、使用条件和主要优点,具体讨论了计算机化自适应多阶段测试系统的设计,以及基于测验总分的单参数logistic模型和基于作答反应模式的双参数logistic模型的算分方法,为提升计算机化自适应测试的水平,进而促进教师因材施教、减轻学生作业负担和考试负担提供了考试科学视角下的新办法. 相似文献
5.
高燕 《教育测量与评价(理论版)》2015,(3):19-23
测验结果报告和使用的科学性是学业评价科学性的重要组成部分.传统的测试结果呈现和使用方式不利于对测试本身价值的进一步挖掘,为解决这一问题,河南省郑州市教研室开发了面向多群体的“郑州市学业评价分析报告系统”.该系统以《教育与心理测量标准》与“考试结果报告的八个关键特征框架”为开发思路,以建立基于课程标准的命题规范流程为分析报告的基础,考虑了市级教育管理者、校长、班主任、学科教师、学生等的不同需求,从测验概况、科目水平、题目水平三个层次呈现测试情况.它在一定程度上驱动了教育决策、积累了大量测试数据,发挥了评价促进教学的功能,引导各方树立了正确的学业质量现和评价观,但在报告内容、报告功能等方面还需做进一步改进. 相似文献
6.
7.
教育部考试中心在对“升学指导测验”2005年测试数据统计分析的基础上。进一步完善测验的有关内容。并邀请参加测试的高中三年级学生和专家一起对题目和测试结果报告进行了修订。与原有的测试题目和测试结果报告相比,2006年版的测试题目设计更加贴近中学生的生活,测试结果报告更加通俗易慌。内容更丰富。 相似文献
8.
以缆道为测验载体的水文站进行悬移质输沙率测验时,流速仪测流与调压积时式采样器取样分步进行,受流速脉动影响,悬移质取样容积不准确而影响测验成果精度。本文通过对控制悬移质取样的因素进行分析研究,提出了采用流速仪信号作为控制悬移质取样信息的取样方法,既解决了实现流速仪测流与调压积时式采样器取样同步进行的问题,又消除了流速脉动对积时式取样器取样容积精度的影响,完善了悬移质输沙率测验方案。 相似文献
9.
10.
11.
Stefan Johansson 《Educational research; a review for teachers and all concerned with progress in education》2016,58(2):139-148
AbstractBackground: International large-scale assessments (ILSAs) are a much-debated phenomenon in education. Increasingly, their outcomes attract considerable media attention and influence educational policies in many jurisdictions worldwide. The relevance, uses and consequences of these assessments are often the focus of research scrutiny. Whilst some argue that the assessment outcomes provide an effective basis for informed policy-making, critics claim that the use of international assessment data can result in a range of unintended consequences, such as the shaping and governing of school systems ‘by numbers’.Purpose: This article explores and analyses the arguments about the uses and consequences of ILSAs. In particular, the discourse about the assessments’ consequential validity will be discussed and evaluated.Sources of evidence: Literature relating to the uses and consequences of large-scale assessment was analysed, with a focus on research on the consequential aspects of validity.Main argument: Much research suggests that ILSAs have unintended consequences that affect and influence educational policy. However, the influences on educational policy are complex and interwoven: for example, it is not clear-cut whether effects such as converging curricular are, necessarily, direct consequences of large-scale assessments. Further, it is suggested that a beneficial consequence of large-scale assessment is the infrastructure they provide for studies in the social sciences, although caution must be applied to causal claims, in particular because of the cross-sectional design of the assessments.Conclusions: The considerable literature discussing the uses and consequences of large-scale assessments tends to point out potential negative aspects of the studies. However, it is also apparent that large-scale international assessments can be a valuable resource for studying global trends and evolving systems in education. Despite the extensive debates around large-scale assessment outcomes both in the media and in educational policy arenas, empirical educational research all too often appears underused in the discussion. 相似文献
12.
国际大规模学业测评对家庭因素影响学生学业成就发展的关注度逐年上升。综合分析影响力较大且相对成熟的6个国际大规模学业测评项目中家庭因素指标的选取原则,梳理测评结果在学生个体、学校教学和教育改革中的应用实例,为我国紧扣学生的关键家庭因素,整体考虑涵盖学生、班级和学校层面嵌套结构的家庭因素测评体系,并有效使用测评结果提供参考与借鉴。 相似文献
13.
认知诊断理论与考试评价 总被引:4,自引:1,他引:3
认知诊断理论——新一代测验理论的核心,是认知心理学与心理测量学相结合的产物。本文简要介绍了认知诊断理论及其研究和应用现状,并对我国在大规模统一考试中如何实现诊断功能提出了建议。 相似文献
14.
李利芳 《太原教育学院学报》2007,(2)
大学英语四、六级考试是由教育部高教司主办的一项大规模标准化考试,其目的是对大学生的英语综合能力进行评价,从而加快我国外语教学的步伐,促进人们学习外语的积极性,同样检验英语学习者的英语水平。 相似文献
15.
杨晶佩宜 《昆明师范高等专科学校学报》2010,32(1):135-137,142
大学英语口语考试具备了较好的效度、信度及积极的反拔作用的支持,而且也得到了广泛的认可。尽管大学英语口语考试有其弊端,即在考试大纲所要求的小组讨论任务中考生互动程度与考生真实表现之间的效度差异。然而,口语考试还是展现了许多积极的方面。作为全国唯一一项大规模标准化英语口语能力考试,大学英语口语考试对于教育政策、教学实践、教育系统以及个人等都产生了一定的影响。 相似文献
16.
An Examination of English Teachers' Opinions About the Ontario Grade 9 Reading and Writing Test 总被引:1,自引:1,他引:1
Interest in the use of large-scale achievement testing for accountability purposes and to drive instructional reform has been increasing in Canada. In the 1995 publications in Interchange, several researchers debated the merits and demerits of standardized achievement testing, including among the latter a tendency to reduce the curriculum and overemphasize routine learning (i.e., "teaching to the test"). Almost no studies have found empirical evidence for such testing's purported benefits. We set out to investigate these issues in Ontario: We present findings from a mail survey designed to find out, from Grade 9 and Grade 10 English teachers in Ontario, their perception of the quality of the Ontario Grade 9 literacy testing program and the effects it has had on the teaching and learning processes. Based on the responses of 107 teachers, our results paint a negative picture of teachers' opinions of the Grade 9 test in terms of its quality and its impact on teaching and learning. Three years after the Grade 9 test was first introduced, Grade 9 and 10 English teachers are still not convinced of its value. Our findings (and those from two other similar surveys) appear to suggest, at least based on teachers' self-reporting, that the purposes of the test — improving the quality of education and learning — as envisioned by the Ontario Ministry of Education and Training have not been met. These findings support those of other assessment impact studies in Canada, namely British Columbia and Alberta, regarding the adverse consequences of large-scale standardized testing (either multiple-choice test or performance-based assessment), and the lack of evidence for its purported positive educational influences. We recommend future research to investigate further the validity and the educational impact of the provincial tests and the reasons responsible for the observed impact or lack of it, and to determine resources, such as teacher training and materials, that are necessary to supplement the provincial testing program's effort to improve teaching and learning. 相似文献
17.
Gregory J. Cizek 《Educational Measurement》2003,22(1):40-44
This article responds to two critiques of an earlier article on the unintended consequences of high-stakes testing. The exchange of viewpoints suggests that the successful resolution of current policy debates surrounding such testing and companion accountability systems rests on the ability to: recognize, document, and disseminate instances of real learning gains; develop improved methods and assessment systems that provide instructional relevance and utility of test results; and negotiate a balance between the role of large-scale assessment in identifying achievement deficits and the responsibility and resources that educational systems have to redress identified deficiencies. 相似文献
18.
19.
沈玉顺 《华东师范大学学报(教育科学版)》2014,32(3):26-30
中招考试制度改革涉及贯彻落实国家教育政策要求、开发人力资源、保障个人教育选择权和维护教育公平等重大问题,其核心是如何处理教育质量、效率与公平的关系。本文从教育评价学角度分析了中招考试制度改革涉及的上述政策因素,提出了以质量和效率为导向统筹考虑初中毕业考试和中招考试制度改革,建立以学业成就评价、职业性向测验和学生发展潜力评价为基础的中招考试制度,在基础教育过程中解决个人选择和教育公平问题的改革思路。 相似文献