首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 125 毫秒
1.
信度和效度是衡量一个测量工具质量的关键指标,教育认知诊断测验中的信度和效度研究近年来受到研究者的关注。诊断测验的信度系数基本上源自基于α系数的属性信度系数、经验属性信度系数、四分相关系数、模拟重测一致性和分类一致性指标;效度系数主要包括模拟判准率、分类准确性和理论构想效度等。教育认知诊断测验的信度和效度研究较新,仍存在着一定的不足且缺乏全面的比较研究,更缺少系统的评价体系。  相似文献   

2.
Student self-assessment is a central component of current conceptions of formative and classroom assessment. The research on self-assessment has focused on its efficacy in promoting both academic achievement and self-regulated learning, with little concern for issues of validity. Because reliability of testing is considered a sine qua non for the validity of assessment interpretations, and research into the human ability to self-evaluate work raises concerns about the quality of students’ judgements, it is sensible to investigate the accuracy of students’ self-assessments. This article reviews relevant literature from educational psychology and psychometrics to define the need for a better understanding of accuracy in self-assessment as well as to identify possible pitfalls in measuring accuracy that could undermine its effectiveness by, for example, trading the focus on formative feedback for summative scoring or rating. The article concludes with recommendations for the design of research on accuracy in self-assessment.  相似文献   

3.
Peer assessment exercises yield varied reliability and validity. To maximise reliability and validity, the literature recommends adopting various design principles including the use of explicit assessment criteria. Counter to this literature, we report a peer assessment exercise in which criteria were deliberately avoided yet acceptable reliability and validity were achieved. Based on this finding, we make two arguments. First, the comparative judgement approach adopted can be applied successfully in different contexts, including higher education and secondary school. Second, the success was due to this approach; an alternative technique based on absolute judgement yielded poor reliability and validity. We conclude that sound outcomes are achievable without assessment criteria, but success depends on how the peer assessment activity is designed.  相似文献   

4.
幼儿教师职业生涯周期的职业倦怠研究   总被引:2,自引:0,他引:2  
在国内外教师职业倦怠相关文献的基础上,通过开放式问卷、访谈等方法编制了《幼儿教师职业倦怠问卷》,发现幼儿教师的倦怠有四个可解释的维度,分别为:情感耗竭、精力枯竭躯体化、幼儿教师职业失意、职业怠慢。幼儿教师职业生涯中的倦怠可划分为七个阶段:0-2年激情适应期、3-6年成长选择期、7-9年胜任平稳期、10-12年更新冲突期、13-18年平稳停滞期、19-25年复发危机期、26年以上离职消退期。  相似文献   

5.
论文评审是学位论文质量评价的重要环节。为了使评审结果能真实地反映博士学位论文的质量和水平,必须首先保证论文评审指标体系的准确性、可靠性和有效性。信度和效度分析是验证指标体系准确性、可靠性和有效性的重要方法。文章利用北京师范大学五年博士学位论文匿名评审全数据量化结果,对评审指标体系的信度和效度进行实证研究。结果表明,目前广泛采用的评审指标体系具有良好的信度和效度,能够真实地反映博士学位论文的质量和水平。  相似文献   

6.
甘露 《煤炭高等教育》2010,28(6):124-125
本文针对"轮机英语"听力与会话评估的有效与否,采取了测试学信度和效度的方法进行分析。分析结果表明,评估体现了信度和效度的统一,具有一定的科学性和可操作性,但是也存在明显的不足,需要采取有效的方式进行完善。  相似文献   

7.
"1+X"证书制度是新型的国家资历框架制度,为了畅通技术技能人才成长通道,培养复合型高技术技能人才,确保职业教育人才培养质量,亟须解决当前职业院校实施"1+X"证书制度面临的现实困境,如证书与地方院校人才培养方案对接不畅、信效度偏低、"两证"融通度不够、专业匹配不足、"双师型"教师素质偏弱、教材教法适应不良、课证融通平台与渠道缺乏等。职业院校应通过多元主体参与、确保信效度、产教融合、加快课程建设、创新教师培养模式、改革教材教法、搭建"学分银行"等策略,确保"1+X"证书制度有效实施,实现以学生为中心,建立适合职业教育人才培养特点的评价制度和评价模式。  相似文献   

8.
殷明 《培训与研究》2006,23(3):91-93
本研究首先自行编制了具有较高信度和效度的性别角色刻板印象问卷,其次是利用现有的“贝姆性别角色量表”(Ben Sex Role Inventory简称BSR I)来测定城乡中学生的性度取向。利用测试的结果一方面可以研究城乡双性化群体在各自总体中所占的比率,看其是否存在差异;另一方面使用自编的性别角色刻板印象问卷将双性化个体进行城乡比较。  相似文献   

9.
“学生评教”作为教师课堂教学评价体系的一部分,受到国内外高校的广泛重视。但师生对“学生评教”的认同度,直接关系到评教的成效。本文通过对笔者所在学校学生评教情况的调查,发现了在评价的效度和信度、评价指标体系、评价结果的处理等方面存在的问题,对其产生的原因进行了分析,提出了应加强宣传、提高评教者的素质、科学设计评价指标、慎重对待评价结果等建议。  相似文献   

10.
《Assessing Writing》2004,9(3):190-207
Specialists in the field of large-scale, high-stakes writing assessment have, over the last forty years alternately discussed the issue of maximizing either reliability or validity in test design. Factors complicating the debate–such as Messick's (1989) expanded definition of validity, and the ethical implications of testing–are explored. An inverse relationship between the loss of reliability and the loss of validity of a test is proffered. The term, Quality, in reference to writing assessment is defined and introduced. Construct complexity is hypothesized as a factor that influences validity, reliability, and quality. It is suggested that the either/or debate concerning emphasis over reliability or validity in test design be put aside in favor of a discussion on how to maximize the quality of an assessment. Insofar as this goal can be achieved, it is necessary in the design of the test to minimize and balance the loss of both validity and reliability. The discussion draws on literature from within the field of writing assessment and from works in the fields of mathematics and information theory.  相似文献   

11.
通过查阅文献资料与已有成熟问卷,征询专家意见,根据初中生数学学习特点,编制了"初中生数学学习非智力因素调查问卷".经过项目分析、探索性因素分析、验证性因素分析,对问卷进行相应修改,正式问卷包括动机、态度、意志、性格、情绪情感5个维度,以及对应的13个因子.所编问卷具有较好的信度(一致性系数,重测信度,分半信度)与效度(内容效度、结构效度、效标效度),可作为测量初中生数学学习非智力因素的有效工具.  相似文献   

12.
目的:编制大学生学习情况调查问卷。方法:在文献研究的基础上,构建学习情况的理论维度,编写问卷题目。对在校大学生两次预测,并检验问卷的信度、效度。结果:各项指标显示,本问卷具有良好的信度和效度。  相似文献   

13.
文学思潮的"特性",即是指只有在"文学思潮"状态下才能得以充分显现的东西.只有当各种因素的"共同倾向性"呈现出容易被人们感知的"群体"样态时,"思潮"的生命过程才算开始."共同倾向性",既表现为"共时性"的各个完整系统之中的各环节或部分,又表现为"历时性"衍化而成的"家族相似"式的纵向现象异变中."文学思潮"的"群体性",既可以是某一特定时期以平行方式排列的集束"现象",也可以是一种现象以跨时空(指不同文化时段)排列呈现为纵向性的"现象"群体.文学思潮的"扩张性"特点,意味着它总体上不是内敛,而是外扩并具有强大的辐射性.由"点"到"面"的变化过程亦就是由单个的"思"之于群体"潮"的生成过程."点"的重要性在于,它可能最先呈现了某种趋向,潜含了外扩的潜能及被多角度接受的可能性.它首先体现为对存有"小异"的"同质"因素的"吸附",其次表现为对在对抗中日益弱化的因素的"招降"或"改编",同时也相应产生对"传统"因素的"改造"."整合性"是文学思潮"扩张性"特征的重要内涵,也是"扩张性"功能外化的主要策略."整合"中的"呼应性"和"矛盾性"(对抗性),可谓文学思潮扩展历程中最有活力的两个因素.  相似文献   

14.
"乡"与"市"和中国现代文学   总被引:5,自引:0,他引:5  
“乡村”和“城市”两种社区,作为化构成,客观存在并支配着中国现代学发展过程,有可能用以考察这一历史阶段学现代性的依据。上个世纪二、三、四十年代,乡村小说的典型事实具体而微地显示了“乡”与“市”两种化价值倾向的起伏消长,从“市”对“乡”的超越至“乡”对“市”的包围,是中国现代学留给今天的一份重要启示录。  相似文献   

15.
席扬 《三明学院学报》2005,22(3):241-249
“十七年”文学思潮在形成与发展过程中表现为鲜明的“统一化”和“一元化”状态。它一方面通过对1942年以后“解放区文学”传统的全面继承体现了政治与文学关系的一致性,另一方面又必须根据“新的时代”的文化的、意识形态的要求对于那些在“战争语境”中产生的审美理念。进行不断的修正与完善。“十七年”文学思潮的发展过程,首先体现在一系列的“文艺运动”的交替更移中——“文艺运动”的过程,既是社会主义文艺“理论建构”的实际步骤,也是确立“文学新秩序”工程的重要组成部分。“文艺运动”不仅是文学思潮发展的主要载体和形式,而且从“文艺运动”之间的“间歇”中,还可以了解到当代文学思潮的发展节奏和主导趋向。在文学创作实践领域里。文艺思潮的发展与变化主要体现在提倡“写什么”、“怎么写”和反对“写什么”、“怎么写”的两个方面及其两个方面的张力与冲突。  相似文献   

16.
The psychometric measures of accuracy, reliability and validity of peer assessment are critical qualities for its use as a supplement to instructor grading. In this study, we seek to determine which factors related to peer review are the most influential on these psychometric measures, with a primary focus on the accuracy of peer assessment or how closely peer-given grades match those of an instructor. We examine and rank the correlations of accuracy, reliability and validity with 17 quantitative and qualitative variables for three senior undergraduate courses that used peer assessment on high value written assignments. Based on these analyses, we altered the single most significant variable of one of the courses. We demonstrate that the number of reviews completed per reviewer has the greatest influence on the accuracy of peer assessment out of all the factors analysed. Our calculations suggest that six reviews must be completed per reviewer to achieve peer assessment that is no different from the grading of an instructor. Effective training, previous experience and strong academic abilities in the reviewers may reduce this number.  相似文献   

17.
全球化语境中,学的“世界”是一种深度的标志和话语权力象征,“民族”则是对于主体性的维护和多样化的呈示。深度追求是民族学和世界学的共通点。扎根于传统化,作人性的深入思考和生命追问,民族学就能走向世界。  相似文献   

18.
Some literature on elder abuse recommends, and practitioners claim, that there should be better assessment and screening tools. In order to improve the accuracy of measurement instruments, the purpose of this article is threefold: (a) describing the construction of an instrument with formative indicators and the survey design about the sensitive topic of elder abuse, (b) development of an analytic strategy to improve the precision of the measures by (c) evaluating the measurement instrument through quality criteria against outcomes of the instrument. We randomly selected 2,880 home-dwelling older women aged 60 and above from five European Union countries who participated in a survey on elder abuse. Prevalence data on abuse against older women was gathered using a postal (BE, FI, PT), face-to-face (BE, LT), and telephone survey (AT) but using an identical instrument. A table with outcome measures was calculated to evaluate the formative indicators of the measurement instrument, and a decision strategy for item reduction was developed. The results suggest that 12 (35%) of the original 34-indicators instrument can be omitted. The adapted version can provide the same elder abuse prevalence rates (reliability) with the same negative associations in terms of life quality (validity). The results indicate in an applied way how an elder abuse instrument can be evaluated and further developed using formative measures.  相似文献   

19.
OBJECTIVE: To develop a scale to measure emotional abuse among school children aged 13-15 years in Sri Lanka that can be used in field settings. METHOD: Emotional abuse was defined based on available literature. Review of literature and discussions with a panel identified 85 items indicating abusive behaviors. Content validity and social acceptability were achieved by getting the views of this same panel and parents, respectively. Item analysis and assessment of validity were carried out based on the responses to the draft instrument by 105 children aged 13-15 years attending a school in an urban slum area. The test-retest reliability of the scale was assessed using the responses from 144 students from a different school. Criterion validity was established by comparing the independent assessments of a psychiatrist for the presence or absence of emotional abuse with the respective scores of the children on the scale. RESULTS: The scale developed to assess emotional abuse included 46 items having a Cronbach alpha of .89. The test-retest reliability was .73. A cutoff score of 95 was identified at a sensitivity of 76.9% and a specificity of 51.4%. CONCLUSIONS: A scale that had a satisfactory level of reliability and validity was socially acceptable, which could be used in school settings in Sri Lanka was developed to enable identification of emotional abuse among school children aged 13-15.  相似文献   

20.
通过文献查阅、问卷调查和个案访谈编制我国大学生心理社会能力初测量表。探索性因素分析(n=691)确定该量表22道题项,抽取情绪管理、自我认知、社会适应和人际交往四个因子。验证性因素分析(n=500)及信效度检验结果表明该量表结构合理,拟合度良好,具有较好的结构效度及内部一致性信度,可以作为我国大学生心理社会能力的测量工具。  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号