期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Princesses are bigger than elephants: Effect size as a category error in evidence‐based education

Adrian Simpson 《British Educational Research Journal》2018,44(5):897-913

Much of the evidential basis for recent policy decisions is grounded in effect size: the standardised mean difference in outcome scores between a study's intervention and comparison groups. This is interpreted as measuring educational influence, importance or effectiveness of the intervention. This article shows this is a category error at two levels. At the individual study level, the intervention plays only a partial role in effect size, so treating effect size as a measure of the intervention is a mistake. At the meta‐analytic level, the assumptions needed for a valid comparison of the relative effectiveness of interventions on the basis of relative effect size are absurd. While effect size continues to have a role in research design, as a measure of the clarity of a study, policy makers should recognise the lack of a valid role for it in practical decision‐making. 相似文献

2.

Advances in the assessment of social competence: Findings from a preliminary investigation of a general outcome measure for social behavior

Kelli D. Cummings Ruth A. Kaminski Kenneth W. Merrell 《Psychology in the schools》2008,45(10):930-946

This study describes the initial validation of an innovative social‐‐behavioral observational assessment tool that is designed to be used on a repeated basis to assess growth and development of social competence over time to: (a) identify the social functioning of all students, (b) assist in planning support for students at risk, and (c) evaluate the effectiveness of individual and system‐wide interventions. Eighteen first‐grade students were monitored over an 8‐week period using the Initiation‐Response Assessment (IRA) Code. The School Social Behavior Scales, a published teacher rating scale, was included as a criterion measure. Estimates of reliability and criterion‐related validity were calculated for the IRA. The measure's sensitivity to growth over time and between‐group variability were also assessed using hierarchical linear modeling procedures. Results indicate that scores on this measure are stable, and tap constructs similar to those assessed via teacher rating. © 2008 Wiley Periodicals, Inc. 相似文献

3.

基于净现值理论的精品课程网站投资效益评价 总被引：1，自引：1，他引：0

钟元生杨瑜李卫群《现代教育技术》2010,20(3):93-96

为给精品课程网站的投资效益评价提供一个客观依据,提出了一种课程网站投资效益的定量分析模型。该模型将课程网站各种投入与效益以货币计量单位进行估算,将各年的净现金流换算为立项年份的净现值,以各年净现值之和作为课程网站的投资效益,并以某高校2004年立项的13门校级精品课程为例说明了该模型的使用。相似文献

4.

汉语名词和量词组合的认知研究

刘顺刘雪芹《南京师范大学文学院学报》2010,(2):182-188

汉语名词计量方式的认知基础是事物的离散性特征,离散性强的事物,投射到语言中,相应名词的空间性就强;离散性弱的事物,投射到语言中,相应名词的空间性也弱。强空间性名词可以适用所有类别的量词,弱空间性名词不能适用个体量词,根据其空间性程度,只能适用度量量词、部分量词、借用量词、种类量词,或完全不能与量词组合。名词与量词的组合不是任意的,是有理可据的。相似文献

5.

A critique of “unequal educational opportunity" 1

Jacob Goldstein 《教育心理学家》2013,48(3):332-344

Using the 159 counties of Georgia as statistical units Osborne has presented data indicating (a) a positive relationship between percentage of non‐Whites and per pupil expenditures; (b) a negative relationship between per pupil expenditures and mean achievement and intelligence scores; and (c) a negative relationship between percentage of non‐Whites and mean intelligence and achievement scores. Osborne has interpreted these findings as indicating that the attempt to raise the achievement levels of previously deprived groups has been a failure, and has used this interpretation as a basis for a more general negative conclusion concerning the effectiveness of compensatory efforts. The present analysis indicates, however, that neither of these conclusions follows from his data. The evidence suggests rather that Osborne's negative correlations reflect in large measure long‐range effects of earlier conditions. 相似文献

6.

Project Clarion: Three Years of Science Instruction in Title I Schools among K-Third Grade Students

Kyung Hee Kim Joyce VanTassel-Baska Bruce A. Bracken Annie Feng Tamra Stambaugh Lori Bland 《Research in Science Education》2012,42(5):813-829

The purpose of the study was to measure the effects of higher level, inquiry-based science curricula on students at primary level in Title I schools. Approximately 3,300 K-3 students from six schools were assigned to experimental or control classes (N?=?115 total) on a random basis according to class. Experimental students were exposed to concept-based science curriculum that emphasized ??deep learning?? though concept mastery and investigation, whereas control classes learned science from traditional school-based curricula. Two ability measures, the Bracken Basic Concept Scale-Revised (BBCS-R, Bracken 1998) and the Naglieri Nonverbal Intelligence Test (NNAT, Naglieri 1991), were used for baseline information. Additionally, a standardized measure of student achievement in science (the MAT-8 science subtest), a standardized measure of critical thinking, and a measure for observing teachers?? classroom behaviors were used to assess learning outcomes. Results indicated that all ability groups of students benefited from the science inquiry-based approach to learning that emphasized science concepts, and that there was a positive achievement effect for low socio-economic young children who were exposed to such a curriculum. 相似文献

7.

The boder test: Neuropsychological and demographic features of dyslexic subtypes

Cathy F. Telzrow Evelyn Century Barbara Whitaker Carol Redmond Barbara Zimmerman 《Psychology in the schools》1983,20(4):427-432

The Boder Test may represent a viable screening instrument for the identification of dyslexia and dyslexic subtypes. Proportions of the 30 LD children studied identified by Boder's classification system as dysphonetic (63.3%), dyseidetic (6.7%), and mixed dysphonetic-dyseidetic (13.3%) were similar to those reported in earlier studies. Neuropsychological characteristics associated with the Boder categories were consistent with the literature: Significantly fewer dysphonetic readers were represented in the V > and Spatial<Sequential IQ groups, and left-handedness and left-hand tapping preference were overrepresented in the mixed dyslexic category. Black children who had been identified as learning disabled on the basis of other tests were categorized as normal readers by the Boder, suggesting its possible use as a nonbiased measure of reading. 相似文献

8.

Moral and human rights education: the contribution of the United Nations

《Journal of moral education》2012,41(1):115-132

Moral education can take many forms. With the end of the United Nations Decade for Human Rights Education (UNDHRE) (1995–2004), we critically review developments in human rights education (HRE) during those ten years in the context of moral education. We argue that, despite some modest successes, the decade lacked direction and a major impact and has failed to prepare a sound basis for securing HRE internationally. These outcomes largely account for the United Nations' (UN) decision in 2005 to initiate the World Programme for Human Rights Education. Meanwhile initiatives in defining the goals and practice of HRE have happened outside the UN context. Overall the UN's contribution to building HRE and moral education has, at best, been marginally successful due in large measure to the inherent weaknesses of the organisation as well as the UN's inability to engage member states. 相似文献

9.

Student performance in off-campus programs

E. Michael Walsh Jehiel Novick Michael Andrechak 《Innovative Higher Education》1979,3(4):230-241

The quality of degree programs offered off-campus, particularly on military bases, has become an issue affecting many institutions of higher education. This study asked the question: Do students in off-campus programs perform as well as students in on-campus programs? As a basis for comparison, only courses taught on- and off-campus by the same instructor were chosen. Six courses, taught 37 times, by ten instructors, met these criteria. The subjects included the 649 undergraduate students from Southern Illinois University-Carbondale who received letter grades in these courses between 1975 and 1977. As a measure of performance, the grades of on- and off-campus students were compared by means of at-test. The mean grade of off-campus students (3.34) was not significantly different (at the .01 level) from that of on-campus students (3.29). As a control, faculty were interviewed to determine the equivalence in content and rigor of courses they taught in both settings. Faculty generally responded that their on- and off-campus courses were equivalent in content and rigor, supporting the use of grades as a measure of student performance. These results indicate that not only do faculty maintain their standards while teaching off-campus, but that the academic performance of off-campus students equals that of on-campus students. 相似文献

10.

Out With the Old-In With the New: Thoughts on the Future of Educational Productivity Research

《Peabody Journal of Education》2013,88(3):31-56

The purpose of this article is to present a review of-and to provide insight into future directions for research on-the literature surrounding the efficient production of educational outcomes. Research studies on educational productivity and efficiency generally support 2 dominant paradigms within the field of school finance: (a) Money Does Not Matter and (b) Money Does Matter. After a brief historical review, the article critiques the cost-minimization assumptions implicit to both paradigms, using public choice theory as a basis to acknowledge that nonmarket forces influence educational productivity. Next, suggestions are made surrounding normative economics concepts that need to be expanded, explored, and operationalized to measure educational efficiency more appropriately. Finally, it is suggested that researchers investigate at least 3 nontraditional methods to measure levels of economic efficiency in public educational organizations: (a) modified quadriform analysis, (b) data envelopment analysis, and (c) stochastic frontier analysis. 相似文献

11.

Establishing a crosswalk between the Common European Framework for Languages (CEFR) and writing domains scored by automated essay scoring

Mark D. Shermis 《教育实用测度》2018,31(3):177-190

ABSTRACT

This article employs the Common European Framework Reference for Language Acquisition (CEFR) as a basis for evaluating writing in the context of machine scoring. The CEFR was designed as a framework for evaluating proficiency levels of speaking for the 49 languages comprising the European Union. The intent was to impact language instruction so that “mastery” of one language has the same meaning as it does in another. A second objective is to provide a crosswalk for what one automated writing evaluation (AWE) system does in attending to the dimensions of the framework. The CEFR Framework is divided into five traits and different proficiency levels. The question then becomes: Does the AWE system attempt to measure these dimensions of writing? And, if so, how is this operationalized? Is it measuring aspects of communication that are not specified? The goal here is to create a common vocabulary between the writing community and those interested in AWE systems as to what is actually being measured by their software, and mapping that to a developmental scale of writing performance. 相似文献

12.

Designing Intervention Studies: Selected Populations,Range Restrictions,and Statistical Power

《Journal of research on educational effectiveness》2013,6(4):556-569

ABSTRACT

An appropriate estimate of statistical power is critical for the design of intervention studies. Although the inclusion of a pretest covariate in the test of the primary outcome can increase statistical power, samples selected on the basis of pretest performance may demonstrate range restriction on the selection measure and other correlated measures. This can result in attenuated pretest–posttest correlations, reducing the variance explained by the pretest covariate. We investigated the implications of two potential range restriction scenarios: direct truncation on a selection measure and indirect range restriction on correlated measures. Empirical and simulated data indicated that direct range restriction on the pretest covariate greatly reduced statistical power and necessitated sample size increases of 82%–155% (dependent on selection criteria) to achieve equivalent statistical power to parameters with unrestricted samples. However, measures demonstrating indirect range restriction required much smaller sample size increases (32%–71%) under equivalent scenarios. Additional analyses manipulated the correlations between measures and pretest–posttest correlations to guide planning experiments. Results highlight the need to differentiate between selection measures and potential covariates and to investigate range restriction as a factor impacting statistical power. 相似文献

13.

Third wave of measurement in the self-regulated learning field: when measurement and intervention come hand in hand

Ernesto Panadero Julia Klug Sanna Järvelä 《Scandinavian Journal of Educational Research》2016,60(6):723-735

Measurement is a central issue for the self-regulated learning (SRL) field as SRL is a phenomenon difficult to measure in a reliable and valid way. Here, 3 waves in the history of SRL measurement are identified and profiled. Our focus lies on the third and newest one, which combines measurement and intervention within the same tools. The basis for this approach is located in the reactivity principle via students’ self-monitoring: when students are aware of their actions, they can react and change what is needed. That happens when the measurement tools promote students' self-monitoring which turn part of the intervention then. Examples of this new approach to SRL measurement and guidelines for implementing it are presented. 相似文献

14.

A scale to measure educators’ musical skills in early childhood education

《Studies in Educational Evaluation》2021

Evaluating skills of students training to become teachers in early childhood education (ECE) is a key measure to improve their training and, subsequently, to bring about improvements in the way they train their pupils. No research literature specifically describing a scale designed to measure educators’ musical skills at the ECE level has been previously published. In view of this lack, we carried out the customary procedures for designing and validating a psychological measurement scale: on the basis of a sample of university students (n = 209), we created a valid, reliable tool that allows researchers to evaluate and quantify how teacher trainees perceive their own musical skills. By applying EFA, Parallel Analysis, and CFA, we observed the emergence of four differentiated categories distributed along 25 items in the questionnaire’s final version. To improve and refine this tool, further research and study replication in a series of different educational contexts would be required. 相似文献

15.

A Comparison of Piaget's and Kohlberg's Theories and Tests for Moral Judgment

Harald R⊘rvik 《Scandinavian Journal of Educational Research》2013,57(3):99-124

Abstract:R?rvik, H. 1980. A Comparison of Piaget's and Kohlberg's Theories and Tests for Moral Judgment. Scandinavian Journal of Educational Research 25,99‐124. Piaget's and Kohlberg's theories for moral judgment are compared. On the basis of this comparison, hypotheses are formulated regarding expected relationships between the tests constructed on the basis of the two theories. The empirical testing of these hypotheses indicates that there are marked similarities between Piaget's and Kohlberg's tests as to characteristics measured, power of discrimination between age levels, and in the stage placement of subjects.

The main differences between the tests seem to be that Piaget's test is most influenced by the personal relationship to other persons. Contrary to the impression given by the theorist himself, Kohlberg's test seems to a larger extent to measure the subjects’ norms and emotional reactions connected to inter‐nalization of norms. Moral behavior is more closely related to Kohlberg's measure. 相似文献

16.

Variables in adaptive decisions in individualized instruction 1

James G. Holland 《教育心理学家》2013,48(2):146-161

相似文献

17.

Comparing secondary teachers on logical consistency in educational philosophy and flexibility in teaching

Henry R. Weinstock Robert J. Starr Charles J. Fazzaro 《Instructional Science》1974,3(2):115-126

This study of secondary inservice teachers was designed to measure the possible relationship between the consistency with which they logically relate philosophical views (theory) to educational ideas (practice) and their teaching flexibility (as demonstrated in actual teaching practice).Using the GNC Scale of Logical Consistency of Ideas about Education, two groups of teachers were identified, i.e., those who were logically consistent in their ideas about education and those who were not so. Each of the logically consistent teachers was found to be so within an empirical, rather than rationalistic framework of educational theory.Flexibility was ascertained by data gathered through the use of the Flanders Verbal Interaction System. Each teacher tape recorded his/her own classes and then completed a Flanders' Matrix.The Mann-Whitney U Test was used as a basis for the statistical analysis. Neither group was found to be either more flexible or to exhibit more indirect behavior within the classroom, i.e., being logically consistent in ideas about education (as measured by the GNC Scale) was not found to be related to being flexible in teaching (as measured by the FVIS). 相似文献

18.

种类量词的范围确定、语义特征和研究展望

冯冬梅《柳州职业技术学院学报》2009,9(1):119-121

提出使用“种”同义替换的方法作为确定种类量词的形式标准,确定了18个种类量词。归纳出种类量词的语义特征,即表示种属类别、表示对象通用、表数“非止一个”等。指出当前种类量词研究方面存在的不足。并展望种类量词研究的发展方向。相似文献

19.

Investigating the validity of two widely used quantitative text tools

James W. Cunningham Elfrieda H. Hiebert Heidi Anne Mesmer 《Reading and writing》2018,31(4):813-833

In recent years, readability formulas have gained new prominence as a basis for selecting texts for learning and assessment. Variables that quantitative tools count (e.g., word frequency, sentence length) provide valid measures of text complexity insofar as they accurately predict representative and high-quality criteria. The longstanding consensus of text researchers has been that such criteria will measure readers’ comprehension of sample texts. This study used Bormuth’s (1969) rigorously developed criterion measure to investigate two of today’s most widely used quantitative text tools—the Lexile Framework and the Flesch–Kincaid Grade-Level formula. Correlations between the two tools’ complexity scores and Bormuth’s measured difficulties of criterion passages were only moderately high in light of the literature and new high stakes uses for such tools. These correlations declined a small amount when passages from the University grade band of use were removed. The ability of these tools to predict measured text difficulties within any single grade band below University was low. Analyses showed that word complexity made a larger contribution relative to sentence complexity when each tool’s predictors were regressed on the Bormuth criterion rather than their original criteria. When the criterion was texts’ grade band of use instead of mean cloze scores, neither tool classified texts well and errors disproportionally placed texts from higher grade bands into lower ones. Results suggest these two text tools may lack adequate validity for their current uses in educational settings. 相似文献

20.

A comparison of methods of observation in preservice teacher training

Nathan Stoller Gerald S. Lesser Philip I. Freedman 《Educational technology research and development : ETR & D》1964,12(2):177-197

Summary This study tested the hypothesis that different techniques of classroom observation result in different degrees of learning by teachers-in-training. Specifically, it was predicted that kinescope recordings (prepared in advance) provide a more effective medium of observation than closed-circuit television and that TV observation is in turn more effective than the traditional procedure of direct observation in the classroom. The logical theoretical basis for this hypothesis and the special conditions of experimentation used in this study were elaborated. Measures of two dependent variables were used to test this hypothesis. One measure of the students’ response to these observational techniques, an objective multiple-choice measure of information about methods of teaching, failed to confirm the hypothesis, but did show systematic variation with several other experimental variables. The other measure, an essay examination assessing ability to evaluate an observed classroom lesson critically, revealed strong confirmation of the hypothesis. Several other results emerged. One significant finding indicated that when used by certain instructors, the differential effect of the observational condition can outweigh the very great importance of general scholastic ability as a correlate of gain in learning. Interpretations of these data were made to clarify the role of classroom observation in the teacher training process. This research was supported by a grant from the Educational Media Branch of the U.S. Office of Education. 相似文献