期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Assessing the reliability of student evaluations of teaching: choosing the right coefficient

Donald Morley 《Assessment & Evaluation in Higher Education》2014,39(2):127-139

Many of the studies used to support the claim that student evaluations of teaching are reliable measures of teaching effectiveness have frequently calculated inappropriate reliability coefficients. This paper points to three coefficients that would be appropriate depending on if student evaluations were used for formative or summative purposes. Results from the present study indicated that students had very low absolute inter-rater reliability, but somewhat higher consistency inter-rater reliability. 相似文献

2.

水动力导数的系统辨识

蔡泽伟《宁波大学学报(教育科学版)》1989,(2)

本文以伪随机二进制信号(PRBS)操舵,用同时测量轨迹与航向角的方法,计算得到漂角β,从而辨识了白箱模型(2.1),获得了操纵运动的水动力导数。并与PMM试验结果进行了比较,证实了本文方法的有效性。相似文献

3.

Long-term stability of students' evaluations: A note on Feldman's “consistency and variability among college students in rating their teachers and courses”

Herbert W. Marsh Dr. J. U. Overall 《Research in higher education》1979,10(2):139-147

Feldman (1977), reviewing research about the reliability of student evaluations, reported that while class average responses were quite reliable (.80s and .90s), single rater reliabilities were typically low (.20s). However, studies he reviewed determined single rater reliability with internal consistency measures which assumed that differences among students in the same class (within-class variance) were completely random—an assumption which Feldman seriously questioned. In the present study, this assumption was tested by collecting evaluations from the same students at the end of each class and again one year after graduation. Single rater reliability based upon an internal consistency approach (agreement among different students in the same class) was similar to that reported by Feldman. However, single rater reliability based upon a stability approach (agreement between end-of-term and follow-up ratings by the same student) was much higher (medianr=.59). These results indicate that individual student evaluations were remarkably stable over time and more reliable than previously assumed. Most important, there was systematic information in individual student ratings—beyond that implied by the class average response—that internal consistency approaches have ignored or assumed to be nonexistent. 相似文献

4.

Educational Research

《The Journal of educational research》2012,105(1):5-19

Abstract

To assess the extent to which student evaluations of lecturers are accurate indicators of quality of instruction (teacher effectiveness), 128 students taking a multisection course in University Mathematics evaluated the performance of their particular lecturers. Initial tests indicated no difference among the students taking the different sections (same content) in either aptitude or expected performance. Hence, the fact that significantly positive correlations were obtained between average student performance and average student evaluations of their lecturers across the various sections suggests that student evaluations of this kind can be used as a measure of instructional effectiveness under appropriate conditions. 相似文献

5.

Effects of administrative scheduling and directions upon student ratings of instruction

Henry F. Dizney John L. Brickell 《Contemporary educational psychology》1984,9(1):1-7

Using temporal proximity of instructor/course evaluations to critical in-course events, together with psychological set provided via directions, the present investigation sought to assess (1) the appropriateness of within-course time-series analyses for instructor ratings, and (2) the potency of inducing rater perceptions of the rating process as an interactive variable in course evaluations. A 2 × 2 (temporal proximity by psychological set) factorial design used students enrolled in two sections of an undergraduate course in educational psychology as subjects. Using both midterm and final examination time periods as data collection points, the results of this investigation suggested that preexamination instructor/course evaluations tend to be more favorable than postexamination evaluations, and that a specific positive psychological set prior to evaluations tends to produce more favorable evaluations than very general, brief instructions. 相似文献

6.

Influences on peer evaluation in a group project: an exploration of leadership,demographics and course performance

Molly Dingel Wei Wei 《Assessment & Evaluation in Higher Education》2014,39(6):729-742

Collaborative learning strategies are widely used in higher education to deepen learning, promote team-building skills and achieve course learning objectives. Using peer evaluation is an important strategy to ensure that engaged and active students are rewarded for their efforts, and to discourage loafing within groups. However, less is known about what biases may influence students’ peer evaluations. In this paper, we investigate what variables students may (consciously or unconsciously) use to evaluate their peers. We explore the role of sex, race, course performance and group leadership on peer evaluation. We also investigate whether these variables correlate with students’ final course grade. We found that students who reported being leaders in groups were evaluated higher than peers who reported being followers, and that course performance positively correlated with peer evaluations. White students received higher peer evaluations than students of colour. This difference reflects trends in group leadership and course performance, with more white students than students of colour reporting being leaders in groups and receiving higher grades. 相似文献

7.

Evaluation practices for college students with LD

Ofiesh NS McAfee JK 《Journal of learning disabilities》2000,33(1):14-25

To examine current practices in the use of psychoeducational evaluations for service delivery, we surveyed 91 service providers to college students with learning disabilities. The three purposes of the survey were to determine (a) whether service delivery decisions are based on information from psychoeducational evaluations, (b) which sections of the psychoeducational report are most useful in making service delivery decisions, and (c) the respondents' satisfaction with the tests and measurements for service delivery. The findings supported the common belief that data from psychoeducational evaluations serve as the primary basis for both eligibility and specific accommodation determinations. Respondents reported that all sections of the psychoeducational evaluation written report were useful, with the least useful section being test scores and the most useful being the summary of cognitive strengths and weaknesses. However, the section used most often for service delivery decisions was the professional's recommendations. 相似文献

8.

A review of reliability procedures for measuring observer agreement

Olga Towstopiat 《Contemporary educational psychology》1984,9(4):333-352

The present article reviews the procedures that have been developed for measuring the reliability of human observers' judgments when making direct observations of behavior. Measures such as the percentage of agreement, Cohen's kappa, and phi have been used to measure observer agreement; however, these coefficients have serious limitations. In addition to specifying the deficiencies that exist with these excessively used reliability measures, the present article discusses recently developed univariate and multivariate agreement measures that are based on quasi-equiprobability and quasi-independence models and estimates. Improvements in precision are provided by such models and estimates since they (1) yield a probability based coefficient of agreement with a directly interpretable meaning, (2) correct for the proportion of “chance” agreement, and (3) allow the partitioning of the agreement and disagreement estimates within the models. 相似文献

9.

Using mid-semester course evaluation as a feedback tool for improving learning and teaching in higher education

E. Murat Sozer Mustafa Kaya 《Assessment & Evaluation in Higher Education》2019,44(7):1003-1016

The way in which mid-semester course evaluations are structured, administered and reported is important for generating rich and high-quality student feedback for the enhancement of learning and teaching. Mid-semester evaluations usually contain open-ended questions that trigger more elaborative feedback about what is going on in a class than that from end-of-semester evaluations with Likert scale-type questions. The anonymity of the process for students and the confidentiality of the process for instructors make the process more reflective for students and less stressful for instructors. This study describes how the mid-semester course evaluation process can be used as a feedback tool for improving the quality of teaching and learning at an institutional level. Through a longitudinal analysis of 341 mid-semester course evaluation reports, positive areas and areas of concern with respect to learning and teaching were identified, and changes in student evaluations over the years were examined meticulously to make an overall evaluation of the quality of learning and teaching at a non-profit Turkish university. This research showed that the value of mid-semester course evaluations can go beyond course-level if we use open-ended questions and are able to gather the reports together to make some comprehensive analysis at university level. 相似文献

10.

A Practitioner's Guide to Computation and Interpretation of Reliability Indices for Mastery Tests

Michael J. Subkoviak 《Journal of Educational Measurement》1988,25(1):47-55

From the perspective of teachers and test makers at the district or state level, current methods for obtaining reliability indices for mastery tests like the agreement coefficient and kappa coefficient are quite laborious. For example, some methods require two test administrations, whereas single administration approaches involve complex statistical procedures and require access to appropriate computer software. The present paper offers practitioners tables from which agreement and kappa coefficients can be read directly. Further-more, because these indices differ from traditional reliability coefficients, the issue of what constitutes acceptable values of agreement and kappa coefficients is also addressed 相似文献

11.

Disciplinary,Gender and Course Year Influences on Student Perceptions of Teaching: Explorations and implications

Elizabeth Santhanam Owen Hicks 《Teaching in Higher Education》2013,18(1):17-31

The prevalent use of student ratings in teaching evaluations, particularly the reliability of such data, has been debated for many years. Reports in the literature indicate that there are many factors influencing student perceptions of teaching. Three of these factors were investigated at the University of Western Australia, namely the broad discipline group, course/unit year level and student gender. Data collected over 3 years were analysed. The outcomes of this study confirmed results reported by other workers in the field that there are differences in ratings of students in different discipline groups and at different year levels. It also provided a possible explanation for the mixed results reported in studies of student gender in relation to student ratings. 相似文献

12.

Student evaluation of teaching and matters of reliability

Dennis E. Clayson 《Assessment & Evaluation in Higher Education》2018,43(4):666-681

The student evaluation of teaching process is generally thought to produce reliable results. The consistency is found within class and instructor averages, while a considerable amount of inconsistency exists with individual student responses. This paper reviews these issues along with a detailed examination of common measures of reliability that are utilised with the instruments. While inter-item consistency of the evaluations has been shown to be high, the agreement between students was shown to be no better than what would be expected by chance, indicating that students do not agree on what they are being asked to evaluate. The reliability measures generated by the student evaluations of teaching are an insufficient foundation for establishing validity. Further, the pattern of reliability indicates that the instruments are generally providing information about students, not instructors. 相似文献

13.

Freshman interest groups in criminal justice education

Stephen G. Tibbetts 《Journal of Criminal Justice Education》2013,24(2):213-228

Freshman group programs have been promoted by many institutions of higher learning. However, few empirical evaluations of such programs have been reported in the scholarly literature, and none have focused on programs in criminal justice education. Subjects in this study included 130 students enrolled in an introductory criminal justice course at one institution, a portion of which (32 students) were also enrolled in sections of a group program called Freshman Interest Groups. Results showed that students enrolled in this program did better in the course than their freshmen peers and upper-class students, which analyses showed was largely due to higher attendance. 相似文献

14.

Online graduate educational technology program: An illuminative evaluation

《Studies in Educational Evaluation》2016

相似文献

15.

The stability of teacher performance in the same course over time

Philip L. Smith 《Research in higher education》1979,11(2):153-165

Traditional reliability studies report the stability of teaching performance, as measured by student course/instructor evaluations, as ranging from .40 to .90, depending upon the study. Recently, generalizability theory has been applied to these rating data to more specifically pinpoint sources of error in student course evaluations. The purpose of the present study is to (1) familiarize the reader with some of the work that has been done with student rating data using generalizability theory and (2) to illustrate the use of the theory on a problem that in the past has been examined using traditional reliability estimates. Specifically, the paper addresses the stability of teacher performance across time in the same course. 相似文献

16.

Scorer reliability on the AAMD adaptive behavior scale,public school version,part one

Thelma Givens 《Psychology in the schools》1980,17(3):335-338

Two groups of scorers were used to investigate the effects of in-service training in scoring and the reliability of scorers with the AAMD Adaptive Behavior Scale, Public School Version, Part One. A mean scorer reliability of .55 was found. There was a significant difference between the coefficients of agreement computed for the two groups. This established the efficacy of in-service training in scoring. Additionally, most frequent scoring errors were delineated. 相似文献

17.

Adventures in Flipping College Algebra

Jenna Van Sickle 《PRIMUS》2015,25(8):600-613

Abstract

This paper outlines the experience of a university professor who implemented flipped learning in two sections of college algebra courses for two semesters. It details how the courses were flipped, what technology was used, advantages, challenges, and results. It explains what students do outside of class, what they do inside class, and discusses what students thought of the course based on surveys and course evaluations. It also compares results on a similar final exam to those of a control group. 相似文献

18.

Process Evaluation of a Career Course: A Replication and Extension

Stacie H. Vernick Robert C. Reardon James P. Sampson Jr. 《Journal of Career Development》2004,30(3):201-213

A process evaluation of a three-credit, upper division career course at a southeastern university is described. Student responses to a standardized course rating instrument were analyzed. Anonymous course evaluations of 293 students enrolled in 11 sections of the career course were compared to student ratings of one section of the course 5 years earlier, and normative data taken from summary reports provided by the university. Results indicated that student perceptions of the course are consistent over time, and more positive in classes meeting multiple times per week. The course was rated higher in course demands, student-instructor involvement, and course organization than other courses. The career course was rated lower in student interest than other courses. Implications of the findings and directions for future research are discussed. Course designers and instructors might benefit from using the information gleaned from this study in structuring career courses and in planning course schedules. 相似文献

19.

AN EMPIRICAL MODEL FOR FACULTY TIME ANALYSIS

Colin Campbell 《Assessment & Evaluation in Higher Education》1982,7(2):181-185

This report contributes to the discussion of faculty time analysis by outlining the development of a model for estimating the time involved in teaching a university course. Over a period of six years, systematic time analysis has assisted in determining the principal components and empirical coefficients of the model. Data are reported showing reasonable agreement between model estimates and time analysis for the courses specified. 相似文献

20.

Course Factors that Motivate Students to Submit End-of-Course Evaluations

Caroline M. Jaquett Victoria G. VanMaaren Robert L. Williams 《Innovative Higher Education》2017,42(1):19-31

We surveyed students (N = 152) in several sections of an undergraduate educational psychology course to determine what course factors would most motivate them to submit course evaluations. The survey directed students to choose among several pairs of course characteristics as to their relative impact on their decision to submit a course evaluation. After tabulating their choices, we ranked the course characteristics for the total sample and then for various demographic and performance subgroups. In general, students indicated that positive aspects of a course would motivate them to submit course evaluations more than would negative aspects. 相似文献