首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 48 毫秒
1.
《教育实用测度》2013,26(3):231-244
For any testing program intended for licensure, certification, competency, or proficiency, the estimation of content relevant test scores for pass/fail decision making is necessary. This study compares number-correct scoring to empirical option weighting in the context of such tests. The study was conducted under two test design conditions, three test length conditions, and four passing score levels. Two criteria were used to evaluate the effectiveness of empirical option weighting versus number-correct scoring. Empirical option weighting typically produced slightly more reliable domain score estimates and more consistent pass/fail decisions than number-correct scoring, particularly in the lower half of the test score distribution. For many types of testing programs where the passing scores are established in the lower half of the test score distribution, the empirical option weighting method used in this study seems both appropriate and effective in improving the depend- ability of test scores and the consistency of pass/fail decisions. Test users, however, must weigh the effort required to use option weighting against the small gains obtained with this method. Other problems are discussed that may limit the usefulness of option weighting.  相似文献   

2.
Many U.S. students must pass a standards-based exit exam to earn a high school diploma. The degree to which exit exams and state standards properly signal to students their preparedness for postsecondary schooling has been questioned. The alignment of test scores with college grades for students at the University of Arizona (n = 2,667) who took the Arizona high school exams was ascertained in this study. The pass/fail signal accuracy of test scores varied depending on subject: The writing cut score was well aligned with collegiate performance, the reading cut score was below expectations, and the mathematics cut score was set quite rigorously. High school content and performance standards might not be as diluted as prior research has suggested.  相似文献   

3.
《教育实用测度》2013,26(4):277-288
The logic of using a sequential decision-making approach using college grade point average (GPA) and test scores for teacher licensure decisions within the conjunctive decision-making model is discussed and contrasted with the compensatory model for decision making. The major issues to consider in choosing between the conjunctive and compensatory models are (a) whether there is a linear (or at least monotonically increasing) relationship between each predictor variable and the criterion variable and (b) whether an excess of one predictor variable can compensate for a deficiency in another variable. If linearity (monotonicity) and compensation are reasonable assumptions, the compensatory model is generally preferred. However, for nonlinear and noncompensatory situations, such as teacher licensure where the purpose is to set a cutoff score to represent minimal competence, not to predict the degree of success, the conjunctive model is more appropriate. Some have suggested using college GPA and licensure tests in a compensatory model for licensure decisions. However, given the lack of content validity of the GPA for the purpose of licensure, and given the lack of comparability of GPAs across institutions (and indeed within institutions), a compensatory model using test scores and college GPA seems inappropriate.  相似文献   

4.
This real‐data‐guided simulation study systematically evaluated the decision accuracy of complex decision rules combining multiple tests within different realistic curricula. Specifically, complex decision rules combining conjunctive aspects and compensatory aspects were evaluated. A conjunctive aspect requires a minimum level of performance, whereas a compensatory aspect requires an average level of performance. Simulations were performed to obtain students' true and observed score distributions and to manipulate several factors relevant to a higher education curriculum in practice. The results showed that the decision accuracy depends on the conjunctive (required minimum grade) and compensatory (required grade point average) aspects and their combination. Overall, within a complex compensatory decision rule the false negative rate is lower and the false positive rate higher compared to a conjunctive decision rule. For a conjunctive decision rule the reverse is true. Which rule is more accurate also depends on the average test reliability, average test correlation, and the number of reexaminations. This comparison highlights the importance of evaluating decision accuracy in high‐stake decisions, considering both the specific rule as well as the selected measures.  相似文献   

5.
The article presents a framework for combining multiple measures to reach high-stakes decisions. Criteria are identified for the employment of conjunctive, compensatory, and complementary approaches to combining measures. The framework is illustrated through the documentation of the School District of Philadelphia's initiative to employ multiple measures, including standardized test scores, to determine promotion decisions. The author demonstrates that the use of multiple measures itself does not necessarily improve the reliability and validity of the decisions. It is the logic by which the measures are combined that determines the accuracy and appropriateness of the decisions reached.  相似文献   

6.
鉴于来华预科留学生汉语写作水平普遍较低的现状,对留学生写作策略的使用情况进行实证性研究是很有必要的。文章通过调查发现:留学生们在写作前阶段使用最多的为识别策略;在写作时阶段使用较多的是补偿策略中的近义词替代、母语替代和简化意义策略;在修改阶段使用较多的是积极接受反馈策略;写作策略使用与新HSK四级写作成绩呈正相关,与新HSK五级写作成绩呈负相关;不当的作文修改步骤和方法能有效预测汉语写作成绩;低水平的写作者往往倾向于选择计划性差、耗时量大、对写作策略使用熟练程度要求高的具体写作策略.结果往往适得其反。  相似文献   

7.
This article chronicles the evolution of a programmatic line of research on strategic writing instruction for adolescents with learning disabilities (LD) conducted by staff and affiliates of the University of Kansas Center for Research on Learning. The goal associated with this research is that students with LD learn the writing skills that they need to succeed in high school and beyond and that their skills are comparable to the skills of their peers. Individual studies have shown that adolescents with LD can master a given writing strategy and can apply that strategy to novel prompts and in general education classes. Moreover, they can learn simple writing strategies from computerized programs. They can also maintain use of a writing strategy over time. When students learn several writing strategies, their scores on standardized tests improve, and their writing competency is comparable to that of peers. Studies have also shown that teachers can teach the writing strategies and achieve successful results. Care must be taken, however, to ensure that students with LD receive the instruction under conditions where they have multiple opportunities to reach mastery on each skill and receive individualized feedback on practice attempts. Overall, the research has shown that adolescents with LD can learn complex writing skills such as planning, writing, and editing multiparagraph themes; can apply these skills to tasks that are assigned in required general education courses; and can be successful in those courses.  相似文献   

8.
The Motivation for Educational Attainment (MEA) questionnaire, developed to assess facets related to early adolescents’ motivation to complete high school, has a bifactor structure with a large general factor and three smaller orthogonal specific factors (teacher expectations, peer aspirations, value of education). This prospective validity study investigated the utility of each factor in predicting high school dropout or completion of a general education development (GED) certificate versus completion of a high school degree. Participants were 474 (55.1% male) ethnically diverse students who were originally recruited into a larger longitudinal study in Grade 1 on the basis of academic risk. Fourteen years later, 373 had obtained a high school diploma, 15 had obtained a GED, and 86 had dropped out of high school. During their first year of Grade 9, participants were administered the MEA. Using multinomial logistic regression with high school graduation as the reference outcome and controlling for Grade 9 letter grades, reading and math test scores, gender, and ethnic/racial group status, scores on the latent general factor and the latent peer aspirations factor predicted high school dropout versus high school graduation status. Neither the general factor nor any of the three specific factors predicted GED completion versus high school graduation. Ethnicity, but not gender, moderated the associations between scores on the general factor and high school graduation versus dropout.  相似文献   

9.
10.
《教育实用测度》2013,26(4):419-426
Over a decade passed between the landmark legal challenge to Florida's high school graduation test (Debra P. v. Turlington, 1984) and the legal challenge to the Texas high school graduation test (GI Forum v. Texas Education Agency, 2000). However, the popularity of accountability systems heavily weighted with scores from a variety of tests, including tests for graduation and promotion from grade to grade, makes these and other tests inviting targets for challenge and possible litigation. The recent legal challenge in Texas has broad implications for state assessment programs. The legal success surrounding the Texas case was not just good fortune. In this article, I (a former state test director whose responsibilities included supervision of a high school graduation test) discuss implications and issues associated with the Texas case.  相似文献   

11.
Over the last 3 decades student achievement has remained essentially unchanged in the United States, but not for a lack of spending. Over the same period a myriad of education reforms have been suggested and per-pupil spending has more than doubled. Since the 1990s the education reform attempts have frequently included judicial decisions to revise state school finance systems. Invoking general clauses about the need for an adequate education found in every state constitution, judges in more than half of the states waded into the development of finely tuned reform strategies. This article empirically estimates the effect of judicial intervention on student achievement using standardized test scores and graduation rates in 48 states from 1992 to 2005. We find no evidence that court-ordered school spending improves student achievement.  相似文献   

12.
A high school education prepares young people to participate positively in the economy and in civic life, among other positive life outcomes. However, nearly one in five American high school students does not graduate from high school on time, if ever. Progress has been made on understanding why students fail to complete high school and on raising graduation rates. Previous reviews and syntheses of this literature have focused on identifying factors that put students at risk for dropping out of school. Less is understood regarding what assets can promote high school graduation. Therefore, we reviewed research from the past 25 years on high school graduation, focusing on longitudinal, US-based studies of malleable factors that predict graduation. Through this systematic search, we identified 12 assets in individual, family, school, peer, and community contexts, which predict high school graduation, as well as identified assets for which more research is needed. Implications for policy and practice and directions for future research are discussed.  相似文献   

13.
This paper reports two studies of standard setting using Angoff's method. Results of the first study suggest that specialization within broad content areas does not affect an expert's estimates of the performance of the borderline group. This is reassuring because the knowledge base of many professions is so large that no individual can be considered an expert in all aspects of it. Results of the second study support the recommendation that performance data be provided during the standard-setting process. They are frequently used by experts, but will not have an impact on the standard unless the distribution of item difficulties is skewed markedly. It also increases the correspondence between p-values and estimates of borderline group performance, thereby reducing errors in pass/fail decisions. Overall, the results support recommendations often made in standard-setting literature, but they need to be replicated with other groups of experts  相似文献   

14.
The claim that writing facilitates students’ learning, although widely accepted, has little support from empirical research. A possible explanation for the lack of empirical evidence is that writing-to-learn research has disregarded that students use different writing strategies. The purpose of the present experimental study is to test whether it is effective to adapt writing-to-learn tasks to different writing strategies when teaching literature. A course “Learning to write argumentative texts about literature” was developed in two different versions: one adapted to a planning writing strategy, the other to a revising writing strategy. Participants were 113 tenth-grade high school students in the Netherlands. Our hypothesis is an adaptation hypothesis: we expect that the more a student will use a planning writing strategy, the more the student will profit from the lessons in the planning condition, and that the more a student uses a revising writing strategy, the more beneficial the revising condition will be. However, results show that for improving literary interpretation skill, a course adapted to the planning writing strategy is more effective for almost all students.  相似文献   

15.
Career academies serve an increasingly wide range of students. This paper examines the contemporary profile of students entering career academies in a large, diverse school district and estimates causal effects of participation in one of the district's well-regarded academies on a range of high school and college outcomes. Exploiting the lottery-based admissions process of this technology-focused academy, we find that academy enrollment increases the likelihood of high school graduation by about 8 percentage points and boosts rates of college enrollment for males but not females. Analysis of intermediate outcomes suggests that effects on attendance and industry-relevant certification at least partially mediate the overall high school graduation effect.  相似文献   

16.
Use of the Rasch IRT Model in Standard Setting: An Item-Mapping Method   总被引:1,自引:0,他引:1  
This article provides both logical and empirical evidence to justify the use of an item-mapping method for establishing passing scores for multiple-choice licensure and certification examinations. After describing the item-mapping standard-setting process, the rationale and theoretical basis for this method are discussed, and the similarities and differences between the item-mapping and the Bookmark methods are also provided. Empirical evidence supporting use of the item-mapping method is provided by comparing results from four standard-setting studies for diverse licensure and certification examinations. The four cut score studies were conducted using both the item-mapping and the Angoff methods. Rating data from the four standard-setting studies, using each of the two methods, were analyzed using item-by-rater random effects generalizability and dependability studies to examine which method yielded higher inter-judge consistency. Results indicated that the item-mapping method produced higher inter-judge consistency and achieved greater rater agreement than the Angoff method.  相似文献   

17.
Rater training is an important part of developing and conducting large‐scale constructed‐response assessments. As part of this process, candidate raters have to pass a certification test to confirm that they are able to score consistently and accurately before they begin scoring operationally. Moreover, many assessment programs require raters to pass a calibration test before every scoring shift. To support the high‐stakes decisions made on the basis of rater certification tests, a psychometric approach for their development, analysis, and use is proposed. The circumstances and uses of these tests suggest that they are expected to have relatively low reliability. This expectation is supported by empirical data. Implications for the development and use of these tests to ensure their quality are discussed.  相似文献   

18.
This study examines the predictive validity of three commonly used nursing school admission indices, that is, scholastic aptitude test scores, matriculation grades, and evaluations of performance in a group interview situation, in a sample of 321 Israeli nursing school students. Grade point average, supervisor evaluation of clinical internship, and scores on a government certification exam served as primary indices of criterion performance. Whereas composite aptitude test scores correlated moder ately with both grade point average and certification exam scores, matriculation grades correlated negligibly with all three criterion measures. Group interview ratings correlated moderately with clinical performance, but negligibly with the remaining criteria. Aptitude test scores were not found to be biased predictors of criterion performance by ethnicity or social background. The implications of these findings for the selection of nursing school candidates in Israel are discussed.  相似文献   

19.
Many adolescents, particularly adolescents with disabilities, have difficulty with literacy tasks such as reading and writing. Yet research has found that when students with disabilities receive appropriate instruction, they typically are able to improve their overall writing outcomes. This study explored the effectiveness of a summary writing strategy taught through a mnemonic device (WINDOW) with prompts for self-regulation on the summary writing and reading comprehension of high school students with disabilities. Results indicated that the students who received the intervention wrote longer and higher quality summaries and improved their reading comprehension scores after learning the strategy when compared to a control group. These outcomes indicate that the WINDOW strategy may have the potential to improve reading and writing outcomes for high school students with disabilities. Implications for practice and future research are discussed.  相似文献   

20.
We study the effects of closing and restarting low-performing schools as charter schools in New Orleans and Baton Rouge. Using matched difference-in-differences identification with students in Louisiana, we estimate effects for the students who attended the treated schools at the time of treatment. We find positive and precise effects of closure/restart on elementary/middle school test scores, but no clear effects on high school graduation or college entry in New Orleans. However, in Baton Rouge high schools, the interventions reduced high school graduation by 11–15 percentage points. We also provide evidence about how and why these effects emerge. The variation in test score effects within and across cities is positively related to the increase in school value-added that treated students experienced and negatively related to student age/grade and the extent of student disruption. The effects of school closure and restart therefore depend, predictably, on policy design and implementation. This work builds on prior closure/restart research and helps explain the positive effects of the post-Katrina school reforms in New Orleans.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号