首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
The use of curriculum-based measurement (CBM) is supported by several decades of research regarding their technical adequacy, practical utility, and use with diverse populations. Questions remain regarding the measurement of growth using tri-annual reading CBM (R-CBM) assessment. Existing research on annual rates of growth is inconclusive with some studies suggesting that the most growth occurs from the fall to the winter and others finding growth accelerates from winter to spring. With a sample of students in third through fifth grades, consisting of a high percentage of English Language Learners (ELLs) and students exited from bilingual instruction, this study found that Non-ELL students demonstrated semester differences in R-CBM growth consistent with prior research in third and fourth grades, but not in fifth grade. However, this pattern was not always consistent for ELLs and students recently exited from bilingual instruction, suggesting that conclusions regarding semester differences in R-CBM growth rates should not be consistently extended to ELLs.  相似文献   

2.
This study evaluated the classification accuracy of a second grade oral reading fluency curriculum‐based measure (R‐CBM) in predicting third grade state test performance. It also compared the long‐term classification accuracy of local and publisher‐recommended R‐CBM cut scores. Participants were 266 students who were divided into a calibration sample (n = 170) and two cross‐validation samples (n = 46; n = 50), respectively. Using calibration sample data, local fall, winter, and spring R‐CBM cut scores for predicting students’ state test performance were developed using three methods: discriminant analysis (DA), logistic regression (LR), and receiver operating characteristic curve analysis (ROC). The classification accuracy of local and publisher‐recommended cut scores was evaluated across subsamples. Only DA and ROC produced cut scores that maintained adequate sensitivity (≥.70) across cohorts; however, LR and publisher‐recommended scores had higher levels of specificity and overall correct classification. Implications for developing local cut scores are discussed.  相似文献   

3.
《Educational Assessment》2013,18(2):85-104
This study examined the relationship between oral reading fluency and performance on a statewide reading test across Grades 3, 4, and 5. A curriculum-based measure of oral reading fluency was administered to 3rd, 4th, and 5th graders who participated in the reading test from the Colorado Student Assessment Program (CSAP). A cross-sectional design with individuals nested within classrooms and classrooms nested within grade levels was used. Hierarchical linear modeling was employed to analyze the relationships. Significant correlations were found between oral reading fluency and the CSAP for all 3 grades. The measure of oral reading fluency added unique information to predicting CSAP performance over and above the predictability of previous year CSAP testing. There was significant variation at the individual classroom level in the relationship between oral reading fluency and CSAP performance. Diagnostic efficiency statistics were used with cut scores to predict whether students would pass or fail the CSAP.  相似文献   

4.
This longitudinal study modeled growth rates, from ages 4.5 to 11, in English and Spanish oral language and word reading skills among 173 Spanish-speaking children from low-income households. Individual growth modeling was employed using scores from standardized measures of word reading, expressive vocabulary, and verbal short-term language memory. The trajectories demonstrate that students' rates of growth and overall ability in word reading were on par with national norms. In contrast, students' oral language skills started out below national norms and their rates of growth, although surpassing the national rates, were not sufficient to reach age-appropriate levels. The results underscore the need for increased and sustained attention to promoting this population's language development.  相似文献   

5.
Screening for early reading problems is a critical step in early intervention and prevention of later reading difficulties. Evaluative frameworks for determining the utility of a screening process are presented in the literature but have not been applied to many screening measures currently in use in numerous schools across the nation. In this study, the accuracy of several Dynamic Indicators of Basic Early Literacy Skills (DIBELS) subtests in predicting which students were at risk for reading failure in first grade was examined in a sample of 12,055 students in Florida. Findings indicate that the DIBELS Nonsense Word Fluency, Initial Sound Fluency, and Phoneme Segmentation Fluency measures show poor diagnostic utility in predicting end of Grade 1 reading performance. DIBELS Oral Reading Fluency in fall of Grade 1 had higher classification accuracy than other DIBELS measures, but when compared to the classification accuracy obtained by assuming that no student had a disability, suggests the need to reevaluate the use of classification accuracy as a way to evaluate screening measures without discussion of base rates. Additionally, when cut scores on the screening tools were set to capture 90 percent of all students at risk for reading problems, a high number of false positives were identified. Finally, different cut scores were needed for different subgroups, such as English Language Learners. Implications for research and practice are discussed.  相似文献   

6.
All children attending the maintained primary schools of an education authority in the West Midlands were tested for reading ability and listening vocabulary in the term before entry to junior schooling. Children from West Indian and Asian families are compared with indigenous children on national and local norms for these two tests. The statistics of test scores are used to measure under‐achievement in reading. Although West Indian and Asian children are significantly below the mean of local indigenous children in reading skill, it is shown that they are not under‐achieving, and that they tend to higher achievement than samples of indigenous pupils matched for the social and familial features of the ethnic minorities. Some aspects of the psychometric model which results in this finding are examined.  相似文献   

7.
Although there have been a number of studies that compared the test results of the Wide Range Achievement Test (WRAT) and the Peabody Individual Achievement Test (PIAT), none had been accomplished with a group of college-aged learning disabled students. Indeed, few researchers had investigated populations over 16 years of age, and none had compared grade scores and standard scores with appropriate subtest scores as derived from these two instruments. With the recent emphasis on postsecondary education for the learning disabled, it is deemed critical that researchers should develop a data base for study of this population. The current research demonstrated that although both tests purport to measure academic achievement in reading, spelling, and arthmetic, in reality, significantly different scores are derived when WRAT (1965, 1978) norms are compared to PIAT (1970) norms as stated in grade scores and standard scores. These significant differences in the grade and standard scores clearly indicate that the WRAT and the PIAT, particularly in regard to arthmetic and to a lesser degree to reading, are not interchangeable instruments of academic achievement.  相似文献   

8.
An unexpected and remarkable preference for second language reading among some dyslexics has been noted, presenting a challenge to accepted theory on dyslexia and the capacity for second language learning. The current study was designed to examine this phenomenon by systematically looking at the differential reading scores in the first and second languages of reading-disabled young Swedish adults who claimed to prefer reading in their second language (English). Three groups were selected for study: a group of 10 reading-disabled young adults who prefer to read English; a second group of 10 reading-disabled with no special preference for second language reading, matched on word recognition efficiency, age group, gender and educational level and a group of 10 normal readers matched on age group and educational level. The test battery was designed to compare overall reading efficiency in English and Swedish and therefore encompassed both speed and accuracy measures. The battery covered seven phonological measures, four orthographic measures,three isolated word reading measures, two continuous text reading measures, a comprehension task and an author recognition task. All tasks were carried out in both English and Swedish. The results showed that two dyslexic groups differed significantly in the degree to which task performance, including reading efficiency, was impeded by the English format. A tentative hypothesis was forwarded as to how the exceptional and unexpected facility with English might be explained.  相似文献   

9.
The aim of our study was to determine DEM test performance norms for school-aged children in Latvia, assess how DEM test results correlate with children’s reading rates, compare test performance between strong and weak readers. A modified DEM test and a newly developed reading test were administered to 1487 children during a screening survey. Our study provides norms for adjusted DEM scores for children from 7 to 18 years of age. A high correlation exists between a child’s reading rate and her DEM speed scores for both parts of the test. Weak readers performed significantly more slowly on the DEM test than strong readers. Overall, 6 % of the subject population scored 1 standard deviation below the mean value on both the DEM and reading tests. We conclude that these individuals may be at a higher risk for developing reading impairments.  相似文献   

10.
How to monitor and raise standards of educational attainment generally were issues addressed by the British Government in 1988 when the National Curriculum was introduced into all state‐funded primary schools in England and Wales: ‘There is every reason for optimism that in providing a sound, sufficiently detailed framework over the next decade the National Curriculum will give children and teachers much needed help in achieving higher standards.’ This paper assesses the value of looking closely at successive reading scores of cohorts of children in order to monitor reading attainments over time. The reading attainments of seven cohorts of Year 2 children from five randomly selected primary schools within one local education authority (LEA) (N = 1,329) are analysed to see if standards of literacy have changed since the introduction of the National Curriculum in 1989. Both quantitative and qualitative methods of data collection were used in this cross‐sectional study. The reading attainments of each of the seven cohorts of Year 2 children, as measured by the are presented. Headteacher perceptions of the effects of the National Curriculum on the teaching and learning of reading in their schools were collected through interviews. Examination of the means of the standardized comprehension scores for each cohort reveals no statistically significant differences in attainment between any two cohorts. It was noted that the distribution of the reading scores was skewed towards underachievement in all seven cohorts. The relative stability in reading attainments contrasts with the aspiration that the introduction of the National Curriculum would raise standards. Headteachers saw this stability as the result of key stage 1 teachers working at an intense rate to safeguard the teaching and learning of reading against the pressures exerted by the introduction of the National Curriculum and assessment procedures.  相似文献   

11.
This article introduces the Diagnostic Profiles (DP) standard setting method for setting a performance standard on a test developed from a cognitive diagnostic model (CDM), the outcome of which is a profile of mastered and not‐mastered skills or attributes rather than a single test score. In the DP method, the key judgment task for panelists is a decision on whether or not individual cognitive skill profiles meet the performance standard. A randomized experiment was carried out in which secondary mathematics teachers were randomly assigned to either the DP method or the modified Angoff method. The standard setting methods were applied to a test of student readiness to enter high school algebra (Algebra I). While the DP profile judgments were perceived to be more difficult than the Angoff item judgments, there was a high degree of agreement among the panelists for most of the profiles. In order to compare the methods, cut scores were generated from the DP method. The results of the DP group were comparable to the Angoff group, with less cut score variability in the DP group. The DP method shows promise for testing situations in which diagnostic information is needed about examinees and where that information needs to be linked to a performance standard.  相似文献   

12.
This paper examines the effects of high school detracking on Korean Scholastic Aptitude Test (KSAT) scores in the subjects of Korean (reading), mathematics, and English. We found that a transition to detracking reduces test scores by 0.16–0.18 of one standard deviation. Additionally, the administrative autonomy of private schools and the use of private tutoring services at home attenuate the negative effects of detracking. Our findings are in line with those in the previous literature showing a modest effect of tracking (i.e., ability sorting) in advanced countries where the autonomous school system is well organized and commercial educational services are easily accessible and a strong positive effect of tracking in developing countries where access to well-resourced private schools and private education is generally limited.  相似文献   

13.
(NB. In the interests of brevity, the British-born children of parents of West Indian origin are referred to as the WI group and the children of parents of English origin are referred to as the E group.) It has been suggested that, partly because of dialect interferences, WI children experience greater difficulty than E group children in various aspects of reading. The present study compares the attainments of a sample of 20 WI and E boys and girls of equivalent intellectual ability attending a multi-cultural inner-city comprehensive school in a social priority area. Reading accuracy, reading comprehension and listening comprehension attainments of the children were tested by the individual administration of the Neale Analysis of Reading Ability, Forms A, B and C. In terms of both reading accuracy and comprehension, although the group mean scores were all below the norms for their chronological ages, there were no significant mean differences between the WI and E pupils or between boys and girls. Silent reading led to significantly lower scores than did reading aloud.  相似文献   

14.
In this paper, we attempt to explain the differences between reading and math scores of migrants’ children (8430 daughters and 8526 sons) in 17 OECD destination countries, coming from 45 origin countries or regions, using PISA 2009 data. In addition to the societal gender equality levels of the origin and destination countries (the gender empowerment measure), we use macro indicators of the origin countries’ educational systems, economic development and religions. We find that migrant daughters from countries with higher gender equality levels obtain higher reading scores than comparable migrant sons do (but this is not the case for math scores). In addition, the higher the gender equality levels in the destination countries, the lower the reading and math scores of both male and female migrants’ children in their destination countries. Further analyses suggest it is the difference between gender equality levels, rather than the levels themselves, that explains the educational performance of both female and male migrant pupils. Our results also show the low gender equality level in Islamic origin countries offers a possible explanation for the low educational performance of Islamic pupils, both male and female. Finally, migrants’ daughters seem to perform slightly better educationally, compared with migrants’ sons.  相似文献   

15.
Cut‐scores were set by expert judges on assessments of reading and listening comprehension of English as a foreign language (EFL), using the bookmark standard‐setting method to differentiate proficiency levels defined by the Common European Framework of Reference (CEFR). Assessments contained stratified item samples drawn from extensive item pools, calibrated using Rasch models on the basis of examinee responses of a German nationwide assessment of secondary school language performance. The results suggest significant effects of item sampling strategies for the bookmark method on cut‐score recommendations, as well as significant cut‐score judgment revision over cut‐score placement rounds. Results are discussed within a framework of establishing validity evidence supporting cut‐score recommendations using the widely employed bookmark method.  相似文献   

16.
Abstract

This study uses a regression discontinuity design to evaluate the impact of seventh-grade interventions delivered by 25 schools on the reading and school engagement outcomes of struggling readers. Students in participating schools were assigned to intervention (n?=?1,495) or comparison (n?=?4,397) conditions, based on their state reading performance and reading fluency scores in sixth grade. On the state reading test, intervention impact favoring the treatment group approached, but did not reach, statistical significance (p?=?.056, g?=?.15). On a measure of reading fluency, there was a significant interaction between the cut points schools chose for condition assignment and intervention impact (p?=?.015). In schools that chose a lower cut point, intervention students outperformed comparison students, while in schools that chose a higher cut point, comparison students outperformed intervention students. On a student self-report measure of school engagement, no significant differences between the intervention and comparison groups were observed (p?=?.83, g?=?.02). Implications for rigorous evaluation of interventions implemented by districts and schools under naturalistic conditions are discussed.  相似文献   

17.
The state of Pennsylvania, like many organizations interested in performance improvement, routinely engages in professional development activities. Educators in this hands‐on activity engaged in setting meaningful criterion‐referenced cut scores for career and technical education assessments using two methods. The main purposes of this study were to (a) assess if training differences had a differential impact on standard setting of the cut scores, (b) determine if there is a significant difference in cut scores between two groups of educators, and (c) examine how cut scores established by this analytical method might differ from holistic impressions cut scores. The results showed general agreement among the career and technical education judges on the cut scores established. These judgments were not influenced by the characteristics of career and technical education students. However, the judges' analytical cut scores were significantly lower than their corresponding holistic impressions cut scores.  相似文献   

18.
This study describes response to intervention (RTI) screening and progress‐monitoring instruments and procedures in 41 local school settings. For screening the schools most often used published reading assessments or commercial products; a three‐times‐per‐year screening schedule was most prevalent. For progress monitoring schools most often relied on published reading assessments; a weekly progress monitoring schedule was most prevalent. The variability between local school practices is discussed with regard to efficiency, equity, and viability of RTI.  相似文献   

19.
Conventional methods of differentiating reading disability (RD) caused by deficits in decoding skills or comprehension from poor reading performance caused by inconsistent attention associated with attention-deficit/hyperactivity disorder (ADHD) have produced equivocal results. This study presents a model of differential diagnosis of attentional problems and RD that differs from these conventional approaches. The new diagnostic procedure uses intraindividual differences seen in the performance of at-risk learners on tasks related to reading that vary in their sensitivity to the sustained attention required for successful performance. The hypothesis is that children with inconsistent attention would perform more poorly on tests that require sustained attention, such as listening comprehension, than on tests that are more tolerant of inattention, such as reading comprehension. Such differences would not be seen in the test scores of children who have only RD, because their performance is determined more by the difficulty level of the reading tests than by the degree of sensitivity of the task to attention. The validity of this new model was evaluated by determining the capability of the differences seen in the scores of tests that differ in their sensitivity to sustained attention to predict the degree of inconsistency in sustained attention as measured by a continuous performance test. The data obtained from 39 children who are at risk for RD suggest that this is a viable model.  相似文献   

20.
The present study explores the relationship between sentence recall and reading and language skills in a group of 7–11-year-old children with learning difficulties. While recent studies have found that performance on sentence recall tasks plays a role in learning, it is possible that this contribution is a reflection of shared resources with working memory. In order to investigate whether sentence recall was uniquely associated with reading and language skills, differences associated with IQ and working memory capacity were statistically controlled. A sample of 72 children was tested on measures of verbal complex memory, verbal short-term memory, sentence recall, expressive vocabulary, verbal and performance IQ, reading and language skills. Both sentence recall and verbal complex memory shared unique links with reading skill, and sentence recall was uniquely associated with language skills. This finding indicates that resources in long-term memory also play an important diagnostic role in reading and language abilities. The implications for educational practice are discussed.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号