期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Introduction to the Special Issue on Alternate Assessments Based on Modified Academic Achievement Standards: New Policy,New Practices,and Persistent Challenges

Ryan J. Kettler Stephen N. Elliott 《Peabody Journal of Education》2013,88(4):467-477

This introduction to the special issue titled Alternate Assessments Based on Modified Academic Achievement Standards: New Policy, New Practices, and Persistent Challenges addresses the federal policy introducing the new alternate assessment for students with persistent academic difficulties, as well as related implementation issues that will be more thoroughly considered throughout the journal. Three guidelines are identified within the policy for alternate assessments based on modified academic achievement standards (AA-MASs), including that (a) a state's grade-level academic content standards cannot be modified for an AA-MAS, (b) a state's general test can be modified for an AA-MAS, and (c) a state's achievement standards can be modified for an AA-MAS so long as they remain on grade level. This article introduces key issues including identification of students eligible for an AA-MAS, the degree of modification that can be applied to develop an AA-MAS, and the current state of AA-MAS development across the nation. The article concludes with overviews of each contribution in the journal. 相似文献

2.

Considering the Use of General and Modified Assessment Items in Computerized Adaptive Testing

Adam E. Wyse Anthony D. Albano 《教育实用测度》2015,28(2):156-167

This article used several data sets from a large-scale state testing program to examine the feasibility of combining general and modified assessment items in computerized adaptive testing (CAT) for different groups of students. Results suggested that several of the assumptions made when employing this type of mixed-item CAT may not be met for students with disabilities that have typically taken alternate assessments based on modified achievement standards (AA-MAS). A simulation study indicated that the abilities of AA-MAS students can be underestimated or overestimated by the mixed-item CAT, depending on students’ location on the underlying ability scale. These findings held across grade levels and test lengths. The mixed-item CAT appeared to function well for non-AA-MAS students. 相似文献

3.

Gender DIF in Reading and Mathematics Tests With Mixed Item Formats

Catherine S. Taylor Yoonsun Lee 《教育实用测度》2013,26(3):246-280

This was a study of differential item functioning (DIF) for grades 4, 7, and 10 reading and mathematics items from state criterion-referenced tests. The tests were composed of multiple-choice and constructed-response items. Gender DIF was investigated using POLYSIBTEST and a Rasch procedure. The Rasch procedure flagged more items for DIF than did the simultaneous item bias procedure—particularly multiple-choice items. For both reading and mathematics tests, multiple-choice items generally favored males while constructed-response items generally favored females. Content analyses showed that flagged reading items typically measured text interpretations or implied meanings; males tended to benefit from items that asked them to identify reasonable interpretations and analyses of informational text. Most items that favored females asked students to make their own interpretations and analyses, of both literary and informational text, supported by text-based evidence. Content analysis of mathematics items showed that items favoring males measured geometry, probability, and algebra. Mathematics items favoring females measured statistical interpretations, multistep problem solving, and mathematical reasoning. 相似文献

4.

Modifying Achievement Test Items: A Theory-Guided and Data-Based Approach for Better Measurement of What Students With Disabilities Know

Ryan J. Kettler Stephen N. Elliott Peter A. Beddow 《Peabody Journal of Education》2013,88(4):529-551

Federal regulations allow up to 2% of the student population of a state to achieve proficiency for adequate yearly progress by taking an alternate assessment based on modified academic achievement standards (AA-MAS). Such tests are likely to be easier, but as long as a test is considered a valid measure of grade level content, it is allowable as an AA-MAS (U.S. Department of Education, 2007b U.S. Department of Education. 2007b. Standards and assessments peer review guidance, Washington, DC: Author. [Google Scholar]). In this article, we examine procedures for developing, modifying, and evaluating items and tests using an evolving modification paradigm, as well as a classic reliability and validity framework. Theoretical influences, such as principles of universal design, cognitive load theory, and item development research, are discussed. The Test Accessibility and Modification Inventory, a tool that provides systematic and comprehensive guidance to help educators modify grade-level tests, is introduced. Cognitive lab methods and experimental field tests are then described, along with examples and key findings from each, relevant to AA-MASs. The article concludes with a discussion of precautions, lessons learned, and questions generated about the methods used to improve both access and test score validity for the students who are eligible for this new alternate assessment. 相似文献

5.

Effects of Item Modifications on Test Accessibility for Persistently Low-Performing Students with Disabilities

Dale J. Cohen Jin Zhang Werner Wothke 《教育实用测度》2013,26(4):269-280

ABSTRACT

Construct-irrelevant cognitive complexity of some items in the statewide grade-level assessments may impose performance barriers for students with disabilities who are ineligible for alternate assessments based on alternate achievement standards. This has spurred research into whether items can be modified to reduce complexity without affecting item construct. This study uses a generalized linear mixed modeling analysis to investigate the effects of item modifications on improving test accessibility by reducing construct-irrelevant cognitive barriers for persistently low-performing fifth-grade students with cognitive disabilities. The results showed item scaffolding was an effective modification for both mathematics and reading. Other modifications, such as bolding/underlining of key words, hindered test performance for low-performing students. We discuss the findings’ potential impact on test development with universal design. 相似文献

6.

Using a Multidimensional Differential Item Functioning Framework to Determine if Reading Ability Affects Student Performance in Mathematics

Cindy M. Walker Bo Zhang John Surber 《教育实用测度》2013,26(2):162-181

Many teachers and curriculum specialists claim that the reading demand of many mathematics items is so great that students do not perform well on mathematics tests, even though they have a good understanding of mathematics. The purpose of this research was to test this claim empirically. This analysis was accomplished by considering examinees that differed in reading ability within the context of a multidimensional DIF framework. Results indicated that student performance on some mathematics items was influenced by their level of reading ability so that examinees with lower proficiency classifications in reading were less likely to obtain correct answers to these items. This finding suggests that incorrect proficiency classifications may have occurred for some examinees. However, it is argued that rather than eliminating these mathematics items from the test, which would seem to decrease the construct validity of the test, attempts should be made to control the confounding effect of reading that is measured by some of the mathematics items. 相似文献

7.

State Perspectives on Implementing,or Choosing Not to Implement,an Alternate Assessment Based on Modified Academic Achievement Standards

Porter W. Palmer 《Peabody Journal of Education》2013,88(4):578-584

Since Federal regulations have given states the option to implement alternate assessments based on modified academic achievement standards (AA-MAS) as part of their accountability systems for a small group of students with disabilities, a number of states have made decisions about whether or not to develop and implement such an assessment. State-level directors of assessment and directors of special education were surveyed about their state's decisions on implementing AA-MAS. Improvements in accessibility and appropriateness were reasons given for choosing to implement an AA-MAS, whereas lack of resources and guidance were identified as barriers. This article presents the findings from a survey on 22 states' decisions concerning implementation of AA-MAS. 相似文献

8.

The Changing Landscape of Alternate Assessments Based on Modified Academic Achievement Standards: An Analysis of Early Adopters of AA-MASs

Sheryl S. Lazarus Martha L. Thurlow 《Peabody Journal of Education》2013,88(4):496-510

Several states had an assessment that they considered to be an alternate assessment based on modified academic achievement standards (AA-MAS) in place, or in development, when the April 2007 federal regulations on modified achievement standards were finalized. This article uses publicly available information collected by the National Center on Educational Outcomes to analyze changes in states' AA-MAS between 2007 and 2008. The article compares across the 2 years the number of states that had an assessment they considered to be an AA-MAS, states' participation guidelines, and the characteristics of these assessments. We also provide information about the number of students who participated in this assessment option during the 2006–07 school year. In 2007, six states (Kansas, Louisiana, Maryland, North Carolina, North Dakota, Oklahoma) either had or were in the process of developing an assessment they considered to be an AA-MAS. In 2008, California, Connecticut, and Texas also offered this assessment option. Six of the nine states that have been early implementers of the AA-MAS were states that had offered an out-of-level testing option until federal policies required that option to be phased out. Most states had fewer items on their AA-MAS than on their regular assessment. States that had participation guidelines for the AA-MAS in 2007 developed them prior to finalization of the federal regulations, and some changes between 2007 and 2008 in states' participation guidelines appear to have been made to better align the guidelines with the regulations. The characteristics of the assessments states consider to be AA-MASs are changing rapidly and will probably continue to change rapidly over the next few years as more is learned about the advantages and limitations of this assessment. 相似文献

9.

The Impact of an Early Literacy Initiative on the Long Term Academic Success of Diverse Students

Stella SiWan Zimmerman Michael C. Rodriguez Kirsten L. Rewey Sandra L. Heidemann 《Journal of Education for Students Placed at Risk》2013,18(4):452-481

The primary goal of the Words Work (WW) early literacy initiative was to prepare diverse students to become successful learners in school. Four groups of students were followed over 6 years: students from traditional Head Start (HS) centers, students from augmented HS centers (WW students), students eligible for HS but not enrolled (waitlist or WL), and a random sample of non-Head Start students (NHS). Students received school readiness instruction at approximately 4 years of age, and their performance on standardized tests was followed from 1st grade through 5th grade. WW students significantly outscored HS, WL, and NHS students of similar backgrounds in reading and outscored WL and NHS students in mathematics on 2nd-grade standardized assessments. In addition, WW students were able to maintain their lead through 5th grade in both reading and mathematics. Based on these results, WW is an effective HS enhancement for minimizing the achievement gap often observed between students who may be at risk for academic difficulty and students who are at low risk. 相似文献

10.

An Examination of the Relationship between Computation,Problem Solving,and Reading

Damien C. Cormier Theodore J. Christ Laura D. Offrey Katherine Pratt 《Exceptionality》2016,24(4):225-240

The purpose of this study is to evaluate the relationship of mathematics calculation rate (curriculum-based measurement of mathematics; CBM-M), reading rate (curriculum-based measurement of reading; CBM-R), and mathematics application and problem solving skills (mathematics screener) among students at four levels of proficiency on a statewide test. It was hypothesized that CBM-M provides insufficient information to make good screening decisions and that other measures with content more similar to that of large-scale tests of mathematics would function to improve screening. One hundred and seventy students in third grade from a rural elementary school in the Midwestern United States participated. Structural equation modeling was used to evaluate direct, mediator, and latent growth models. In general, CBM-R mediated the relationship between the mathematics ability screener and passing the state assessment, while CBM-M did not have any significant paths within these models. Results are discussed in terms of the utility of CBM-M and CBM-R procedures in screening for success on state test performance in mathematics. 相似文献

11.

Italian students’ results in the PISA mathematics test: does reading competence matter?

Anna Maria Ajello Elisa Caponera Laura Palmerio 《European Journal of Psychology of Education - EJPE》2018,33(3):505-520

In Italy, from the 2003 reports to the present, the National Institute for the Educational Evaluation of Instruction and Training (INVALSI) has conducted research on Programme for International Student Assessment (PISA) results in order to understand Italian students’ low achievement in mathematics. In the present paper, data from a representative sample of 15-year-old Italian students who participated in PISA 2012 were analysed. This study’s primary aim is to verify how students’ linguistic competences are associated with their performance in mathematics. For the evaluation of the impact of item reading demand on students’ performance, we selected 24 mathematics items with a high reading demand and 31 mathematics items with a low reading demand, as classified by Italian language and methodology experts. Repeated measure variance analyses were conducted. The results showed differences in function of gender: females are advantaged in mathematics items with a high reading demand, independent of their level of reading literacy. In contrast, males are advantaged in mathematics items with a low reading demand, independent of their level of reading literacy. Possible policy implications are discussed. 相似文献

12.

The “Two Percent Students”: Considerations and Consequences of Eligibility Decisions

Naomi Zigmond Amanda Kloo 《Peabody Journal of Education》2013,88(4):478-495

Since the 2001–02 school year, the accountability provisions of the No Child Left Behind Act (NCLB) have shaped much of the work of public school teachers and administrators in the United States. NCLB explicitly prohibits schools from excluding students with disabilities from the accountability system and requires not only participation of all students in statewide accountability assessments but also reporting of the results for students with disabilities along with other students and as a disaggregated group. From the beginning of these requirements, lawmakers recognized that there would be a small group of students with disabilities for whom the regular assessment, even with accommodations, would not be appropriate and they authorized states to develop an alternate assessment based on alternate achievement standards (AA-AAS) for this group of students. More recently, responding to pressures from the field, additional flexibility has been granted to develop an additional alternate assessment based on modified grade-level achievement standards (AA-MAS) for students with disabilities who present with persistent academic difficulties. It is expected that approximately 2% of the total student population might be included in this new alternate assessment. This article examines the decisions that need to be made by individual states to determine the target population for this new alternate assessment and the policy implications of these decisions. 相似文献

13.

Ethnic DIF in Reading Tests With Mixed Item Formats

Catherine S. Taylor Yoonsun Lee 《Educational Assessment》2013,18(1):35-68

This article presents a study of ethnic Differential Item Functioning (DIF) for 4th-, 7th-, and 10th-grade reading items on a state criterion-referenced achievement test. The tests, administered 1997 to 2001, were composed of multiple-choice and constructed-response items. Item performance by focal groups (i.e., students from Asian/Pacific Island, Black/African American, Native American, and Latino/Hispanic origins) were compared with the performance of White students using simultaneous item bias and Rasch procedures. Flagged multiple-choice items generally favored White students, whereas flagged constructed-response items generally favored students from Asian/Pacific Islander, Black/African American, and Latino/Hispanic origins. Content analysis of flagged reading items showed that positively and negatively flagged items typically measured inference, interpretation, or analysis of text in multiple-choice and constructed-response formats. Items that were not flagged for DIF generally measured very easy reading skills (e.g., literal comprehension) and reading skills that require higher level thinking (e.g., developing interpretations across texts and analyzing graphic elements). 相似文献

14.

PISA mathematics and reading performance differences of mainstream European and Turkish immigrant students

Serkan Arikan Fons J. R. van de Vijver Kutlay Yagmur 《Educational Assessment, Evaluation and Accountability》2017,29(3):229-246

Lower reading and mathematics performance of Turkish immigrant students as compared to mainstream European students could reflect differential learning outcomes, differential socioeconomic backgrounds of the groups, differential mainstream language proficiency, and/or test bias. Using PISA reading and mathematics scores of these groups, we examined the role of bias and various measures related to immigrant integration policies of the host societies. Results of a multilevel analysis of reading and mathematics tests demonstrated that at individual level, students with higher scores on an index of economic, social, and cultural status obtained higher achievement scores. At country level, MIPEX scores of education and the human development index of participating countries could predict differences in reading results but not in mathematics. After correction for background characteristics, effect sizes showed a difference of .65 SD (down from a value of .96 before correction) for reading and .58 SD (down from .78) for mathematics. However, a similar correction for background variables increased the score differences between Turkish immigrants and mainstreamers. 相似文献

15.

Test item linguistic complexity and assessments for deaf students

Cawthon S 《American annals of the deaf》2011,156(3):255-269

Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64 students completed 52 multiple-choice items, 32 in mathematics and 20 in reading. These items were coded for linguistic complexity components of vocabulary, syntax, and discourse. Mathematics items had higher linguistic complexity ratings than reading items, but there were no significant relationships between item linguistic complexity scores and student performance on the test items. The discussion addresses issues related to the subject area, student proficiency levels in the test content, factors to look for in determining a "linguistic complexity effect," and areas for further research in test item development and deaf students. 相似文献

16.

Calibration and Scoring of Tests With Multiple-Choice and Constructed-Response Item Types

Kadriye Ercikan Richard D. Sehwarz Marc W. Julian George R. Burket Melba M. Weber Valerie Link 《Journal of Educational Measurement》1998,35(2):137-154

This article discusses and demonstrates combining scores from multiple-choice (MC) and constructed-response (CR) items to create a common scale using item response theory methodology. Two specific issues addressed are (a) whether MC and CR items can be calibrated together and (b) whether simultaneous calibration of the two item types leads to loss of information. Procedures are discussed and empirical results are provided using a set of tests in the areas of reading, language, mathematics, and science in three grades. 相似文献

17.

Validation of Student, Principal, and Self-Ratings in 360° Feedback^? for Teacher Evaluation

David J. Wilkerson Richard P. Manatt Mary Ann Rogers Ron Maughan 《Journal of Personnel Evaluation in Education》2000,14(2):179-192

相似文献

18.

Teachers’ expectations about students’ use of reading strategies,knowledge and behaviour in Grades 3, 5 and 7

Fatemeh Arabsolghar John Elkins 《Journal of Research in Reading》2001,24(2):154-162

Although extensive basic research has been carried out on children’s metacognition, little is known about teachers’ views of their students’ cognitive and metacognitive skills in reading. The ways in which teachers expected their children to use, or to know how to use, certain reading skills are examined in this study. A questionnaire on reading components (strategies, knowledge and behaviour) was completed by 45 teachers in Grades 3, 5 and 7. In this questionnaire teachers were asked to make judgements about whether or not students of high, average and low ability levels in their classes would be likely to show these skills. An analysis of variance (grade×ability×component) revealed a significant interaction between ability and component. There was much greater variability in the three components for the low and average levels of ability. The main effect for ability was significant. The highest expectations of teachers were for high‐ability students in all the three groups of items, followed by average and low‐ability students. The main effect for component was also significant for knowledge. There was no significant difference between the grades. However, teachers hold equivalent performance expectations for high‐ability students in each of the three components, but for average and low‐ability groups, expectations were higher for knowledge than strategy and behaviour. 相似文献

19.

The Impact of Item Stem Format on the Dimensional Structure of Mathematics Assessments

Adnan Kan Damien C. Cormier 《Educational Assessment》2019,24(1):13-32

Item stem formats can alter the cognitive complexity as well as the type of abilities required for solving mathematics items. Consequently, it is possible that item stem formats can affect the dimensional structure of mathematics assessments. This empirical study investigated the relationship between item stem format and the dimensionality of mathematics assessments. A sample of 671 sixth-grade students was given two forms of a mathematics assessment in which mathematical expression (ME) items and word problems (WP) were used to measure the same content. The effects of mathematical language and reading abilities in responding to ME and WP items were explored using unidimensional and multidimensional item response theory models. The results showed that WP and ME items appear to differ with regard to the underlying abilities required to answer these items. Hence, the multidimensional model fit the response data better than the unidimensional model. For the accurate assessment of mathematics achievement, students’ reading and mathematical language abilities should also be considered when implementing mathematics assessments with ME and WP items. 相似文献

20.

Matthew effects in reading and mathematics: Examining developmental patterns in population data

《Contemporary educational psychology》2023

This study examined whether Matthew effects were evident in developmental patterns of reading and mathematics skills from middle childhood to adolescence. We obtained standardized reading and mathematics scores at Grades 3, 5, 7 and 9 for full cohorts of students in two Australian states, NSW (N = 88,958, 48% female) and Victoria (N = 65,984, 49% female). Latent growth curve models were used to identify the best-fitting longitudinal trajectories of reading and mathematics, and to examine whether cumulative (i.e. a Matthew effect), compensatory, or stable interindividual differences characterized development in each domain. For both reading and mathematics, and in both samples, growth decelerated as grade levels increased, with latent basis models fitting the data better than linear models. Negative intercept-slope covariances, and decreasing variances at increasing grades in both domains indicated compensatory growth patterns, rather than cumulative patterns or Matthew effects. These results indicate that students with below average achievement at Grade 3 make greater gains to Grade 9 than their initially higher-achieving peers. Modeling growth trajectories in two longitudinal population datasets allows strong tests of theorized growth patterns for both reading and mathematics, and presents insights about developmental change in academic skills from middle childhood to adolescence. 相似文献