期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

DIAGNOSING STUDENTS' ERRORS FROM THEIR RESPONSE SELECTIONS IN LANGUAGE ARTS

NOREEN M. WEBB JOAN L. HERMAN BEVERLY CABELLO 《Journal of Educational Measurement》1986,23(2):163-170

This set of studies examined the consistency of student response patterns on a test of language arts, as a first step toward designing a computerized adaptive test to diagnose errors. A diagnostic domain-referenced language arts test was designed so that the choice of response would immediately point to a specific misconception in pronoun usage. This direct correspondence between error and diagnosis was designed to facilitate classroom instruction and remediation. Analysis of students' response choices on matched items and analysis of students' rationales for selecting their responses showed that student behavior was not always consistent and could be used to diagnose some errors but not others. 相似文献

2.

Misconceptions of Turkish Pre-Service Teachers about Force and Motion

Sule Bayraktar 《International Journal of Science and Mathematics Education》2009,7(2):273-291

The purpose of this study was to diagnose the misconceptions held by pre-service physics teachers about force and motion. The secondary aim of the study was to detect whether misconceptions vary according to gender, educational level, and culture. The study was conducted with 79 student-teachers attending to one of the largest faculties of education in Turkey. Force Concept Inventory (FCI) was used to diagnose student-teachers’ misconceptions. FCI is a conceptual test consisting of 29 multiple choice items. Each wrong choice for each question reflects a specific misconception about the force and motion concepts. Data from the study was analyzed by using frequencies, t-test, and ANOVA for making comparisons according to gender and years of education. Results of the study showed that student-teachers of physics hold very strong misconceptions about impetus and active force. No significant differences were found between male and female students’ scores on the concept test. The results also showed that misconceptions about force and motion decreased through the years of education. However, they did not disappear completely. Findings of the study are very similar to the other research findings conducted on the subject in other countries. Student-teachers’ conceptions about Newton’s Third Law, on the other hand, were significantly better than those observed in other research done in other countries such as the US and Finland. 相似文献

3.

Justifying the selection of answers in multiple choice items

Pinchas Tamir 《International Journal of Science Education》2013,35(5):563-573

Although multiple choice tests, as commonly used, can be rightly criticized, their structure, when wisely used, makes them an excellent diagnostic tool for identifying students’ conceptions, including misconceptions. This paper offers a number of useful guidelines regarding the analysis of multiple choice items. A special reference is made to the use of the justifications that students provide for their choices. Examples of the procedures, which may be used in analysing justifications and the kinds of diagnostic information that may be obtained, are provided. 相似文献

4.

The Positive and Negative Effects of Science Concept Tests on Student Conceptual Understanding

Chun‐Yen Chang Ting‐Kuang Yeh James P. Barufaldi 《International Journal of Science Education》2013,35(2):265-282

相似文献

5.

标点符号在英语单项填空题设计中的运用

汪望珠《中国考试》2011,(7)

标点符号是书面语中意义表达的重要手段,在语言测试中也起着非常关键的作用。本文通过具体实例(主要来源于NMET)阐述了如何将标点符号用于英语单项填空题设计中。相似文献

6.

THE EFFECT OF SELECTED POOR ITEM-WRITING PRACTICES ON TEST DIFFICULTY, RELIABILITY AND VALIDITY

CYNTHIA BOARD DOUGLAS R. WHITNEY 《Journal of Educational Measurement》1972,9(3):225-233

Violations of four selected principles of writing multiple choice items were introduced into an undergraduate political science examination. Three of the four poor practices had no overall effect on test difficulty. A significant (α= .05) interaction effect between the poor practices and course achievement occurred for one of the four practices, with the poorer students generally gaining most from the poorly written items. KR 20 values were significantly lower for sets of items with the same flaws than for "good" versions of the items in three of four comparisons. The reductions in reliability were equivalent to those expected to result from shortening the test by 13 to 56 percent. Concurrent validity (correlation of experimental test scores with final examination scores) was significantly lower in two of four cases. The reductions in validity were equivalent to those expected to result from shortening the test by 56 to 83 percent. 相似文献

7.

An alternative method of answering and scoring multiple choice tests

Charles Taylor Paul L. Gardner 《Research in Science Education》1999,29(3):353-363

A simple modification to the method of answering and scoring multiple choice tests allows students to indicate their estimates of the probability of the correctness of the multiple choice options for each question, without affecting the validity of the assessment. A study was conducted using a test that investigated common misconceptions in mechanics. The study showed that for assessment purposes this method gives results that are very similar to results obtained by students who answer in the traditional manner. Year 12 Physics students (N=85) were randomly allocated to two treatment groups: one received a standard format multiple choice test, the other a test format allowing students to select more than one response in a multiple choice test, and to distribute their marks among their chosen optionsl An analysis of the students' uncertainties is used to argue that not only can students appeal to different conceptions in different contexts, but that they can also hold conflicting conceptions with respect to a single context. 相似文献

8.

DETECTING EXPERIMENTALLY INDUCED ITEM BIAS USING THE ITERATIVE LOGIT METHOD

FRANK G. KOK GIDEON J. MELLENBERGH HENK VAN DER FLIER 《Journal of Educational Measurement》1985,22(4):295-303

A test for mental arithmetic was constructed, consisting of items written in Dutch (the subjects' native language), Spanish, and Roman numerals. A group of 286 subjects received some information on Spanish numerals. The group was randomly split into a Spanish Group and a Roman Group. The Spanish Group received further instruction on Spanish numerals, while the Roman Group got instruction on Roman numerals. Checks on the experimental manipulations showed that the Spanish Group had better knowledge of Spanish numerals than the Roman Group, whereas the Roman Group had better knowledge of Roman numerals. From the total test two subtests were constructed: a 30-item Dutch/Spanish subtest (15 items in Dutch and 15 in Spanish), and a 25-item Dutch/Roman subtest (15 items in Dutch and 10 in Roman). The Dutch items were unbiased between the Spanish and Roman groups, whereas the Spanish items of the Dutch/Spanish subtest were biased against the Roman Group, and the Roman items of the Dutch/Roman subtest were biased against the Spanish Group. The iterative logit method was applied to the two subtests. The method showed very good results in detecting biased items. 相似文献

9.

Teaching social studies to learning disabled high school students: effects of a hypertext study guide

Steven V Horton Randall A Boone Thomas C Lovitt 《British journal of educational technology : journal of the Council for Educational Technology》1990,21(2):118-131

This study investigated the effectiveness of a computer-based study guide using hypertext software to increase textbook comprehension among four learning disabled students enrolled in a remedial high school social studies class. The program provided four levels of instructional cues that matched students to their highest level of independent interaction with a textbook passage, based on item-to-item responses to computer-generated questions. Using alternative forms of a 45-item multiple-choice test, a pre-test/post-test design was arranged, with a retention test given after a 30-day period. Fifteen questions were designated as control items by placing them in the 45-item tests but not in the computer treatment. The computer program consisted of three separate lessons administered across consecutive class sessions, with each followed by a written 15-item multiple choice test containing 10 computer questions and 5 control items. Results indicated a significant gain for pupils on computer items from pre-test to post-test and from pre-test to retention test, while no significant change occurred on control items across measures. A single-case analysis revealed a consistent relationship between gain scores on computer items, reading time on computer, and the number of instructional cues required by students. Two types of non-linear pathways that teacher might consider when constructing study guides are discussed. 相似文献

10.

Students' and teachers' misapplication of le chatelier's principle: Implications for the teaching of chemical equilibrium

Juan Quílez-Pardo Joan Josep Solaz-Portols 《科学教学研究杂志》1995,32(9):939-957

The aim of this article was to study the reasons, strategies, and procedures that both students and teachers use to solve some chemical equilibrium questions and problems. Inappropriate conceptions on teaching and a lack of knowledge regarding the limited usefulness of Le Chatelier's principle, with its vague and ambiguous formulation and textbook presentation, may be some of the sources of misconceptions about the prediction of the effect of changing conditions on chemical equilibrium. To diagnose misconceptions and their possible sources, a written test was developed and administered to 170 1st-year university chemistry students. A chemical equilibrium problem, relating to the students' test, was solved by 40 chemistry teachers. First, we ascertained that teacher's conceptions might influence the problem-solving strategies of the learner. Based on this first aspect, our discussion also concerns students' and teachers' misconceptions related to the Le Chatelier's principle. Misconceptions emerged through: (a) misapplication and misunderstanding of Le Chatelier's principle; (b) use of rote-learning recall and algorithmic procedures; (c) incorrect control of the variables involved; (d) limited use of the chemical equilibrium law; (e) a lack of mastery of chemical equilibrium principles and difficulty in transferring such principles to new situations. To avoid chemical equilibrium misconceptions, a specific pattern of conceptual and methodological change may be considered. 相似文献

11.

Development and Application of a Novel Rasch-based Methodology for Evaluating Multi-Tiered Assessment Instruments: Validation and utilization of an undergraduate diagnostic test of the water cycle

William L. Romine Dane L. Schaffer Lloyd Barrow 《International Journal of Science Education》2013,35(16):2740-2768

We describe the development and validation of a three-tiered diagnostic test of the water cycle (DTWC) and use it to evaluate the impact of prior learning experiences on undergraduates’ misconceptions. While most approaches to instrument validation take a positivist perspective using singular criteria such as reliability and fit with a measurement model, we extend this to a multi-tiered approach which supports multiple interpretations. Using a sample of 130 undergraduate students from two colleges, we utilize the Rasch model to place students and items along traditional one-, two-, and three-tiered scales as well as a misconceptions scale. In the three-tiered and misconceptions scales, high confidence was indicative of mastery. In the latter scale, a ‘misconception’ was defined as mastery of an incorrect concept. We found that integrating confidence into mastery did little to change item functioning; however, three-tiered usage resulted in higher reliability and lower student ability estimates than two-tiered usage. The misconceptions scale showed high efficacy in predicting items on which particular students were likely to express misconceptions, and revealed several tenacious misconceptions that all students were likely to express regardless of ability. Previous coursework on the water cycle did little to change the prevalence of undergraduates’ misconceptions. 相似文献

12.

Meta‐evaluation in clinical anatomy: A practical application of item response theory in multiple choice examinations

Milton Severo Maria A. Ferreira Tavares 《Anatomical sciences education》2010,3(1):17-24

The nature of anatomy education has changed substantially in recent decades, though the traditional multiple‐choice written examination remains the cornerstone of assessing students' knowledge. This study sought to measure the quality of a clinical anatomy multiple‐choice final examination using item response theory (IRT) models. One hundred seventy‐six students took a multiple‐choice clinical anatomy examination. One‐ and two‐parameter IRT models (difficulty and discrimination parameters) were used to assess item quality. The two‐parameter IRT model demonstrated a wide range in item difficulty, with a median of ?1.0 and range from ?2.0 to 0.0 (25th to 75th percentile). Similar results were seen for discrimination (median 0.6; range 0.4–0.8). The test information curve achieved maximum discrimination for an ability level one standard deviation below the average. There were 15 items with standardized loading less than 0.3, which was due to several factors: two items had two correct responses, one was not well constructed, two were too easy, and the others revealed a lack of detailed knowledge by students. The test used in this study was more effective in discriminating students of lower ability than those of higher ability. Overall, the quality of the examination in clinical anatomy was confirmed by the IRT models. Anat Sci Educ 3:17–24, 2010. © 2009 American Association of Anatomists. 相似文献

13.

Development and application of a two-tier diagnostic test measuring college biology students' understanding of diffusion and osmosis after a course of instruction

Arthur Louis Odom Lloyd H. Barrow 《科学教学研究杂志》1995,32(1):45-61

This study involved the development and application of a two-tier diagnostic test measuring college biology students' understanding of diffusion and osmosis after a course of instruction. The development procedure had three general steps: defining the content boundaries of the test, collecting information on students' misconceptions, and instrument development. Misconception data were collected from interviews and multiple-choice questions with free response answers. The data were used to develop 12 two-tier multiple choice items in which the first tier examined content knowledge and the second examined understanding of that knowledge. The conceptual knowledge examined was the particulate and random nature of matter, concentration and tonicity, the influence of life forces on diffusion and osmosis, membranes, kinetic energy of matter, the process of diffusion, and the process of osmosis. The diagnostic instrument was administered to 240 students (123 non-biology majors and 117 biology majors) enrolled in a college freshman biology laboratory course. The students had completed a unit on diffusion and osmosis. The content taught was carefully defined by propositional knowledge statements, and was the same content that defined the content boundaries of the test. The split-half reliability was .74. Difficulty indices ranged from 0.23 to 0.95, and discrimination indices ranged from 0.21 to 0.65. Each item was analyzed to determine student understanding of, and identify misconceptions about, diffusion and osmosis. 相似文献

14.

Rapid‐Guessing Behavior: Its Identification,Interpretation, and Implications

下载免费PDF全文

Steven L. Wise 《Educational Measurement》2017,36(4):52-61

The rise of computer‐based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple‐choice items. In particular, very short response time—termed rapid guessing—has been shown to indicate disengaged test taking, regardless whether it occurs in high‐stakes or low‐stakes testing contexts. This article examines rapid‐guessing behavior—its theoretical conceptualization and underlying assumptions, methods for identifying it, misconceptions regarding its dynamics, and the contextual requirements for its proper interpretation. It is argued that because it does not reflect what a test taker knows and can do, a rapid guess to an item represents a choice by the test taker to momentarily opt out of being measured. As a result, rapid guessing tends to negatively distort scores and thereby diminish validity. Therefore, because rapid guesses do not contribute to measurement, it makes little sense to include them in scoring. 相似文献

15.

Adding scents to symbols: using food fragrances with deafblind young people making choices at mealtimes

Heather Murdoch Anne Gough Eileen Boothroyd Kate Williams 《British Journal of Special Education》2014,41(3):249-267

This article is written by Heather Murdoch, research consultant for the Seashell Trust, Anne Gough, deputy headteacher at Royal School Manchester/Seashell Trust, Eileen Boothroyd, consultant for the Seashell Trust, and Kate Williams, a creative perfumer for Seven (PZ Cussons). It describes the use of food fragrances with deafblind students who are making lunchtime meal choices. The fragrances were added to picture symbols or objects of reference used to represent different menu options. A multiple case study, mixed methods design was used to evaluate three students' choice‐making before and after the introduction of the fragrances. Case study was chosen because of the complex and diverse needs created by deafblindness and because the project was an exploratory study of a new approach. The methods included semi‐structured interviews with students' 1:1 keyworkers and other staff, direct and videotaped observations of students' choice‐making, and written diary records of students' food choices and responses. The findings are necessarily tentative but support the use of food fragrances to assist students' understanding of mealtime choice‐making and their engagement with the process. The practicalities of using olfaction to support other sensory inputs are discussed, together with priorities for future research and practice in this area. 相似文献

16.

The effects of a sense of compatibility between type of script and word in written Japanese

Iwahara Akihiko Hatta Takeshi Maehara Aiko 《Reading and writing》2003,16(4):377-397

相似文献

17.

Interaction of learner characteristics with learning from three models of the periodic table

Jeffrey R. Lehman John J. Koran Mary Lou Koran 《科学教学研究杂志》1984,21(9):885-893

This study was designed to explore the effects on learning of: (1) structural modifications to the periodic table, (2) the location of a periodic table within instructional materials, and (3) the presence of a two-page schema showing relationships between the topics explained in the written materials and the periodic table. One hundred and sixty high school students were randomly assigned to one of eight treatments. A 28-item posttest (KR –; 21 = 0.72), consisting of multiple choice and constructed answer items, was designed to measure subjects' ability to use their periodic tables to obtain factual information and to solve qualitative chemistry problems. Regression analyses using the multiple choice portion of the posttest as a dependent variable and table type as an independent variable revealed that for subjects with minimal experience with the periodic table, those who received the table with added visual data performed significantly better than subjects who received either of the other two tables (df 3,93; F = 2.72; p < 0.05). For subjects familiar with the periodic table, significant vocabulary X table (df 3,49; F = 3.22; p < 0.05) and vocabulary X location (df 1,49; F = 4.46; p < 0.05) interactions were detected. Subjects high in verbal comprehension tended to take advantage of the modified tables, while those low in verbal comprehension processed the traditional table with less information most effectively. These latter students also benefited more from having the periodic table alongside their written materials. 相似文献

18.

COLLEGE STUDENTS’ REACTIONS TOWARDS KEY FACETS OF CLASSROOM TESTING

Moshe Zeidner 《Assessment & Evaluation in Higher Education》1990,15(2):151-169

The major aim of the present study is to assess college students’ attitudes, perceptions, emotional reactions and affective dispositions with respect to various critical dimensions of course achievement testing and assessment, including: “papers” vs. “exams”, “essay” vs. “multiple choice” type formats, “open book” vs. “closed book” exams, “free choice” among items vs. “no free choice” among items, and “oral” vs. “written” modes of test administration. A further aim is to delineate the construction, properties, and potential classroom uses and applications of a selected sample of examinee feedback inventories designed to gauge students’ test attitudes and dispositions. The use of each examinee feedback inventory is demonstrated and exemplified in the context of an empirical study. This paper discusses the assumptions underlying the use of feedback systems in college achievement evaluation; their importance for assessing the face validity of classroom tests; some possible future applications of feedback inventories for research and applied purposes in college; and some guidelines for future research. A mapping sentence specifying the universe of content of test attitude and examinee feedback research is suggested as a heuristic device for guiding future research. 相似文献

19.

Multiple‐choice and true/false tests: myths and misapprehensions

Richard F. Burton 《Assessment & Evaluation in Higher Education》2005,30(1):65-72

Examiners seeking guidance on multiple‐choice and true/false tests are likely to encounter various faulty or questionable ideas. Twelve of these are discussed in detail, having to do mainly with the effects on test reliability of test length, guessing and scoring method (i.e. number‐right scoring or negative marking). Some misunderstandings could be based on evidence from tests that were badly written or administered, while others may have arisen through the misinterpretation of reliability coefficients. The usefulness of item response theory in the analysis of academic test items is briefly dismissed. 相似文献

20.

Performance of a Proposed Method for the Linking of Mixed Format Tests With Constructed Response and Multiple Choice Items

Richard Tate 《Journal of Educational Measurement》2000,37(4):329-346

The error associated with a proposed linking method for tests consisting of both constructed response and multiple choice items was investigated in a simulation study. Study factors that were varied included the relative proportion of constructed response items in the test, the size of the year-to-year change in the ability metric, the number of anchor items, the number of linking papers to be reassessed, and the presence of guessing. The results supported the use of the proposed linking method, In addition, simulations were used to illustrate possible linking bias resulting from (a) the use of the traditional linking method and (b) the use of only multiple choice anchor items in the presence of test multidimensionality. 相似文献