期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Not Read,but Nevertheless Solved? Three Experiments on PIRLS Multiple Choice Reading Comprehension Test Items

Jörn R. Sparfeldt Rumena Kimmel Lena Löwenkamp Antje Steingräber Detlef H. Rost 《Educational Assessment》2013,18(4):214-232

Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N ₁ = 230, N ₂ = 340, N ₃ = 194) worked on three versions of MC items from the Progress in International Reading Literacy Study 2001 reading comprehension test with relevant components successively deleted: “original version” (text, questions, MC-answers), “version without text” (questions, MC-answers), “version without text and without questions” (only MC-answers). Answering correctly the MC items became more difficult as the relevant information was eliminated. In the two narrative fictional texts presented, the students' performance of the version without text was not better than chance. Conversely in the informational (fictional) text, the students' performance of the version without text was better than chance. In the third condition, students' performance was never better than chance. 相似文献

2.

A cross-cultural comparison of numeracy skills using a written and an interactive arithmetic test

Jonathan Hippisley Graham Douglas Stephen Houghton 《Educational research; a review for teachers and all concerned with progress in education》2013,55(2):205-215

This study applied two arithmetic tests, one written and one one computer-based interactive, to samples of primary school children from two populations, one suburban non-Aboriginal and one rural Aboriginal. The results from the written test were significantly (p?&;lt;?0.001) better for the non-Aboriginal children than for the Aboriginal children. This was not the case with the results from the computer-based interactive test. The study used Rasch-based methodology to reduce the results from the two tests to a common scale, to ascertain whether the Aboriginal children performed better (in relation to the non-Aboriginal children) in the computer-based than in the written test. The study found that this was the case, and concluded that the results from the computer-based test exhibited less cultural bias against the Aboriginal children than the written test. 相似文献

3.

Picturing progressive texts: images of ‘democratic schooling’ in the work of John and Evelyn Dewey and contemporaries

Peter Cunningham 《History of education》2019,48(1):118-141

Visual images played an increasing role in professional discourse and in popular and political debate about progressive education over a century or more. In the early 1900s photography was adopted by some progressive texts to convey new ideas illustrated by practice. This paper highlights an iconic example: John and Evelyn Dewey’s celebrated Schools of To-Morrow (1915), with reference to a small selection of its photographic illustrations. Consideration is given to how images were constructed, their status as historical evidence and issues of interpretation. Comparison is made with other illustrated works, preceding and following Schools of To-Morrow, by advocates of child- or student-centred pedagogies. The article urges critical reflection on visual representation in arguments for and against progressivism in more recent times. Insights drawn from earlier examples should be borne in mind by historians seeking to evaluate the role of pictorial sources in discourses of pedagogical reform. 相似文献

4.

Item Response Models for Examinee‐Selected Items

Wen‐Chung Wang Kuan‐Yu Jin Xue‐Lan Qiu Lei Wang 《Journal of Educational Measurement》2012,49(4):419-445

In some tests, examinees are required to choose a fixed number of items from a set of given items to answer. This practice creates a challenge to standard item response models, because more capable examinees may have an advantage by making wiser choices. In this study, we developed a new class of item response models to account for the choice effect of examinee‐selected items. The results of a series of simulation studies showed: (1) that the parameters of the new models were recovered well, (2) the parameter estimates were almost unbiased when the new models were fit to data that were simulated from standard item response models, (3) failing to consider the choice effect yielded shrunken parameter estimates for examinee‐selected items, and (4) even when the missingness mechanism in examinee‐selected items did not follow the item response functions specified in the new models, the new models still yielded a better fit than did standard item response models. An empirical example of a college entrance examination supported the use of the new models: in general, the higher the examinee's ability, the better his or her choice of items. 相似文献

5.

Interaction of learner characteristics with learning from three models of the periodic table

Jeffrey R. Lehman John J. Koran Mary Lou Koran 《科学教学研究杂志》1984,21(9):885-893

This study was designed to explore the effects on learning of: (1) structural modifications to the periodic table, (2) the location of a periodic table within instructional materials, and (3) the presence of a two-page schema showing relationships between the topics explained in the written materials and the periodic table. One hundred and sixty high school students were randomly assigned to one of eight treatments. A 28-item posttest (KR –; 21 = 0.72), consisting of multiple choice and constructed answer items, was designed to measure subjects' ability to use their periodic tables to obtain factual information and to solve qualitative chemistry problems. Regression analyses using the multiple choice portion of the posttest as a dependent variable and table type as an independent variable revealed that for subjects with minimal experience with the periodic table, those who received the table with added visual data performed significantly better than subjects who received either of the other two tables (df 3,93; F = 2.72; p < 0.05). For subjects familiar with the periodic table, significant vocabulary X table (df 3,49; F = 3.22; p < 0.05) and vocabulary X location (df 1,49; F = 4.46; p < 0.05) interactions were detected. Subjects high in verbal comprehension tended to take advantage of the modified tables, while those low in verbal comprehension processed the traditional table with less information most effectively. These latter students also benefited more from having the periodic table alongside their written materials. 相似文献

6.

Measuring the impact of the flipped anatomy classroom: The importance of categorizing an assessment by Bloom's taxonomy

下载免费PDF全文

David A. Morton Jorie M. Colbert‐Getz 《Anatomical sciences education》2017,10(2):170-175

The flipped classroom (FC) model has emerged as an innovative solution to improve student‐centered learning. However, studies measuring student performance of material in the FC relative to the lecture classroom (LC) have shown mixed results. An aim of this study was to determine if the disparity in results of prior research is due to level of cognition (low or high) needed to perform well on the outcome, or course assessment. This study tested the hypothesis that (1) students in a FC would perform better than students in a LC on an assessment requiring higher cognition and (2) there would be no difference in performance for an assessment requiring lower cognition. To test this hypothesis the performance of 28 multiple choice anatomy items that were part of a final examination were compared between two classes of first year medical students at the University of Utah School of Medicine. Items were categorized as requiring knowledge (low cognition), application, or analysis (high cognition). Thirty hours of anatomy content was delivered in LC format to 101 students in 2013 and in FC format to 104 students in 2014. Mann Whitney tests indicated FC students performed better than LC students on analysis items, U = 4243.00, P = 0.030, r = 0.19, but there were no differences in performance between FC and LC students for knowledge, U = 5002.00, P = 0.720 or application, U = 4990.00, P = 0.700, items. The FC may benefit retention when students are expected to analyze material. Anat Sci Educ 10: 170–175. © 2016 American Association of Anatomists. 相似文献

7.

Assessing 10- to 11-year-old children’s performance and misconceptions in number sense using a four-tier diagnostic test

Der-Ching Yang 《Educational research; a review for teachers and all concerned with progress in education》2013,55(4):368-388

Background: Number sense is a key topic in mathematics education, and the identification of children’s misconceptions about number is, therefore, important. Information about students’ serious misconceptions can be quite significant for teachers, allowing them to change their teaching plans to help children overcome these misconceptions. In science education, interest in children’s alternative conceptions has led to the development of three- and four-tier tests that not only assess children’s understandings and misconceptions, but also examine children’s confidence in their responses. However, there are few such tests related to mathematical content, especially in studies of number sense.

Purpose: The purpose of this study was to investigate children’s performance and misconceptions with respect to number sense via a four-tier diagnostic test (Answer Tier → Confidence rating for Answer Tier → Reason Tier → Confidence rating for Reason Tier).

Design and method: A total of 195 fifth graders (10–11 years old) from Taiwan participated in this study. The four-tier test was web-based and contained 40 items across five components of number sense.

Findings: The results show that (1) students’ mean confidence rating for the answer tier was significantly higher than for the reason tier; (2) an average of 68% of students tended to have equal confidence ratings in both answer and reason tiers; (3) students who chose correct answers or reasons had higher mean confidence ratings in most items (36 out of 40) than those who did not; and (4) 16 misconceptions were identified and most of them were at a strong level.

Conclusion: The four-tier test was able to identify several misconceptions in both the answer and reason tier and provide information about the confidence levels. By using such information, teachers may be better positioned to understand the nature of learners’ misconceptions about number sense and therefore support their pupils’ progress in mathematics. 相似文献

8.

Using generalizability analysis to estimate parameters for anatomy assessments: A multi‐institutional study

下载免费PDF全文

Jessica N. Byram Mark F. Seifert William S. Brooks Laura Fraser‐Cotlin Laura E. Thorp James M. Williams Adam B. Wilson 《Anatomical sciences education》2017,10(2):109-119

With integrated curricula and multidisciplinary assessments becoming more prevalent in medical education, there is a continued need for educational research to explore the advantages, consequences, and challenges of integration practices. This retrospective analysis investigated the number of items needed to reliably assess anatomical knowledge in the context of gross anatomy and histology. A generalizability analysis was conducted on gross anatomy and histology written and practical examination items that were administered in a discipline‐based format at Indiana University School of Medicine and in an integrated fashion at the University of Alabama School of Medicine and Rush University Medical College. Examination items were analyzed using a partially nested design in which items were nested within occasions (i:o) and crossed with students (s). A reliability standard of 0.80 was used to determine the minimum number of items needed across examinations (occasions) to make reliable and informed decisions about students' competence in anatomical knowledge. Decision study plots are presented to demonstrate how the number of items per examination influences the reliability of each administered assessment. Using the example of a curriculum that assesses gross anatomy knowledge over five summative written and practical examinations, the results of the decision study estimated that 30 and 25 items would be needed on each written and practical examination to reach a reliability of 0.80, respectively. This study is particularly relevant to educators who may question whether the amount of anatomy content assessed in multidisciplinary evaluations is sufficient for making judgments about the anatomical aptitude of students. Anat Sci Educ 10: 109–119. © 2016 American Association of Anatomists. 相似文献

9.

NCME 2008 Presidential Address: The Impact of Anchor Test Configuration on Student Proficiency Rates

Anne R. Fitzpatrick 《Educational Measurement》2008,27(4):34-40

Examined in this study were the effects of reducing anchor test length on student proficiency rates for 12 multiple‐choice tests administered in an annual, large‐scale, high‐stakes assessment. The anchor tests contained 15 items, 10 items, or five items. Five content representative samples of items were drawn at each anchor test length from a small universe of items in order to investigate the stability of equating results over anchor test samples. The operational tests were calibrated using the one‐parameter model and equated using the mean b‐value method. The findings indicated that student proficiency rates could display important variability over anchor test samples when 15 anchor items were used. Notable increases in this variability were found for some tests when shorter anchor tests were used. For these tests, some of the anchor items had parameters that changed somewhat in relative difficulty from one year to the next. It is recommended that anchor sets with more than 15 items be used to mitigate the instability in equating results due to anchor item sampling. Also, the optimal allocation method of stratified sampling should be evaluated as one means of improving the stability and precision of equating results. 相似文献

10.

Wilson,Eydie (2013) Serious comix ISTE (Washington & Eurospan,London) isbn 978‐1‐56484‐321‐0 96 pp £27.50 https://www.iste.org/store/product?ID=2444%E2%80%8E

Eric Deeson 《British journal of educational technology : journal of the Council for Educational Technology》2014,45(4):E24-E25

Eydie Wilson's book pleads that struggling secondary learners be encouraged to design on‐screen storyboards as a means of planning various kinds of work. The book is short and—especially at this high price—would benefit from being much better illustrated than it is: in particular, it needs sample learners' work and other forms of case study material. Though clearly written and developed, Serious comix therefore remains dry and shows few signs of the author's passion for its subject. Yet the concept is appealing and has potential: if you think it could have some impact on your work, borrow a copy and see what appeals to you. Eric Deeson 相似文献

11.

Interest-enhancing approaches to mathematics curriculum design: Illustrations and personalization

Virginia Clinton Candace Walkington 《The Journal of educational research》2019,112(4):495-511

Two common interest-enhancement approaches in mathematics curriculum design are illustrations and personalization of problems to students’ interests. The objective of these experiments is to test a variety of illustrations and personalization approaches. In the illustrations experiment, students (n?=?265) were randomly assigned to lessons with story problems containing decorative illustrations, contextual illustrations, diagrammatic illustrations, misleading illustrations, or no illustrations (only text [control condition]). Students’ problem-solving performance and attitudes were not affected by illustration condition, but learning was better in the control compared with contextual illustrations. In the personalization experiment, students (n?=?223) were randomly assigned to story problems that were either personalized based on: a survey of their interests, their choice of interest topics, a randomly assigned interest topic, or the original nonpersonalized story problem (control). The findings indicated there were benefits for choice personalization both for performance in the problem set as well as on a later learning assessment. 相似文献

12.

Research and Teaching in the Science Department of the University of London Institute of Education

A. D. Turner 《International Journal of Science Education》2013,35(4):457-459

This study was designed (1) to analyse the relationship between the answer profile from multiple‐choice questions on stoichiometric problems and the students’ reasoning patterns and (2) to examine the effect for certain variables on the facility values of test items. The instruments used were mainly paper‐and‐pencil tests. The subjects were 6262 grammar school students from all parts of the Federal Republic of Germany. They were randomly assigned to the test items.

The results indicated that many students arrived at their answers by mixing up amount and reacting mass, or molar mass and reacting mass. It was also found that the variables ‘easy/hard calculations’, and ‘formula given/to be developed’ determined the facility values of test items.

From the results, it was possible to make recommendations to practising teachers as well as to examiners. Knowing students’ ideas, the teacher can think of how to make use of them before entering the classroom. A teaching unit may start off with easy problems leaving the more difficult ones for later. Examiners developing new tests on stoichiometry should consider two essential preconditions for the formulae of chemical compounds, used in the item: formulae of the type AB should be avoided and the molar masses of the elements involved must be clearly different. 相似文献

13.

C. Walter Hodges: A Life Illustrating History

Matthew Eve 《Children‘s Literature in Education》2004,35(2):171-198

C. Walter Hodges first came to prominence as the author/illustrator of Columbus Sails in 1939, which the Junior Bookshelf hailed as The best book never to have been awarded the Carnegie Medal. Widely acclaimed for the treatment of its subject matter, its powerful narration, and accompanying dramatic line illustrations, Columbus Sails was the first of a number of vivid historical novels written and illustrated by Hodges, including The Namesake (nominated for the 1964 Carnegie Medal), The Marsh King (1967), and The Overland Launch (1969). He is internationally recognised both for his indispensable and learned books about the Elizabethan theatre (for which he gained the Kate Greenaway Medal for illustration in 1964) and his vital illustrations to other authors' texts. This timely article is based on interviews and correspondence between the author and Hodges, and traces and celebrates the latter's life and career as a writer, book illustrator, teacher, and scholar. 相似文献

14.

Investigating the Effectiveness of Equating Designs for Constructed-Response Tests in Large-Scale Assessments

Sooyeon Kim Michael E. Walker Frederick McHale 《Journal of Educational Measurement》2010,47(2):186-201

Using data from a large-scale exam, in this study we compared various designs for equating constructed-response (CR) tests to determine which design was most effective in producing equivalent scores across the two tests to be equated. In the context of classical equating methods, four linking designs were examined: (a) an anchor set containing common CR items, (b) an anchor set incorporating common CR items rescored, (c) an external multiple-choice (MC) anchor test, and (d) an equivalent groups design incorporating rescored CR items (no anchor test). The use of CR items without rescoring resulted in much larger bias than the other designs. The use of an external MC anchor resulted in the next largest bias. The use of a rescored CR anchor and the equivalent groups design led to similar levels of equating error. 相似文献

15.

Two-item<Emphasis Type="Italic">same-different</Emphasis> concept learning in pigeons

Blaisdell AP Cook RG 《Learning & behavior》2005,33(1):67-77

We report the first successful demonstration of a simultaneous, two-itemsame-different (S/D) discrimination by 6 pigeons, in which nonpictorial color and shape stimuli were used. This study was conducted because the majority of recently successful demonstrations of S/D discrimination in pigeons have employed displays with more than two items. Two pairs of stimulus items were simultaneously presented on a touch screen equipped computer monitor. Pigeons were reinforced for consistently pecking at either thesame (i.e., identical) or thedifferent (i.e., nonidentical) pair of items. These pairs were created from combinations of simple colored shapes drawn from a pool of six colors and six shapes. After acquiring the discrimination with item pairs that differed redundantly in both the shape and the color dimensions, the pigeons were tested for transfer to items that varied in only one of these dimensions. Although both dimensions contributed to the discrimination, greater control was exhibited by the color dimension. Most important, the discrimination transferred in tests with novel colored, shaped, and sized items, suggesting that the mechanisms involved were not stimulus specific but were more generalized in nature. These results suggest that the capacity to judge S/D relations is present in pigeons even when only two stimuli are used to implement this contrast. 相似文献

16.

A classification of the ISIS program using Bloom's cognitive taxonomy

Richard F. Clevenstine 《科学教学研究杂志》1987,24(8):699-712

This article focuses on the practical use of Bloom's Taxonomy of Educational Objectives. The current status of analyzing and classifying test items and behavioral objectives was examined in this study. Specifically, the purpose of this study was to analyze and classify the ISIS minicourse performance objectives and criterion-referenced test items according to Bloom's cognitive Taxonomy in order to determine what levels of cognition the ISIS instructional materials are directed. The performance objectives and test items of thirty-three ISIS minicourses and criterion-referenced tests were collected and classified. Four research questions were posed in the study. The findings indicate that ISIS minicourse test items and performance objectives are written primarily at the Knowledge and Comprehension levels. The ISIS instructional materials reflect low percentages of upper cognitive level test items and performance objectives. Based upon the use of a chi-square analysis, twenty-four of the ISIS minicourses and tests demonstrate a positive congruence between their performance objectives and criterion-referenced test items. Nine ISIS minicourses were found to demonstrate a negative relationship between their performance objectives and test items. Implications and Recommendations based on the findings of the studies are provided. 相似文献

17.

Assessing Multimedia Influences on Student Responses Using a Personal Response System

Kyle Gray Katharine Owens Xin Liang David Steer 《Journal of Science Education and Technology》2012,21(3):392-402

To date, research to date on personal response systems (clickers) has focused on external issues pertaining to the implementation of this technology or broadly measured student learning gains rather than investigating differences in the responses themselves. Multimedia learning makes use of both words and pictures, and research from cognitive psychology suggests that using both words and illustrations improves student learning. This study analyzed student response data from 561 students taking an introductory earth science course to determine whether including an illustration in a clicker question resulted in a higher percentage of correct responses than questions that did not include a corresponding illustration. Questions on topics pertaining to the solid earth were categorized as illustrated questions if they contained a picture, or graph and text-only if the question only contained text. For each type of question, we calculated the percentage of correct responses for each student and compared the results to student ACT-reading, math, and science scores. A within-groups, repeated measures analysis of covariance with instructor as the covariate yielded no significant differences between the percentage of correct responses to either the text-only or the illustrated questions. Similar non-significant differences were obtained when students were grouped into quartiles according to their ACT-reading, -math, and -science scores. These results suggest that the way in which a conceptest question is written does not affect student responses and supports the claim that conceptest questions are a valid formative assessment tool. 相似文献

18.

Oral versus written assessments: a test of student performance and attitudes

Mark Huxham Fiona Campbell Jenny Westwood 《Assessment & Evaluation in Higher Education》2012,37(1):125-136

Student performance in and attitudes towards oral and written assessments were compared using quantitative and qualitative methods. Two separate cohorts of students were examined. The first larger cohort of students (n = 99) was randomly divided into ‘oral’ and ‘written’ groups, and the marks that they achieved in the same biology questions were compared. Students in the second smaller cohort (n = 29) were all examined using both written and oral questions concerning both ‘scientific’ and ‘personal development’ topics. Both cohorts showed highly significant differences in the mean marks achieved, with better performance in the oral assessment. There was no evidence of particular groups of students being disadvantaged in the oral tests. These students and also an additional cohort were asked about their attitudes to the two different assessment approaches. Although they tended to be more nervous in the face of oral assessments, many students thought oral assessments were more useful than written assessments. An important theme involved the perceived authenticity or ‘professionalism’ of an oral examination. This study suggests that oral assessments may be more inclusive than written ones and that they can act as powerful tools in helping students establish a ‘professional identity’. 相似文献

19.

Integrating Cognitive and Psychometric Models to Measure Document Literacy

Kathleen Sheehan Robert J. Mislevy 《Journal of Educational Measurement》1990,27(3):255-272

The Survey of Young Adult Literacy conducted in 1985 by the National Assessment of Educational Progress included 63 items that elicited skills in acquiring and using information from written documents. These items were analyzed using two different models: (1) a qualitative cognitive model, which characterized items in terms of the processing tasks they required, and (2) an item response theory (IRT) model, which characterized items difficulties and respondents' proficiencies simply by tendencies toward correct response. This paper demonstrates how a generalization of Fischer and Seheibleehner's Linear Logistic Test Model can be used to integrate information from the cognitive analysis into the IRT analysis, providing a foundation for subsequent item construction, test development, and diagnosis of individuals skill deficiencies. 相似文献

20.

The Computer Paint Program

《学校用计算机》2013,30(1-2):163-178

Abstract

The goal of this research was to explore the relationship between the use of a computer paint program to create visual images and the subsequent verbal expression of these illustrations. First- and fourth-grade children created visual stories using either traditional media or a computer paint program and subsequently word-processed a verbal representation of their stories. Analyses indicated that the paint program visuals illustrated greater creative strength and the accompanying written stories were richer in detail. Additionally, the creative process varied across the two groups in that the students who used the paint program engaged in more peer collaboration, were more likely to experiment with media effects, and made more revisions in both their visual and written products. 相似文献