期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

AN ANNOTATED BIBLIOGRAPHY OF BRITISH RESEARCH ON TEACHING AND TEACHING ABILITY

K. M. Evans 《Educational research; a review for teachers and all concerned with progress in education》2013,55(1):67-80

Although word recognition tests continue to be widely employed by teachers, the application of out‐of‐date norms and varying methods of administration and scoring seriously reduce the usefulness of these tests. Largely as a result of approaches from teachers, it was decided to undertake a large‐scale restandardization of the Burt‐Vernon and Schonell tests to produce reliable norms and to standardize administration and scoring procedures. The opportunity was also taken to record the data in such a way as to enable the production of revised orders of words on both tests to reflect contemporary usage, and to relate reading attainment, as assessed by the tests, to three important factors ‐‐ sex, school organization and socioeconomic status. 相似文献

2.

MULTIPLE CHOICE VERSUS TRUE-FALSE: A COMPARISON OF RELIABILITIES AND CONCURRENT VALIDITIES

DAVID A. FRISBIE 《Journal of Educational Measurement》1973,10(4):297-304

The purpose of this study was to compare the reliabilities of true-false (TF) and multiple choice (MC) tests and to determine the concurrent validities of both. Two methods, judgmental and discrimination, were devised for objectively converting MC items to TF form. The TF items generated by the two methods from 70-item MC natural science and social studies tests were incorporated in eight final forms which were differentiated by subject matter, conversion method, and item form order. A sample of 1018 nonurban high school students each responded to one of the eight forms. Examinees tried three TF items for every pair of MC items attempted. The TF tests were significantly less reliable than the MC tests but did tend to measure the same thing as the corresponding MC tests. 相似文献

3.

ANALYSIS OF FOURTH GRADE TESTING

R. G. Cochrane J. Elkins Dawn M. Richmond 《International Journal of Disability, Development & Education》1969,16(3):131-142

Three hundred and ninety‐nine Grade 4 Brisbane children were given a battery o1 general ability and achievement tests. Means and standard deviations allowed construction of 15‐point scales for a Queensland Grade 4. A correlation matrix included indices concerning sub‐tests of general ability and achievement tests. Comparisons with original A.C.E.R. co‐efficients were made. The matrix was factor sub‐tests of general ability and achievement tests. Comparisons with analysed and rotated, three factors emerging ‐‐ verbal, performance and computational. A canonical correlation was performed and three significant elements emerged ‐‐ a verbal, a computational (or integrative or automatic sequential) and a problem‐solving element. 相似文献

4.

A FACET-FACTORIAL APPROACH TO THE CONSTRUCTION OF RATING SCALES TO MEASURE COMPLEX BEHAVIORS

HAROLD F. ABELES 《Journal of Educational Measurement》1973,10(2):145-151

The purpose of this study was to examine a technique for the development of performance rating scales to measure achievement in courses whose objectives require complex behaviors not easily measurable with paper and pencil achievement tests. A facet-factorial approach to rating scale construction was employed (i.e. the behavior was conceptualized as multidimensional and items for the scales were selected by employing factor analytical techniques) to construct scales to measure clarinet music performance. The three major results of the study were: 1) a thirty-item rating scale based on a six factor structure of clarinet music performance; 2) high inter-judge reliability estimates for both the total score (above .90) and the scale scores (above .60); and, 3) criterion-related validity coefficients greater than .80. Results of the investigation suggest that the facet-factorial approach can be an effective technique for the construction of rating scales to measure complex behavior such as music performance. 相似文献

5.

IQs ARE VERY STRONG BUT IMPERFECT INDICATORS OF PSYCHOMETRIC g: RESULTS FROM JOINT CONFIRMATORY FACTOR ANALYSIS

Ryan L. Farmer Randy G. Floyd Matthew R. Reynolds John H. Kranzler 《Psychology in the schools》2014,51(8):801-813

The most global score yielded by intelligence tests, IQs, are supported by substantial validity evidence and have historically been central to the identification of intellectual disabilities, learning disabilities, and giftedness. This study examined the extent to which IQs measure the ability they target, psychometric g. Data from three samples of children and adolescents (Ns = 200, 150, and 135) who completed varying pairs of individually administered, multidimensional intelligence tests were analyzed using a joint confirmatory factor analysis to generate correlations between IQs and general factors representing psychometric g. The resulting values, expressed as g loadings, for the six IQs ranged from .88 to .95 (M = .92). The accuracy of IQs in measuring psychometric g, the meaning of reliable specific ability variance in IQs not accounted for by psychometric g, and the use of IQs in schools and related settings are discussed. 相似文献

6.

OPTIMAL NUMBER OF OPTIONS: AN INVESTIGATION OF THE ASSUMPTION OF PROPORTIONALITY 总被引：1，自引：0，他引：1

DAVID V. BUDESCU BARUCH NEVO 《Journal of Educational Measurement》1985,22(3):183-196

One assumption common to all models for determining the optimal number of options per item (e. g., Lord, 1977) is that total testing time is proportional to the number of items and the number of options per item. Therefore, under this assumption given a fixed testing time, the test can be shortened or lengthened by deleting or adding a proportional number of options. The present study examines the validity of this assumption in three tests which were administered with 2, 3, 4, and 5 options per item. The number of items attempted in the first 10 and 15 minutes of the testing session and the time needed to complete the tests were recorded. Thus, the rate of performance for both fixed time and fixed test length was analyzed. A strong and consistently negative relationship between rate of performance and the number of options was detected in all tests. Thus, the empirical results did not support the assumption of proportionality. Furthermore, the data indicated that the method by which options are deleted can play a role in this context. A more realistic assumption of generalized proportionality, proposed by Grier (1976), was supported by the results from a Mathematical Reasoning test, but was only partially supported for a Vocabulary and a Reading Comprehension test. 相似文献

7.

Take home tests: An experimental study

Larry J. Weber Janice K. McBee Jean E. Krebs 《Research in higher education》1983,18(4):473-483

The take home test was compared with the conventional closed and open book tests at the college level. It was found that scores on knowledge items were significantly higher with the take home test, and that additional time spent looking up answers was important. An additional factor was the level of anxiety, perceived by students to be less with the take home test. Rampant cheating does not appear to be a problem with take home tests. 相似文献

8.

焊接冷裂纹临界冷却时间判据建立

张文钺许玉环徐德禄陈邦固杜则裕《天津大学学报(英文版)》1995,(1)

用平板刚性拘束裂纹试验（ＰＲＲＣ）研究了４种国产低合金高强钢焊接冷裂敏感性，探讨了焊接接头扩散氢含量、焊接线能量、局部预热温度、拘束度的大小及局部预热产生附加应力等对钢材冷裂纹敏感性的影响。建立了国产低合金高强钢的临界冷却时间判据ｔｃｒ。实验表明此判据可靠，可用于评定现行焊接工艺条件下是否产生冷裂纹及确定合理的焊接工艺。相似文献

9.

MEASURING THE ORGANIZATIONAL ASPECTS OF WRITING ABILITY

STEPHEN L. BENTON KENNETH A. KIEWRA 《Journal of Educational Measurement》1986,23(4):377-386

The present study assessed the relationship among holistic writing ability, the Test of Standard Written English (TSWE), and the following tests of organizational ability: anagram solving, word reordering, sentence reordering, and paragraph assembly. Based upon a sample of 105 undergraduate students, the main findings were that writing ability, as measured by the holistic method of scoring, was significantly correlated with performance on the TSWE and the four tests of organizational ability. A composite score on all four organizational tests was found to have the highest zero-order correlation with the measure of writing ability. A stepwise regression analysis, with the measure of writing ability as the criterion, also indicated that the composite score explained a significant proportion of the variance beyond that explained by the TSWE. The results are discussed in terms of the Kintsch and van Dijk model of strategic discourse processing, which suggests that different organizational strategies operate at the levels of words, sentences, and paragraphs. It is concluded that tests assessing organizational strategies ought to be included in assessments of writing ability. 相似文献

10.

JOINT CONFIRMATORY FACTOR ANALYSIS OF THE WOODCOCK‐JOHNSON TESTS OF COGNITIVE ABILITIES,THIRD EDITION,AND THE STANFORD‐BINET INTELLIGENCE SCALES,FIFTH EDITION,WITH A PRESCHOOL POPULATION

Mei Chang Sharon E. Paulson W. Holmes Finch David E. Mcintosh Barbara A. Rothlisberg 《Psychology in the schools》2014,51(1):32-57

This study examined the underlying constructs measured by the Woodcock‐Johnson Tests of Cognitive Abilities, Third Edition (WJ‐III COG) and the Stanford‐Binet Intelligence Scales, Fifth Edition (SB5), based on the Cattell‐Horn‐Carrol (CHC) theory of cognitive abilities. This study reports the results of the first joint confirmatory factor analysis of the WJ‐III COG and SB5 with an independently collected preschool‐aged sample. The WJ‐III COG and SB5 were administered to 200 preschool‐aged children of 4 to 5 with no known disorders or disabilities. Confirmatory factor analyses using maximum likelihood estimation were conducted to evaluate three models of increasing complexity and two alternative models to determine which model best describe the underlying constructs measured by the WJ‐III COG and the SB5. Though none of the models displayed a good fit to the data, results showed that the underlying construct of the two tests was best represented by a Three‐Stratum alternative CHC model in which the Gf factor and subtests were omitted. Exploratory factor analysis was conducted to provide further insights into the actual latent structure underlying the data. Implications of findings to guide school‐based practitioners in using cross‐battery assessment with preschool children were addressed. 相似文献

11.

THE VALIDITY AND RELIABILITY OF ORAL EXAMINATIONS IN ASSESSING COGNITIVE SKILLS IN MEDICINE1

HAROLD G. LEVINE CHRISTINE H. McGUIRE 《Journal of Educational Measurement》1970,7(2):63-74

In order to attempt to assess aspects of clinical competence, not adequately assessed by other means, the Center for the Study of Medical Education, University of Illinois College of Medicine together with the American Board of Orthopaedic Surgery developed oral examinations in formats specifically designed to yield information on high level cognitive functioning. The examinations were administered to 784 candidates for certification in January 1968. Reliability of the oral problem-solving component score pooled from four examiners was approximately .50. Assessment of content, construct, and concurrent validity made by questionnaire and factor analytic studies indicated that the oral tests identified factors not measured by multiple-choice tests and, therefore, significantly improved the relationship between supervisory evaluations and test scores. 相似文献

12.

THE INFLUENCE OF TEST TITLE ON TEST RESPONSE

JOHN A. JENSEN JOHN A. SCHMITT 《Journal of Educational Measurement》1970,7(4):241-245

This study was designed to determine the extent to which responses to test items of the type frequently found in personality inventories would be influenced by the title associated with the test. The basic hypothesis was that subjects respond to the test title by developing a particular response set which will be reflected in the individual responses. An instrument was constructed and administered to eight treatment groups. Each administration differed primarily in the title each group's tests bore. The dependent variables were measures of the tendency to lie, respond defensively, answer carefully, and complete questions. Subjects tended to lie and respond more defensively to titled tests than to a test having no title and administered under nonthreatening conditions. All other comparisons were not statistically significant. 相似文献

13.

Developing a Mathematics Education Quality Scale

S. Koza Ciftci 《Africa Education Review》2016,13(3-4):211-234

The aim of this study was to evaluate students’ perceptions of the quality of mathematics education and to develop a reliable and valid measurement tool. The research was conducted with 638 (first study) and 407 (second study) secondary school students in Eski?ehir, Turkey. Item discrimination, structural validity (exploratory factor analysis and confirmatory factor analysis) and internal consistency were examined. The findings showed that the Mathematics Education Quality Scale (MEQS) was formed by five factors, namely: (i) quality of instruction; (ii) quality of school environment; (iii) quality of teacher; (iv) quality of family cooperation; and (v) quality of school guidance, and it was valid and reliable. As a result, it could be argued that the MEQS can be used to measure secondary school students’ perceptions of the quality of mathematics education. 相似文献

14.

DIFFERENTIAL WEIGHTING BY JUDGED DEGREE1OF CORRECTNESS

DURGADAS PATNAIK ROSS E. TRAUB 《Journal of Educational Measurement》1973,10(4):281-286

Two conventional scores and a weighted score on a group test of general intelligence were compared for reliability and predictive validity. One conventional score consisted of the number of correct answers an examinee gave in responding to 69 multiple-choice questions; the other was the formula score obtained by subtracting from the number of correct answers a fraction of the number of wrong answers. A weighted score was obtained by assigning weights to all the response alternatives of all the questions and adding the weights associated with the responses, both correct and incorrect, made by the examinee. The weights were derived from degree-of-correctness judgments of the set of response alternatives to each question. Reliability was estimated using a split-half procedure; predictive validity was estimated from the correlation between test scores and mean school achievement. Both conventional scores were found to be significantly less reliable but significantly more valid than the weighted scores. (The formula scores were neither significantly less reliable nor significantly more valid than number-correct scores.) 相似文献

15.

THE EFFECT OF DIFFERENTIAL OPTION WEIGHTING ON MULTIPLE-CHOICE OBJECTIVE TESTS1

GERRY F. HENDRICKSON 《Journal of Educational Measurement》1971,8(4):291-296

The purpose of this study was to determine in what way Guttman weighting affected the internal consistency and intercorrelation of the suhtests of the Scholastic Aptitude Test. The tests were first scored with Guttman weights and then with conventional correction-for-guessing weights. The internal consistency of the tests increased markedly when Guttman weights were used. The correlation of the two verbal subtests increased somewhat when Guttman weights were used, but the correlation of the two mathematics subtests as well as the intercorrelation of all verbal and mathematics subtests decreased. Differences in the factor structure of the Guttman- and conventionally-weighted subtests were used to explain the result. 相似文献

16.

Chapter 1 Introduction,The Field Study

L. Joseph Lins Allan P. Abell H. Clifton Hutchins 《Journal of Experimental Education》2013,81(2):1-10

This study was an investigation of the effectiveness of eight types of visual illustrations used to complement oral instruction. An attempt was made to evaluate the instructional value of black and white and colored visual Illustrations. Two hundred sixty-four eleventh-grade students were randomly assigned by class to one of nine treatment groups. Student learning was evaluated by means of five criterion tests designed to measure different educational objectives. Comparisons among the means of the nine treatment groups yielded significant differences. It was found that the realism continuum for still visuals is not a reliable predictor of learning efficiency; however, visual illustrations were found to significantly increase student learning on three criterion tests. Color in visual illustrations did not appear to be an important variable in increasing student learning in is study. 相似文献

17.

THE RACIAL STRESS OF MEMBERSHIP: DEVELOPMENT OF THE FACULTY INVENTORY OF RACIALIZED EXPERIENCES IN SCHOOLS

Sherry Coleman Howard C. Stevenson 《Psychology in the schools》2013,50(6):548-566

Research on the experience of faculty of color in predominately White independent schools (PWIS) is limited. This study explored faculty of varying racial backgrounds and their initiation of, interactions with, and stress reactions to racial conflicts within the school settings using an online survey. Several measures were developed according to the Racial/Ethnic Coping Appraisal and Socialization Theory (recast) model and administered to 339 faculty participants. Using factor analytic and correlational analyses, results from the highly reliable measures found that Black faculty show significantly less trust in schools to manage racial conflict, lower sense of school membership, greater racial stress, and more racial socialization than their White counterparts do. With reliable and valid measures on the racialized experiences of faculty in PWIS, future research can ask better questions on how much racial politics influence diversity initiatives, including faculty and student recruitment and retention, supportive racial and intellectual climate, and organizational mission and professional development. Implications for the development of racially relevant measures in schools are discussed. 相似文献

18.

MEASURING SCIENCE INTEREST: RASCH VALIDATION OF THE SCIENCE INTEREST SURVEY 总被引：2，自引：0，他引：2

Richard Lawrence Lamb Leonard Annetta Jeannette Meldrum David Vallett 《International Journal of Science and Mathematics Education》2012,10(3):643-668

Students in the USA have fallen near the bottom in international competitions and tests in mathematics and science. It is thought that extrinsic factors such as family, community, and schools might be more influential than intrinsic attitudes toward science interest. However, there are relatively few valid and reliable measures of intrinsic factors such as interest relating to science. With the lack of intrinsic measures, it is difficult to determine the impact of extrinsic factors on the intrinsic construct. A fuller picture of the factors affecting intrinsic factors such as science interest will allow interventions to become more refined and targeted. Several studies suggest that student interest toward science affects the likelihood of the student pursuing advanced courses in science. The goal of this paper is to establish the validity and reliability of the Science Interest Survey and to determine if the survey meets the formal requirements of measurements as defined by the Rasch model. Results using both IRT and CRT analysis suggest that Science Interest Survey is an adequate measure of the unidimensional construct known as science interest. Results further suggest the Science Interest Survey is a valid and reliable measure for assessing science interest levels. 相似文献

19.

COGNITIVE,MOTIVATIONAL AND TEMPERAMENTAL INDICANTS OF CREATIVITY AND INTELLIGENCE1

B. B. ROSSMAN J. L. HORN 《Journal of Educational Measurement》1972,9(4):265-286

Relationships between measures of creativity and measures of intelligence were analysed to show that even when the intercorrelations among the former are not appreciably larger than correlations between creativity and intelligence measures, dimensions of creativity are found to be psychometrically and conceptually distinct from dimensions of intelligence. One hundred eighty-eight art and engineering college students were administered objective tests and questionnaires designed to tap a number of ability and nonability indicants of creativity and intelligence. Some of these measures were of a kind that some investigators would regard as “criterion” measures of creativity and intelligence. Data were analysed by means of correlational and factor analytic procedures. Eight major influences were indicated by independent factors rotated to approximate simple structure. The influences represented by these factors were identified as: fluid intelligence, crystallized intelligence, memory, fluency, rule-orientation versus intuitive thinking, and self-sufficient-calculated-risk-taking. The fourth, seventh, and eighth factors were discussed as representing distinct cognitive, motivational, and temperamental aspects of creativity. At the level of test intercorrelations discriminant validity did not obtain–measures of creativity did not intercorrelate more highly among themselves than they correlated with measures of intelligence. The putative factors of creativity did correlate more highly with creativity criteria, however, than did the factors identified as representing intelligence. Moreover, the intelligence factors were more highly related to intelligence criteria than to creativity criteria. Thus the evidence of this study suggests that it is useful to think of creativity and intelligence as the outgrowths of distinct (although overlapping) sets of influences, even when this is not well indicated by discriminant-convergent validation analyses carried out with respect to tests (in contrast to factors). 相似文献

20.

Identifying dyslexia in adults: an iterative method using the predictive value of item scores and self-report questions

Peter Tamboer Harrie C. M. Vorst Frans J. Oort 《Annals of dyslexia》2014,64(1):34-56

Methods for identifying dyslexia in adults vary widely between studies. Researchers have to decide how many tests to use, which tests are considered to be the most reliable, and how to determine cut-off scores. The aim of this study was to develop an objective and powerful method for diagnosing dyslexia. We took various methodological measures, most of which are new compared to previous methods. We used a large sample of Dutch first-year psychology students, we considered several options for exclusion and inclusion criteria, we collected as many cognitive tests as possible, we used six independent sources of biographical information for a criterion of dyslexia, we compared the predictive power of discriminant analyses and logistic regression analyses, we used both sum scores and item scores as predictor variables, we used self-report questions as predictor variables, and we retested the reliability of predictions with repeated prediction analyses using an adjusted criterion. We were able to identify 74 dyslexic and 369 non-dyslexic students. For 37 students, various predictions were too inconsistent for a final classification. The most reliable predictions were acquired with item scores and self-report questions. The main conclusion is that it is possible to identify dyslexia with a high reliability, although the exact nature of dyslexia is still unknown. We therefore believe that this study yielded valuable information for future methods of identifying dyslexia in Dutch as well as in other languages, and that this would be beneficial for comparing studies across countries. 相似文献