首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In matching-to-sample, comparison choice should be controlled by the identity of the sample and, when the sample is not available, by the overall probability of reinforcement associated with each of the comparisons. In the present research, pigeons were trained to match a frequent sample (appearing on 80% of the trials) to one comparison (C fr) and an infrequent sample (appearing on 20% of the trials) to the other (C inf), with the number of reinforcements associated with each sample equated. In Experiment 1, the task was identity matching; in Experiments 2 and 3, it was symbolic matching. We asked whether, when control of comparison choice by the sample was reduced (by inserting a delay between the sample and the comparisons), pigeons would choose comparisons on the basis of (1) the number of reinforcements per comparison (and thus show no comparison bias), (2) the comparison associated with the more frequent sample during training (and show a preference forC fr), or (3) the probability of reinforcement given a correct response (and show a preference forC inf), or (4) inhibition produced by nonreinforced choice of the more frequently correct comparison (and show a preference forC inf). Pigeons showed a significant tendency to chooseC fr. In Experiment 3, we showed that this bias did not result from the effects of intertrial facilitation or interference. Thus, it appears that when control of comparison choice by the sample is reduced, pigeons’ choice is controlled not merely by the probability of reinforcement but also by overall sample frequency.  相似文献   

2.
Pigeons responded in a two-component peak procedure in which the components differed in terms of reinforcement magnitude (Experiment 1), immediacy (Experiment 2), or probability (Experiment 3). The prediction of behavioral momentum theory that responding in the relatively richer component should be more resistant to change was tested by (1) presenting response-independent food in the intervals between components according to a variable-time (VT) schedule, (2) prefeeding, and (3) extinction. In all the experiments, peak location in baseline occurred earlier, relative to the schedule value in the richer component. Peak response rate was more resistant to change in the richer component during the VT and prefeeding tests, and change in peak rate was more sensitive to differential reinforcement than change in overall response rate. Changes in measures of performance on peak trials during the disruptor tests were partially consistent with predictions of the behavioral theory of timing. The results suggest that peak response rate provides a more sensitive index of resistance to change for fixed-interval schedules than does overall response rate and that reinforcement strengthens both peak responding and temporal control.  相似文献   

3.
Control of pigeons’ keypecking by a stimulus-reinforcer contingency was investigated in the context of a four-component multiple schedule. In each of three experiments, pigeons were exposed to a schedule consisting of two two-component sequences. Discriminative stimuli identifying each sequence were present only in Component 1, which was 4, 6, or 8 sec in duration, while reinforcers could be earned only in Component 2 (30 sec in duration). Control by a stimulus-reinforcer contingency was sought during Component 1 by arranging a differential relation between Component 1 cues and schedule of reinforcement in Component 2. In Experiment 1, rate of keypecking during Component 1 varied with the presence and absence of a stimulus-reinforcer contingency. When a contingency was introduced, rate of keypecking increased during the Component 1 cue associated with the availability of reinforcement in Component 2. In Experiment 2, the stimulus-reinforcer contingency was manipulated parametrically by varying the correlation between Component 1 cues and Component 2 schedules of reinforcement. Responding in Component 1 varied as a function of strength of the stimulus-reinforcer contingency. The relatively high rates of Component 1 responding observed in Experiments 1 and 2 pose difficulties for conceptions of stimulus-reinforcer control based on probability of reinforcement. In these two experiments, the stimulus-associated probabilities of reinforcement in Component 1 were invariant at zero. An alternate dimension of stimulus-reinforcer control was explored in Experiment 3, in which Component 1 cues were differentially associated with delay to reinforcement in Component 2, while probability of reinforcement was held constant across components. When the stimulus-reinforcer contingency was in force, rate of responding in Component 1 varied inversely with delay to reinforcement in Component 2. In a quantitative analysis of data from Experiments 2 and 3, relative rate of responding during Component 1 was strongly correlated with two measures of relative delay to reinforcement.  相似文献   

4.
In three experiments, we examined the effects of signaling reinforcement during operant responding in order to illuminate the factors underlying instrumental overshadowing and potentiation effects. Specifically, we examined whether signaling reinforcement produces an enhancement and attentuation of responding when the response-reinforcer correlation is weak and strong, respectively. In Experiment 1, rats responded on variable-ratio (VR) or variable-interval (VI) schedules that were equated for the number of responses emitted per reinforcer. A signal correlated with reinforcement enhanced response rates on the VR schedule, but attenuated response rates were produced by the signal on the VI schedule. In Experiment 2, two groups of rats responded on a VI schedule while the two other groups received a conjoint VI, negative fixed-ratio schedule in which the subjects lost the availability of reinforcements if they emitted high response rates. A reinforcement signal attenuated responding for the simple VI groups but not for the animals given the negative fixed-ratio component, although the signal improved response efficiency in both groups. In Experiment 3, a poor correlation between responding and reinforcement was produced by a VI schedule onto which the delivery of response-independent food was superimposed. A signal for reinforcement initially elevated responding on this schedule, relative to an unsignaled condition; however, this pattern was reversed with further training. In sum, the present experiments provide little support for the view that signaling reinforcement enhances responding when the response-reinforcer correlation is weak and attenuates responding when this correlation is strong.  相似文献   

5.
Gibbon  J.  Locurto  C.  Terrace  H. S. 《Learning & behavior》1975,3(4):317-324

Five groups of pigeons were studied in an auto-shaping procedure which programmed two types of trials represented by hues on the response key. Each signal was separated by a brief intertriai interval. Three groups were studied with a positive correlation between one of the signals and food (contingent groups). They differed with respect to the frequency with which the positive signal appeared. Two noncontingent groups were studied in which the correlation between the signals and food was eliminated by programming food with the same probability following either signal. One noncontingent group had a high density of reinforcement produced by adding reinforcement in the other signal, at the same rate as programmed in the positive signal for the contingent groups. The other noncontingent group experienced the same number of reinforcements in the session as the contingent group with the least frequent positive trial, but these reinforcements were distributed with equal probability across the signals. Birds in the contingent groups with intermediate or infrequent positive signals all acquired reliable pecking, with acquisition most rapid for the infrequent signal. Maintained responding covaried with the speed of acquisition. No birds in the noncontingent groups showed reliable responding. Birds in the contingent group with a frequent positive signal (approximately 3/4 of the session), also showed no reliable pecking. This result suggests that more than one noncontingent group is informative for assessing the role of differential reinforcement probability in the acquisition of auto-shaped keypecking. In particular, a noncontingent group which controls for the frequency of reinforced trials is an appropriate reference group.

  相似文献   

6.
Pigeons pecked on two response keys that delivered reinforcers on a variable-interval schedule. The proportion of reinforcers delivered by one key was constant for a few sessions and then changed, and subjects’ choice responses were recorded during these periods of transition. In Experiment 1, response proportions approached a new asymptote slightly more slowly when the switch in reinforcement proportions was more extreme. In Experiment 2, slightly faster transitions were found with higher overall rates of reinforcement. The results from the first session, after a switch in the reinforcement proportions, were generally consistent with a mathematical model that assumes that the strength of each response is increased by reinforcement and decreased by nonreinforcement. However, neither this model nor other similar models predicted the “spontaneous recovery” observed in later sessions: At the start of these sessions, response proportions reverted toward their preswitch levels. Computer simulations could mimic the spontaneous recovery by assuming that subjects store separate representations of response strength for each session, which are averaged at the start of each new session.  相似文献   

7.
Reed P 《Learning & behavior》2003,31(2):205-211
The effect of various relationships between a response (an investment made in the context of a game) and an outcome (a return on the investment) on judgments of the causal effectiveness of the response was examined. In Experiment 1, response rates and causal judgments were higher for a differential-reinforcement-of-high-rate (DRH) schedule relative to a variable-ratio (VR) schedule with the same probability of outcome following a response. Response rates were also higher for a DRH than for a variable-interval schedule matched for reinforcement rate. In Experiment 2, response rates and causal judgments were lower for a differential-reinforcement-of-low-rate schedule relative to a VR schedule with the same probability of outcome following a response. These results corroborate the view that schedules are a determinant of both response rates and causal judgments, and that few current theories of causal judgment explicitly predict this pattern of results.  相似文献   

8.
We present an algebraic model of resistance to extinction that is consistent with research on resistance to change. The model assumes that response strength is a power function of reinforcer rate and that extinction involves two additive, decremental processes: (1) the termination of the reinforcement contingency and (2) generalization decrement resulting from reinforcer omission. The model was supported by three experiments. In Experiment 1, 4 pigeons were trained on two-component multiple variable-interval (VI) 60-sec, VI 240-sec schedules. In two conditions, resistance to change was tested by terminating the response-reinforcer contingency and presenting response-independent reinforcers at the same rate as in training. In two further conditions, resistance to change was tested by prefeeding and by extinction. In Experiment 2, 6 pigeons were trained on two-component multiple VI 150-sec schedules with 8-sec or 2-sec reinforcers, and resistance to change was tested by terminating the response-reinforcer contingency in three conditions. In two of those conditions, brief delays were interposed between responses and response-independent reinforcers. In both Experiments 1 and 2, response rate was more resistant to change in the richer component, except for extinction in Experiment 1. In Experiment 3, 8 pigeons were trained on multiple VI 30-sec, VI 120-sec schedules. During extinction, half of the presentations of each component were accompanied by a novel stimulus to produce generalization decrement. The extinction data of Experiments 1 and 3 were well described by our model. The value of the exponent relating response strength and reinforcement was similar in all three experiments.  相似文献   

9.
Pigeons responded to changeover-key concurrent variable-interval variable-interval reinforcement schedules while there were intervals during which the changeover key was inoperative (no-choice intervals). In Experiment 1, a multiple schedule on the changeover key signaled choice and no-choice intervals. All subjects showed near-perfect discrimination during initial discrimination training and rapid reacquisition of discrimination following contingency reversals. In Experiment 2, the onset of no-choice intervals was unsignaled and contingent on interchangeover time. The temporal distribution of changeover-key responses conformed to the temporal distribution of choice intervals. The results of both experiments suggest that changeover responding is modifiable as a function of its immediate consequences. The results of Experiment 2, in particular, suggest that time or some correlate of time since the last changeover response can determine subsequent changeover behavior.  相似文献   

10.
Numerical competence has been studied in animals under a variety of conditions, but only a few experiments have reported animals’ ability to detect absolute number. Capaldi and Miller (1988) tested rats’ ability to detect absolute number by using biologically important events—the number of reinforced runs followed by a nonreinforced run—and found that the rats ran significantly slower on the nonreinforced run. In the present experiments, we used a similar procedure. Pigeons were given a sequence of trials in which responding on the first three trials ended in reinforcement but responding on the fourth trial did not (RRRN). When the response requirement on each trial was a single peck (Experiment 1), we found no significant increase in latency to peck on the fourth trial. When the response requirement was increased to 10 pecks (Experiment 2), however, the time to complete the peck requirement was significantly longer on the nonreinforced trial than on the reinforced trials. Tests for control by time, number of responses, and amount of food consumed indicated that the pigeons were using primarily the number of reinforcements obtained in each sequence as a cue for nonreinforcement. This procedure represents a sensitive and efficient method for studying numerical competence in animals.  相似文献   

11.
Four experiments assessed the role of reinforcement expectancies in the trial spacing effect obtained in delayed matching-to-sample by pigeons. In Experiment 1, a differential outcome (DO) group received reinforcement with a probability of 1.0 for correct comparison responses following one sample stimulus and a probability of 0.2 for correct comparison responses following the other sample stimulus. The nondifferential outcome (NDO) group received reinforcement with a probability of 0.6 for correct responses to either stimulus. While matching accuracy was higher for the DO group than for the NDO group, both groups showed an equivalent decline in accuracy as the intertriai interval (ITI) duration was decreased. However, within the DO group, ITI duration affected performance on low-probability-of-reinforcement trials but not on high-probability-of-reinforcement trials. In Experiment 2, delay interval (DI) duration was 5, 10, or 15 sec and accuracy was higher for the DO group than for the NDO group at all DI durations. In addition, accuracy decreased similarly on high- and low-probability-of-reinforcement trials for the DO group as DI was increased. In Experiment 3, all birds were studied under DO conditions and ITI duration was manipulated along with DI duration. At the short DI duration, decreasing ITI duration had a detrimental effect on low-probability-of-reinforcement trials but no effect on high-probability-of-reinforcement trials. At the long DI duration, decreasing ITI duration had detrimental effects on both types of trials. In Experiment 4, unsignaled ITI reinforcers disrupted accuracy when the DI was long and when the ITI was short. The applicability of scalar expectancy theory to these data is discussed.  相似文献   

12.
Pigeons were trained on a psychophysical choice task to make one response after a 2-sec signal and a different response after a 10-sec signal. Delayed dimensional control was assessed by presenting durations intermediate to the short and long signals and by introducing delays between the signals and choice opportunities. In Experiment 1, choices after intermediate durations were not reinforced; in Experiment 2, one choice was reinforced after the three shortest durations and another was reinforced after the three longest durations. In Experiment 1, the slopes of the psychophysical functions decreased with increases in delays, but the decrease in stimulus control was not unbiased; choice probabilities decreased for longer durations, but did not increase for shorter durations. Experiment 2 revealed the same generalized loss of stimulus control on the temporal dimension, but not the same pattern of bias; temporal control was relinquished equally for shorter and longer durations. These results are evaluated in the context of the subjective shortening model of remembered duration (Spetch & Wilkie, 1983) and Staddon’s theory of timing and remembering (Staddon, 1984).  相似文献   

13.
In Experiment 1, rats were trained to leverpress on a variable ratio (VR) 30 schedule with a 500-msec delay between the reinforced response and food delivery. Subjects that experienced a signal during the delay responded faster than did control subjects that received the stimulus un-correlated with reinforcement. Higher response rates were obtained when the stimulus used to signal reinforcement was auditory rather than visual. Experiments 2 and 3 compared the effects of signaling reinforcement with either a localized or a diffuse light on responding maintained by VR schedules of reinforcement. Elevated response rates were observed with the diffuse stimulus, but the localized stimulus failed to produce such potentiation. Experiment 3 also examined the conditioned reinforcing power of localized and diffuse visual stimuli. These results are discussed with reference to (1) theories of selective association and sign tracking and (2) their implications for current theories of signaling reinforcement.  相似文献   

14.
Pigeons pecked keys on concurrent-chains schedules that provided a variable interval 30-sec schedule in the initial link. One terminal link provided reinforcers in a fixed manner; the other provided reinforcers in a variable manner with the same arithmetic mean as the fixed alternative. In Experiment 1, the terminal links provided fixed and variable interval schedules. In Experiment 2, the terminal links provided reinforcers after a fixed or a variable delay following the response that produced them. In Experiment 3, the terminal links provided reinforcers that were fixed or variable in size. Rate of reinforcement was varied by changing the scheduled interreinforcer interval in the terminal link from 5 to 225 sec. The subjects usually preferred the variable option in Experiments 1 and 2 but differed in preference in Experiment 3. The preference for variability was usually stronger for lower (longer terminal links) than for higher (shorter terminal links) rates of reinforcement. Preference did not change systematically with time in the session. Some aspects of these results are inconsistent with explanations for the preference for variability in terms of scaling factors, scalar expectancy theory, risk-sensitive models of optimal foraging theory, and habituation to the reinforcer. Initial-link response rates also changed within sessions when the schedules provided high, but not low, rates of reinforcement. Within-session changes in responding were similar for the two initial links. These similarities imply that habituation to the reinforcer is represented differently in theories of choice than are other variables related to reinforcement.  相似文献   

15.
In three between-groups blocking experiments with rats, two concurrent and one forward, several common control procedures were employed: Reinforced trials with the putative blocking stimulus were either omitted entirely (Kamin control), replaced by unsignaled reinforcements (Wagner control), or replaced by reinforced trials with a different stimulus (C1 control). In each experiment, parallel treatments with the target stimulus absent during training served to examine the possibility that differential responding in tests with the target stimulus might be traced solely to differential exposure to the nontarget stimuli. In Experiment 1, responding by a concurrent blocking group during the test was no different than responding by a Kamin control group, and responding by a Wagner control group was greater than that of either of the other groups—a pattern of results, mirrored in the performance of the target-absent groups, that could be attributed to the elevation of contextual excitation by unsignaled reinforcement. In Experiment 2, responding in the test by a concurrent blocking group was no different than that by a C1 control group. In Experiment 3, a finding of less responding by a forward blocking group than by a C1 control group when the target stimulus was present during training, but not when it was absent, provided plausible evidence of blocking.  相似文献   

16.
Event-generated memory refers to the memory of a reinforcement (R) or nonreinforcement (N) event from an immediately preceding trial;signal-generated memory refers to the memory of a temporally remote R or N, retrieval of which is generated by presentation of a signal with which the memory is associated (Haggbloom, 1988). In each of three experiments, Group Signal-R received runway discrimination training in Phase 1 to establish a stimulus as a signal for R, and partial reinforcement training in Phase 2. An extinction test measured learning about the memory of nonreward (SN)—learning that occurs when SN is retrieved on R trials that follow N trials. In Group Signal-H, those R trials were accompanied by the signal for R, a treatment we hypothesized would generate retrieval of the memory of reinforcement (SR) so that signal-generated SR would replace event-generated SN as the operative memory, thereby eliminating the increased resistance to extinction normally produced by PRF training. In each experiment, Group Signal-R was less resistant to extinction than was a control group conditioned to respond to-event-generated SN. Extinction was as rapid in Group Signal-R as it was in a consistent reinforcement control group (Experiment 1) and in a group given intertrial reinforcements to interfere with learning about SN (Experiment 3). Experiment 2 tested two alternative interpretations of the failure to learn about SN in Group Signal-R. Those alternatives were found to be less viable than the hypothesis that the signal for R actively recruited retrieval of a competing memory.  相似文献   

17.
Three experiments investigated the effects of magnitude and schedule of reinforcement and level of training in instrumental escape learning at a 24-h intertriai interval. In Experiment I, two magnitudes of reinforcement were factorially combined with two schedules of reinforcement (CRF and PRF). Under PRF, large reward produced greater resistance to extinction than did small reward, while the reverse was true under CRF. In Experiment II, two levels of acquisition training were factorially combined with three schedules of reinforcement (CRF, single-alternation, and nonalternated PRF). Patterned running was observed late in acquisition in the single-alternation extended-training condition. Resistance to extinction was greater for the nonalternated PRF condition than for the single-alternation condition following extended acquisition, and the reverse was true following limited acquisition. Experiment III confirmed the extinction findings of Experiment II. The results of all three experiments supported an analysis of escape learning at spaced trials in terms of Capaldi’s (1967) sequential theory.  相似文献   

18.
In three experiments, we examined the effect on the patterns of responding noted on fixed interval (FI) schedules of prior exposure to a range of interval and ratio schedules. Rats leverpressed for food reinforcement on random ratio (RR), random interval (RI), or variable interval (VI) schedules prior to transfer to FI schedules. In Experiment 1, prior exposure to an RR schedule retarded the development of typical FI patterns of responding. Exposure to a yoked RI schedule produced even greater retardation of typical FI performance. This effect was replicated in Experiment 2, using a within-subjects design. Rats responded on a multiple RR-RI schedule prior to a multiple FI-FI schedule. Typical FI performance emerged more slowly in the component previously associated with the RI than with that associated with the RR. In Experiment 3, exposure to an RR schedule retarded the development of FI performance to a greater extent than did exposure to a VR schedule. The latter schedule was programmed to allow the possibility that inhibitory control would develop after reinforcement. These results confirm that ratio schedules independently result in the disruption of FI responding. This effect was not long lasting and cannot be used plausibly to explain species differences in responding to FI schedules. However, it does suggest that temporal control—as manifested by the transfer of inhibitory control from one schedule to another—could facilitate movement between interval schedules.  相似文献   

19.
In three experiments, we examined the effect of response-outcome relations on human ratings of causal efficacy and demonstrated that such efficacy ratings transfer to novel situations through derived stimulus relations. Causal efficacy ratings were higher, and probability of an outcome given a response was lower, for a differential reinforcement of high rate schedule than for either a differential reinforcement of low rate schedule (Experiment 1) or a variable interval schedule (Experiment 2). In Experiment 3, we employed schedules that were equated for outcome probability and noted that ratings of causal efficacy and the rate of response were higher on a variable ratio than on a variable interval schedule. For participants in all three experiments, causal efficacy ratings transferred to the stimulus present during each schedule and generalized to novel stimuli through derived relations. The results corroborate the view that schedules are a determinant of both response rates and causal efficacy ratings. In addition, the novel demonstration of a mechanism of generalization of these ratings via derived relations has clinical implications.  相似文献   

20.
Three experiments investigated the effects of percentage of reinforcement on the resistance to extinction of an instrumental running response. In Experiment 1, with N-length held constant, 47% reinforcement during acquisition generated greater resistance to extinction (Rn) than did 77%. In Experiment 2, this result was replicated with both functional N-length and number of N-R transitions held constant. In Experiment 3, Rn was shown to be a function of both N-length and percentage of reinforcement. The results of all three experiments were discussed in terms of Capaldi’s reinforcement level theory and possible alternative explanations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号