Second-Order Conditioning and Conditioned Inhibition: Influences of Speed versus Accuracy on Human Causal Learning

Jessica C. Lee; Evan J. Livesey

doi:10.1371/journal.pone.0049899

Abstract

In human causal learning, excitatory and inhibitory learning effects can sometimes be found in the same paradigm by altering the learning conditions. This study aims to explore whether learning in the feature negative paradigm can be dissociated by emphasising speed over accuracy. In two causal learning experiments, participants were given a feature negative discrimination in which the outcome caused by one cue was prevented by the addition of another. Participants completed training trials either in a self-paced fashion with instructions emphasising accuracy, or under strict time constraints with instructions emphasising speed. Using summation tests in which the preventative cue was paired with another causal cue, participants in the accuracy groups correctly rated the preventative cue as if it reduced the probability of the outcome. However, participants in the speed groups rated the preventative cue as if it increased the probability of the outcome. In Experiment 1, both speed and accuracy groups later judged the same cue to be preventative in a reasoned inference task. Experiment 2 failed to find evidence of similar dissociations in retrospective revaluation (release from overshadowing vs. mediated extinction) or learning about a redundant cue (blocking vs. augmentation). However in the same experiment, the tendency for the accuracy group to show conditioned inhibition and the speed group to show second-order conditioning was consistent even across sub-sets of the speed and accuracy groups with equivalent accuracy in training, suggesting that second-order conditioning is not merely a consequence of poorer acquisition. This dissociation mirrors the trade-off between second-order conditioning and conditioned inhibition observed in animal conditioning when training is extended.

Citation: Lee JC, Livesey EJ (2012) Second-Order Conditioning and Conditioned Inhibition: Influences of Speed versus Accuracy on Human Causal Learning. PLoS ONE 7(11): e49899. https://doi.org/10.1371/journal.pone.0049899

Editor: Reginald Frederick Westbrook, University of New South Wales, Australia

Received: September 9, 2012; Accepted: October 17, 2012; Published: November 28, 2012

Copyright: © 2012 Lee, Livesey. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Funding: This research was supported by a University of Sydney Bridging Support Grant awarded to EJL. No external sources of funding contributed to this research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

In a typical human causal learning experiment, cues are presented that may increase or decrease the likelihood of a particular outcome and the participant’s task is to assess to what degree each cue either causes or prevents that outcome. Various authors have suggested that this process involves elementary associative learning mechanisms because results from many of these experiments bear strong resemblance to animal conditioning phenomena (e.g. [1], [2]). Conditioned inhibition – or learning about a cue that has a negative contingency with an outcome – is one such example.

Conditioned inhibition results from experience with a feature negative (FN) discrimination, where one cue leads to an outcome (A+), but when it is paired with a second cue, no outcome occurs (AX−). After sufficient training with these contingencies, the test stimulus (X) typically acquires inhibitory properties, such that its presence reduces responding in animal conditioning [3] or lowers ratings of causation or contingency in human learning [4]. In other words,×becomes a conditioned inhibitor as a consequence of its negative contingency with the outcome. When paired with another cue that has previously signaled the outcome (e.g. B+) the conditioned inhibitor reduces behavioral anticipation of the outcome that would normally be elicited by B (i.e. a summation test; [3]). In human causal learning, some doubt has been cast over several experiments that purport to show conditioned inhibition because of the choice of appropriate controls (see [5]). However, several experiments have found evidence of conditioned inhibition using a conservative test in which the ratings for the critical summation test compound BX are compared to ratings for a compound of B and a neutral or novel stimulus [5], [6]. In these studies, ratings for BX were substantially diminished, indicating that learning about×reduces causal ratings above and beyond what would be expected from a simple external inhibition effect; the reduction in ratings produced by pairing B with any other stimulus that has not been paired with the outcome [5]. Thus, like several other phenomena, conditioned inhibition appears to be common to a range of very different learning paradigms from Pavlovian conditioning to human causal judgment.

The general conclusion that human judgments of causation have an associative basis has been challenged on several grounds, including parsimony [7]. Humans display cognitive abilities such as deductive reasoning (e.g. [8]) and rule abstraction [9], [10] that could succinctly explain many of the causal learning results without recourse to primitive learning mechanisms. The task of separating the contributions of associative learning from other forms of cognition is made difficult by the fact that most experimental results in causal reasoning and contingency judgement are consistent with multiple explanations. Conditioned inhibition, for instance, could be explained as the formation of an inhibitory link between the conditioned inhibitor×and the outcome, which negates excitatory associations between other cues and the outcome. Such explanations follow naturally from the mechanisms described in many associative learning models (e.g. [11]). However, alternatively one could interpret this as the participant forming an inference that cue×prevents the occurrence of the outcome [12]. These explanations are by no means mutually exclusive but both effectively account for the learned properties of the conditioned inhibitor.

Given this general problem of dissociating psychological processes from one another, the FN paradigm is particularly interesting because under some circumstances, the cue (X) that possesses a negative contingency with the outcome actually appears to acquire excitatory rather than inhibitory properties [13], [14], [15]. This effect is often referred to as second-order conditioning because×acquires excitatory properties via its association with an excitatory cue (A) that is paired directly with the outcome. Several animal learning studies suggest that a transition from second-order conditioning to conditioned inhibition occurs through the course of training, with inhibition developing slowly. For example, Yin, Barnet, and Miller [15] have shown that conditioned inhibition manifests only with extended training with the FN contingencies intermixed, while second-order conditioning is evident with fewer training trials, presented either interspersed or in a blocked (A+ then AX−) design.

Second-order conditioning is noteworthy because normative and inferential models predict that×should not be treated as a cause of the outcome, given its negative contingency (X never appears with the outcome). For this reason, the mere fact that second-order conditioning occurs is viewed as being potentially diagnostic of the psychological mechanisms involved in learning [16]. Evidence for this effect in human causal learning can be found in a study reported by Karazinov and Boakes [17], who found second-order conditioning by limiting participants’ time to think on each trial. Each participant completed a causal scenario in which they played the role of a doctor attempting to discover which foods consumed by a fictitious patient were causing migraine headaches. Participants in one group completed the training phase of the experiment in a self-paced fashion (as is usually the case in causal learning tasks), whereas another group were limited to three seconds to respond on each training trial. Embedded amongst several other contingencies, the participants were given a FN discrimination (P+/PX−), where the addition of×to P prevented a migraine from occurring. However, instead of judging the test stimulus (X) to be preventative of the outcome, as did the self-paced group, in both experiments the paced group gave the test cue a higher causal rating than they did a non-causal control cue (M) trained in compound (LM−). Results from the typical summation tests – comparing×to M in compound with a trained excitor (T+) – suggested a similar pattern. Experiment 1 revealed a group interaction whereby TX was rated higher than TM in the paced group, but neither conditioned inhibition nor second-order conditioning was evident in the unpaced group. In Experiment 2, the unpaced group rated TX lower than TM (consistent with conditioned inhibition) but no group interaction was evident and the paced group did not rate TX higher than TM.

Shanks ([16]; see also Mitchell et al., [7]) has recently cited this result as a compelling example of causal learning taking a form that defies any obvious explanation in terms of rational inference, suggesting instead the operation of associative processes in human causal learning. The result is particularly noteworthy because effects indicating excitatory and inhibitory learning were revealed with training on the same contingencies, albeit not within the same experiment. Other cue competition effects are known to be sensitive to the conditions of learning in a seemingly similar fashion. These include retrospective revaluation effects (e.g. mediated extinction versus release from overshadowing; [18]) and the evaluation of a redundant cue (e.g. blocking versus augmentation; [19]), which will be briefly discussed in relation to Experiment 2. However, by and large, studies rarely observe cue contingency effects of this nature occurring in both excitatory and inhibitory directions on the basis of a single manipulation. Karazinov and Boakes’ [17] results constitute the best evidence for a non-rational second-order conditioning effect in human causal learning. However, even in their study, excitatory and inhibitory simple effects were not found in the same experiment. The potential significance of the effect and the somewhat equivocal nature of Karazinov and Boakes’ result make it all the more important to replicate this dissociation and to examine its properties.

The primary aim of this study was to garner further evidence for Karazinov and Boakes’ [17] dissociation in the FN paradigm by varying additional training parameters in addition to their pacing manipulation, providing a stronger impetus to respond either as quickly or as accurately as possible. However, unlike Karazinov and Boakes, we wished to obtain the dissociation using an identical set of test stimuli to find effects consistent with conditioned inhibition and second-order conditioning. Both experiments used a between-subjects design to manipulate trial time (unpaced versus paced trials), accompanied by instructions and feedback that emphasized the importance of either accuracy or speed during learning. Participants given self-paced trials and instructions to be as accurate as possible were expected to show learning consistent with conditioned inhibition, as has been observed in similar causal learning tasks previously (e.g. [4]). Participants given trial time limits and instructions emphasizing speed were expected to show second-order conditioning, consistent with Karazinov and Boakes’ [17] findings. In each experiment, participants assumed the role of a pharmaceutical researcher learning about the effects of different drugs that could cause potential side-effects. The cues were novel drug names (e.g. Slevoral, Melixil), and the possible outcomes were the occurrence of migraine (Experiments 1 and 2), nausea (Experiment 1 only), or no outcome. Experiment 1 focused on the feature negative contingencies in a complex causal learning task involving multiple outcomes. Experiment 2 examined the effect of trial time restriction on other cue contingency effects in addition to the FN discrimination. To test the claim that normative and inferential models do not predict second-order conditioning [17], an inference test in Experiment 1 aimed to show that conditioned inhibition was the rational judgement that should have resulted in the speed group.

Experiment 1

Experiment 1 primarily aimed to dissociate excitatory and inhibitory learning resulting from acquisition of the FN discrimination, using instructions, feedback and trial time limits to emphasise either speed or accuracy during training. In addition to the stimuli directly involved in the FN paradigm, other stimuli were included to assess transfer of learning and to function as filler cues (Table 1). The experiment used a scenario in which two possible side effects could occur as outcomes. Thus, each trial type was associated with “migraine”, “nausea”, or “no outcome”. Each participant completed two sets of FN discrimination and related control trials, one set involving migraine as the potential outcome, the other involving nausea (see Table 1).

Download:

Table 1. Cues and outcomes used in the training phase of Experiment 1.

https://doi.org/10.1371/journal.pone.0049899.t001

After training, both groups were given a self-paced ratings test, in which they were shown drug cues (or combinations of cues) and had to indicate the degree to which they expected each of the two side-effects to occur. The ratings test yielded two kinds of scores: outcome-specific ratings (specifically using the rating for the associated outcome during training) and the ratings difference scores (the difference between the ratings for the associated outcome and the alternative outcome). For example, the outcome-specific score for A₁ was the rating for outcome 1 only, and the difference score was obtained by subtracting the rating for outcome 2 from the rating for outcome 1. The difference scores were included as a means of gauging outcome specificity in learning, allowing for learning that “X causes/prevents O1” to be distinguished from the generalised learning of “X causes/prevents a side-effect”, which would manifest as a change in ratings for both scales (e.g. see [20]).

To assess learning, non-causal cues C₁ and C₂ were combined with trained excitors (B₁ and B₂) to form a novel control compound, which would then be compared with a novel compound consisting of the test cues (X₁ and X₂) and the same trained excitors (B₁ and B₂). Thus, the presence of conditioned inhibition or second-order conditioning was assessed via a summation test by comparing these critical test stimuli B₁X₁ and B₂X₂, to controls B₁C₁ and B₂C₂. If participants had genuinely learned that the test stimuli (X₁ and X₂) were inhibitors, they should rate the probability of their respective illnesses occurring as being low when they are paired with different excitors, compared to when the excitors are paired with the non-causal (but also non-preventative) control cues (C₁ and C₂). This was thought to be a conservative but necessary measure of conditioned inhibition, since it is known that combining a trained excitor with another stimulus results in lower predictive ratings due to reasons other than conditioned inhibition (see [5], [6], [21]). Since the aim was to obtain the group interaction on the same test cues, the choice of control cue was driven by the need to compare excitatory and inhibitory learning with an unambiguously non-causal cue.

Conversely, a higher rating for BX than for BC indicates second-order conditioning has occurred as it suggests that the presence of×has an excitatory rather than an inhibitory relationship with the outcome. This is an atypical measure for second-order conditioning, which has conventionally involved testing individual stimuli. However, it is appropriate in this case for two reasons. First, both BX and BC are novel compounds and any effect on ratings generated by uncertainty about new combinations of drugs will affect both. Second, it provides a direct comparison with the evidence for conditioned inhibition. By any conventional analysis based on associative learning principles, the excitatory strength of B should not inflate ratings of BX any more than BC and thus if BX receives a higher rating than BC, it should be based on the participant’s evaluation of×vs. C. Following from both the animal literature and Karazinov and Boakes’ [17] results, it was expected that conditioned inhibition would be evident in the accuracy group. The question of most interest was whether this effect would interact with the group manipulation and, more specifically, whether second-order conditioning would occur in the speed group, where the opportunity to reflect on each trial is restricted. A self-paced inference test at the end of the experiment sought to clarify whether conditioned inhibition was considered a rational judgement, and specifically, whether the speed group would still show second-order conditioning when given the opportunity to reason about the contingencies.

Method

Participants.

Fifty-two first-year psychology students from the University of Sydney participated in exchange for partial course credit. Five participants who scored below 35% (slightly above chance) accuracy for the feature negative stimuli (mean of A₁, A₂, A₁X₁ and A₂X₂) in the last quarter of the training phase were excluded, leaving 23 participants in the speed condition, and 24 in the accuracy condition (37 female, mean age = 19.8 years). All participants gave written informed consent and the procedure was approved by the University of Sydney Human Research Ethics Committee.

Apparatus.

The experiment was programmed using Psych Toolbox for Matlab [22], [23] and run on Apple Mac Mini desktop computers connected to 17 inch CRT monitors, refreshed at a rate of 85 Hz. Participants made their responses using a standard Apple keyboard and mouse. Testing was conducted in individual cubicles in groups of up to five, with sound feedback delivered via personal headphones.

Procedure.

In the training phase, participants were asked to assume the role of a pharmaceutical researcher whose job was to determine the effects of different drugs using trial and error. On each trial a drug or combination of drugs was presented and participants were asked to predict which of three possible outcomes they thought might occur (migraine, nausea or no outcome) by clicking on one of the buttons below the drug names. When an answer was selected, the box surrounding the outcome turned yellow, the three buttons disappeared and were replaced by the correct answer while the drug names remained on the screen. The drug names appeared in one of 3 colours (blue, green or red) and either a picture of a sad face or medicine was displayed on the feedback screen if the correct outcome was one of the illnesses. The choice of cue colour and picture was not systematically related to particular cues or outcomes.

Participants in the accuracy group were told to do the task as accurately as they could and to take their time, receiving a buzzer tone and the word ‘INCORRECT’ on the top of the screen if they made an error, as well as the word “correct” in smaller font if they chose correctly. Participants in the speed group were told to complete the task as fast as they could and were given only 1.5 seconds to respond, after which a buzzer tone was heard and the word ‘FASTER’ appeared at the top of the screen and no response recorded. The speed group were not given any feedback as to whether they were correct or incorrect and were only shown the correct answer. All contingencies were consistent throughout and therefore each stimulus presentation fully predicted a particular outcome. There were 8 blocks of 24 trials presented continuously without break for the entire training phase (192 trials in total). Within each block there were 2 repetitions of the 12 trial types (see Table 1), with their order of appearance randomised within each block. The spatial presentation of stimuli within each compound was counterbalanced so equal numbers of each were seen (e.g. AX and XA).

In the ratings test, participants were asked to rate the likelihood of each of the two outcomes occurring given the presence of one or two of the drug cues. On each trial, the drug name(s) appeared at the top of the screen, followed by two linear analogue scales appearing next to each of the outcome names (i.e. one scale for migraine, one for nausea). The end points of each scale were labelled “definitely will not occur” to “definitely will occur”. Participants could click anywhere on the scale, yielding ratings ranging from 0–100. The order of presentation was randomised, with each single-cue stimulus presented once, and each compound twice, again with the order of presentation within each compound counterbalanced. The ratings test was self-paced.

The last phase of the experiment (the inference test) aimed to extract a rational predictive judgement about the test stimuli by presenting all the relevant contingencies in the summation test at once on the screen. Participants were told that they would be viewing the results of the drugs again and could make another reasoned judgement which could be the same or different as before. Participants were shown that A₁ led to outcome 1, A₁X₁ led to no outcome, B₁ led to outcome 1 and C₁ led to no outcome (A₁+/A₁X₁−/B₁+/C₁−). They were then asked to rate how likely both outcomes 1 and 2 were to occur for the compounds B₁X₁ and B₁C₁ (the same compounds used in the summation test). These ratings were made in the same fashion as the predictive ratings, with all scores transformed to a scale of 0–100. This was then repeated for the corresponding stimuli with outcome 2 (A₂+/A₂X₂−/B₂+/C₂−, test C₂X₂ and C₂E₂). All drug name allocations and drug-illness contingencies were the same as in training, with all writing presented in white on a black background.

Results and Discussion

All analyses were performed with an alpha level of.05 and Greenhouse-Geisser adjusted p-values are reported where relevant.

Training.

Figure 1 shows accuracy for each stimulus type across training, averaged in four equal blocks. Over all stimuli, the accuracy group were more accurate throughout all training blocks, lowest F(1, 46) = 5.09, p = .029, and overall, F(1, 46) = 18.32, p<.001. As expected, the speed group responded faster overall, F(1, 46) = 61.78, p<.001, by a mean of 1.00 seconds per trial. Even in the last 24 trials when the accuracy group were at their best performance (about 90.5% correct), they were still slower than the speed group by 0.50 seconds, F(1, 46) = 28.51, p<.001. Thus the pacing and instructions succeeded in manipulating both accuracy and the time spent on each trial.

Download:

Figure 1. Training accuracy in Experiment 1 over blocks. Averaged over trial type (i.e. A₁ and A₂ averaged into A).

Upper panels: speed group, lower panels: accuracy group. Left panels: feature negative stimuli and controls, right panels: filler stimuli.

https://doi.org/10.1371/journal.pone.0049899.g001

Ratings test.

Table 2 shows the ratings given for all stimuli tested for both groups. For the summation test, variables were computed to represent the mean of the test stimuli (B₁X₁ and B₂X_2, henceforth BX) and the mean of the control stimuli (B₁C₁ and B₂C₂, henceforth BC) as difference scores (O1–O2 for B₁X₁ and B₁C₁, O2–O1 for B₂X₂ and B₂C₂), and as outcome-specific scores (O1 for B₁X₁ and B₁C₁, O2 for B₂X₂ and B₂C₂).

Download:

Table 2. Ratings for both outcomes in Experiment 1 (distractors omitted).

https://doi.org/10.1371/journal.pone.0049899.t002

For both these scores, in the accuracy group, the test stimuli were rated lower than the controls, whereas the reverse was found in the speed group (Figure 2). Two separate 2×(2) repeated measures ANOVAs with group as the between-subjects factor comparing the differences between test and control found a significant interaction with group using both the difference scores, F(1, 45) = 4.40, p = .042, and the outcome-specific scores, F(1, 45) = 4.32, p = .043 (Figure 2).

Download:

Figure 2. Summation tests comparing control stimuli (BC) against test stimuli (BX) for both outcomes.

Left panel: difference scores, calculated as the difference between O1 and O2 ratings for B₁X₁ and B₁C₁, and the difference between O2 and O1 ratings for B₂X₂ and B₂C₂. Right panel: outcome-specific scores, using the rating for O1 only for B₁X₁ and B₁C₁, and the rating for O2 only for B₂X₂ and B₂C₂. Error bars represent the standard error of the mean difference between test and control.

https://doi.org/10.1371/journal.pone.0049899.g002

To detect the presence of second-order conditioning in the speed group and conditioned inhibition in the accuracy group, the difference between the test (BX) and control (BC) stimuli was analysed for each group separately. Using the difference scores, there was no significant difference between the test and control stimuli for the speed group, F<1, but within the accuracy group, the test stimuli were rated significantly lower than the controls, F(1, 22) = 4.43, p = .046, consistent with conditioned inhibition (Figure 2, left panel). This meant that participants given instructions emphasising accuracy showed a reduced preference for predicting the outcome associated with the excitor relative to the unrelated outcome. Using the same analysis on the outcome-specific scores, evidence of second-order conditioning was found in the speed group with the test stimuli rated significantly higher than the controls, F(1, 22) = 7.83, p = .010 (Figure 2, right panel), but no evidence of conditioned inhibition in the accuracy group, F<1. This indicates that for the speed group, there was a general inflation of prediction ratings for both outcomes in the presence of the test stimulus rather than an increased preference towards predicting its related outcome.

Thus the main hypothesis that different instructions and trial timeouts emphasising speed or accuracy could produce opposing patterns of learning was supported, with the speed group displaying second-order conditioning and the accuracy group showing conditioned inhibition. Notably, group interactions were obtained on both summation test measures, extending the findings of Karazinov and Boakes [17], who detected second-order conditioning and conditioned inhibition across experiments but failed to find a significant interaction of the summation test with group. It is interesting that conditioned inhibition was found with the difference scores, a measure more sensitive to outcome specificity, and second-order conditioning was found on the outcome-specific scores where the unrelated outcome was not considered. This finding is perhaps a reflection of the nature of these learning effects. The learned properties of a conditioned inhibitor are thought to be bound to the outcome associated with the paired excitor [3], which may explain why conditioned inhibition manifested on the difference scores. Meanwhile second-order conditioning may have emerged in the speed group because participants regarded×as contributing to the likelihood of a side-effect occurring but could not remember which outcome this test stimulus had been paired with (in fact, it was never paired directly with either migraine or nausea). Hence, the observed effect was to inflate ratings of BX relative to BC on both outcome scales.

It is also worth noting that while all participants rated an unambiguously causal cue (B) quite high, and an unambiguously non-causal cue (C) quite low (see Table 2), ratings to the control compound BC were substantially lower than to B, indicating a strong generalization decrement resulting from the addition of a non-causal cue. Additional ANOVAs were used to compare the trained cues (B₁/B₂) against the summation controls (B₁C₁/B₂C₂), using the outcome-specific scores (O1 for B₁ and B₁C₁, O2 for B₂ and B₂C₂) in one analysis and the difference scores (O1–O2 for B₁ and B₁C₁, O2–O1 for B₂ and B₂C₂) in the other. Both analyses yielded significantly higher scores for B than for BC (smaller F(1,44) = 51.08, p<.001), but neither revealed an interaction with group (Fs<1), suggesting that the amount of generalization decrement resulting from the addition of the non-causal cue C was very similar in the two groups. While the ratings for BC seem to differ between groups, this is probably due to the fact that B itself received different ratings from the two groups.

Inference test.

Due to its similarity to the ratings test, the data from the inference test were analysed in the same way, with variables computed to represent the average of the test stimuli and the controls, using both difference scores and outcome-specific ratings. A 2×(2) repeated measures ANOVA on the difference scores (Figure 3, left panel) revealed a significant difference between the test stimuli and the controls, F(1,45) = 19.19, p<.001, and this did not interact with group, F<1. A 2×(2) repeated measures ANOVA was also conducted on the outcome-specific scores, yielding a significant difference between the test and control stimuli, F(1,45) = 18.21, p<.001, but again no interaction with group, F(1,45) = 1.49, p = .229 (Figure 3, right panel). Thus neither analyses revealed any group differences on the inference test, with both groups rating the test compounds lower than the controls, consistent with conditioned inhibition.

Download:

Figure 3. Results from summation test stimuli presented again during inference test.

Presented A+/AX−/C+/DE-, tested CX, CE, for both outcome 1 and 2. Left panel: difference scores, calculated as the difference between O1 and O2 ratings for B₁X₁ and B₁C₁, and the difference between O2 and O1 ratings for B₂X₂ and B₂C₂. Right panel: outcome-specific scores, using the rating for O1 only for B₁X₁ and B₁C₁, and the rating for O2 only for B₂X₂ and B₂C₂. Error bars represent the standard error of the mean difference between test and control.

https://doi.org/10.1371/journal.pone.0049899.g003

The above results indicate that both groups judged the test stimuli as inhibitory given the chance to reflect on, and make a rational inference about, the contingencies. This suggests that speeded learning conditions can result in learning that is non-rational. While this does not necessarily entail that learning was driven purely by associative mechanisms, it does suggest that a type of learning that is better explained by associative theory emerges when people learn under strict time constraints, processing information quickly and without careful thought. This experiment also shows that the speed group switched to judging the test stimuli as inhibitory once given the chance to reflect carefully on the contingencies, suggesting that second-order conditioning is not readily explained by a reasoned inference. Since the current manipulation was successful in dissociating learning in the FN paradigm, Experiment 2 sought to replicate this dissociation and examine its relationship with other cue competition effects that are thought to be mediated by cognitive resources.

Experiment 2

The results of Experiment 1 support Karazinov and Boakes’ [17] conclusion that limiting time to think during learning a FN discrimination may yield the conditions necessary to observe second-order conditioning rather than conditioned inhibition. Experiment 2 sought to replicate the dissociation found in Experiment 1, provide stronger evidence of second-order conditioning and conditioned inhibition using the same test measures, and examine whether other seemingly irrational learning effects could also be obtained using the speed manipulation. Recently, Vadillo and Matute [19] described such an effect using a blocking design. Blocking occurs when a target cue is paired with an outcome, but is always presented in compound with another cue that has previously been established as a strong predictor of that outcome. Typically, the target cue is rated as being less likely to cause the outcome than control cues that are only trained in compound. Thus, learning about the target cue is blocked by the presence of a strong predictor. This blocking effect is routinely observed in animal conditioning and, under many conditions, is also reliably found in causal learning experiments (e.g. [24], [25]). However, under time pressure, Vadillo and Matute observed augmentation rather than the typical blocking effect, where the target cue was given a higher rating than the control cues. This is a striking finding as it is not easy to explain in terms of either rational inference or associative learning principles, which normally predict blocking. Although, in theory, the target cue could acquire some excitatory strength via the within-compound association, demonstrations of augmentation are rare in the animal and human learning literature.

The time-to-think hypothesis proposed by Karazinov and Boakes [17] is also clearly relevant to another set of cue competition effects – retrospective revaluation effects – that are assumed to require the retrieval of learned cues in order for revaluation of learning to occur. An example relevant to the current design is release from overshadowing; after a compound of two cues are paired with the outcome, one cue is then presented individually and is shown not to cause the outcome. The individual presentations of this cue affect the ratings of the target cue that is not presented, relative to controls that are simply trained in compound. Typically, participants rate the target cue as being more likely to cause the outcome than the controls [26]. However, presentation of the non-causal cue in isolation can sometimes have the opposite effect on ratings of the target cue, an effect called mediated extinction. Using an AB+/A− design, Liljeholm & Balleine ([18], Experiment 1) found that encouraging the amount of generalization that occurred between A and B resulted in judgments consistent with mediated extinction. This was achieved by using visual cues that were joined spatially so as to encourage configural processing. Their study sheds some light on why mediated extinction is usually found in animal studies where the cues consist of flavours and odors (e.g. [27]), and release from overshadowing found in causal judgement tasks, where participants are more likely to treat the cues as independent causes of the outcome. As with second-order conditioning and augmentation, it seems difficult to account for mediated extinction in terms of a logical inference. But in this case, the associative explanation of this seemingly irrational effect relies upon inhibitory learning (or the weakening of associations) during the extinction of A- being transferred to B through within-compound associations. Both release from overshadowing and mediated extinction require retrospective revaluation of a cue that is not presented and therefore rely to some extent on retrieval of cues on the basis of within-compound associations.

In Experiment 2, the FN contingencies were retained and new contingencies were added to assess retrospective revaluation and blocking. Retrospective revaluation was examined using a blocked feature positive (FP) discrimination (FY+ followed by F−) where the critical revaluation effects were revealed by comparing ratings of the test cue Y with ratings of control cues (G and H) that were previously trained in compound (GH+). Learning about a redundant cue was examined using the conventional forward blocking design in which one of two cues is pretrained (I+) and followed by compound training with the target cue (IZ+).

Training in Experiment 2 was divided into two phases with the FP and blocking stimuli presented in separate phases, and the FN stimuli presented consistently throughout (see Table 3 for design). All other training parameters and instructions were the same as in Experiment 1. Due to the increased complexity of the design, the second outcome was omitted to reduce the total number of training stimuli so that the only outcomes were migraine (+) and no outcome (–). Karazinov and Boakes [17] used a control compound consisting of a trained excitor and a control cue (TM) where M was trained in compound (LM−), which differs slightly from the control used in Experiment 1, where we combined a trained excitor with a control cue (C−) which unambiguously predicted no outcome. While they were very similar, the subtle differences between these controls may prove to be important and as such, two control compounds were included in Experiment 2: BC, as used in Experiment 1, and BE where E was previously non-causal but trained in compound (DE−).

Download:

Table 3. Cues used in the training phase and ratings test of Experiment 2.

https://doi.org/10.1371/journal.pone.0049899.t003