Investigating how positive and negative feedback
affects performance
Learning, Memory & Perception
Tutor: Aileen Proudlock
Dawn McManus
0057466
Introduction
The subject of reinforcement
has been widely publicised and many studies and articles have probed the
subject. Probably the most famous researcher in this area is Skinner. Although
the title ‘Founding Father of Behaviour’ is attributed to B.F.Skinner, the
roots of the approach go back to before the 19th century (Cardwell, Clark & Meldrum 1996: p521).
Skinner himself acknowledged the influence of Thorndike in his work, especially
his description of a ‘law of effect’. This law states that any behaviour that
leads to a successful outcome will tend to be repeated in the future. (Thorndike 1911, cited in Cardwell, Clark
& Meldrum 1996: p521).
Outcomes that are dependent on behaviour can be
desirable or aversive. For instance, a positive stimulus is known as positive
reinforcement and could promote the behaviour. However, a negative stimulus
could be seen as a punishment and could serve to decrease the behaviour (Anderson 2000). The definitions of the
different types of reinforcements and punishments we are concentrating on are
simple. Positive reinforcement would be defined as a ‘pleasant stimulus that
follows a desired behaviour’ and would increase the likelihood of the desired
behaviour. Positive punishment would be defined as ‘Presentation of an
unpleasant stimulus after an undesired behaviour occurs’ and could decrease the
likelihood of the undesired behaviour (Atkinson
et al 2000: p245). The main focus of this research project covered the
aspects of positive reinforcement and positive punishment, or otherwise known
as positive and negative feedback.
Many studies have looked into the idea of feedback
and performance, and whether there is a relationship between the two. Berkowitz
et al conducted an experiment on 25 eighteen to twenty eight year olds, who
were asked to find the maximum height of an unseen hill by choosing a pair of
co-ordinates. They received different levels of feedback on the height
achieved. It was found that restricting the levels of feedback to the
participant, significantly reduced performance (Berkowitz, Lewis & Drury 1983). This would suggest that feedback is an important tool for
learning. Cohen & Herr found that out of 35 college teachers who
participated in their study, many were rated higher by their students if they
had administered interactive feedback, than those who were in the no feedback
group (Cohen & Herr 1982). However
feedback is not only a useful teaching tool. Philibin & Seidenstadt
discovered from their time production experiment on 42 female undergraduates
that partial feedback on 50% of the trials enhanced performance as effectively
as 100% feedback did (Philibin &
Seidenstadt 1983). This would indicate that feedback of any amount is
helpful to performance. Matsui et al also looked at feedback and performance.
They originally tested 87 female students who conducted a 10-minute arithmetic
trial. After 5 minutes they received feedback on the number attempted. From
this it was suggested that the feedback given induced a larger amount of effort
from the participants. They replicated the study using 103 male students and
this also found that feedback improved performance (Matsui Okada & Inoshita 1983). A similar study by Scriven
& Glynn also indicates that feedback results in major gains to performance
in given tasks (Scriven & Glynn
1983). It has to be said that although these studies indicate a positive
reaction to feedback, they don’t distinguish whether the feedback is positive
or negative. Pavett however did just this.
In her 1983 study on feedback and performance, it
was hypothesised that the frequency of positive and negative feedback would
make a significant contribution to the prediction of supervisory rated
performance. The results indicated that amongst the 203 staff tested, positive
rather than negative feedback served as an independent predictor of performance
(Pavett 1983). An earlier study by
Vitulli looked at not only positive and negative feedback, but also on no
feedback at all. This study looked at the varying degrees of feedback on the
number of correct guesses at a computerised ESP test. The computer program
provided immediate feedback. Vitulli found that there was a significant mean
score for each of the three feedback groups but a comparison test showed the
largest mean difference between positive and negative feedback, with results
indicating a close parallel between the effects of feedback and the number of
correct guesses (Vitulli 1982).
However a further Vitulli study on these three degrees of feedback on a
computerised psi study, showed no effect (Vitulli
1983). Overall, with these studies in mind, it is suggested that feedback,
whether positive or negative, does indeed have an impact on our learning
performance.
Aim
As a result of the above literature, the aim of this
study is to investigate the differential effects of positive and negative
feedback.
H1: There will be a difference in
reaction time between participants subjected to positive and negative feedback.
H0: There will be no difference in
reaction time between participants subjected to positive and negative feedback.
H1: There will be a difference in errors
made between participants subjected to positive and negative feedback.
H0: there will be no difference in errors
made between participants subjected to positive and negative feedback.
This study was conducted using a semantic categorisation task with eight blocks of stimuli, either auditory or visual. The stimuli were randomised within each block for each participant. The order of the blocks was counter balanced, and the participant paused after each block. Auditory and Visual stimuli were matched on the basis of their word frequency, or how often we encounter the words in real life; and divided into large and small on the basis of the ratings by judges. There was a 1000 msec gap between the presentation of each stimulus. Each participant was given the same on screen instructions and heard the same auditory voice as a control measure. The study implements an independent measures design as each of the participants were subjected to only one level of the independent variable. In this case the independent variable was the feedback given, running on two levels: positive feedback and negative feedback. There are two dependent variables in this design. DV1 being the reaction time in each of eight blocks of tests after the feedback was given. DV2 is the errors made occurring in each of the eight blocks of tests, i.e. correct or incorrect answers.
The study was conducted
using 78 naïve psychology undergraduates (N) of which 63 were female and 15
were male. The sample was opportunity as it was quick, easy and could be
carried out in the Mac lab.
The materials used were a
Macintosh computer with keyboard and monitor, a Superlab program, and suitable
headphones.
Each participant had to
perform a semantic categorisation task. They read on screen instructions and
responded to eight blocks of word stimuli by pressing relevant keys on the
keyboard. The ‘a’ key was pressed when the word represented an object, which
was smaller than the computer screen used, and the ‘l’ key was pressed when it
was larger. There were 60 auditory stimuli and 60 visual stimuli. Half of all
the stimuli represented objects that were larger, and the other half
represented smaller ones. The stimuli were repeated between the eight blocks
giving 360 experimental trials in total. Half of the participants received
positive feedback: a tick for each correct answer, and half of them received
negative feedback: a cross for each incorrect response. Their reaction times and
errors were recorded. In two blocks they saw only visual stimuli; in another
two blocks they heard only auditory stimuli, in the remaining four blocks they
received a mixture of auditory and visual stimuli. The feedback was given 300
msec after each response. Participants were debriefed in accordance with BPS
guidelines on the aims of the study during the next week’s seminar.
A descriptive analysis of
the participant’s group statistics was conducted. 78 participants took part and
produced interval level data in the form of overall totals. The tests will be
conducted using DV1 reaction time, DV2 errors and the IV
feedback on two levels, positive and negative. The results are shown in figures
1 and 1a below:
|
Feedback |
N |
Mean |
Standard. Deviation |
Standard. Error Mean |
|
Negative |
35 |
7080.9199 |
3459.3275 |
584.7331 |
| Positive |
43 |
6284.7531 |
1286.3117 |
196.1607 |
It can be seen from figure 1 above that there is a difference in means between those participants who were subjected to negative feedback, and those who were subjected to positive feedback; implying that participants receiving negative feedback reacted quicker. The standard deviations are also different with 3459.3 for negative and 1286.3 for positive showing that the negative group had more variation than the positive group.
|
Feedback |
N |
Mean |
Standard. Deviation |
Standard. Error Mean |
| Negative |
35 |
32.0286 |
15.6759 |
2.6497 |
|
Positive |
43 |
25.4651 |
11.7905 |
1.7980 |
It can be seen from the figure 1a above that there is a difference in means between those participants who were subjected to negative feedback, and those who were subjected to positive feedback; again implying that participants receiving negative feedback created more errors. The standard deviations are also different with 15.6 for negative and 11.7 for positive showing that the negative group had slightly more variation than the positive group.
For
further descriptive analysis, bar charts were created. Results are shown in
figures 2 and 2a below:
Figure 2: Bar chart showing differences in reaction time for the two
levels of the IV
Figure 2a: Bar chart showing differences in errors made for the two
levels of the IV
The charts in figures 2 and 2a above clearly show the scale of differences in mean reaction time scores and errors made between participants who were subjected to negative and positive feedback. However it should be noted that the charts do not start at zero and could give misleading representations of the results due to the dramatic size differences.
As the descriptive
statistics are inconclusive, further analysis was needed. The data is
interval level so parametric tests were used, in the form of t-tests (see appendix 1).
The first test conducted
was a Levene’s test. This is used to test for equality of variance. The
results for DV1 and DV2 are shown in figures 3 and 3a
below:
REACTION TIME
|
Levene's Test for Equality
of Variances |
|
| F |
Sig. |
|
| Equal variances assumed |
4.140 |
.045 |
| Equal variances not
assumed |
|
|
It can be seen from
figure 3 above that the significance is <0.05 (.045). This shows that the
test does not have equality of variance despite being very close, and the
results for the final t-test shown in figure 4 should be read from the
bottom line where equal variances are not assumed.
ERRORS MADE
|
Levene's
Test for Equality of Variances |
|
| F |
Sig. |
|
| Equal variances assumed |
1.787 |
.185 |
| Equal
variances not assumed |
|
|
It can be seen from
figure 3a that the significance is >0.05 (.185). This shows that the
test does have equality of variance, and the results for the final t-test in
figure 4a should be read from the top line where equal variances are assumed.
Results of the final t-tests for DV1 and DV2 are
shown in figures 4 and 4a below:
REACTION TIME
|
t-test for Equality of Means |
||
| t |
df |
Sig. (2-tailed) |
|
| Equal variances assumed |
1.397 |
76 |
.167 |
| |
1.291 |
41.656 |
.204 |
By reading the bottom line in figure 4 above, where equal variances are not assumed, it can be seen that the significance level is >0.05 (.204) for a two tailed test. Therefore the standard form results are: t = 1.291; df = 41.656; p >0.05 (0.204).
ERRORS MADE
|
t-test for Equality of Means |
||
| t |
df |
Sig. (2-tailed) |
|
| Equal variances assumed |
2.110 |
76 |
.038 |
| |
2.050 |
61.897 |
.045 |
By reading the top line in figure 4a above, where equal variances are assumed, it can be seen that the significance level is <0.05 (.038) for a two tailed test. Therefore the standard form results are: t = 2.110; df = 76; p <0.05 (0.038).
Discussion
The aim of this study was to investigate the differential effects of positive and negative feedback. The results showed that there was no significance for effects of feedback on DV1 reaction time in relation to the experimental hypothesis, so H1 was rejected and H0 was accepted. For DV2 errors made, the results showed that there was significance in the results for feedback and errors made. Therefore in relation to the experimental hypothesis for DV2, H1 was accepted and H0 was rejected. The results for both tests were based on a two-tailed test.
These findings are true in relation to the literature that was used as the basis for this study. The studies shown focused on feedback and performance, with the majority, including studies by Matsui et al (1983) and Scriven & Glynn (1983), getting significant results. In the case of Vitulli (1982) there was found to be a strong link between feedback and correct guesses at a computer aided task. This is much in keeping with this particular study and specifically the hypothesis for DV2.
Despite the significance of this result for errors made, it is worth mentioning that errors could have occurred for other reasons over and above positive and negative feedback. For instance, the auditory voice was in a thick northern dialect, and some words could have been difficult to decipher resulting in errors. There is also an anticipatory issue; each participant would have been positioned in such a way that their fingers would have been hovering over the specific keys needed, in this case ‘a’ and ‘l’. Almost by default, if the participant had frequently pressed down the ‘a’ key, anticipation could have taken over and they could have involuntarily depressed the ‘l’ by mistake. Though these limitations are taken into consideration, the result was still significant for DV2 in such that feedback does indeed affect performance.
The fact that this study has continued in the vein of previous studies as seen in the introduction, can only hold welcome implications for the school of learning, and indeed future studies in the area. Along the same lines as Philibin & Seidenstadt’s study in 1983 which discovered that even 50% of feedback can enhance performance as much as 100% feedback can, further studies could investigate exactly how much feedback, either positive or negative can actually effect performance, and to what extent. If these techniques are regularly applied in both teaching, as shown in the Cohen & Herr study (1982), and learning as seen in the study by Berkowitz et al (1983) then the outlook for teaching professionals and students alike can be seen to be exciting at the very least.
References
Anderson. J. R (2000) Learning and Memory: An Integrated Approach. 2nd Edition. USA: Wiley.
Atkinson. R. L; Atkinson. R. C; Smith. E. E; Benn. D. J & Nolen-Hoeksema. S (2000) Hilgard’s Introduction to Psychology. 13th Edition. USA: Harcourt Brace. p245.
Berkowitz. D; Lewis. R & Drury. C. G (1983) The Effect of Computer-Displayed Knowledge of Results on a Two-Variable Optimisation Task. Ergonomics. Vol. 26 (10) p 975-979 (Abstract).
Cardwell. M; Clark. L & Meldrum. C (1996) Psychology for A Level. London: Harper Collins. p 521.
Cohen. P. A & Herr. G (1982) Using an Interactive Feedback Procedure to Improve College Teaching. Teaching of Psychology. Vol. 9 (3) p 138-140. (Abstract).
Matsui. T Okada. A & Inoshita. O (1983) Mechanism of Feedback Affecting Task Performance. Organisational Behaviour & Human Decision Processes. Vol. 31 (1) p 114-122. (Abstract).
Philibin. T & Seidenstadt. R. M (1983) Feedback and Time Perception. Perceptual & Motor Skills. Vol. 57 (1) p 308. (Abstract).
Pavett. C. M (1983) Evaluation of the Impact of Feedback on Performance and Motivation. Human Relations. Vol. 36 (7) p 641-654. (Abstract).
Scriven. J & Glynn. T (1983) Performance Feedback on Written Tasks for Low Achieving Secondary students. New Zealand Journal of Educational Studies. Vol. 18 (2) p 134-145. (Abstract).
Vitulli. W. F (1982) Effects of Immediate Feedback on Computer Assisted Testing of ESP Performance. Psychological Reports. Vol. 51 (2) p 403-408. (Abstract).
Vitulli. W. F (1983) Immediate Feedback and Target Symbol Variation in Computer Assisted Psi Tests. Journal of Parapsychology. Vol. 47 (1) p 37-47. (Abstract).