Operant conditioning
Encyclopedia
Operant conditioning is a form of psychological
Psychology
Psychology is the study of the mind and behavior. Its immediate goal is to understand individuals and groups by both establishing general principles and researching specific cases. For many, the ultimate goal of psychology is to benefit society...

 learning
Learning
Learning is acquiring new or modifying existing knowledge, behaviors, skills, values, or preferences and may involve synthesizing different types of information. The ability to learn is possessed by humans, animals and some machines. Progress over time tends to follow learning curves.Human learning...

 during which an individual modifies the occurrence and form of its own behavior
Behavior
Behavior or behaviour refers to the actions and mannerisms made by organisms, systems, or artificial entities in conjunction with its environment, which includes the other systems or organisms around as well as the physical environment...

 due to the association
Association (psychology)
In psychology and marketing, two concepts or stimuli are associated when the experience of one leads to the effects of another, due to repeated pairing. This is sometimes called Pavlovian association for Ivan Pavlov's pioneering of classical conditioning....

 of the behavior with a stimulus
Stimulus (psychology)
In psychology, stimuli are energy patterns which are registered by the senses. In behaviorism and related stimulus–response theories, stimuli constitute the basis for behavior, whereas in perceptual psychology they constitute the basis for perception.In the second half of the 19th century, the...

. Operant conditioning is distinguished from classical conditioning
Classical conditioning
Classical conditioning is a form of conditioning that was first demonstrated by Ivan Pavlov...

 (also called respondent conditioning) in that operant conditioning deals with the modification of "voluntary behavior"
Behavior modification
Behavior modification is the use of empirically demonstrated behavior change techniques to increase or decrease the frequency of behaviors, such as altering an individual's behaviors and reactions to stimuli through positive and negative reinforcement of adaptive behavior and/or the reduction of...

 or operant behavior. Operant behavior "operates" on the environment and is maintained by its consequences, while classical conditioning deals with the conditioning of reflexive (reflex
Reflex
A reflex action, also known as a reflex, is an involuntary and nearly instantaneous movement in response to a stimulus. A true reflex is a behavior which is mediated via the reflex arc; this does not apply to casual uses of the term 'reflex'.-See also:...

) behaviors which are elicited by antecedent
Antecedent
An antecedent is a preceding event, condition, cause, phrase, or word. It may refer to:* Antecedent moisture, a hydrologic term describing the relative wetness condition of a sewershed.* Antecedent , the first half of a hypothetical proposition....

 conditions. Behaviors conditioned via a classical conditioning procedure are not maintained by consequences.

Reinforcement, punishment, and extinction

Reinforcement
Reinforcement
Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

 and punishment
Punishment
Punishment is the authoritative imposition of something negative or unpleasant on a person or animal in response to behavior deemed wrong by an individual or group....

, the core tools of operant conditioning, are either positive (delivered following a response), or negative (withdrawn following a response). This creates a total of four basic consequences, with the addition of a fifth procedure known as extinction
Extinction (psychology)
Extinction is the conditioning phenomenon in which a previously learned response to a cue is reduced when the cue is presented in the absence of the previously paired aversive or appetitive stimulus.-Fear conditioning:...

 (i.e. no change in consequences following a response).

It is important to note that actors are not spoken of as being reinforced, punished, or extinguished; it is the actions that are reinforced, punished, or extinguished. Additionally, reinforcement, punishment, and extinction are not terms whose use is restricted to the laboratory. Naturally occurring consequences can also be said to reinforce, punish, or extinguish behavior and are not always delivered by people.
  • Reinforcement
    Reinforcement
    Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

    is a consequence that causes a behavior to occur with greater frequency.
  • Punishment
    Punishment (psychology)
    In operant conditioning, punishment is any change in a human or animal's surroundings that occurs after a given behavior or response which reduces the likelihood of that behavior occurring again in the future. As with reinforcement, it is the behavior, not the animal, that is punished...

    is a consequence that causes a behavior to occur with less frequency.
  • Extinction
    Extinction (psychology)
    Extinction is the conditioning phenomenon in which a previously learned response to a cue is reduced when the cue is presented in the absence of the previously paired aversive or appetitive stimulus.-Fear conditioning:...

    is the lack of any consequence following a behavior. When a behavior is inconsequential (i.e., producing neither favorable nor unfavorable consequences) it will occur with less frequency. When a previously reinforced behavior is no longer reinforced with either positive or negative reinforcement, it leads to a decline in that behavior.

Four contexts of operant conditioning

Here the terms positive and negative are not used in their popular sense
Popular culture
Popular culture is the totality of ideas, perspectives, attitudes, memes, images and other phenomena that are deemed preferred per an informal consensus within the mainstream of a given culture, especially Western culture of the early to mid 20th century and the emerging global mainstream of the...

, but rather: positive refers to addition, and negative refers to subtraction.

What is added or subtracted may be either reinforcement or punishment. Hence positive punishment is sometimes a confusing term, as it denotes the "addition" of a stimulus or increase in the intensity of a stimulus that is aversive (such as spanking or an electric shock). The four procedures are:
  1. Positive reinforcement (Reinforcement): occurs when a behavior (response) is followed by a stimulus that is appetitive or rewarding
    Reward system
    In neuroscience, the reward system is a collection of brain structures which attempts to regulate and control behavior by inducing pleasurable effects...

    , increasing the frequency of that behavior. In the Skinner box
    Skinner box
    An operant conditioning chamber is a laboratory apparatus used in the experimental analysis of behavior to study animal behavior. The operant conditioning chamber was created by B. F. Skinner while he was a graduate student at Harvard University...

     experiment, a stimulus such as food or sugar solution can be delivered when the rat engages in a target behavior, such as pressing a lever.
  2. Negative reinforcement (Escape): occurs when a behavior (response) is followed by the removal of an aversive stimulus, thereby increasing that behavior's frequency. In the Skinner box experiment, negative reinforcement can be a loud noise continuously sounding inside the rat's cage until it engages in the target behavior, such as pressing a lever, upon which the loud noise is removed.
  3. Positive punishment (Punishment) (also called "Punishment by contingent stimulation"): occurs when a behavior (response) is followed by a stimulus, such as introducing a shock or loud noise, resulting in a decrease in that behavior.
  4. Negative punishment (Penalty) (also called "Punishment by contingent withdrawal"): occurs when a behavior (response) is followed by the removal of a stimulus, such as taking away a child's toy following an undesired behavior, resulting in a decrease in that behavior.


Also:
  • Avoidance learning is a type of learning in which a certain behavior results in the cessation of an aversive stimulus. For example, performing the behavior of shielding one's eyes when in the sunlight (or going outdoors) will help avoid the aversive stimulation of having light in one's eyes.
  • Extinction occurs when a behavior (response) that had previously been reinforced is no longer effective. In the Skinner box experiment, this is the rat pushing the lever and being rewarded with a food pellet several times, and then pushing the lever again and never receiving a food pellet again. Eventually the rat would cease pushing the lever.
  • Noncontingent reinforcement refers to delivery of reinforcing stimuli regardless of the organism's (aberrant) behavior. The idea is that the target behavior decreases because it is no longer necessary to receive the reinforcement. This typically entails time-based delivery of stimuli identified as maintaining aberrant behavior, which serves to decrease the rate of the target behavior. As no measured behavior is identified as being strengthened, there is controversy surrounding the use of the term noncontingent "reinforcement".

Operant conditioning is a form of psychological
Psychology
Psychology is the study of the mind and behavior. Its immediate goal is to understand individuals and groups by both establishing general principles and researching specific cases. For many, the ultimate goal of psychology is to benefit society...

 learning
Learning
Learning is acquiring new or modifying existing knowledge, behaviors, skills, values, or preferences and may involve synthesizing different types of information. The ability to learn is possessed by humans, animals and some machines. Progress over time tends to follow learning curves.Human learning...

 during which an individual modifies the occurrence and form of its own behavior
Behavior
Behavior or behaviour refers to the actions and mannerisms made by organisms, systems, or artificial entities in conjunction with its environment, which includes the other systems or organisms around as well as the physical environment...

 due to the association
Association (psychology)
In psychology and marketing, two concepts or stimuli are associated when the experience of one leads to the effects of another, due to repeated pairing. This is sometimes called Pavlovian association for Ivan Pavlov's pioneering of classical conditioning....

 of the behavior with a stimulus
Stimulus (psychology)
In psychology, stimuli are energy patterns which are registered by the senses. In behaviorism and related stimulus–response theories, stimuli constitute the basis for behavior, whereas in perceptual psychology they constitute the basis for perception.In the second half of the 19th century, the...

. Operant conditioning is distinguished from classical conditioning
Classical conditioning
Classical conditioning is a form of conditioning that was first demonstrated by Ivan Pavlov...

 (also called respondent conditioning) in that operant conditioning deals with the modification of "voluntary behavior"
Behavior modification
Behavior modification is the use of empirically demonstrated behavior change techniques to increase or decrease the frequency of behaviors, such as altering an individual's behaviors and reactions to stimuli through positive and negative reinforcement of adaptive behavior and/or the reduction of...

 or operant behavior. Operant behavior "operates" on the environment and is maintained by its consequences, while classical conditioning deals with the conditioning of reflexive (reflex
Reflex
A reflex action, also known as a reflex, is an involuntary and nearly instantaneous movement in response to a stimulus. A true reflex is a behavior which is mediated via the reflex arc; this does not apply to casual uses of the term 'reflex'.-See also:...

) behaviors which are elicited by antecedent
Antecedent
An antecedent is a preceding event, condition, cause, phrase, or word. It may refer to:* Antecedent moisture, a hydrologic term describing the relative wetness condition of a sewershed.* Antecedent , the first half of a hypothetical proposition....

 conditions. Behaviors conditioned via a classical conditioning procedure are not maintained by consequences.Domjan, Michael, Ed., The Principles of Learning and Behavior, Fifth Edition, Belmont, CA: Thomson/Wadsworth, 2003

Reinforcement, punishment, and extinction

Reinforcement
Reinforcement
Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

 and punishment
Punishment
Punishment is the authoritative imposition of something negative or unpleasant on a person or animal in response to behavior deemed wrong by an individual or group....

, the core tools of operant conditioning, are either positive (delivered following a response), or negative (withdrawn following a response). This creates a total of four basic consequences, with the addition of a fifth procedure known as extinction
Extinction (psychology)
Extinction is the conditioning phenomenon in which a previously learned response to a cue is reduced when the cue is presented in the absence of the previously paired aversive or appetitive stimulus.-Fear conditioning:...

 (i.e. no change in consequences following a response).

It is important to note that actors are not spoken of as being reinforced, punished, or extinguished; it is the actions that are reinforced, punished, or extinguished. Additionally, reinforcement, punishment, and extinction are not terms whose use is restricted to the laboratory. Naturally occurring consequences can also be said to reinforce, punish, or extinguish behavior and are not always delivered by people.
  • Reinforcement
    Reinforcement
    Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

    is a consequence that causes a behavior to occur with greater frequency.
  • Punishment
    Punishment (psychology)
    In operant conditioning, punishment is any change in a human or animal's surroundings that occurs after a given behavior or response which reduces the likelihood of that behavior occurring again in the future. As with reinforcement, it is the behavior, not the animal, that is punished...

    is a consequence that causes a behavior to occur with less frequency.
  • Extinction
    Extinction (psychology)
    Extinction is the conditioning phenomenon in which a previously learned response to a cue is reduced when the cue is presented in the absence of the previously paired aversive or appetitive stimulus.-Fear conditioning:...

    is the lack of any consequence following a behavior. When a behavior is inconsequential (i.e., producing neither favorable nor unfavorable consequences) it will occur with less frequency. When a previously reinforced behavior is no longer reinforced with either positive or negative reinforcement, it leads to a decline in that behavior.

Four contexts of operant conditioning

Here the terms positive and negative are not used in their popular sense
Popular culture
Popular culture is the totality of ideas, perspectives, attitudes, memes, images and other phenomena that are deemed preferred per an informal consensus within the mainstream of a given culture, especially Western culture of the early to mid 20th century and the emerging global mainstream of the...

, but rather: positive refers to addition, and negative refers to subtraction.

What is added or subtracted may be either reinforcement or punishment. Hence positive punishment is sometimes a confusing term, as it denotes the "addition" of a stimulus or increase in the intensity of a stimulus that is aversive (such as spanking or an electric shock). The four procedures are:
  1. Positive reinforcement (Reinforcement): occurs when a behavior (response) is followed by a stimulus that is appetitive or rewarding
    Reward system
    In neuroscience, the reward system is a collection of brain structures which attempts to regulate and control behavior by inducing pleasurable effects...

    , increasing the frequency of that behavior. In the Skinner box
    Skinner box
    An operant conditioning chamber is a laboratory apparatus used in the experimental analysis of behavior to study animal behavior. The operant conditioning chamber was created by B. F. Skinner while he was a graduate student at Harvard University...

     experiment, a stimulus such as food or sugar solution can be delivered when the rat engages in a target behavior, such as pressing a lever.
  2. Negative reinforcement (Escape): occurs when a behavior (response) is followed by the removal of an aversive stimulus, thereby increasing that behavior's frequency. In the Skinner box experiment, negative reinforcement can be a loud noise continuously sounding inside the rat's cage until it engages in the target behavior, such as pressing a lever, upon which the loud noise is removed.
  3. Positive punishment (Punishment) (also called "Punishment by contingent stimulation"): occurs when a behavior (response) is followed by a stimulus, such as introducing a shock or loud noise, resulting in a decrease in that behavior.
  4. Negative punishment (Penalty) (also called "Punishment by contingent withdrawal"): occurs when a behavior (response) is followed by the removal of a stimulus, such as taking away a child's toy following an undesired behavior, resulting in a decrease in that behavior.


Also:
  • Avoidance learning is a type of learning in which a certain behavior results in the cessation of an aversive stimulus. For example, performing the behavior of shielding one's eyes when in the sunlight (or going outdoors) will help avoid the aversive stimulation of having light in one's eyes.
  • Extinction occurs when a behavior (response) that had previously been reinforced is no longer effective. In the Skinner box experiment, this is the rat pushing the lever and being rewarded with a food pellet several times, and then pushing the lever again and never receiving a food pellet again. Eventually the rat would cease pushing the lever.
  • Noncontingent reinforcement refers to delivery of reinforcing stimuli regardless of the organism's (aberrant) behavior. The idea is that the target behavior decreases because it is no longer necessary to receive the reinforcement. This typically entails time-based delivery of stimuli identified as maintaining aberrant behavior, which serves to decrease the rate of the target behavior.Tucker, M., Sigafoos, J., & Bushell, H. (1998). Use of noncontingent reinforcement in the treatment of challenging behavior. Behavior Modification, 22, 529–547. As no measured behavior is identified as being strengthened, there is controversy surrounding the use of the term noncontingent "reinforcement".

Operant conditioning is a form of psychological
Psychology
Psychology is the study of the mind and behavior. Its immediate goal is to understand individuals and groups by both establishing general principles and researching specific cases. For many, the ultimate goal of psychology is to benefit society...

 learning
Learning
Learning is acquiring new or modifying existing knowledge, behaviors, skills, values, or preferences and may involve synthesizing different types of information. The ability to learn is possessed by humans, animals and some machines. Progress over time tends to follow learning curves.Human learning...

 during which an individual modifies the occurrence and form of its own behavior
Behavior
Behavior or behaviour refers to the actions and mannerisms made by organisms, systems, or artificial entities in conjunction with its environment, which includes the other systems or organisms around as well as the physical environment...

 due to the association
Association (psychology)
In psychology and marketing, two concepts or stimuli are associated when the experience of one leads to the effects of another, due to repeated pairing. This is sometimes called Pavlovian association for Ivan Pavlov's pioneering of classical conditioning....

 of the behavior with a stimulus
Stimulus (psychology)
In psychology, stimuli are energy patterns which are registered by the senses. In behaviorism and related stimulus–response theories, stimuli constitute the basis for behavior, whereas in perceptual psychology they constitute the basis for perception.In the second half of the 19th century, the...

. Operant conditioning is distinguished from classical conditioning
Classical conditioning
Classical conditioning is a form of conditioning that was first demonstrated by Ivan Pavlov...

 (also called respondent conditioning) in that operant conditioning deals with the modification of "voluntary behavior"
Behavior modification
Behavior modification is the use of empirically demonstrated behavior change techniques to increase or decrease the frequency of behaviors, such as altering an individual's behaviors and reactions to stimuli through positive and negative reinforcement of adaptive behavior and/or the reduction of...

 or operant behavior. Operant behavior "operates" on the environment and is maintained by its consequences, while classical conditioning deals with the conditioning of reflexive (reflex
Reflex
A reflex action, also known as a reflex, is an involuntary and nearly instantaneous movement in response to a stimulus. A true reflex is a behavior which is mediated via the reflex arc; this does not apply to casual uses of the term 'reflex'.-See also:...

) behaviors which are elicited by antecedent
Antecedent
An antecedent is a preceding event, condition, cause, phrase, or word. It may refer to:* Antecedent moisture, a hydrologic term describing the relative wetness condition of a sewershed.* Antecedent , the first half of a hypothetical proposition....

 conditions. Behaviors conditioned via a classical conditioning procedure are not maintained by consequences.Domjan, Michael, Ed., The Principles of Learning and Behavior, Fifth Edition, Belmont, CA: Thomson/Wadsworth, 2003

Reinforcement, punishment, and extinction

Reinforcement
Reinforcement
Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

 and punishment
Punishment
Punishment is the authoritative imposition of something negative or unpleasant on a person or animal in response to behavior deemed wrong by an individual or group....

, the core tools of operant conditioning, are either positive (delivered following a response), or negative (withdrawn following a response). This creates a total of four basic consequences, with the addition of a fifth procedure known as extinction
Extinction (psychology)
Extinction is the conditioning phenomenon in which a previously learned response to a cue is reduced when the cue is presented in the absence of the previously paired aversive or appetitive stimulus.-Fear conditioning:...

 (i.e. no change in consequences following a response).

It is important to note that actors are not spoken of as being reinforced, punished, or extinguished; it is the actions that are reinforced, punished, or extinguished. Additionally, reinforcement, punishment, and extinction are not terms whose use is restricted to the laboratory. Naturally occurring consequences can also be said to reinforce, punish, or extinguish behavior and are not always delivered by people.
  • Reinforcement
    Reinforcement
    Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

    is a consequence that causes a behavior to occur with greater frequency.
  • Punishment
    Punishment (psychology)
    In operant conditioning, punishment is any change in a human or animal's surroundings that occurs after a given behavior or response which reduces the likelihood of that behavior occurring again in the future. As with reinforcement, it is the behavior, not the animal, that is punished...

    is a consequence that causes a behavior to occur with less frequency.
  • Extinction
    Extinction (psychology)
    Extinction is the conditioning phenomenon in which a previously learned response to a cue is reduced when the cue is presented in the absence of the previously paired aversive or appetitive stimulus.-Fear conditioning:...

    is the lack of any consequence following a behavior. When a behavior is inconsequential (i.e., producing neither favorable nor unfavorable consequences) it will occur with less frequency. When a previously reinforced behavior is no longer reinforced with either positive or negative reinforcement, it leads to a decline in that behavior.

Four contexts of operant conditioning

Here the terms positive and negative are not used in their popular sense
Popular culture
Popular culture is the totality of ideas, perspectives, attitudes, memes, images and other phenomena that are deemed preferred per an informal consensus within the mainstream of a given culture, especially Western culture of the early to mid 20th century and the emerging global mainstream of the...

, but rather: positive refers to addition, and negative refers to subtraction.

What is added or subtracted may be either reinforcement or punishment. Hence positive punishment is sometimes a confusing term, as it denotes the "addition" of a stimulus or increase in the intensity of a stimulus that is aversive (such as spanking or an electric shock). The four procedures are:
  1. Positive reinforcement (Reinforcement): occurs when a behavior (response) is followed by a stimulus that is appetitive or rewarding
    Reward system
    In neuroscience, the reward system is a collection of brain structures which attempts to regulate and control behavior by inducing pleasurable effects...

    , increasing the frequency of that behavior. In the Skinner box
    Skinner box
    An operant conditioning chamber is a laboratory apparatus used in the experimental analysis of behavior to study animal behavior. The operant conditioning chamber was created by B. F. Skinner while he was a graduate student at Harvard University...

     experiment, a stimulus such as food or sugar solution can be delivered when the rat engages in a target behavior, such as pressing a lever.
  2. Negative reinforcement (Escape): occurs when a behavior (response) is followed by the removal of an aversive stimulus, thereby increasing that behavior's frequency. In the Skinner box experiment, negative reinforcement can be a loud noise continuously sounding inside the rat's cage until it engages in the target behavior, such as pressing a lever, upon which the loud noise is removed.
  3. Positive punishment (Punishment) (also called "Punishment by contingent stimulation"): occurs when a behavior (response) is followed by a stimulus, such as introducing a shock or loud noise, resulting in a decrease in that behavior.
  4. Negative punishment (Penalty) (also called "Punishment by contingent withdrawal"): occurs when a behavior (response) is followed by the removal of a stimulus, such as taking away a child's toy following an undesired behavior, resulting in a decrease in that behavior.


Also:
  • Avoidance learning is a type of learning in which a certain behavior results in the cessation of an aversive stimulus. For example, performing the behavior of shielding one's eyes when in the sunlight (or going outdoors) will help avoid the aversive stimulation of having light in one's eyes.
  • Extinction occurs when a behavior (response) that had previously been reinforced is no longer effective. In the Skinner box experiment, this is the rat pushing the lever and being rewarded with a food pellet several times, and then pushing the lever again and never receiving a food pellet again. Eventually the rat would cease pushing the lever.
  • Noncontingent reinforcement refers to delivery of reinforcing stimuli regardless of the organism's (aberrant) behavior. The idea is that the target behavior decreases because it is no longer necessary to receive the reinforcement. This typically entails time-based delivery of stimuli identified as maintaining aberrant behavior, which serves to decrease the rate of the target behavior.Tucker, M., Sigafoos, J., & Bushell, H. (1998). Use of noncontingent reinforcement in the treatment of challenging behavior. Behavior Modification, 22, 529–547. As no measured behavior is identified as being strengthened, there is controversy surrounding the use of the term noncontingent "reinforcement".Poling, A., & Normand, M. (1999). Noncontingent rein
  • Shaping is a form of operant conditioning in which the increasingly accurate approximations of a desired response are reinforced.
  • Chaining
    Chaining
    Chaining is an instructional procedure used in behavioral psychology, experimental analysis of behavior and applied behavior analysis. It involves reinforcing individual responses occurring in a sequence to form a complex behavior. It is frequently used for training behavioral sequences that are...

    is an instructional procedure which involves reinforcing individual responses occurring in a sequence to form a complex behavior.
  • Response cost is a form of punishment in which occurs the reducing in the occurrence of a response that is frequently followed by the annihilation of an appetitive stimulus. Carlson, Heth, Neil R. , C. Donald (2007). Psychology the Science of Behavious. New Jersey, USA: Pearson. pp. 700. ISBN 978-0-205-64524-4

Thorndike's law of effect

Operant conditioning, sometimes called instrumental conditioning or instrumental learning, was first extensively studied by Edward L. Thorndike (1874–1949), who observed the behavior of cats trying to escape from home-made puzzle boxes.Thorndike, E.L. (1901). Animal intelligence: An experimental study of the associative processes in animals. Psychological Review Monograph Supplement, 2, 1–109. When first constrained in the boxes, the cats took a long time to escape. With experience, ineffective responses occurred less frequently and successful responses occurred more frequently, enabling the cats to escape in less time over successive trials. In his law of effect
Law of effect
The law of effect basically states that “responses that produce a satisfying effect in a particular situation become more likely to occur again inthat situation, and responses that produce a discomforting effect become less likely to occur again in that...

, Thorndike theorized that successful responses, those producing satisfying consequences, were "stamped in" by the experience and thus occurred more frequently. Unsuccessful responses, those producing annoying consequences, were stamped out and subsequently occurred less frequently. In short, some consequences strengthened behavior and some consequences weakened behavior. Thorndike produced the first known learning curves through this procedure.



B.F. Skinner (1904–1990) formulated a more detailed analysis of operant conditioning based on reinforcement, punishment, and extinction. Following the ideas of Ernst Mach
Ernst Mach
Ernst Mach was an Austrian physicist and philosopher, noted for his contributions to physics such as the Mach number and the study of shock waves...

, Skinner rejected Thorndike's mediating structures required by "satisfaction" and constructed a new conceptualization of behavior without any such references. So, while experimenting with some homemade feeding mechanisms, Skinner invented the operant conditioning chamber which allowed him to measure rate of response as a key dependent variable using a cumulative record of lever presses or key pecks.Mecca Chiesa (2004) Radical Behaviorism: the philosophy and the science

Biological correlates of operant conditioning

The first scientific studies identifying neuron
Neuron
A neuron is an electrically excitable cell that processes and transmits information by electrical and chemical signaling. Chemical signaling occurs via synapses, specialized connections with other cells. Neurons connect to each other to form networks. Neurons are the core components of the nervous...

s that responded in ways that suggested they encode for conditioned stimuli came from work by Mahlon deLong"Activity of pallidal neurons during movement", M.R. DeLong, J. Neurophysiol., 34:414–27, 1971 and by R.T. "Rusty" Richardson. They showed that nucleus basalis neurons, which release acetylcholine broadly throughout the cerebral cortex
Cerebral cortex
The cerebral cortex is a sheet of neural tissue that is outermost to the cerebrum of the mammalian brain. It plays a key role in memory, attention, perceptual awareness, thought, language, and consciousness. It is constituted of up to six horizontal layers, each of which has a different...

, are activated shortly after a conditioned stimulus, or after a primary reward if no conditioned stimulus exists. These neurons are equally active for positive and negative reinforcers, and have been demonstrated to cause plasticity
Neuroplasticity
Neuroplasticity is a non-specific neuroscience term referring to the ability of the brain and nervous system in all species to change structurally and functionally as a result of input from the environment. Plasticity occurs on a variety of levels, ranging from cellular changes involved in...

 in many cortical
Cerebral cortex
The cerebral cortex is a sheet of neural tissue that is outermost to the cerebrum of the mammalian brain. It plays a key role in memory, attention, perceptual awareness, thought, language, and consciousness. It is constituted of up to six horizontal layers, each of which has a different...

 regions.PNAS 93:11219-24 1996, Science 279:1714–8 1998 Evidence also exists that dopamine is activated at similar times. There is considerable evidence that dopamine participates in both reinforcement and aversive learning.Neuron 63:244–253, 2009, Frontiers in Behavioral Neuroscience, 3: Article 13, 2009 Dopamine pathways project much more densely onto frontal cortex regions. Cholinergic
Cholinergic
The word choline generally refers to the various quaternary ammonium salts containing the N,N,N-trimethylethanolammonium cation. Found in most animal tissues, choline is a primary component of the neurotransmitter acetylcholine and functions with inositol as a basic constituent of lecithin...

 projections, in contrast, are dense even in the posterior cortical regions like the primary visual cortex. A study of patients with Parkinson's disease
Parkinson's disease
Parkinson's disease is a degenerative disorder of the central nervous system...

, a condition attributed to the insufficient action of dopamine, further illustrates the role of dopamine in positive reinforcement.Michael J. Frank, Lauren C. Seeberger, and Randall C. O'Reilly (2004) "By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism," Science 4, November 2004 It showed that while off their medication, patients learned more readily with aversive consequences than with positive reinforcement. Patients who were on their medication showed the opposite to be the case, positive reinforcement proving to be the more effective form of learning when the action of dopamine is high.

Factors that alter the effectiveness of consequences

When using consequences to modify a response, the effectiveness of a consequence can be increased or decreased by various factors. These factors can apply to either reinforcing or punishing consequences.
  1. Satiation/Deprivation: The effectiveness of a consequence will be reduced if the individual's "appetite" for that source of stimulation has been satisfied. Inversely, the effectiveness of a consequence will increase as the individual becomes deprived of that stimulus. If someone is not hungry, food will not be an effective reinforcer for behavior. Satiation is generally only a potential problem with primary reinforcers, those that do not need to be learned such as food and water.
  2. Immediacy: After a response, how immediately a consequence is then felt determines the effectiveness of the consequence. More immediate feedback will be more effective than less immediate feedback. If someone's license plate is caught by a traffic camera for speeding and they receive a speeding ticket in the mail a week later, this consequence will not be very effective against speeding. But if someone is speeding and is caught in the act by an officer who pulls them over, then their speeding behavior is more likely to be affected.
  3. Contingency: If a consequence does not contingently (reliably, or consistently) follow the target response, its effectiveness upon the response is reduced. But if a consequence follows the response consistently after successive instances, its ability to modify the response is increased. The schedule of reinforcement, when consistent, leads to faster learning. When the schedule is variable the learning is slower. Extinction is more difficult when learning occurs during intermittent reinforcement and more easily extinguished when learning occurs during a highly consistent schedule.
  4. Size: This is a "cost-benefit" determinant of whether a consequence will be effective. If the size, or amount, of the consequence is large enough to be worth the effort, the consequence will be more effective upon the behavior. An unusually large lottery jackpot, for example, might be enough to get someone to buy a one-dollar lottery ticket (or even buying multiple tickets). But if a lottery jackpot is small, the same person might not feel it to be worth the effort of driving out and finding a place to buy a ticket. In this example, it's also useful to note that "effort" is a punishing consequence. How these opposing expected consequences (reinforcing and punishing) balance out will determine whether the behavior is performed or not.


Most of these factors exist for biological reasons. The biological purpose of the Principle of Satiation is to maintain the organism's homeostasis
Homeostasis
Homeostasis is the property of a system that regulates its internal environment and tends to maintain a stable, constant condition of properties like temperature or pH...

. When an organism has been deprived of sugar, for example, the effectiveness of the taste of sugar as a reinforcer is high. However, as the organism reaches or exceeds their optimum blood-sugar levels, the taste of sugar becomes less effective, perhaps even aversive.

The Principles of Immediacy and Contingency exist for neurochemical reasons. When an organism experiences a reinforcing stimulus, dopamine
Dopamine
Dopamine is a catecholamine neurotransmitter present in a wide variety of animals, including both vertebrates and invertebrates. In the brain, this substituted phenethylamine functions as a neurotransmitter, activating the five known types of dopamine receptors—D1, D2, D3, D4, and D5—and their...

 pathways in the brain are activated. This network of pathways "releases a short pulse of dopamine onto many dendrites, thus broadcasting a rather global reinforcement signal to postsynaptic neurons."Schultz, Wolfram (1998). Predictive Reward Signal of Dopamine Neurons. The Journal of Neurophysiology, 80(1), 1–27. This results in the plasticity of these synapses allowing recently activated synapses to increase their sensitivity to efferent signals, hence increasing the probability of occurrence for the recent responses preceding the reinforcement. These responses are, statistically, the most likely to have been the behavior responsible for successfully achieving reinforcement. But when the application of reinforcement is either less immediate or less contingent (less consistent), the ability of dopamine to act upon the appropriate synapses is reduced.

Operant variability

Operant variability is what allows a response to adapt to new situations. Operant behavior is distinguished from reflexes in that its response topography (the form of the response) is subject to slight variations from one performance to another. These slight variations can include small differences in the specific motions involved, differences in the amount of force applied, and small changes in the timing of the response. If a subject's history of reinforcement is consistent, such variations will remain stable because the same successful variations are more likely to be reinforced than less successful variations. However, behavioral variability can also be altered when subjected to certain controlling variables.Neuringer, A. (2002). Operant variability: Evidence, functions, and theory. Psychonometric Bulletin & Review, 9(4), 672–705.

Avoidance learning

Avoidance learning belongs to negative reinforcement schedules. The subject learns that a certain response will result in the termination or prevention of an aversive stimulus. There are two kinds of commonly used experimental settings: discriminated and free-operant avoidance learning.

Discriminated avoidance learning

In discriminated avoidance learning, a novel stimulus such as a light or a tone is followed by an aversive stimulus such as a shock (CS-US, similar to classical conditioning). During the first trials (called escape-trials) the animal usually experiences both the CS (Conditioned Stimulus) and the US (Unconditioned Stimulus), showing the operant response to terminate the aversive US. During later trials, the animal will learn to perform the response already during the presentation of the CS thus preventing the aversive US from occurring. Such trials are called "avoidance trials."

Free-operant avoidance learning

In this experimental session, no discrete stimulus is used to signal the occurrence of the aversive stimulus. Rather, the aversive stimulus (mostly shocks) are presented without explicit warning stimuli. There are two crucial time intervals determining the rate of avoidance learning. This first one is called the S-S-interval (shock-shock-interval). This is the amount of time which passes during successive presentations of the shock (unless the operant response is performed). The other one is called the R-S-interval (response-shock-interval) which specifies the length of the time interval following an operant response during which no shocks will be delivered. Note that each time the organism performs the operant response, the R-S-interval without shocks begins anew.

Two-process theory of avoidance

This theory was originally established to explain learning in discriminated avoidance learning. It assumes two processes to take place:
a) Classical conditioning of fear.
During the first trials of the training, the organism experiences both CS and aversive US (escape-trials). The theory assumed that during those trials classical conditioning takes place by pairing the CS with the US. Because of the aversive nature of the US the CS is supposed to elicit a conditioned emotional reaction (CER) – fear. In classical conditioning, presenting a CS conditioned with an aversive US disrupts the organism's ongoing behavior.

b) Reinforcement of the operant response by fear-reduction.
Because during the first process, the CS signaling the aversive US has itself become aversive by eliciting fear in the organism, reducing this unpleasant emotional reaction serves to motivate the operant response. The organism learns to make the response during the US, thus terminating the aversive internal reaction elicited by the CS. An important aspect of this theory is that the term "avoidance" does not really describe what the organism is doing. It does not "avoid" the aversive US in the sense of anticipating it. Rather the organism escapes an aversive internal state, caused by the CS.

Verbal Behavior

In 1957, Skinner
B. F. Skinner
Burrhus Frederic Skinner was an American behaviorist, author, inventor, baseball enthusiast, social philosopher and poet...

 published Verbal Behavior
Verbal Behavior (book)
Verbal Behavior is a 1957 book by psychologist B.F. Skinner, in which he analyzes human behavior, encompassing what is traditionally called language, linguistics, or speech...

, a theoretical extension of the work he had pioneered since 1938. This work extended the theory of operant conditioning to human behavior previously assigned to the areas of language, linguistics and other areas. Verbal Behavior is the logical extension of Skinner's ideas, in which he introduced new functional relationship categories such as intraverbals, autoclitics
Autoclitics (psychology)
Autoclitics are verbal responses that modify the effect on the listener of the primary operants that comprise B.F. Skinner's classification of Verbal Behavior.-Autoclitics:...

, mands, tacts and the controlling relationship of the audience. All of these relationships were based on operant conditioning and relied on no new mechanisms despite the introduction of new functional categories.

Four term contingency

Applied behavior analysis, which is the name of the discipline directly descended from Skinner's work, holds that behavior is explained in four terms: conditional stimulus (SC), a discriminative stimulus (Sd), a response (R), and a reinforcing stimulus (Srein or Sr for reinforcers, sometimes Save for aversive stimuli).Pierce & Cheney (2004) Behavior Analysis and Learning

Operant hoarding

Operant hoarding is a referring to the choice made by a rat, on a compound schedule called a multiple schedule, that maximizes its rate of reinforcement
Reinforcement
Reinforcement is a term in operant conditioning and behavior analysis for the process of increasing the rate or probability of a behavior in the form of a "response" by the delivery or emergence of a stimulus Reinforcement is a term in operant conditioning and behavior analysis for the process of...

 in an operant conditioning context. More specifically, rats were shown to have allowed food pellets to accumulate in a food tray by continuing to press a lever on a continuous reinforcement schedule instead of retrieving those pellets. Retrieval of the pellets always instituted a one-minute period of extinction
Extinction
In biology and ecology, extinction is the end of an organism or of a group of organisms , normally a species. The moment of extinction is generally considered to be the death of the last individual of the species, although the capacity to breed and recover may have been lost before this point...

 during which no additional food pellets were available but those that had been accumulated earlier could be consumed. This finding appears to contradict the usual finding that rats behave impulsively in situations in which there is a choice between a smaller food object right away and a larger food object after some delay. See schedules of reinforcement.Cole, M.R. (1990). Operant hoarding: A new paradigm for the study of self-control. Journal of the Experimental Analysis of Behavior, 53, 247–262.

An alternative to the law of effect

However, an alternative perspective has been proposed by R. Allen and Beatrix Gardner.Gardner, R.A., & Gardner, B.T. (1988). Feedforward vs feedbackward: An ethological alternative to the law of effect. Behavioral and Brain Sciences. 11:429–447.Gardner, R.A. & Gardner, B.T. (1998). The structure of learning from sign stimuli to sign language. Mahwah NJ: Lawrence Erlbaum Associates. Under this idea, which they called "feedforward," animals learn during operant conditioning by simple pairing of stimuli, rather than by the consequences of their actions. Skinner asserted that a rat or pigeon would only manipulate a lever if rewarded for the action, a process he called "shaping" (reward for approaching then manipulating a lever).Skinner, B.F. (1953). Science and human behavior. Oxford, England: Macmillan. However, in order to prove the necessity of reward (reinforcement) in lever pressing, a control condition where food is delivered without regard to behavior must also be conducted. Skinner never published this control group. Only much later was it found that rats and pigeons do indeed learn to manipulate a lever when food comes irrespective of behavior. This phenomenon is known as autoshaping. Brown, P., & Jenkins, H.M. (1968). Autoshaping of the pigeon's key-peck. J. Exp. Anal. Behav. 11:1–8. Autoshaping demonstrates that consequence of action is not necessary in an operant conditioning chamber, and it contradicts the law of effect. Further experimentation has shown that rats naturally handle small objects, such as a lever, when food is present.Timberlake, W. (1983). Rats' responses to a moving object related to food or water: A behavior-systems analysis. Animal Learning & Behavior. 11(3):309–320. Rats seem to insist on handling the lever when free food is available (contra-freeloading)Jensen, G.D. (1963). Preference for bar pressing over 'freeloading' as a function of number of rewarded presses. Journal of Experimental Psychology. 65:451–454.Neuringer, A.J. (1969). Animals respond for food in the presence of free food. Science. 166:399-401. and even when pressing the lever leads to less food (omission training).Williams, D.R. and Williams, H. (1969). Auto-maintenance in the pigeon: sustained pecking despite contingent non-reinforcement. J. Exper. Analys. of Behav. 12:511–520.Peden, B.F., Brown, M.P., & Hearst, E. (1977). Persistent approaches to a signal for food despite food omission for approaching. Journal of Experimental Psychology: Animal Behavior Processes. 3(4):377–399. Whenever food is presented, rats handle the lever, regardless if lever pressing leads to more food. Therefore, handling a lever is a natural behavior that rats do as preparatory feeding activity, and in turn, lever pressing cannot logically be used as evidence for reward or reinforcement to occur. In the absence of evidence for reinforcement during operant conditioning, learning which occurs during operant experiments is actually only Pavlovian (classical) conditioning. The dichotomy between Pavlovian and operant conditioning is therefore an inappropriate separation.

See also

  • Animal testing
    Animal testing
    Animal testing, also known as animal experimentation, animal research, and in vivo testing, is the use of non-human animals in experiments. Worldwide it is estimated that the number of vertebrate animals—from zebrafish to non-human primates—ranges from the tens of millions to more than 100 million...

  • Applied behavior analysis
    Applied Behavior Analysis
    Applied behavior analysis is a science that involves using modern behavioral learning theory to modify behaviors. Behavior analysts reject the use of hypothetical constructs and focus on the observable relationship of behavior to the environment...

    , the application of operant behaviorism
  • Behaviorism
    Behaviorism
    Behaviorism , also called the learning perspective , is a philosophy of psychology based on the proposition that all things that organisms do—including acting, thinking, and feeling—can and should be regarded as behaviors, and that psychological disorders are best treated by altering behavior...

    , the family of philosophies behind operant conditioning
  • Cognitivism (psychology)
    Cognitivism (psychology)
    In psychology, cognitivism is a theoretical framework for understanding the mind that came into usage in the 1950s. The movement was a response to behaviorism, which cognitivists said neglected to explain cognition...

    , a competing theory that invokes internal mechanisms without reference to behavior
  • Educational psychology
    Educational psychology
    Educational psychology is the study of how humans learn in educational settings, the effectiveness of educational interventions, the psychology of teaching, and the social psychology of schools as organizations. Educational psychology is concerned with how students learn and develop, often focusing...

  • Educational technology
    Educational technology
    Educational technology is the study and ethical practice of facilitating learning and improving performance by creating, using and managing appropriate technological processes and resources." The term educational technology is often associated with, and encompasses, instructional theory and...

  • Experimental analysis of behavior
    Experimental analysis of behavior
    The experimental analysis of behavior is the name given to the school of psychology founded by B.F. Skinner, and based on his philosophy of radical behaviorism. A central principle was the inductive, data-driven examination of functional relations, as opposed to the kinds of hypothetico-deductive...

  • Exposure therapy
    Exposure therapy
    Exposure therapy is a technique in behavior therapy intended to treat anxiety disorders and involves the exposure to the feared object or context without any danger in order to overcome their anxiety. Procedurally it is similar to the fear extinction paradigm in rodent work...

  • Habituation
    Habituation
    Habituation can be defined as a process or as a procedure. As a process it is defined as a decrease in an elicited behavior resulting from the repeated presentation of an eliciting stimulus...


  • Learned industriousness
    Learned industriousness
    Learned industriousness is a behaviorally rooted theory developed by Robert Eisenberger to explain the differences in general work effort among people of equivalent ability. According to Eisenberger, individuals who are reinforced for exerting high effort on a task are also secondarily reinforced...

  • Matching law
    Matching law
    In operant conditioning, the matching law is a quantitative relationship that holds between the relative rates of response and the relative rates of reinforcement in concurrent schedules of reinforcement...

  • Negative (positive) contrast effect
    Negative (positive) contrast effect
    -Negative contrast effect in operant conditioning:In the behavioral theory of operant conditioning, the negative contrast effect is evident when an attempt to reinforce a particular behavior through reward; when the rewards are finally withdrawn or reduced the subject is even less likely to...

  • Premack principle
  • Reinforcement learning
    Reinforcement learning
    Inspired by behaviorist psychology, reinforcement learning is an area of machine learning in computer science, concerned with how an agent ought to take actions in an environment so as to maximize some notion of cumulative reward...

  • Reward system
    Reward system
    In neuroscience, the reward system is a collection of brain structures which attempts to regulate and control behavior by inducing pleasurable effects...

  • Sensitization
    Sensitization
    Sensitization is an example of non-associative learning in which the progressive amplification of a response follows repeated administrations of a stimulus. An everyday example of this mechanism is the repeated tonic stimulation of peripheral nerves that will occur if a person rubs his arm...

  • Social conditioning
    Social conditioning
    Social conditioning refers to the sociological process of training individuals in a society to act or respond in a manner generally approved by the society in general and peer groups within society. The concept is stronger than that of socialization, which refers to the process of inheriting norms,...

  • Spontaneous recovery
    Spontaneous recovery
    Spontaneous recovery is a phenomenon first seen in Pavlovian conditioning and then later discovered in memory functioning. The general pattern of spontaneous recovery found in Pavlovian conditioning in animals essentially encompasses two varying habits learned by the animal where there is an...


  • Jerzy Konorski
    Jerzy Konorski
    Jerzy Konorski was a Polish neurophysiologist who further developed the work of Ivan Pavlov by discovering secondary conditioned reflexes and also operant conditioning...


External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK