Learning

Information about Learning

Published on November 23, 2007

Author: Junyo

Source: authorstream.com

Content

Learning:  Learning Learning:  Learning Learning refers to relatively permanent changes in behavior resulting from practice or experience Learning can be unlearned Observation can lead to learning Learning requires an operational memory system Classical Conditioning:  Classical Conditioning Classical condition is learning by association it is sometimes called “reflexive learning” it is sometimes called respondent conditioning The Russian physiologist, Ivan Pavlov, and his dogs circa 1905 discovered classical conditioning by serendipity received the Nobel Prize in science for discovery Pavlov’s Experiment:  Pavlov’s Experiment Analysis of Pavlov’s Study:  Analysis of Pavlov’s Study Classical Conditioning:  Classical Conditioning Association: the KEY element in classical conditioning Pavlov considered classical conditioning to be a form of learning through association, in time, of a neutral stimulus and a stimulus that incites a response. Any stimulus can be paired with another to make an association if it is done in the correct way (following the classical conditioning paradigm) Classical Conditioning:  Classical Conditioning Terminology of Classical Conditioning Unconditioned Stimulus (UCS): any stimulus that will always and naturally ELICIT a response Unconditioned Response (UCR): any response that always and naturally occurs at the presentation of the UCS Neutral Stimulus (NS): any stimulus that does not naturally elicit a response associated with the UCR Classical Conditioning:  Classical Conditioning Terminology of Classical Conditioning (continued) Conditioned Stimulus (CS): any stimulus that will, after association with an UCS, cause a conditioned response (CR) when present to a subject by itself Conditioned Response (CR): any response that occurs upon the presentation of the CS Classical Conditioning:  Classical Conditioning Certain stimuli can elicit a reflexive response Air puff produces an eye-blink Smelling a grilled steak can produce salivation The reflexive stimulus (UCS) and response (UCR) are unconditioned The neutral stimulus is referred to as the conditioned stimulus (CS) In classical conditioning, the CS is repeatedly paired with the reflexive stimulus (UCS) Conditioning is best when the CS precedes the UCS Eventually the CS will produce a response (CR) similar to that produced by the UCS Classical Conditioning:  Classical Conditioning The Classical Conditioning “paradigm” “paradigm” is a scientific word similar to using the word “recipe” in a kitchen, I.e., this is how you do it UCS--------------------->UCR NS------------->UCS--------------------->UCR CS------------------------------------------>CR That’s all there is to it. I’ll show you a fleshed-out example on the next slide Classical Conditioning:  Classical Conditioning Here’s a fleshed out example: UCS----------------->UCR (food powder) --------------> (salvating) NS--------------->UCS----------------->UCR (bell)--->(food powder) -------------> (salvating) CS---------------------------------------->CR (bell)------------------------------------> (salvating) Classical Conditioning:  Classical Conditioning Here’s another example: UCS------------------>UCR (onion juice) -----------------> (crying) NS --------------> UCS ----------------->UCR (whistle)-->(onion juice)---------------> (crying) CS ---------------------------------------->CR (whistle)----------------------------------> (crying) Importance of Classical Conditioning:  Importance of Classical Conditioning Classical conditioning is involved in many of our behaviors wherever stimuli are paired together over time we come to react to one of them as if the other were present a particular song is played and you immediately think of a particular romantic partner a particular cologne is smelled and you immediately think of a romantic partner Classical Conditioning:  Classical Conditioning Some pointers on effective conditioning NS and UCS pairings must not be more than about 1/2 second apart for best results Repeated NS/UCS pairings are called “training trials” Presentations of CS without UCS pairings are called “extinction trials” Intensity of UCS effects how many training trials are necessary for conditioning to occur Operant Conditioning: Skinner Box:  Operant Conditioning: Skinner Box Operant Conditioning:  Operant Conditioning Organisms make responses that have consequences The consequences serve to increase or decrease the likelihood of making that response again The response can be associated with cues in the environment We put coins in a machine to obtain food But we refrain when an Out of Order sign is placed on the machine Operant Conditioning:  Operant Conditioning Operant conditioning is simply learning from the consequences of your behavior the “other side” of the psychologist’s tool box, operant conditioning is a form of learning in which the consequences of behavior lead to changes in the probability of a behavior’s occurrence. Key Aspects of Operant Conditioning:  Key Aspects of Operant Conditioning In operant conditioning, the stimulus is a cue, it does not elicit the response Operant responses are voluntary In operant conditioning, the response elicits a reinforcing stimulus, whereas in classical conditioning, the UCS elicits the reflexive response Key Terms of Operant Conditioning:  Key Terms of Operant Conditioning Reinforcement is any procedure that increases the response Punishment is any procedure that decreases the response Types of reinforcers: Primary: e.g. food or water Secondary: money or power Operant Conditioning:  Operant Conditioning The Operant Conditioning paradigm: SD ------> Response -----> Consequence where “SD” is the “discriminative stimulus” where “Response” is the subject’s behavior where “Consequence” is what happens to the subject after EMITTING the response What consequences can follow a subject’s response? Operant Conditioning:  Operant Conditioning Consequences to behavior can be: nothing happens: extinction something happens the “something” can be pleasant the “something” can be aversive Consequences include positive and negative reinforcement, time out, and punishment. We’ll examine each of these now. Reinforcement/Punishment:  Reinforcement/Punishment Schedules of Reinforcement:  Schedules of Reinforcement Continuous: reinforcement occurs after every response Produces rapid acquisition and is subject to rapid extinction Partial: reinforcement occurs after some, but not all, responses Responding on a partial reinforcement schedule is more resistant to extinction Partial Reinforcement Schedules:  Partial Reinforcement Schedules Ratio: every nth response is reinforced Fixed: every nth response Variable: on average, every nth response Interval: first response after some interval results in reinforcement Fixed: interval is x in length (e.g. 1 min) Variable: the average interval is x Reinforcement Schedules:  Reinforcement Schedules Positive Reinforcement:  Positive Reinforcement What is a reinforcer? Definition: a reinforcer is any stimulus which, when delivered to a subject, increases the probability that a subject will emit a response. Primary reinforcers, e.g., food Secondary reinforcers, e.g., praise One can only know if a stimulus is a reinforcer based on the increased probability of occurrence of a subject’s behavior Slide29:  Money: a secondary reinforcer Positive Reinforcement:  Positive Reinforcement What is positive reinforcement? a procedure where a pleasant stimulus is delivered to a subject contingent upon the subject’s emitting a desired behavior Schedules of reinforcement reinforcement schedules may be used to decrease the probability that a response pattern in a subject will extinguish Positive Reinforcement:  Positive Reinforcement Schedules of reinforcement there are 4 types of reinforcement schedules fixed ratio schedule of reinforcement fixed interval schedule of reinforcement variable ratio schedule of reinforcement variable interval schedule of reinforcement each of these schedules will produce different response patterns in subjects; the variable ratio schedule best for most resistant to extinction Positive Reinforcement:  Positive Reinforcement Shaping behaviors the use of positive reinforcement in the differential reinforcement of successive approximations is called “shaping” shaping can be used to create a new response pattern in a subject shaping must be done carefully and one must rely on the differential reinforcement of successive approximations to the target behavior Negative Reinforcement:  Negative Reinforcement Negative reinforcement a procedure where an aversive stimulus is removed from a subject contingent upon the subject’s emitting a desired behavior the reinforcing consequence is the removal or avoidance of an aversive stimulus Escape conditioning: the behavior is reinforced because it stops an aversive stimulus Avoidance conditioning: behavior reinforced because aversive stimulus is prevented Negative Reinforcement:  Negative Reinforcement Examples of negative reinforcement in the real world include: taking out the trash to avoid your mother yelling at you taking an aspirin to get rid of a headache using a condom to avoid contracting a fatal disease paying your car insurance on time to prevent cancellation of your policy Punishment:  Punishment Punishment:  Punishment Punishment defined a procedure where an aversive stimulus is presented to a subject contingent upon the subject emitting an undesired behavior. punishment should be used as a last resort in behavior engineering; positive reinforcement should be used first examples include spanking, verbal abuse, electrical shock, etc. Punishment:  Punishment Dangers in use of punishment punishment is often reinforcing to a punisher (resulting in the making of an abuser) punishment often has a generalized inhibiting effect on the punished individual (they stop doing ANY behavior at all) we learn to dislike the punisher (a result of classical conditioning) Punishment:  Punishment Dangers in use of punishment what the punisher thinks is punishment may, in fact, be a reinforcer to the “punished” individual punishment does not teach more appropriate behavior; it merely stops a behavior from occurring punishment can cause emotional damage in the punished individual (antisocial behavior) Punishment:  Punishment Dangers in use of punishment punishment only stops the behavior from occurring in the presence of the punisher; when the punisher is not present then the behavior will often reappear and with a vengeance the best tool for engineering behavior is positive reinforcement Punishment:  Punishment Guidelines for the effective use of punishment use the least painful stimulus possible; if you spank your child, do it on the child’s bottom with an open hand never more than twice and NEVER so hard as to leave any marks on your child. That would be classified as child abuse. reinforce the appropriate behavior to take the place of the inappropriate behavior Punishment:  Punishment Guidelines make it clear to the individual which behavior you are punishing and remove all threat of punishment immediately as soon as the undesired behavior stops. do not give punishment mixed with rewards for a given behavior; be consistent! once you have begun to administer punishment do not back out but use punishment wisely Contrasting Classical and Operant Conditioning:  Contrasting Classical and Operant Conditioning Classical conditioning usually involves reflexive behavior (eliciting a response) whereas operant condition involves instrumental behavior (emitting a response) Classical conditioning elicits a response whereas operant conditioning manipulates the probability that a given response will be emitted by the subject. Extinction: the process of unlearning:  Extinction: the process of unlearning Extinction is the process of unlearning a learned response because of a change on the part of the environment (reinforcement or punishment or stimulus pairing contingencies) Removing the source of learning in CC, not pairing the NS with the UCS will result in extinction in OC, not providing consequences causes ext. Summary of Conditioning:  Summary of Conditioning

Related presentations


Other presentations created by Junyo

workshop evaporative
08. 11. 2007
0 views

workshop evaporative

Black Holes
29. 11. 2007
0 views

Black Holes

The Precious Panda in Dan
01. 12. 2007
0 views

The Precious Panda in Dan

CHAPTER 6
07. 12. 2007
0 views

CHAPTER 6

HellenisticWorld
29. 10. 2007
0 views

HellenisticWorld

introcicero
30. 10. 2007
0 views

introcicero

The Catholic Counter Reformation
30. 10. 2007
0 views

The Catholic Counter Reformation

Sadowski Thin Films Legnaro2006
31. 10. 2007
0 views

Sadowski Thin Films Legnaro2006

EhrlichCharlemagne1
01. 11. 2007
0 views

EhrlichCharlemagne1

New Septic Tank Sizing
06. 11. 2007
0 views

New Septic Tank Sizing

ETOMS
14. 11. 2007
0 views

ETOMS

GDC Video Game Graphics
15. 11. 2007
0 views

GDC Video Game Graphics

E97 Presentation Sewers
01. 11. 2007
0 views

E97 Presentation Sewers

groundwater 101
17. 12. 2007
0 views

groundwater 101

VL24 25januar2006PPadj
24. 12. 2007
0 views

VL24 25januar2006PPadj

SoCalWaterDialogue
28. 12. 2007
0 views

SoCalWaterDialogue

Child Study Movement
28. 12. 2007
0 views

Child Study Movement

Autotrader 1416
19. 11. 2007
0 views

Autotrader 1416

EXLP Sample 5
31. 12. 2007
0 views

EXLP Sample 5

W 31BodyParagraphs
02. 01. 2008
0 views

W 31BodyParagraphs

Biomass part 1 2
03. 01. 2008
0 views

Biomass part 1 2

ch8Jan99
04. 01. 2008
0 views

ch8Jan99

Actuation and Control System
04. 01. 2008
0 views

Actuation and Control System

bisnovaty kogan 1
13. 11. 2007
0 views

bisnovaty kogan 1

YDL 2007 final slides
10. 12. 2007
0 views

YDL 2007 final slides

Mugabe
07. 01. 2008
0 views

Mugabe

Sensors for Robots 3
07. 01. 2008
0 views

Sensors for Robots 3

20021010 1600C
27. 09. 2007
0 views

20021010 1600C

Chp5 part1
06. 11. 2007
0 views

Chp5 part1

media 26044 en
21. 11. 2007
0 views

media 26044 en

1 COMPOUND
16. 11. 2007
0 views

1 COMPOUND

20074 HCompanionAnimalUpda te
19. 11. 2007
0 views

20074 HCompanionAnimalUpda te

Lin Erda 2
04. 10. 2007
0 views

Lin Erda 2

High Yield 2005
24. 02. 2008
0 views

High Yield 2005

Chapter09
27. 02. 2008
0 views

Chapter09

Chp5 part2
06. 11. 2007
0 views

Chp5 part2

11 02 01Quality improvement
13. 04. 2008
0 views

11 02 01Quality improvement

meta models unification
29. 10. 2007
0 views

meta models unification

Qiaogj
14. 11. 2007
0 views

Qiaogj

Foders 060323
01. 10. 2007
0 views

Foders 060323

EncodinginLTM
17. 12. 2007
0 views

EncodinginLTM

Scheinin
25. 12. 2007
0 views

Scheinin

06 Burkard
28. 11. 2007
0 views

06 Burkard

TenNecessities
02. 01. 2008
0 views

TenNecessities

Probhodini KnowYourProjs
06. 12. 2007
0 views

Probhodini KnowYourProjs

Jiping Xie
06. 11. 2007
0 views

Jiping Xie

Purchasing Train
23. 11. 2007
0 views

Purchasing Train