This is something that many people new to positive reinforcement (R+) training struggle with. Why does ‘fixing’ behaviour problems (or any training problem) seem so much harder using positive techniques than using punishment or force? I mean, you’re rewarding the hell out of the behaviours you want so why are things still turning to custard? Because using positive reinforcement to change established behaviour requires that we become PRO-active rather than RE-active trainers – and that is the challenging bit.
Take a look at these two video sequence stills:
The arrows indicate the time of the training intervention. The first clip shows when a trainer would apply a reactive punishment to reduce the jumping behaviour; during or after the jumping.
In contrast, a reinforcement-based trainer would be proactive; interrupt and cue an alternative BEFORE the feet come off the ground.
To change a behaviour, you have to prevent the dog from gaining reinforcement for doing the unwanted action. For example, with jumping, for many dogs, just the fact that they got their feet off the ground and on you is enough! Add to that the attention it gets and how much fun it is to do and you can see why it persists even in the face of ‘firm’ corrections.
The reinforcement trainer interrupts the unwanted behaviour so it doesn’t get any chance to be strengthened and then either asks for, or engineers, a suitable alternative that can take its place and gives the dog the same reinforcement. In the case of jumping, it might be a handful of food stuffed in the dog’s face to interrupt the jump and lure/cue a sit, which is then followed by calm stroking, praise and lovin’.
The Recipe for using R+ to Change Established Behaviours
To use reinforcement training to ‘fix’ problems the trainer MUST be able to prevent the unwanted behaviour so a new, more acceptable alternative can be ‘inserted’ into the sequence. So, when you are trying to ‘fix’ a problem, ask yourself the following:
- Where do I need to ‘edit’ my behaviour sequence?
- How can I best do that edit? Management? Interrupt early? Both?
- What can I replace the unwanted sequence with?
Once you have decided your edit point and created (taught) your replacement behaviour, you can then insert it into the problem ‘video’.
However, a pro-active approach is not as simple as punishment-based solutions. It takes thought, planning and some effort to be able to predict and interrupt an unwanted behaviour. So much easier to just react after the fact with yelling or corrections. The problem is, unless the punishment was strong enough to override any reinforcement the dog received from the behaviour, it’s just not going to work! And if it is strong enough to get the job done, there’s likely to be fallout to deal with later.
Like so many things in life, training with a proactive approach is almost always more successful in the long run. If you want something to change, DO something active to change it, don’t just continue reacting after the event and expect a miracle. Edit that video and choose what you are going to insert instead!
– Sarah & The Gang
Like what you’ve just read? Don’t forget to share this post so others can enjoy it too 🙂