I have discovered that not all gestures need to be performed correctly to have them recognised and correctly performed by VoiceOver. For example, if you are listening to music that you wish to pause, a one-finger tap followed by a two-finger tap or vice versa is sufficient. This is the case for other gestures as well.