Hello! I’m working on a version that fixes this. The problem arises from the way the expected position the audio should be at is calculated. Until I finish polishing this future version, all I can suggest is standardizing the frame durations. Sorry for the inconvenience!