Turned out there were no locks around the atomic commit so it was only atomic if it page flipping was synchronous. If another commit came in it would clobber the commit obj and/or abandon the completion event. Vendor solution in the unholy fork was to disable async page flipping to a terrible performance hit. But they also actually fixed the bug later. :facepalm:
I wonder how many kernel patches are like "this other party fork is impossible to merge but they fixed this one thing for which I am including the patch" ... or is that... uncouth?