Related to #5646, the current `test_step_continuous` in `test_step.py` is a bit flaky and I'm not sure what it's designed to test. Can I get some insight into test so I can refactor it to be more robust?