Abstract: Human skeleton sequences are widely used for action recognition due to their robustness to background noise and computational efficiency. In this paper, we propose a transformer-based method ...