Hiring in life sciences? Share your open positions with our professional community. Read more Close

Advertisement

Spatial-Temporal Graph Mamba for Music-Guided Dance Video Synthesis.

Created on 16 Jul 2025

Authors

Hao Tang, Ling Shao, Zhenyu Zhang, Luc Van Gool, Nicu Sebe

Published in

IEEE transactions on pattern analysis and machine intelligence. Volume PP. Jul 15, 2025. Epub Jul 15, 2025.

Abstract

We propose a novel spatial-temporal graph Mamba (STG-Mamba) for the music-guided dance video synthesis task, i.e., to translate the input music to a dance video. STG-Mamba consists of two translation mappings: music-to-skeleton translation and skeleton-to-video translation. In the music-to-skeleton translation, we introduce a novel spatial-temporal graph Mamba (STGM) block to effectively construct skeleton sequences from the input music, capturing dependencies between joints in both the spatial and temporal dimensions. For the skeleton-to-video translation, we propose a novel self-supervised regularization network to translate the generated skeletons, along with a conditional image, into a dance video. Lastly, we collect a new skeleton-to-video translation dataset from the Internet, containing 54,944 video clips. Extensive experiments demonstrate that STG-Mamba achieves significantly better results than existing methods.

PMID:
40663669
Bibliographic data and abstract were imported from PubMed on 16 Jul 2025.

Read full publication at:
Please sign in to see all details.

Advertisement

Stats

  • Community rating n/a 0 votes
  • Reviewers' rating n/a 0 votes
  • Your rating

1-terrible, 9-excellent. How would you rate this publication? Sign in in to submit your rating.

  • Recommendations n/a n/a positive of 0 vote(s)
  • Views 52
  • Comments 0

Recommended by

  • No recommendations yet.

Post a comment

You need to be signed in to post comments. You can sign in here.

Comments

There are no comments yet.

Advertisement