Abstract: Recent human image animation methods predominantly rely on motion signals extracted from driving videos, which limits the diversity and flexibility of generated actions. To overcome these ...