Camera Distance-aware Top-down Approach for 3D Multi-person Pose Estimation from a Single RGB Image read more
MuseGAN: Multi-track Sequential Generative Adversarial Networks for Symbolic Music Generation and Accompaniment read more
Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks read more