Unsupervised Representation Disentanglement Using Cross Domain Features and Adversarial Learning in Variational Autoencoder Based Voice Conversion read more
Learning to combine top down and bottom-up signals in recurrent neural networks with attention over modules read more
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations (NeurIPS, 2020) read more