Wav2vec Unsupervised (wav2vec-U) is a framework for building speech recognition systems without any labeled training data as described in Unsupervised Speech Recognition (Baevski et al., 2021). The ...
examples/speech_recognition is implementing ASR task in Fairseq, along with needed features, datasets, models and loss functions to train and infer model described in Transformers with convolutional ...