End-to-end Automatic Speech Recognition Systems - PyTorch
Published:
Implement 4 different deep learning architecture (MLP, CNN, RNN, ANN) to parse audio sentences (feature extraction).
- scrape, clean and preprocess audio data
- experiment 4 different architecture for both extractor and classifier layers
- RNN extractor with RNN classifier performs best