Welcome to the Audio Learning repository! This project serves as a structured sandbox for mastering the fundamentals of digital signal processing (DSP) and applying deep learning techniques to audio data.
The goal is to bridge the gap between raw sound waves and state-of-the-art neural network architectures.
We utilize the following core libraries for audio processing and model training:
- Librosa: The go-to library for audio and music analysis in Python.
- Torchaudio / Tf.signal: High-performance audio processing for deep learning.
- Matplotlib & Librosa.display: Advanced audio visualization.
- PyTorch or TensorFlow/Keras: Building and training neural networks.
- The Audio Programmers Podcast
- Valerio Velardo - The Sound of AI (Highly recommended YouTube channel)
- Python for Audio Signal Processing
Since this is a personal learning repository, contributions are mostly for my own growth, but if you find a bug or have a suggestion for a better processing technique, feel free to open an issue!
Happy Learning! ๐ถ