Convolutional neural networks for audio processing: starting pack

TitleConvolutional neural networks for audio processing: starting pack
Publication TypeMiscellaneous
Year of Publication2017
AuthorsMiron, M., & Slizovskaia O.
preprint/postprint document
Full Text

Neural networks are increasingly popular in audio signal processing for topics as speech recognition or denoising. Scientific papers are usually accompanied by code repositories which rely on libraries as Theano or Tensorflow that can be interfaced from python. However, adapting a system to different tasks and data must take into account a set of pre-training routines and parameter debugging which we will discuss in this tutorial. Starting from the audio signals we introduce the data pre-training steps (feature computation, batch generation, normalization( with examples in numpy or scipy. We summarize the core concepts in neural networks and we code an architecture with the Keras library. Finally, we learn how to visualize and debug parameters with TensorBoard.

The slides can be found here.

The notebooks can be found at the github repository.