Convolutional neural networks for audio processing: starting pack

Miron, M.; Olga Slizovskaia

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Convolutional neural networks for audio processing: starting pack

Title	Convolutional neural networks for audio processing: starting pack
Publication Type	Miscellaneous
Year of Publication	2017
Authors	Miron, M. , & Slizovskaia O.
preprint/postprint document	https://pydata.org/barcelona2017/schedule/presentation/37/
Full Text	Neural networks are increasingly popular in audio signal processing for topics as speech recognition or denoising. Scientific papers are usually accompanied by code repositories which rely on libraries as Theano or Tensorflow that can be interfaced from python. However, adapting a system to different tasks and data must take into account a set of pre-training routines and parameter debugging which we will discuss in this tutorial. Starting from the audio signals we introduce the data pre-training steps (feature computation, batch generation, normalization( with examples in numpy or scipy. We summarize the core concepts in neural networks and we code an architecture with the Keras library. Finally, we learn how to visualize and debug parameters with TensorBoard. The slides can be found here . The notebooks can be found at the github repository .