The performance of neural networks has been dramatically improved since the method called “deep leaning” was developed around 2006[1][2]. Mainly, neural networks have been used for classification problems such as visual pattern recognition and speech recognition. However, there are not so many studies of sound source separation using neural networks. To apply neural networks to separation problems, separation problems require to be transformed into classification problems. To realize it, we referred to spectrogram analysis by specialists. Specialists can separate each source signal from the spectrogram of mixed signals by focusing on each local area of the spectrogram. In this study, we developed a novel method for sound source separation using spectrogram analysis by neural networks. As a result of the simulation, we successfully separated male and female voices from their mixed sound. The proposed method is superior to conventional methods in separation problems with sound reflection on walls and convolutional mixture which includes the difference of traveling time from a sound source to microphones because the method does not require to identify the mixture process in space.
Skip Nav Destination
ASME 2017 International Mechanical Engineering Congress and Exposition
November 3–9, 2017
Tampa, Florida, USA
Conference Sponsors:
- ASME
ISBN:
978-0-7918-5848-6
PROCEEDINGS PAPER
Sound Source Separation Using Spectrogram Analysis by Neural Networks
Tomoki Doura,
Tomoki Doura
Yokohama National University, Hodogaya-ku, Japan
Search for other works by this author on:
Toshihiko Shiraishi
Toshihiko Shiraishi
Yokohama National University, Hodogaya-ku, Japan
Search for other works by this author on:
Tomoki Doura
Yokohama National University, Hodogaya-ku, Japan
Toshihiko Shiraishi
Yokohama National University, Hodogaya-ku, Japan
Paper No:
IMECE2017-71583, V013T01A015; 5 pages
Published Online:
January 10, 2018
Citation
Doura, T, & Shiraishi, T. "Sound Source Separation Using Spectrogram Analysis by Neural Networks." Proceedings of the ASME 2017 International Mechanical Engineering Congress and Exposition. Volume 13: Acoustics, Vibration and Phononics. Tampa, Florida, USA. November 3–9, 2017. V013T01A015. ASME. https://doi.org/10.1115/IMECE2017-71583
Download citation file:
10
Views
Related Proceedings Papers
Related Articles
A Hybrid Analysis Method for Vibration Signals Based on Neural Networks and Pattern Recognition Techniques
J. Vib. Acoust (January,2001)
Multi-Mode Driving Control of a Parallel Hybrid Electric Vehicle Using Driving Pattern Recognition
J. Dyn. Sys., Meas., Control (March,2002)
Application of Fiber-Optical Microphone for Thermo-Acoustic Measurements
J. Eng. Gas Turbines Power (January,2011)
Related Chapters
Non-Correlated Character Recognition Using Hopfield Network: A Study
International Conference on Computer and Computer Intelligence (ICCCI 2011)
An Automatic Speaker Recognition System
International Conference on Advanced Computer Theory and Engineering (ICACTE 2009)
A Novel Approach for LFC and AVR of an Autonomous Power Generating System
International Conference on Mechanical Engineering and Technology (ICMET-London 2011)