Deep Learning based Sound Source Localization with Microphone Array

Shivenkumar M Parmar

Back

Deep Learning based Sound Source Localization with Microphone Array

Thesis

Open access

Deep Learning based Sound Source Localization with Microphone Array

Shivenkumar M Parmar

Master of Science (MS), California State University, Sacramento

11/01/2021

Handle:

https://hdl.handle.net/20.500.12741/rep:1966

Abstract

Localization implies tracking objects in the given environment. Sound source localization is a prominent research area to improve hearing sense in human-machine interaction. It has numerous applications including smart speakers and robots. The microphone array is capable to record sound to build powerful applications using audio data. In this project, we propose deep learning-based sound source localization with a microphone array. Using speaker and microphone array, we collect training and testing audio data for different user locations in two home environments. We mark each user location in the 2D plane using x and y coordinates. Test locations are within the range of 1 meter from the corresponding training locations. we extract features from audio data for each location using Short Time Fourier Transform (STFT) and convert audio data into spectrogram images for each location. Then, we apply deep convolutional neural networks on training locations to classify the user audio location. we use the same trained model on the test dataset to estimate user locations. We calculate distance error between the test and predicted user locations. In the end, experimental results show that our proposed system can obtain good accuracy and less error in classifying the user locations.

Files and links (1)

pdf

ParmarShivenkumar_Spring20212.65 MBDownload View

TextProject Open Access

Metrics

77 File views/ downloads

211 Record Views

Details

Title: Deep Learning based Sound Source Localization with Microphone Array
Creators: Shivenkumar M Parmar
Contributors: Xuyu Wang (Advisor)
Jinsong Ouyang (Committee Member)
Academic Unit: Computer Science Department
Theses and Dissertations: Master of Science (MS); Computer Science; California State University, Sacramento; 05/04/2021; 2021
Publication Details: 11/01/2021
Identifiers: 99257898415501671; https://hdl.handle.net/20.500.12741/rep:1966
Resource Type: Masters Project
Language: English
Number of pages: 44
Comment: The accessibility of this document has been verified by Sacramento State University Library. For questions, please contact lib-508Accessibility@csus.edu.