Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

Shivenkumar Parmar; Xuyu Wang; Chao Yang; Shiwen Mao; ACM

doi:10.1145/3522783.3529528

Back

Conference proceeding

Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

Shivenkumar Parmar, Xuyu Wang, Chao Yang, Shiwen Mao and ACM

PROCEEDINGS OF THE 2022 ACM WORKSHOP ON WIRELESS SECURITY AND MACHINE LEARNIG (WISEML '22), pp.21-26

01/01/2022

DOI: https://doi.org/10.1145/3522783.3529528

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Information Systems

Science & Technology

Technology

Telecommunications

With the fast development of the Internet of Things (IoT), smart speakers for voice assistance have become increasingly important in smart homes, which offers a new type of human-machine interaction interface. Voice localization with microphone arrays can improve smart speaker's performance and enable many new IoT applications. To address the challenges of complex indoor environments, such as non-line-of-sight (NLOS) and multi-path propagation, we propose voice fingerprinting for indoor localization using a single microphone array. The proposed system consists of a ReSpeaker 6-mic circular array kit connected to a Raspberry Pi and a deep learning model, and operates in offline training and online test stages. In the offline stage, the models are trained with spectrogram images obtained from audio data using short-time Fourier transform (STFT). Transfer learning is used to speed up the training process. In the online stage, a top- K probabilistic method is used for location estimation. Our experimental results demonstrate that the Inception-ResNet-v2 model can achieve a satisfactory localization performance with small location errors in two typical home environments.

Metrics

20 Record Views

Details

Title: Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning
Creators: Shivenkumar Parmar - California State University, Sacramento
Xuyu Wang - California State University, Sacramento
Chao Yang - Auburn University
Shiwen Mao - Auburn University
ACM
Academic Unit: Computer Science Department
Publisher: Assoc Computing Machinery
Publication Details: 01/01/2022
Grant note: ECCS-1923163; CNS2107190; CNS-2105416; CNS-2107164 / NSF; National Science Foundation (NSF)
Identifiers: 99258037154801671; https://doi.org/10.1145/3522783.3529528
Language: English
Number of pages: 6

Voice Fingerprinting for Indoor Localization with a Single Microphone Array and Deep Learning

Abstract

Related links

Metrics

Details