Paper
4 March 2015 The non-trusty clown attack on model-based speaker recognition systems
Author Affiliations +
Proceedings Volume 9409, Media Watermarking, Security, and Forensics 2015; 940904 (2015) https://doi.org/10.1117/12.2083412
Event: SPIE/IS&T Electronic Imaging, 2015, San Francisco, California, United States
Abstract
Biometric detectors for speaker identification commonly employ a statistical model for a subject’s voice, such as a Gaussian Mixture Model, that combines multiple means to improve detector performance. This allows a malicious insider to amend or append a component of a subject’s statistical model so that a detector behaves normally except under a carefully engineered circumstance. This allows an attacker to force a misclassification of his or her voice only when desired, by smuggling data into a database far in advance of an attack. Note that the attack is possible if attacker has access to database even for a limited time to modify victim’s model. We exhibit such an attack on a speaker identification, in which an attacker can force a misclassification by speaking in an unusual voice, and replacing the least weighted component of victim’s model by the most weighted competent of the unusual voice of the attacker’s model. The reason attacker make his or her voice unusual during the attack is because his or her normal voice model can be in database, and by attacking with unusual voice, the attacker has the option to be recognized as himself or herself when talking normally or as the victim when talking in the unusual manner. By attaching an appropriately weighted vector to a victim’s model, we can impersonate all users in our simulations, while avoiding unwanted false rejections.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Alireza Farrokh Baroughi and Scott Craver "The non-trusty clown attack on model-based speaker recognition systems", Proc. SPIE 9409, Media Watermarking, Security, and Forensics 2015, 940904 (4 March 2015); https://doi.org/10.1117/12.2083412
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Databases

Speaker recognition

Data modeling

Systems modeling

Expectation maximization algorithms

Statistical analysis

Sensors

Back to Top