Audio Spoof Detection from Theory to Practical Application
暫譯: 音頻偽造檢測:從理論到實踐應用

Dua, Mohit, Chakravarty, Nidhi, Dua, Shelza

  • 出版商: CRC
  • 出版日期: 2026-05-21
  • 售價: $2,240
  • 貴賓價: 9.5$2,128
  • 語言: 英文
  • 頁數: 242
  • 裝訂: Quality Paper - also called trade paper
  • ISBN: 1032912642
  • ISBN-13: 9781032912646
  • 相關分類: 語音辨識 Speech-recognition
  • 尚未上市,無法訂購

商品描述

Audio Spoof Detection (ASD) systems play a pivotal role in evaluating whether the input speech signal has been manipulated by an imposter attempting unauthorized access to an authentic user's account or if it genuinely originates from the declared user. Primarily used for person authentication, these systems strive to verify the speaker's claimed identity. Despite substantial technological advancements, recent testing has revealed persistent vulnerabilities to spoofing, commonly referred to as a spoof attack. Various techniques such as mimicry, replay, text to speech (TTS), and voice conversion (VC) are frequently employed in ASV systems to execute logical access (LA) or physical access (PA) spoofing attacks. To secure an ASD system from these attacks many researchers have proposed effective security models as countermeasures. In addition, numerous review papers by different researchers have discussed various countermeasures developed to secure ASD systems. However, there is a notable absence of an authored book that comprehensively addresses this critical research topic, encompassing frontend, backend, dataset and types of attacks considerations. Therefore, there is an urgent need for a book that serves as a valuable resource for upcoming researchers, offering insights into securing ASD systems and bridging the existing gap in the literature. Hence, this book is an effort by the authors in such direction.

商品描述(中文翻譯)

音頻欺騙檢測(Audio Spoof Detection, ASD)系統在評估輸入語音信號是否被試圖未經授權訪問真實用戶帳戶的冒名者操縱方面扮演著關鍵角色,或該信號是否真正來自所聲明的用戶。這些系統主要用於個人身份驗證,旨在驗證說話者所聲稱的身份。儘管技術上有了顯著的進步,最近的測試仍然揭示了對欺騙的持續脆弱性,通常稱為欺騙攻擊(spoof attack)。各種技術,如模仿(mimicry)、重播(replay)、文本轉語音(text to speech, TTS)和語音轉換(voice conversion, VC),經常在自動語音驗證(ASV)系統中被用來執行邏輯訪問(logical access, LA)或物理訪問(physical access, PA)欺騙攻擊。為了保護ASD系統免受這些攻擊,許多研究人員提出了有效的安全模型作為對策。此外,許多不同研究人員的綜述論文討論了為保護ASD系統而開發的各種對策。然而,缺乏一本全面探討這一關鍵研究主題的專著,涵蓋前端、後端、數據集和攻擊類型的考量。因此,迫切需要一本能為即將到來的研究人員提供寶貴資源的書籍,提供有關保護ASD系統的見解,並填補文獻中的現有空白。因此,這本書是作者在這方面努力的結果。

作者簡介

Mohit Dua earned his Ph.D. in Automatic Speech Recognition from the National Institute of Technology, Kurukshetra, India, in 2018. He is presently working as an assistant professor in the Department of Computer Engineering at NIT Kurukshetra, India. He has more than 17 years of teaching and research experience. He is a member of Institute of Electrical and Electronics Engineers (IEEE), and life member of the Computer Society of India (CSI) and Indian Society for Technical Education (ISTE). His research interests include speech processing, chaos-based cryptography, information security, theory of formal languages, statistical modelling and natural language processing. He has published approximately 100+ research papers including abroad paper presentations in the USA, Canada, Australia, Singapore, Mauritius, and Dubai.

Nidhi Chakravarty earned her Ph.D. in Audio Spoof Detection from the National Institute of Technology, Kurukshetra, India, in 2024. She is presently working as an assistant professor in the Department of Computer Science and Engineering at Thapar Institute of Engineering and Technology, Patiala, India. She has published approximately 20+ research papers in various reputed journals and international conferences.

Shelza Dua earned her Ph.D. in Image Encryption from Banasthali Vidyapith, Banasthali Rajasthan, India, in 2019. She is presently working as a research associate in the Department of Electrical Engineering at NIT Kurukshetra, India. She has more than 20+ years of teaching and research experience. She is a life member of IETE. Her research interests include chaos-based cryptography and image encryption. She has published approximately 30+ research papers in various reputed journals and international conferences.

作者簡介(中文翻譯)

Mohit Dua 於2018年在印度庫魯克舍特拉國立技術學院獲得自動語音識別的博士學位。他目前在印度庫魯克舍特拉國立技術學院的計算機工程系擔任助理教授。他擁有超過17年的教學和研究經驗。他是電氣和電子工程師學會(IEEE)的會員,並且是印度計算機學會(CSI)和印度技術教育學會(ISTE)的終身會員。他的研究興趣包括語音處理、基於混沌的密碼學、信息安全、形式語言理論、統計建模和自然語言處理。他已發表約100篇研究論文,包括在美國、加拿大、澳大利亞、新加坡、毛里求斯和迪拜的國際會議上發表的論文。

Nidhi Chakravarty 於2024年在印度庫魯克舍特拉國立技術學院獲得音頻欺騙檢測的博士學位。她目前在印度帕提亞拉的塔帕爾工程與技術學院的計算機科學與工程系擔任助理教授。她已在各種知名期刊和國際會議上發表約20篇研究論文。

Shelza Dua 於2019年在印度拉賈斯坦邦的巴納斯塔利維迪亞皮特獲得圖像加密的博士學位。她目前在印度庫魯克舍特拉國立技術學院的電氣工程系擔任研究助理。她擁有超過20年的教學和研究經驗。她是IETE的終身會員。她的研究興趣包括基於混沌的密碼學和圖像加密。她已在各種知名期刊和國際會議上發表約30篇研究論文。