Machine Learning in Multimedia: Unlocking the Power of Visual and Auditory Intelligence (多媒體中的機器學習:釋放視覺與聽覺智慧的力量)

Kumar Swarnkar, Suman, Sharma, Annu, Somasekar, J.

  • 出版商: CRC
  • 出版日期: 2024-12-10
  • 售價: $3,570
  • 貴賓價: 9.5$3,392
  • 語言: 英文
  • 頁數: 154
  • 裝訂: Hardcover - also called cloth, retail trade, or trade
  • ISBN: 1032761482
  • ISBN-13: 9781032761480
  • 相關分類: Machine Learning
  • 無法訂購

相關主題

商品描述

This book explores the interdisciplinary nature of machine learning in multimedia, highlighting its intersections with fields such as computer vision, natural language processing, and audio signal processing.

Machine Learning in Multimedia: Unlocking the Power of Visual and Auditory Intelligence serves as a comprehensive guide to navigating this exciting terrain, where artificial intelligence meets the rich tapestry of visual and auditory data. At its core, this book seeks to unravel the mysteries and unveil the potential of machine learning in the realm of multimedia. Whether it's enhancing user experiences in virtual environments, revolutionizing medical diagnostics, or shaping the future of entertainment, the impact of machine learning in multimedia is profound and far-reaching. The journey begins with a thorough exploration of the foundational principles of machine learning, providing readers with a solid understanding of algorithms, models, and techniques tailored specifically for multimedia data. Through clear explanations and illustrative examples, readers will gain insights into how machine learning algorithms can be trained to extract meaningful patterns and insights from diverse forms of multimedia content. Moving beyond theory, this book delves into practical implementations and real-world applications of machine learning in multimedia. Through a series of case studies and examples, readers will witness firsthand how machine learning algorithms are transforming industries and reshaping the way we interact with multimedia content. Whether it's improving image recognition accuracy in autonomous vehicles, enabling personalized recommendations in streaming platforms, or enhancing speech recognition systems for better accessibility, the possibilities are limitless.

This book will be helpful to computer science, data science, and artificial intelligence, researchers, students, and professionals looking to unlock the full potential of visual and auditory intelligence through the power of machine learning.

商品描述(中文翻譯)

本書探討了機器學習在多媒體領域的跨學科特性,強調其與計算機視覺、自然語言處理和音頻信號處理等領域的交集。《Machine Learning in Multimedia: Unlocking the Power of Visual and Auditory Intelligence》作為一本全面的指南,幫助讀者在這個人工智慧與豐富的視覺和聽覺數據交匯的激動人心的領域中導航。本書的核心目標是揭開機器學習在多媒體領域的奧秘,並展示其潛力。無論是提升虛擬環境中的用戶體驗、革新醫療診斷,還是塑造娛樂的未來,機器學習在多媒體中的影響深遠且廣泛。

這段旅程始於對機器學習基本原則的深入探索,為讀者提供對專門針對多媒體數據的算法、模型和技術的堅實理解。通過清晰的解釋和生動的例子,讀者將獲得洞見,了解機器學習算法如何被訓練以從各種形式的多媒體內容中提取有意義的模式和見解。本書不僅限於理論,還深入探討了機器學習在多媒體中的實際應用和現實世界的案例。通過一系列案例研究和示例,讀者將親眼見證機器學習算法如何改變行業,並重塑我們與多媒體內容互動的方式。無論是提高自駕車的圖像識別準確性、在串流平台上實現個性化推薦,還是增強語音識別系統以改善可及性,可能性都是無限的。

本書將對計算機科學、數據科學和人工智慧的研究者、學生和專業人士有所幫助,幫助他們通過機器學習的力量釋放視覺和聽覺智能的全部潛力。

作者簡介

Suman Kumar Swanrkar received a Ph.D. (CSE) degree in 2021 from Kalinga University, Nayaraipur, Chhattisgarh. He received an M.Tech. (CSE) degree in 2015 from the Rajiv Gandhi Proudyogiki Vishwavidyalaya, Bhopal, India. He has 12+ years of experience in Educational Institutes as an Assistant Professor. Currently associated with Shri Shankaracharya Institute of Professional Management & Technology, Raipur as an Assistant Professor in Computer Science & Engineering Department. He has Guided 10+ MTech Scholars. He has published and granted an Indian/Australian patent, some are waiting for a grant. He has authored and co-authored more than 50 journal articles including WOS & Scopus papers and presented research papers in 10 international conferences. He has completed many FDP, Training, webinars & workshops and also completed the 2-Weeks comprehensive online Patent Information Course. Proficiency in handling the Teaching, Research as well as administrative activities. He has contributed massive literature in the fields of Intelligent Data Analysis, Nature-Inspired Computing, Machine Learning and Soft Computing.

Annu Sharma is an Associate Professor in the Department of Computer Applications at RajaRajeswari College of Engineering. She holds a Master's degree in Computer Science and Applications from the Department of Computer Science and Applications, University of Jammu, J&K, and a Ph.D. from the Department of Computer Science, Gurukul Kangri University, Haridwar, Uttrakhand. She has more than 20 years of teaching experience at the Master's and Bachelor's levels, including working Executives. Before joining RRCE, she had worked with Bangalore University, IMT Faridabad, Haryana, Central University of Jammu, J&K, and Arya College Ludhiana, Punjab. Her research interest include Biometrics, Image Processing, Bioinformatics, IOT, Cyber Security, and Machine Learning. She has publications in various Scopus-indexed reputed International Journals and leading International Conferences.

J. Somasekar received a Ph.D. degree in CSE from JNTUA, Andhra Pradesh, and M.Tech. degree from the National Institute of Technology Karnataka (NITK), Surathkal. He is currently working as a Professor of CSE Department, JAIN (Deemed-to-be University), Bangalore and Post-doctoral Researcher at University of South Florida, USA. As a resource person, he has delivered 195 Technical talks for FDPs, Workshops, and Webinars in 13 states of the country. He got an All India Rank of 43 in the GATE exam. He has 16 years of experience in teaching and 6 years of experience in research. He has published more than 35 research articles in leading journals indexed in SCI & SCOPUS, conference proceedings, and 3 international textbook chapters. He is guiding five CSE Ph.D. research scholars. His research interest includes Image processing, Data Science, Machine Learning, Big Data Analytics, and ML for Cyber Security.

Bharat Bhushan is an Assistant Professor of Department of Computer Science and Engineering (CSE) at School of Engineering and Technology, Sharda University, Greater Noida, India. He received his Undergraduate Degree (B-Tech in Computer Science and Engineering) with Distinction in 2012, received his Postgraduate Degree (M-Tech in Information Security) with Distinction in 2015 and Doctorate Degree (PhD Computer Science and Engineering) in 2021 from Birla Institute of Technology, Mesra, India. For the three consecutive years (2021 to 2023), Stanford University (USA) listed Dr. Bharat Bhushan in the top 2% scientists list. He earned numerous international certifications such as CCNA, MCTS, MCITP, RHCE and CCNP. He has published more than 150 research papers in various renowned International Conferences and SCI indexed journals. He has contributed with more than 50 book chapters in various books and has edited 30 books from the most famed publishers. He is a series editor of 2 prestigious Scopus Indexed Book Series named CMIA (Computational Methods for Industrial Applications) and FGIS (Future Generation Information System) published by CRC Press, Taylor and Francis, USA. He has served as Keynote Speaker (resource person) in numerous reputed faculty development programs and international conferences held in different countries including India, Iraq, Morocco, China, Belgium and Bangladesh. He has served as a Reviewer/Editorial Board Member for several reputed international journals. In the past, he worked as an assistant professor at HMR Institute of Technology and Management, New Delhi and Network Engineer in HCL Infosystems Ltd., Noida.

作者簡介(中文翻譯)

Suman Kumar Swanrkar於2021年獲得印度恰蒂斯加爾邦奈亞賴普爾的Kalinga University頒發的計算機科學博士學位。他於2015年獲得印度博帕爾的Rajiv Gandhi Proudyogiki Vishwavidyalaya頒發的計算機科學碩士學位。他在教育機構擔任助理教授已有超過12年的經驗。目前,他在Raipur的Shri Shankaracharya Institute of Professional Management & Technology擔任計算機科學與工程系的助理教授。他指導了10位以上的碩士研究生,並已發表及獲得印度/澳大利亞專利,其中一些正在等待授權。他已撰寫和共同撰寫超過50篇期刊文章,包括WOS和Scopus論文,並在10場國際會議上發表研究論文。他完成了多個FDP、培訓、網路研討會和工作坊,並完成了為期兩週的綜合線上專利資訊課程。他在教學、研究及行政活動方面具備豐富的經驗。他在智能數據分析、自然啟發計算、機器學習和軟計算等領域貢獻了大量文獻。

Annu Sharma是RajaRajeswari College of Engineering計算機應用系的副教授。她擁有來自J&K的Jammu大學計算機科學與應用系的碩士學位,以及來自烏塔拉坎德的Gurukul Kangri University計算機科學系的博士學位。她在碩士和學士層級的教學經驗超過20年,包括與在職高管的合作。在加入RRCE之前,她曾在班加羅爾大學、哈里亞納邦的IMT Faridabad、J&K的中央大學以及旁遮普的Arya College Ludhiana工作。她的研究興趣包括生物識別、影像處理、生物資訊學、物聯網、網路安全和機器學習。她在多個Scopus索引的國際知名期刊和主要國際會議上發表了多篇論文。

J. Somasekar於安得拉邦的JNTUA獲得計算機科學博士學位,並在卡納塔克國立技術學院(NITK)獲得碩士學位。他目前在班加羅爾的JAIN(被認可的大學)擔任計算機科學系教授,並在美國南佛羅里達大學擔任博士後研究員。作為資源人員,他在全國13個州為FDP、工作坊和網路研討會提供了195場技術講座。他在GATE考試中獲得全印度第43名。他擁有16年的教學經驗和6年的研究經驗。他在SCI和SCOPUS索引的主要期刊、會議論文集及3本國際教科書章節中發表了超過35篇研究文章。他正在指導五位計算機科學博士研究生。他的研究興趣包括影像處理、數據科學、機器學習、大數據分析及網路安全的機器學習。

Bharat Bhushan是印度Greater Noida的Sharda University工程與技術學院計算機科學與工程系的助理教授。他於2012年以優異成績獲得計算機科學與工程的學士學位,於2015年以優異成績獲得資訊安全的碩士學位,並於2021年獲得計算機科學與工程的博士學位,均來自印度Mesra的Birla Institute of Technology。在2021至2023年連續三年,斯坦福大學(美國)將Dr. Bharat Bhushan列入前2%的科學家名單。他獲得了多項國際認證,如CCNA、MCTS、MCITP、RHCE和CCNP。他在各大國際會議和SCI索引期刊上發表了超過150篇研究論文,並為多本書籍貢獻了超過50章,編輯了30本來自知名出版社的書籍。他是兩個著名Scopus索引書系列的系列編輯。