Vision-Based Interaction (Synthesis Lectures on Computer Vision)

Matthew Turk, Gang Hua

  • 出版商: Morgan & Claypool
  • 出版日期: 2013-10-01
  • 售價: $1,890
  • 貴賓價: 9.5$1,796
  • 語言: 英文
  • 頁數: 134
  • 裝訂: Paperback
  • ISBN: 1608452417
  • ISBN-13: 9781608452415
  • 相關分類: Computer Vision
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

In its early years, the field of computer vision was largely motivated by researchers seeking computational models of biological vision and solutions to practical problems in manufacturing, defense, and medicine. For the past two decades or so, there has been an increasing interest in computer vision as an input modality in the context of human-computer interaction. Such vision-based interaction can endow interactive systems with visual capabilities similar to those important to human-human interaction, in order to perceive non-verbal cues and incorporate this information in applications such as interactive gaming, visualization, art installations, intelligent agent interaction, and various kinds of command and control tasks. Enabling this kind of rich, visual and multimodal interaction requires interactive-time solutions to problems such as detecting and recognizing faces and facial expressions, determining a person's direction of gaze and focus of attention, tracking movement of the body, and recognizing various kinds of gestures. In building technologies for vision-based interaction, there are choices to be made as to the range of possible sensors employed (e.g., single camera, stereo rig, depth camera), the precision and granularity of the desired outputs, the mobility of the solution, usability issues, etc. Practical considerations dictate that there is not a one-size-fits-all solution to the variety of interaction scenarios; however, there are principles and methodological approaches common to a wide range of problems in the domain. While new sensors such as the Microsoft Kinect are having a major influence on the research and practice of vision-based interaction in various settings, they are just a starting point for continued progress in the area. In this book, we discuss the landscape of history, opportunities, and challenges in this area of vision-based interaction; we review the state-of-the-art and seminal works in detecting and recognizing the human body and its components; we explore both static and dynamic approaches to ""looking at people"" vision problems; and we place the computer vision work in the context of other modalities and multimodal applications. Readers should gain a thorough understanding of current and future possibilities of computer vision technologies in the context of human-computer interaction.

商品描述(中文翻譯)

在早期,計算機視覺領域主要受到研究人員尋求生物視覺的計算模型以及解決製造、國防和醫療等實際問題的驅動。在過去的二十年左右,計算機視覺作為人機互動中的一種輸入方式,受到越來越多的關注。這種基於視覺的互動可以賦予互動系統類似於人與人之間互動的重要視覺能力,以便感知非語言線索並將這些信息融入到互動遊戲、可視化、藝術裝置、智能代理互動以及各種指揮和控制任務等應用中。實現這種豐富的視覺和多模態互動需要在互動時間內解決如檢測和識別面孔及面部表情、確定個人的注視方向和注意焦點、追蹤身體運動以及識別各種手勢等問題。在構建基於視覺的互動技術時,需要在所使用的傳感器範圍(例如,單鏡頭、立體裝置、深度相機)、所需輸出的精確度和粒度、解決方案的移動性、可用性問題等方面做出選擇。實際考量表明,對於各種互動場景並不存在一種通用的解決方案;然而,在這個領域中,有一些原則和方法論方法是適用於廣泛問題的。儘管像 Microsoft Kinect 這樣的新型傳感器對於各種環境中的基於視覺的互動研究和實踐產生了重大影響,但它們僅僅是該領域持續進步的起點。在本書中,我們討論了基於視覺的互動領域的歷史、機會和挑戰;回顧了檢測和識別人體及其組成部分的最新技術和開創性工作;探索了靜態和動態的“觀察人類”視覺問題的方法;並將計算機視覺的工作置於其他模態和多模態應用的背景中。讀者應該能夠深入了解計算機視覺技術在人體互動中的當前和未來可能性。