Computer Vision: From 3d Reconstruction to Visual Recognition (Synthesis Lectures on Computer Vision)
暫譯: 電腦視覺:從三維重建到視覺識別(電腦視覺綜合講座)
Li Fei-fei, Silvio Savarese
- 出版商: Morgan & Claypool
- 出版日期: 2018-10-15
- 售價: $1,620
- 貴賓價: 9.5 折 $1,539
- 語言: 英文
- 頁數: 120
- 裝訂: Paperback
- ISBN: 1627050515
- ISBN-13: 9781627050517
-
相關分類:
Computer Vision
海外代購書籍(需單獨結帳)
相關主題
商品描述
When a 3-dimensional world is projected onto a 2-dimensional image, such as the human retina or a photograph, reconstructing back the layout and contents of the real-world becomes an ill-posed problem that is extremely difficult to solve. Humans possess the remarkable ability to navigate and understand the visual world by solving the inversion problem going from 2D to 3D. Computer Vision seeks to imitate such abilities of humans to recognize objects, navigate scenes, reconstruct layouts, and understand the geometric space and semantic meaning of the visual world. These abilities are critical in many applications including robotics, autonomous driving and exploration, photo organization, image, or video retrieval, and human-computer interaction. This book delivers a systematic overview of computer vision, comparable to that presented in an advanced graduate level class. The authors emphasize two key issues in modeling vision: space and meaning, and focus upon the main problems vision needs to solve, including: • mapping out the 3D structure of objects and scenes• recognizing objects• segmenting objects• recognizing meaning of scenes• understanding movements of humansMotivated by these important problems and centered on the understanding of space and meaning, the book explores the fundamental theories and important algorithms of computer vision, starting from the analysis of 2D images, and culminating in the holistic understanding of a 3D scene
商品描述(中文翻譯)
當三維世界被投影到二維影像上,例如人類的視網膜或照片時,將真實世界的佈局和內容重建回來成為一個不適定問題,這是極其困難的。人類擁有卓越的能力,能夠通過解決從二維到三維的反演問題來導航和理解視覺世界。計算機視覺旨在模仿人類這種識別物體、導航場景、重建佈局以及理解視覺世界的幾何空間和語義意義的能力。這些能力在許多應用中至關重要,包括機器人技術、自動駕駛和探索、照片組織、影像或視頻檢索,以及人機互動。本書提供了計算機視覺的系統性概述,與高級研究生課程中所呈現的內容相當。作者強調了建模視覺的兩個關鍵問題:空間和意義,並專注於視覺需要解決的主要問題,包括:• 繪製物體和場景的三維結構 • 識別物體 • 分割物體 • 識別場景的意義 • 理解人類的動作 本書以這些重要問題為動機,並圍繞空間和意義的理解,探討計算機視覺的基本理論和重要算法,從二維影像的分析開始,最終達到對三維場景的整體理解。