The Computer Vision Workshop: Develop the skills you need to use computer vision algorithms in your own artificial intelligence projects
暫譯: 電腦視覺工作坊:培養在人工智慧專案中使用電腦視覺演算法所需的技能

Asad, Hafsa, Shrimali, Vishwesh Ravi, Singh, Nikhil

相關主題

商品描述

Explore the potential of deep learning techniques in computer vision applications using the Python ecosystem, and build real-time systems for detecting human behavior

Key Features

  • Understand OpenCV and select the right algorithm to solve real-world problems
  • Discover techniques for image and video processing
  • Learn how to apply face recognition in videos to automatically extract key information

Book Description

Computer Vision (CV) has become an important aspect of AI technology. From driverless cars to medical diagnostics and monitoring the health of crops to fraud detection in banking, computer vision is used across all domains to automate tasks. The Computer Vision Workshop will help you understand how computers master the art of processing digital images and videos to mimic human activities.

Starting with an introduction to the OpenCV library, you'll learn how to write your first script using basic image processing operations. You'll then get to grips with essential image and video processing techniques such as histograms, contours, and face processing. As you progress, you'll become familiar with advanced computer vision and deep learning concepts, such as object detection, tracking, and recognition, and finally shift your focus from 2D to 3D visualization. This CV course will enable you to experiment with camera calibration and explore both passive and active canonical 3D reconstruction methods.

By the end of this book, you'll have developed the practical skills necessary for building powerful applications to solve computer vision problems.

What you will learn

  • Access and manipulate pixels in OpenCV using BGR and grayscale images
  • Create histograms to better understand image content
  • Use contours for shape analysis, object detection, and recognition
  • Track objects in videos using a variety of trackers available in OpenCV
  • Discover how to apply face recognition tasks using computer vision techniques
  • Visualize 3D objects in point clouds and polygon meshes using Open3D

Who this book is for

If you are a researcher, developer, or data scientist looking to automate everyday tasks using computer vision, this workshop is for you. A basic understanding of Python and deep learning will help you to get the most out of this workshop.

商品描述(中文翻譯)

探索深度學習技術在計算機視覺應用中的潛力,並構建實時系統以檢測人類行為

主要特點


  • 了解 OpenCV 並選擇合適的算法來解決現實世界中的問題

  • 發現圖像和視頻處理的技術

  • 學習如何在視頻中應用人臉識別,自動提取關鍵信息

書籍描述

計算機視覺 (CV) 已成為人工智慧技術的重要組成部分。從無人駕駛汽車到醫療診斷,從監測作物健康到銀行詐騙檢測,計算機視覺在各個領域中被用來自動化任務。《計算機視覺工作坊》將幫助您理解計算機如何掌握處理數位圖像和視頻的技術,以模仿人類活動。

本書將從 OpenCV 庫的介紹開始,您將學習如何使用基本的圖像處理操作編寫您的第一個腳本。接著,您將掌握基本的圖像和視頻處理技術,如直方圖、輪廓和人臉處理。隨著學習的深入,您將熟悉進階的計算機視覺和深度學習概念,如物體檢測、追蹤和識別,並最終將重點從 2D 轉向 3D 可視化。本課程將使您能夠實驗相機校準,並探索被動和主動的典範 3D 重建方法。

到本書結束時,您將具備構建強大應用程序以解決計算機視覺問題所需的實用技能。

您將學到什麼


  • 使用 BGR 和灰階圖像在 OpenCV 中訪問和操作像素

  • 創建直方圖以更好地理解圖像內容

  • 使用輪廓進行形狀分析、物體檢測和識別

  • 使用 OpenCV 中的各種追蹤器在視頻中追蹤物體

  • 發現如何使用計算機視覺技術應用人臉識別任務

  • 使用 Open3D 可視化點雲和多邊形網格中的 3D 物體

本書適合誰

如果您是研究人員、開發人員或數據科學家,並希望使用計算機視覺自動化日常任務,那麼這個工作坊適合您。對 Python 和深度學習的基本理解將幫助您充分利用本工作坊。

作者簡介

Hafsa Asad graduated in Mechatronics Engineering from NUST, Pakistan. She worked at EVEATI Pvt Ltd for 5 years as a Machine Learning Engineer and Trainer.

Vishwesh Ravi Shrimali graduated from BITS Pilani, where he studied mechanical engineering. Since then he has been working with BigVision LLC in deep learning and computer vision and is also involved in creating official OpenCV courses. He has a keen interest in programming and AI and has applied that interest in mechanical engineering projects. He has also written multiple blogs on OpenCV, deep learning on LearnOpenCV, and on computer vision. When he is not writing blogs or working on projects, he likes to go on long walks or play his acoustic guitar.

Nikhil Singh is a computer vision and natural language processing engineer who likes to apply his knowledge of machine learning and deep learning to solve intriguing problems. He currently works as a data scientist for Alixpartners, London. After getting satisfactory results, he believes his work will help Alixpartners to achieve more excellence in their field. He is also the prime author of the book "Video Analytics using TensorFlow" for Apress Publication.

 

作者簡介(中文翻譯)

哈夫莎·阿薩德畢業於巴基斯坦國立科技大學的機電工程學系。她在EVEATI私人有限公司擔任機器學習工程師和培訓師,工作了5年。

維什維什·拉維·施里馬利畢業於比拉尼的比爾拉科技學院,主修機械工程。自那時起,他一直在BigVision LLC從事深度學習和計算機視覺的工作,並參與創建官方的OpenCV課程。他對程式設計和人工智慧有濃厚的興趣,並將這種興趣應用於機械工程項目中。他還在LearnOpenCV上撰寫了多篇有關OpenCV、深度學習和計算機視覺的博客。當他不在寫博客或從事項目時,他喜歡長時間散步或彈奏他的原聲吉他。

尼基爾·辛格是一位計算機視覺和自然語言處理工程師,喜歡將他的機器學習和深度學習知識應用於解決有趣的問題。他目前在倫敦的Alixpartners擔任數據科學家。在獲得令人滿意的結果後,他相信他的工作將幫助Alixpartners在其領域中取得更大的卓越。他也是Apress出版的《使用TensorFlow進行視頻分析》一書的主要作者。

目錄大綱

  1. Basics of Image Processing
  2. Common Operations When Working with Images
  3. Working with Histograms
  4. Working with Contours
  5. Face Processing in Image and Video
  6. Object Tracking
  7. Object Detection and Face Recognition
  8. OpenVINO with OpenCV

目錄大綱(中文翻譯)


  1. Basics of Image Processing

  2. Common Operations When Working with Images

  3. Working with Histograms

  4. Working with Contours

  5. Face Processing in Image and Video

  6. Object Tracking

  7. Object Detection and Face Recognition

  8. OpenVINO with OpenCV