Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL, 2/e (Paperback)

Michael Schrenk

  • 出版商: No Starch Press
  • 出版日期: 2012-03-15
  • 售價: $1,940
  • 貴賓價: 9.5$1,843
  • 語言: 英文
  • 頁數: 392
  • 裝訂: Paperback
  • ISBN: 1593273975
  • ISBN-13: 9781593273972
  • 相關分類: PHP
  • 無法訂購

買這商品的人也買了...

相關主題

商品描述

There's a wealth of data online, but sorting and gathering it by hand can be tedious and time consuming. Rather than click through page after endless page, why not let bots do the work for you?

Webbots, Spiders, and Screen Scrapers will show you how to create simple programs with PHP/CURL to mine, parse, and archive online data to help you make informed decisions. Michael Schrenk, a highly regarded webbot developer, teaches you how to develop fault-tolerant designs, how best to launch and schedule the work of your bots, and how to create Internet agents that:

  • Send email or SMS notifications to alert you to new information quickly
  • Search different data sources and combine the results on one page, making the data easier to interpret and analyze
  • Automate purchases, auction bids, and other online activities to save time

Sample projects for automating tasks like price monitoring and news aggregation will show you how to put the concepts you learn into practice.

This second edition of Webbots, Spiders, and Screen Scrapers includes tricks for dealing with sites that are resistant to crawling and scraping, writing stealthy webbots that mimic human search behavior, and using regular expressions to harvest specific data. As you discover the possibilities of web scraping, you'll see how webbots can save you precious time and give you much greater control over the data available on the Web.

商品描述(中文翻譯)

網路上有大量的資料,但手動整理和收集資料可能會很乏味且耗時。為什麼不讓機器人替你完成這項工作,而不是一頁一頁地點擊呢?

《Webbots, Spiders, and Screen Scrapers》將教你如何使用PHP/CURL創建簡單的程式,以挖掘、解析和存檔網路資料,幫助你做出明智的決策。這本書由廣受好評的網路機器人開發者Michael Schrenk撰寫,教你如何開發容錯設計,最佳地啟動和安排機器人的工作,以及如何創建以下網路代理程式:

- 發送電子郵件或短信通知,快速提醒你有新資訊
- 搜尋不同的資料來源並將結果合併在一個頁面上,使資料更容易解讀和分析
- 自動化購買、競標和其他網路活動,節省時間

透過自動化價格監控和新聞聚合等範例專案,你將學習如何將所學概念應用於實踐中。

這本《Webbots, Spiders, and Screen Scrapers》第二版還包括處理不易爬取和解析的網站的技巧,撰寫模擬人類搜索行為的隱藏式網路機器人,以及使用正則表達式收集特定資料的方法。當你發現網路爬蟲的可能性時,你將看到網路機器人如何節省寶貴的時間,並讓你對網路上的資料有更大的掌控能力。