Python 2.6 Text Processing: Beginners Guide

Jeff McNeil

  • 出版商: Packt Publishing
  • 出版日期: 2010-12-18
  • 售價: $1,840
  • 貴賓價: 9.5$1,748
  • 語言: 英文
  • 頁數: 380
  • 裝訂: Paperback
  • ISBN: 1849512124
  • ISBN-13: 9781849512121
  • 相關分類: Python程式語言
  • 海外代購書籍(需單獨結帳)

相關主題

商品描述

With a basic knowledge of Python you have the potential to undertake time-saving text processing. This book is a great introduction to the various techniques, and teaches through practical examples and clear explanations. Overview The easiest way to learn text processing with Python Deals with the most important textual data formats you will encounter Learn to use the most popular text processing libraries available for Python Packed with examples to guide you through What you will learn from this book Know the options available for processing text in Python Parse JSON data that is often used as a data delivery mechanism on the Internet Organize a log-processing application via modules and packages to make it more extensible Perform conditional matches via look-ahead and look-behind assertions by using basic regular expressions Process XML and HTML documents in a variety of ways based on the needs of your application Implement callback methods to perform SAX processing and walk in-memory DOM structures Understand Unicode, character encoding, internationalization, and localization Lay out a Mako template-based project by using techniques such as template inheritance, additional tags, and custom filters Install and use the Mako templating system to create your own Mako templates Process a large number of e-mail messages using the Python standard library and index them with Nucular for fast searching Fix common exceptions that occur while dealing with different types of text encoding Build simple PDF output using the ReportLab toolkit's high-level PLATYPUS framework Generate Microsoft Excel output using the xlwt module Open and edit existing Open Document files to use them as template sources Understand supporting functions and classes, such as the Python IO system and packaging components Approach This book is part of the Beginner's Guide series. Each chapter covers the steps for various tasks to process data followed

商品描述(中文翻譯)

擁有基本的 Python 知識,您就有潛力進行節省時間的文本處理。本書是各種技術的絕佳入門,並通過實際範例和清晰的解釋進行教學。

概述
學習使用 Python 進行文本處理的最簡單方法
處理您將遇到的最重要的文本數據格式
學會使用 Python 中最受歡迎的文本處理庫
充滿範例以指導您

您將從本書中學到的內容
了解在 Python 中處理文本的可用選項
解析 JSON 數據,這通常用作互聯網上的數據傳遞機制
通過模組和套件組織日誌處理應用程序,使其更具擴展性
通過使用基本的正則表達式執行前瞻和後顧斷言的條件匹配
根據應用程序的需求以多種方式處理 XML 和 HTML 文檔
實現回調方法以執行 SAX 處理並遍歷內存中的 DOM 結構
理解 Unicode、字符編碼、國際化和本地化
使用模板繼承、附加標籤和自定義過濾器等技術佈局基於 Mako 模板的項目
安裝和使用 Mako 模板系統來創建自己的 Mako 模板
使用 Python 標準庫處理大量電子郵件消息,並使用 Nucular 進行快速搜索索引
修復處理不同類型文本編碼時出現的常見異常
使用 ReportLab 工具包的高級 PLATYPUS 框架構建簡單的 PDF 輸出
使用 xlwt 模組生成 Microsoft Excel 輸出
打開和編輯現有的 Open Document 文件,以用作模板來源
理解支持函數和類,例如 Python IO 系統和打包組件

方法
本書是初學者指南系列的一部分。每一章涵蓋處理數據的各種任務的步驟。