-
用Python写爬虫-源码.rar下载
资源介绍
This repository contains source code of examples from the book *Web Scraping with Python*, published by Packt Publishing.
Examples have been tested with Python 2.7 and depend on:
* [BeautifulSoup](http://www.crummy.com/software/BeautifulSoup/) (Ch 2)
* [lxml](http://lxml.de/) (Ch 2-9)
* [pymongo](http://api.mongodb.org/python/current/) (Ch 3-5, 9)
* [PyQt](http://www.riverbankcomputing.co.uk/software/pyqt/intro) / [PySide](https://pypi.python.org/pypi/PySide) (Ch 5)
* [ghost](http://jeanphix.me/Ghost.py/) (Ch 5)
* [Selenium WebDriver](http://www.seleniumhq.org/projects/webdriver/) (Ch 5, 9)
* [mechanize](http://wwwsearch.sourceforge.net/mechanize/) (Ch 6)
* [PIL](http://www.pythonware.com/products/pil/) / [Pillow](https://python-pillow.github.io/) (Ch 7)
* [pytesseract](https://github.com/madmaze/pytesseract) (Ch 7)
* [scrapy](http://scrapy.org/) (Ch 8)
* [portia](https://github.com/scrapinghub/portia) (Ch 8)
* [scrapely](https://github.com/scrapy/scrapely) (Ch 8)
This examples will break in future as websites change and dependencies are updated, so [bug reports and patches](https://bitbucket.org/wswp/code/issues?status=new&status=open) are welcome.
- 上一篇: ssm图书推荐系统毕业设计程序
- 下一篇: NCM转MP3转换器