-
屏幕抓取应用:Faculty-App-Scraper
资源介绍
Faculty-App-Scraper
从旧网站上抓取 PDF 文档的快速/廉价方法。
##目的:从 GCAST 下载 PDF 格式的研究生申请
工具:
- python 2.7
- selenium: http://www.seleniumhq.org/
- beautiful soup: http://www.crummy.com/software/BeautifulSoup/
过程:
- Use Selenium to drive Firefox and open the GCAST page, pausing for entry of HUID
- Navigate to the list of applications
- "Click" on each application to downloading the PDF file
- The PDF file