整理《web scraping with python》书中代码笔记
Cleaning Your Dirty Data
|
|
Reading and Writing Natural Languages
Summarizing Data
|
|
Markov Models
|
|
natural language toolkit
|
|
Crawling Through Forms and Logins
|
|
Scraping JavaScript
Executing JavaScript in Python with Selenium
|
|
Handling Redirects
|
|
Image Processing and Text Recognition
library
|
|
Processing Well-Formatted Text
|
|
Retrieving CAPTCHAs and Submitting Solutions
|
|
PySocks
|
|
Testing Your Website with Scrapers
Testing Wikipedia
|
|
Interacting with the Site
|
|
Handling Cookies
|
|