Skip to content

Scrapy middleware module which uses image parsing to submit a captcha response to amazon.

License

Notifications You must be signed in to change notification settings

ziplokk1/scrapy-amazon-robot-middleware

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Description

This module uses image parsing to submit a captcha response to amazon using scrapy middleware.

It is accurate roughly 60% of the time.

The middleware checks for "Robot Check" in the title of the page and if the string is found, then it attempts to decode the captcha and submit the response.

Usage

In settings.py

# Keep the middleware as close to the spiders as possible (less than 100)
DOWNLOADER_MIDDLEWARES = {
    'captchabuster.RobotMiddleware': 90
}

Installation

Clone the repository locally.

pip install -e /path/to/captchabuster/

OR

pip install git+https://github.com/ziplokk1/scrapy-amazon-robot-middleware.git

About

Scrapy middleware module which uses image parsing to submit a captcha response to amazon.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published