ApifyHttpProxyMiddleware
Index
Methods
__init__
Create a new instance.
Parameters
proxy_settings: dict
Dictionary containing proxy settings, provided by the Actor input.
Returns None
from_crawler
Create an instance of ApifyHttpProxyMiddleware from a Scrapy Crawler.
Parameters
crawler: Crawler
Scrapy Crawler object.
Returns ApifyHttpProxyMiddleware
ApifyHttpProxyMiddleware: Instance of the class.
process_exception
Process an exception that occurs during request processing.
Parameters
request: Request
Scrapy Request object.
exception: Exception
Exception object.
spider: Spider
Scrapy Spider object.
Returns None | Request
If a TunnelError occurs, return the request object to halt its processing in the middleware pipeline. Return None otherwise to allow the continuation of request processing.
process_request
Process a Scrapy request by assigning a new proxy.
Parameters
request: Request
Scrapy Request object.
spider: Spider
Scrapy Spider object.
Returns None
None: The request is processed and middleware pipeline can continue.
Apify HTTP proxy middleware for Scrapy.
This middleware enhances request processing by adding a 'proxy' field to the request's meta and an authentication header. It draws inspiration from the
HttpProxyMiddlewareincluded by default in Scrapy projects. The proxy URL is sourced from the settings under theAPIFY_PROXY_SETTINGSkey. The value of this key, a dictionary, should be provided by the Actor input. An example of the proxy settings:proxy_settings = {'useApifyProxy': true, 'apifyProxyGroups': []}