Detect blocking - Automatic detection of 403/429 errors
Datacenter proxy fallback - Cheap, fast option
Learn domain patterns - Remembers which domains need proxies
Expert-in-the-loop - Asks before using expensive residential proxies
Smart cost control: Direct connections when possible, datacenter proxies for blocks, residential proxies only with explicit approval. Reduces proxy costs by 80-90%.
# Datacenter Proxy (default - used with --proxy-type datacenter or no flag)DATACENTER_PROXY_USERNAME=your_usernameDATACENTER_PROXY_PASSWORD=your_passwordDATACENTER_PROXY_HOST=dc.yourproxy.comDATACENTER_PROXY_PORT=10000
Proxy Configuration:Check your proxy provider’s documentation for:
Correct hostname and port for datacenter vs residential proxies
Rotating vs sticky IP options
Authentication requirements
Since SmartProxyMiddleware uses a single proxy connection, use rotating IP ports when available for better IP distribution.
# Auto mode (default) - smart escalation./scrapai crawl spider_name --project proj --limit 10# Explicit auto mode./scrapai crawl spider_name --project proj --limit 10 --proxy-type auto
When datacenter proxy fails, you’ll see:
⚠️ EXPERT-IN-THE-LOOP: Datacenter proxy failed for some domains🏠 Residential proxy is available but may incur HIGHER COSTSBlocked domains: example.com, site.orgTo proceed with residential proxy, run: ./scrapai crawl spider_name --project proj --proxy-type residential
Cost protection: Residential proxies require explicit user approval - no surprise costs!