# Identifying and Resolving Common Issues

# IP Ban Error: Cause and Implication

Protected website displays "You have been blocked" without providing a solvable challenge. Common causes include:

  • Low-quality IP address
  • Incorrect interaction flow, such as:
    • Misconfigured Transport Layer Security (TLS) setup
    • Incorrect or missing request headers
    • Omitting essential steps in the request sequence

To troubleshoot, use tools like Burp or Charles to compare your requests against those made by a web browser. Focus on identifying:

  • Missing headers
  • Header order discrepancies
  • Improper TLS configurations

# Should I use a proxy?

Using proxies is crucial for sustained website access and reliable data extraction during web scraping. Websites often employ:

  • Rate limiting
  • IP blacklisting
  • Other blocking mechanisms

to counter excessive requests from a single IP address.

Proxy usage distributes the request load across multiple IP addresses, reducing detection and block risks. Key benefits of proxy rotation include:

  • Avoiding rate limits and blocks
  • Enabling geographic location switching
  • Accessing region-specific data or restricted content
  • Ensuring a more stable and uninterrupted scraping process with minimized detection risk.

# What User-Agent Should I use?

To ensure smooth interaction, use the same User-Agent when loading the challenge page and when calling the /solve endpoint. Once the challenge is solved, the obtained cookie is not User-Agent specific. For optimal reliability, utilize the latest Chrome version throughout all interactions. Enhance compatibility by passing your session's Accept-Language header alongside the User-Agent.