Skip to content
Accueil

/

Glossaire

/

Navigateur

/

CAPTCHA

Navigateur

3 min de lecture

Qu'est-ce que CAPTCHA ?

CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) is a challenge-response mechanism designed to distinguish human users from automated bots on websites.

What is a CAPTCHA?

CAPTCHA stands for Completely Automated Public Turing test to tell Computers and Humans Apart. It is a security mechanism that websites use to verify that a user is human rather than an automated bot. CAPTCHAs present challenges that are easy for humans to solve but difficult for machines, such as identifying distorted text, selecting images containing specific objects, or solving simple puzzles.

Types of CAPTCHAs

CAPTCHAs have evolved significantly since their introduction:

  • Text-based CAPTCHAs: Distorted letters and numbers that users must type correctly. Increasingly rare due to advances in OCR technology.
  • Image-based CAPTCHAs: Tasks like "select all images with traffic lights" that require visual understanding.
  • reCAPTCHA v2: Google's checkbox ("I'm not a robot") that analyzes mouse movement and browsing behavior.
  • reCAPTCHA v3: Invisible scoring system that rates user interactions on a 0-to-1 scale without presenting a visible challenge.
  • hCaptcha: A privacy-focused alternative that uses image classification tasks.
  • Audio CAPTCHAs: Accessibility alternatives that play distorted audio for users to transcribe.
  • CAPTCHAs and Browser Automation

    CAPTCHAs represent one of the most significant obstacles in browser automation. They are specifically designed to block automated access, which creates friction for legitimate automation use cases like data extraction, automated testing, and RPA workflows.

    Common approaches to handling CAPTCHAs in automation:

  • Avoidance: Structuring automation to minimize triggering CAPTCHA challenges by mimicking human browsing patterns, using realistic timing, and maintaining consistent sessions.
  • Human-in-the-loop: Pausing automation when a CAPTCHA appears and alerting a human operator to solve it before resuming.
  • API-based access: Bypassing the browser entirely by using official APIs that don't require CAPTCHA verification.
  • CAPTCHA solving services: Third-party services that use human workers or AI to solve CAPTCHAs programmatically.
  • Pourquoi c'est important

    CAPTCHAs can completely block automated workflows, causing data extraction jobs to fail and business processes to stall. Understanding CAPTCHA types and mitigation strategies is essential for building reliable browser automation that operates within ethical and legal boundaries.

    Comment Autonoly resout ce probleme

    Autonoly's Site Quirks system automatically detects and records CAPTCHA encounters across sessions. When the AI agent encounters a CAPTCHA, it can pause and notify the user for human-in-the-loop resolution. The platform remembers which domains use CAPTCHAs and proactively warns agents, allowing them to adjust their approach or suggest API-based alternatives.

    En savoir plus

    Exemples

    • A web scraping job pauses when reCAPTCHA v2 appears and sends a notification for human intervention

    • Site Quirks data warns the agent that a target website uses hCaptcha, triggering a slower, more human-like browsing pattern

    • An automation workflow falls back to API-based data access after repeated CAPTCHA encounters on a website

    Questions frequemment posees

    Modern AI can solve many CAPTCHA types, including image recognition and text-based challenges. However, CAPTCHA systems continuously evolve to stay ahead of AI capabilities. Google's reCAPTCHA v3, for example, uses behavioral analysis rather than explicit challenges, making it harder to bypass with pure AI. Ethically, automating CAPTCHA solving may violate website terms of service.

    Use realistic browsing patterns with human-like delays between actions, maintain consistent browser sessions with cookies, avoid making too many requests in rapid succession, use residential IP addresses rather than data center IPs, and ensure your browser fingerprint matches a real user's browser configuration.

    Arretez de lire sur l'automatisation.

    Commencez a automatiser.

    Decrivez ce dont vous avez besoin en francais simple. L'agent IA d'Autonoly cree et execute l'automatisation pour vous, sans code.