CAPSOLVER
Blog
How to Solve CAPTCHA in UI.Vision with CapSolver API

How to Solve CAPTCHA in UI.Vision with CapSolver API

Logo of CapSolver

Ethan Collins

Pattern Recognition Specialist

27-Aug-2025

UI.Vision is a popular open-source RPA and browser automation extension for Chrome and Firefox. It’s widely used for web scraping, data extraction, automated testing, and form filling, allowing both developers and non-coders to build macros or run Selenium IDE scripts directly in the browser. However, when dealing with websites protected by CAPTCHAs, these automations often break and require manual solving.

CapSolver is an AI-powered CAPTCHA-solving service designed for scraping, crawling, and browser automation at scale. It can automatically solve reCAPTCHA, Cloudflare Turnstile, and other common challenges. By combining the CapSolver extension with UI.Vision, you can create fully hands free automation workflows without trouble.

This guide shows how to quickly integrate CapSolver into UI.Vision, including setup, a ready-to-use macro example, and best practices for reliable web automation.

UI.Vision Overview & Use Cases

UI.Vision RPA is a versatile browser automation tool that combines visual scripting with advanced features like computer vision (via XModules), API calls, and desktop automation. As an extension, it runs directly in the browser, supporting Chrome, Firefox, and Edge, with options for headless mode and command-line execution. It's free for unlimited use, with optional paid add-ons for enhanced capabilities.

Use Cases

UI.Vision is suitable for a range of automation tasks, including:

  • Web Scraping: Extracting data from sites for research, monitoring, or reporting.
  • Form Automation: Filling and submitting online forms, such as registrations or surveys.
  • Testing: Running UI tests, regression checks, or load testing on web apps.
  • RPA Workflows: Automating business processes like email handling or data entry.

CAPTCHAs often appear in these scenarios, especially on login pages or forms, making a solving integration essential for uninterrupted runs.

Why CAPTCHA Solving is Needed

Websites use CAPTCHAs to block automated bots, which can stop UI.Vision macros mid-execution. These defenses require human verification, leading to failures in headless or scheduled automations.

Common CAPTCHA types include:

CAPTCHA Type Description
reCAPTCHA v2 Requires users to check a box or select images based on a prompt.
reCAPTCHA v3 Uses a scoring system to assess user behavior, often invisible to users.
Cloudflare Turnstile A privacy-focused CAPTCHA alternative that minimizes user interaction.

CapSolver's browser extension auto-detects and solves these, allowing UI.Vision to wait for resolution before proceeding.

How to Use CapSolver to Handle CAPTCHAs

CapSolver offers a browser extension that automatically solves CAPTCHAs on pages. Integrate it with UI.Vision by installing the extension and using wait/assert commands in your macro to detect the "solved" state.

Steps to Integrate CapSolver with UI.Vision

  1. Sign Up for CapSolver: Create an account at CapSolver Dashboard, add funds, and get your API key
  2. Install CapSolver Extension: Add the CapSolver extension to Chrome or Firefox from the Chrome Web Store or Firefox Add-ons.
  3. Configure Extension: Enter your CapSolver API key in the extension settings to enable auto-solving.
  4. Install UI.Vision: Add the UI.Vision RPA extension to your browser.
  5. Create Macro: Build a UI.Vision macro that navigates to the page, waits for CapSolver to solve the CAPTCHA (via element presence), and proceeds.
  6. Run the Macro: Execute in UI.Vision, ensuring the extension handles CAPTCHAs transparently.

Demo Code

UI.Vision uses JSON-based macros. The example below waits for CapSolver's "solved" indicator (an element with id='capsolver-solver-tip-button' and data-state='solved').

Complete Code Example + Step-by-Step Explanation

Below is a complete UI.Vision macro (in JSON format) that demonstrates solving a reCAPTCHA v2 on a demo page using CapSolver's extension.

Prerequisites

Complete Code Example

json Copy
{
  "Name": "solve captcha",
  "CreationDate": "2025-8-20",
  "Commands": [
    {
      "Command": "open",
      "Target": "https://www.google.com/recaptcha/api2/demo",
      "Value": "",
      "Description": ""
    },
    {
      "Command": "waitForPageToLoad",
      "Target": "",
      "Value": "",
      "Description": ""
    },
    {
      "Command": "waitForElementPresent",
      "Target": "xpath=//*[@id='capsolver-solver-tip-button' and @data-state='solved']",
      "Value": "30000",
      "Description": ""
    },
    {
      "Command": "assertElementPresent",
      "Target": "xpath=//*[@id='capsolver-solver-tip-button' and @data-state='solved']",
      "Value": "",
      "Description": ""
    },
    {
      "Command": "click",
      "Target": "id=recaptcha-demo-submit",
      "Value": "",
      "Description": ""
    }
  ]
}

Step-by-Step Explanation

Step Description
1. Open Page Uses open to navigate to the reCAPTCHA demo site where the CAPTCHA appears.
2. Wait for Page Load waitForPageToLoad ensures the page is fully loaded before proceeding.
3. Wait for CAPTCHA Solved waitForElementPresent checks for CapSolver's solved indicator (XPath for the button with data-state='solved'), timing out after 30 seconds. The extension auto-solves in the background.
4. Assert Solved assertElementPresent verifies the CAPTCHA is solved; if not, the macro fails.
5. Submit Form click on the submit button to complete the action once solved.
6. Run and Monitor Play the macro in UI.Vision; the extension handles solving, and logs show progress.

This macro relies on the CapSolver extension to inject the solution automatically.

Demo Walkthrough

This macro automates CAPTCHA solving on the reCAPTCHA demo:

  1. Navigation: Opens the demo page, triggering the CAPTCHA.
  2. Auto-Solving: CapSolver extension detects and solves the reCAPTCHA, updating the tip button to 'solved'.
  3. Waiting and Assertion: The macro waits up to 30 seconds for the solved state and asserts its presence.
  4. Submission: Clicks the submit button, completing the form.
  5. Observation: In UI.Vision's playback, watch the extension solve (visible icon), then the macro proceeds to success.

For real-world use, adjust timeouts or add error handling (e.g., loops) if solving takes longer.

FAQ Section

Question Answer
What types of CAPTCHAs can CapSolver solve? CapSolver supports reCAPTCHA v2/v3,Cloudflare Turnstile, and more. Refer to the CapSolver documentation for a complete list.
How do I handle different CAPTCHA types? The extension auto-detects types; configure supported types in CapSolver settings. Update XPath if the indicator changes.
What if CapSolver fails to solve the CAPTCHA? Add retry loops in the macro or check extension logs. Increase wait time for complex CAPTCHAs.
Can I use CapSolver with other RPA tools? Yes, CapSolver's extension works with any browser-based tool; API integration available for code-based setups.
Do I need proxies with CapSolver in UI.Vision? Proxies help with IP blocks; configure in the extension or UI.Vision for advanced scenarios.

Conclusion

Integrating CapSolver's browser extension with UI.Vision simplifies CAPTCHA handling in RPA workflows, allowing macros to run autonomously. This combination leverages UI.Vision's visual automation with CapSolver's AI solving for efficient, reliable tasks.

To begin, sign up for CapSolver and install the extension. Download UI.Vision and import the example macro. For more, explore CapSolver docs and UI.Vision docs. Automate your CAPTCHAs today!

Bonus for ui.vision Users: Use the promo code uivision when recharging your CapSolver account and receive an exclusive 6% bonus credit—no limits, no expiration.

Supported Browsers and Tools

  • UI.Vision: Works on Chrome, Firefox, Edge; supports visual commands, JavaScript, and Selenium IDE imports.
  • CapSolver: Browser extension for Chrome/Firefox; API for custom integrations.

References

Compliance Disclaimer: The information provided on this blog is for informational purposes only. CapSolver is committed to compliance with all applicable laws and regulations. The use of the CapSolver network for illegal, fraudulent, or abusive activities is strictly prohibited and will be investigated. Our captcha-solving solutions enhance user experience while ensuring 100% compliance in helping solve captcha difficulties during public data crawling. We encourage responsible use of our services. For more information, please visit our Terms of Service and Privacy Policy.

More

What Is a CAPTCHA Solving API? How It Works and When to Use It
What Is a CAPTCHA Solving API? How It Works and When to Use It

Learn what a CAPTCHA solving API is, how it works, and when to use it for automation. Discover the benefits of AI-powered CAPTCHA resolution for web scraping.

The other captcha
Logo of CapSolver

Sora Fujimoto

02-Apr-2026

Choose CAPTCHA Solving API
How to Choose CAPTCHA Solving API? 2026 Buyer's Guide & Comparison

Learn how to choose CAPTCHA solving API for web scraping and AI agents. Compare accuracy, speed, and cost to find the best automated solution for your needs.

web scraping
Logo of CapSolver

Ethan Collins

02-Apr-2026

Web Scraping Security: Best Practices to Protect Data & Avoid Detection
Web Scraping Security: Best Practices to Protect Data & Avoid Detection

Learn best practices for web scraping security, including ethical considerations, anti-bot system bypass, and CAPTCHA solutions to protect data and avoid detection risks. Essential for data professionals.

web scraping
Logo of CapSolver

Nikolai Smirnov

02-Apr-2026

Why CAPTCHA Blocks Users: Triggers, Avoidance & Solutions
Why CAPTCHA Blocks Users: Triggers, Avoidance & Solutions

Explore why CAPTCHA blocks legitimate users, common triggers like bad IP reputation and browser issues, and effective avoidance strategies. Learn how professional solutions like CapSolver handle CAPTCHA at scale for automation.

The other captcha
Logo of CapSolver

Nikolai Smirnov

02-Apr-2026

Elevating Enterprise Automation:
Elevating Enterprise Automation: LLM-Powered Infrastructure for Seamless CAPTCHA Recognition & Operational Efficiency

Discover how LLM-powered AI Automation Infrastructure revolutionizes CAPTCHA recognition, enhancing business process efficiency and reducing manual intervention. Optimize your automated operations with advanced verification solutions.

AI
Logo of CapSolver

Ethan Collins

30-Mar-2026

Scaling Data Collection for LLM Training: Solving CAPTCHAs at Scale
Scaling Data Collection for LLM Training: Solving CAPTCHAs at Scale

Learn how to scale data collection for LLM training by solving CAPTCHAs at scale. Discover automated strategies to build high-quality datasets for AI models.

AI
Logo of CapSolver

Lucas Mitchell

27-Mar-2026