[New Extension] Gemini Captcha Solver: AI-Powered Image Captcha Resolution for Chrome

By mahmood, 27 February, 2026

Forum
Windows
Gemini Captcha Solver: An AI-powered solution for screen reader users

Hello everyone,

Following the release of Vision Assistant Pro, which I first shared with this amazing community, I have been working on a specialized tool to tackle one of the most persistent barriers on the web: Image-based Captchas.

I am proud to introduce Gemini Captcha Solver, a Chrome extension specifically designed to help screen reader users navigate those frustrating alphanumeric image challenges using the power of Google Gemini AI. I will also soon release this feature for iOS via UserScript.

While many websites are moving towards accessible alternatives, others still rely on visual-only captchas. This tool is my latest effort to bridge that gap and ensure a more independent browsing experience for the visually impaired community.


Key Features

  • AI-Driven Recognition: Leverages Google's latest Gemini models to accurately interpret and solve alphanumeric captchas.
  • Built for Accessibility: Fully optimized for NVDA and JAWS using ARIA live regions. The extension provides real-time spoken notifications about the status of the captcha-solving process.
  • Proxy Support: Integrated proxy settings to ensure reliable API connectivity for users in regions with restricted access to Google services.
  • Privacy-Centric: All API keys and configuration data are stored locally within your browser's secure storage.

Installation and Links

The extension is currently available for desktop Chrome via manual installation. Please note that I will soon release this feature for iOS via UserScript as well.


Quick Setup Guide

  1. Get an API Key: Obtain a free Gemini API key from Google AI Studio.
  2. Installation:
    • Download and extract the ZIP file.
    • Open Google Chrome and navigate to chrome://extensions.
    • Enable Developer mode (toggle switch in the top right).
    • Press the Load unpacked button and select the extracted folder.
  3. Configuration:
    • Open the extension settings from your toolbar.
    • Paste your API key.
    • Click the Fetch Models button to retrieve the available AI models.
    • Select a model from the list (e.g., Gemini 3.0 Flash).

Feedback Welcome

Your feedback was instrumental in the success of Vision Assistant Pro, and I hope for the same here. If you encounter any bugs, have questions about the setup, or have suggestions to improve its accessibility, please feel free to reach out or open an issue on the GitHub repository.

I hope this tool makes your daily web navigation a little easier!

Best regards,

Mahmood Hozhabri

Options