The link to the app is as follows: https://colab.research.google.com/drive/1tBnudPYg9Ze7uriQwbwQESd5zl2UL4Y1
The AI Testing Agent streamlines software testing by using AI to compare expected vs. actual results through uploaded documents and screenshots.
Originally Designed for evaluating my AI Document search solutions, it delivers clear evaluations and concise, ready-to-paste summaries for reporting.
The AI agent helps developers make testing faster, smarter, and more consistent — all within the simple Google Colab interface.
By taking the manual guesswork out of testing analysis, the agent allows developers to focus more on building and improving software rather than sifting through results.
It uses Gemini’s powerful language capabilities to offer thoughtful insights based on both context and content — making it especially helpful for testing apps with complex logic or AI-driven features.
Whether you’re working solo or presenting your findings in a larger QA report, the ready-made summary feature ensures your conclusions are clear and shareable in seconds.
Designed with ease of use in mind, the workflow fits smoothly into any developer’s routine, and it’s fully adaptable for future testing needs.
AI Testing Agent is an automated quality assurance tool designed to evaluate software test results using AI-powered comparison and summarization.
Originally built to test document search solutions developed with Vertex AI on Google Cloud, this tool enables intuitive evaluation using uploaded documents and screenshots.
Expected vs. Actual Comparison
Uses Gemini API to semantically compare expected test outcomes (PDF) against actual results (image screenshot).
Contextual Evaluation Report
Generates a detailed evaluation of how the tested application performed based on the project context and query used.
Summary Generator for Reporting
Outputs a concise one- or two-line summary optimized for pasting into formal QA/test documentation.
Interactive File Uploads
Utilizes Google Colab’s widget interface for uploading expected results (PDF) and screenshots (PNG/JPG).
Command-Line Integration
Executes seamlessly via Colab’s built-in CLI, maintaining a lightweight and flexible development workflow.
Clipboard Automation
Automatically copies the generated summary to the user clipboard upon request to streamline reporting.
| Component | Tech/Tool Used |
|---|---|
| AI Model | Gemini API (via Google Cloud Workspace) |
| Environment | Google Colab Notebook |
| UI Elements | Colab Widgets (File Uploads) |
| Text Processing | Python, Gemini Language Model |
| Image Handling | PIL / OpenCV |
Open the Colab Notebook
Clone or open the project via Google Colab.
Install Requirements (Optional)
Colab includes most dependencies, but custom ones can be installed via standard !pip install commands.
Testing the accuracy and relevance of a document search query for a Vertex AI-powered app: