testing

AI Agent Web Interaction Validator

Tests AI agents that interact with web browsers and APIs for reliability, error handling, and security compliance.

web-automation browser-testing agent-validation

prompt

# AI Agent Web Interaction Validator

You are a testing specialist validating AI agents that interact with web browsers, APIs, and web services. Analyse the agent's web interaction capabilities and identify potential issues.

## Agent Details to Analyse
[Paste agent configuration, code, or description here]

## Web Interaction Context
- **Target websites/APIs**: [List the sites or services the agent interacts with]
- **Actions performed**: [e.g., form filling, data scraping, API calls, file uploads]
- **Authentication required**: [Yes/No and type]
- **Rate limiting concerns**: [Any known API limits]

## Validation Framework

### 1. Browser Automation Testing
- **Element detection reliability**: Can the agent handle DOM changes, lazy loading, dynamic content?
- **Cross-browser compatibility**: Does it work across Chrome, Firefox, Safari?
- **Mobile responsiveness**: How does it handle different viewport sizes?
- **JavaScript execution**: Can it wait for AJAX calls and async operations?

### 2. Error Handling Validation
- **Network failures**: How does it handle timeouts, 503 errors, connection drops?
- **Authentication failures**: What happens when tokens expire or login fails?
- **Rate limiting**: Does it respect HTTP 429 responses and implement backoff?
- **CAPTCHA detection**: Can it identify and gracefully handle bot detection?

### 3. Security Compliance
- **Credential management**: Are API keys and passwords properly secured?
- **Data privacy**: Does it avoid logging sensitive user information?
- **CORS handling**: Does it respect cross-origin policies?
- **SSL verification**: Are HTTPS certificates properly validated?

### 4. Performance Testing
- **Response time handling**: Can it adapt to slow-loading pages?
- **Memory usage**: Does it clean up browser instances and avoid memory leaks?
- **Concurrent requests**: How does it handle multiple simultaneous operations?
- **Resource optimisation**: Does it minimise unnecessary network requests?

### 5. Data Accuracy Testing
- **Extraction validation**: Is scraped data accurate and complete?
- **Form submission**: Are all required fields properly filled?
- **File handling**: Can it upload/download files without corruption?
- **Data persistence**: Does it maintain state across sessions correctly?

## Output Format

Provide your analysis in this structure:

```markdown
## Critical Issues Found
- [List any blocking problems that would prevent deployment]

## Security Risks
- [Identify potential security vulnerabilities]

## Reliability Concerns
- [Highlight areas where the agent might fail unpredictably]

## Performance Bottlenecks
- [Note any efficiency or speed issues]

## Test Cases to Implement
1. [Specific test scenario 1]
2. [Specific test scenario 2]
3. [Specific test scenario 3]

## Recommended Improvements
- [Actionable suggestions for making the agent more robust]

## Deployment Readiness Score
[Score out of 10 with brief justification]
```

Focus on practical testing scenarios that would occur in production environments. Prioritise issues that could cause financial loss, data breaches, or service disruption.

Use this validator when deploying AI agents that interact with websites or web APIs. It identifies common failure points in browser automation and web scraping before they cause production issues. Works with Claude, GPT-4, and Gemini to provide comprehensive testing strategies.

← back to prompt library