Web Search Agent Evals

MCPOpen Source19.0

by youdotcom-oss • Uncategorized

Extensible benchmarking suite for evaluating AI coding agents on web search tasks. Compare native search vs MCP servers (You.com, expanding) across multiple agents (Claude Code, Gemini, Droid, Codex, expanding) with automated Docker workflows and statistical analysis.

Description

Quick Actions

View on GitHub

Quick Stats

Service TypeMCP

Pricing ModelFree

Capabilities0 Tools / 0 Prompts / 0 Resources

Owneryoudotcom-oss

CategoryUncategorized

Set Your Username

Web Search Agent Evals

Description

Quick Actions

Quick Stats