23 tasks sorted by pass rate (easiest first).

Task Lang Pass Rate Cost Time Cheapest Fastest
cpp-simple 76%
$0.07 2m $0.01 Google 1m Google
go-microservices-traces 55%
$0.33 7m $0.03 Grok 4m Google
go-grpc-fix 50%
$0.47 17m $0.06 Google 12m Google
cpp-advanced 33%
$0.70 7m $0.04 Google 2m Google
python-microservices 31%
$0.44 7m $0.08 Google 3m Google
go-microservices-logs 26%
$0.74 8m $0.07 Google 2m Google
js-microservices 17%
$0.70 11m $0.23 Z.ai 7m Anthropic
go-microservices 10%
$0.63 16m $0.46 OpenAI 13m OpenAI
net-microservices 10%
$0.19 13m $0.05 DeepSeek 7m OpenAI
php-distributed-context-propagation 10%
$0.50 12m $0.16 Google 6m Google
cpp-distributed-context-propagation 2%
$0.27 15m $0.27 Z.ai 15m Z.ai
go-distributed-context-propagation 2%
$0.31 17m $0.31 Z.ai 17m Z.ai
php-microservices 2%
$1.24 10m $1.24 Anthropic 10m Anthropic
rust-distributed-context-propagation 2%
$0.91 19m $0.91 OpenAI 19m OpenAI
erlang-microservices 0%
go-microservices-traces-simple 0%
go-workflow-tracing 0%
java-distributed-context-propagation 0%
java-microservices 0%
python-distributed-context-propagation 0%
ruby-microservices 0%
rust-microservices 0%
swift-microservices 0%

Cost and Time show median values computed only from successful runs. Cheapest and Fastest show the best single run for each task.

All product names, logos, and brands (™/®) are the property of their respective owners; they're used here solely for identification and comparison, and their use does not imply affiliation, endorsement, or sponsorship.