Quesma Benchmarks

Open-source benchmarks for evaluating AI coding agents on real-world software engineering tasks.

Stay tuned for new benchmarks and results