Reader

OpenAI Introduces Software Engineering Benchmark

| InfoQ | Default

OpenAI has introduced the SWE-Lancer benchmark, to evaluate the capabilities of advanced AI language models in real-world freelance software engineering tasks.

By Daniel Dominguez