
CRUXEval
crux-eval.github.io/cruxevalBenchmark testing LLM code understanding via input-output prediction for Python functions. — extracted from the official website or Wikipedia.

Benchmark testing LLM code understanding via input-output prediction for Python functions. — extracted from the official website or Wikipedia.