Join the Great Agent Hack 2025 - Win big from £50,000 in cash & credits (£30,000 secured)!
Register Now
Learn more about EU AI Act

Back to Glossary

Back to Glossary

Benchmarking

Method of Model Evaluation which assesses AI system performance against a predefined task, such as mapping AI system outputs to a dataset of prompts and responses.

Unlock the Future with AI Governance.

Get a demo

Get a demo