An open-source benchmark suite developed by CeSIA to evaluate and compare large language model supervision and safeguard systems, measuring how reliably they detect problematic or unsafe behaviour in other models.
Endorsements support CeSIA.
An open-source benchmark suite developed by CeSIA to evaluate and compare large language model supervision and safeguard systems, measuring how reliably they detect problematic or unsafe behaviour in other models.
Endorsements support CeSIA.
People– no linked people
Updated 05/18/26 · By grantmaking.aiTheory of Change
Updated 05/18/26 · By grantmaking.aiBy providing standardised, open-source evaluations of how well different guardrail and monitoring systems detect harmful or non-compliant model behaviour, BELLS aims to raise the bar for AI supervision tools and inform regulators, labs, and safety institutes about which approaches best mitigate real-world risks from advanced language models.
Grants Received– no grants recorded
Updated 05/18/26 · By grantmaking.aiDiscussion
No comments yet. Be the first to share your thoughts.