grantmaking.ai Launch Round
What is this project?
We're working on making AI more reliable under adverse conditions, situations where uncertainity, conflicting evidence, corrupted inputs, distribution shifts, or specialist failures can cause otherwise AI systems to make unsafe decisions.
To investigate this problem, we're developing MAVS-GC (Multi Adaptive Vetting Systems-Governance Core), a governance first AI architecture that seperates prediction from output governance. Instead of relying solely on model confidence or a single prediction pathway; MAVS-GC allows multiple specialists to evaluate the same input simultaneously while an independent governance layer evaluates diagnostic signals, contextual evidence, mitigation, and explicit threshold policies before deciding whether an output should be trusted to have real world influence or not.
The project has evolved into a structured research program, with emperical benchmarks, mathematical foundations, public implementations, mechanistic analysis and reproducibility studies.
What has been done?
- Formalizing MAVS-GC as a governance first AI Architecture with proper mathematical definitions and governance calculus.
- Building synthetic benchmark environments to evaluate that the governance mechanisms behave according to their intended semantics.
- Evaluating the framework across multiple real-world datasets spanning different domains under both clean and corrupted conditions.
- Measuring robustness under multiple corruption families to study how governance changes AI behavior as evidence quality deteriorates
- Performing mechanistic ablation studies to identify which governance components are responsible for the observed safety behavior.
- Building reproducible evaluation pipelines, governance traces, public documentation, and research infrastructure so that every result can be independently examined.
Across these stages, the same pattern emerged consistently that explicit output governance appears to substantially reduce unsafe acceptance outputs (about 144-200 times less) whilst maintaing a high predictive accuracy (85%+) and the framework's decision becomes increasingly stable as corruption severity increases.
However, we still do not know if this behavior can survive under industrial grade pressure, therefore; this justifies the next stage of the research, which is industrial grade validation. With funding, we'll expand the evaluation from the current benchmark program to substantially larger cross-domain benchmarks suits, additional corruption families, frontier AI models, and more realistic deployment conditions.
The objective is to rigorously determine whether safety and stability properties observed in current research continue to hold at a much larger scale.
\
What will happen if it succeeds?
If the project succeeds, it's primary output would be a validated governance-first architecture that enables AI systems to be explicitly governed before their outputs are allowed to have any real world influence whatsoever in adverse conditions. Instead of relying on model's prediction and confidence, AI systems would be able to incorporate a governance layer that evaluates diagnostic signals, contextual evidence, mitigation, and risk before determining whether an output should be trusted, rejected, accepted, or escalated.
If the industrial scale validation confirms that the MAVS-GC formalization generalizes beyond the current benchmarks, it would provide evidence that explicit output governance can become a reusable architectural layer for AI systems operating in safety critical environments.
https://docs.google.com/document/d/1nFhljkCd_xqalUxwrA_u30Sv8kVs0MOxwHSMShCffRg/edit?usp=sharing
The document above speaks about the details regarding the money spent in detail.