I'm developing a system to keep advanced AI models safe, I've done some tests, but need more compute to do deeper tests.
I'm developing a system to keep advanced AI models safe, I've done some tests, but need more compute to do deeper tests.
Project Details
Updated 07/03/26 · By grantmaking.aiProject summary
I have a simple hypothesis: As AI agents get more autonomous, we'll need infrastructure that watches what they are doing, instead of only evaluating the models themselves. I want to build the smallest version of that idea and find out if it actually works.
I've aldready done adversial attacks on my system. In one batch I run 233 adaptive attacks, with a strong model as attacker, against my model and Claude. My system catched all attacks and let normal calls pass.
In another batch I run hundreds of iterative tries to cause harmful output with similar results.
I've thus tested if the system discovers attacks.
Now I want to test if my system works to supervide an agent doing tasks, follow what an independent agent does and react if something goes wrong.
I want to build an MVP, as a minimal version of the system, to see if my system works an a supervising infrastructure.
x.com/New_AI_Safety
linkedin.com/in/pedro-bentancour-garin-22a0a215
What are this project's goals? How will you achieve them?
Build a simple MVP and test my system.
The long-term goal of my complete projext is to build a global AI governance and safety layer. I belive I have come a ling way with the architechture and the patents I've filed for it. I'm now in a testing and fund raising phase.
How will this funding be used?
Compute and three months runway.
Who is on your team? What's your track record on similar projects?
I'm a PHD candidate at Stockholm university and a serial founder, founded Treehoo. com in 2008, read about that project (from some fans of mine) here:
What are the most likely causes and outcomes if this project fails?
I would need to fine-tune and adapt my system.
How much money have you raised in the last 12 months, and from where?
0\
People
Updated 07/03/26 · By grantmaking.aicreator
Funding Details
- -
- -
- -
- -
- -
- -
- -
- $10,000
- -
- -
Grants Received– no grants recorded
Updated 07/03/26 · By grantmaking.aiDiscussion
@Lisa-Intel Hey, Austin responded. You aren't supposed to label it under Leo's microgranting program unless you were already accepted. Disregard if you were.
Best,
Nicholas Volta
Hi sorry, didn't see your messages until now. So you can't just apply you mean? @nicholas
@Lisa-Intel No, you can't label it as "Leo’s experimental microgranting program" unless you got accepted.
But how do you now if you're accepted? I have 2 options to chose, above nomal grant and Leo's experimental @nicholas
Hey, do you know if Leo closed his microgrant program? I got removed from the category, and when I added it back, it got removed again. I tried asking Austin last night and have yet to receive a response.
@nicholas Nevermind. The logs have just been updated; he deposited $50,000 USD more into his account.