Two-year AI safety project modeling vulnerable, homeostatic agents in an open-ended, Minecraft-like virtual world to test how embodied vulnerability and biologically inspired inductive biases can give rise to prosocial behavior and artificial empathy.
Endorsements support Institute for Advanced Consciousness Studies.
Two-year AI safety project modeling vulnerable, homeostatic agents in an open-ended, Minecraft-like virtual world to test how embodied vulnerability and biologically inspired inductive biases can give rise to prosocial behavior and artificial empathy.
Endorsements support Institute for Advanced Consciousness Studies.
People– no linked people
Updated 05/18/26 · By grantmaking.aiProject Details
Updated 05/18/26 · By grantmaking.aiPreventing Sociopathic Robots is an AI safety research program that explores how to design world-modeling AI agents whose prosocial alignment emerges from their own homeostatic needs rather than from external constraints alone. Using simulations in an open-ended, Minecraft-like environment, the team studies agents endowed with meta-learning, counterfactual modeling, and explicit representations of vulnerability and bodily-like needs to see when cooperative or antisocial “personalities” develop. The project builds on prior theoretical work arguing that artificial empathy must incorporate affect and embodied vulnerability and has produced a Science Robotics paper proposing architectural principles for preventing antisocial machine behavior.
Theory of Change
Updated 05/18/26 · By grantmaking.aiBy demonstrating that agents with explicit vulnerability and homeostatic drives in rich simulated environments naturally develop prosocial strategies and empathic responses, the project aims to identify design principles for AI systems that are less likely to exhibit sociopathic behavior and more likely to align with human values without relying solely on external reward shaping or hard constraints.
Grants Received– no grants recorded
Updated 05/18/26 · By grantmaking.aiDiscussion
No comments yet. Be the first to share your thoughts.