Apply to our funding round
Apply
← All grants
Neel Nanda → Mitigating Reward Hacking Through RL Training Interventions
Amount:
$7,900
Date Awarded:
Mar 2, 2026
Details
Funder
Neel Nanda
Recipient
Mitigating Reward Hacking Through RL Training Interventions
Purpose
-
Source
https://manifund.org/projects/mitigating-reward-hacking-through-rl-training-interventions
Metadata
Created
May 30, 2026, 2:06 AM UTC
Neel Nanda → Mitigating Reward Hacking Through RL Training Interventions | grantmaking.ai