Democratic Fine-Tuning (DFT) is an OpenAI-funded project of the Meaning Alignment Institute that uses values cards and a moral graph in a short democratic deliberation process to elicit people’s values and aggregate them into training data for fine-tuning language models, providing an alternative to Constitutional AI and standard RLHF.
Endorsements support Meaning Alignment Institute.
Democratic Fine-Tuning (DFT) is an OpenAI-funded project of the Meaning Alignment Institute that uses values cards and a moral graph in a short democratic deliberation process to elicit people’s values and aggregate them into training data for fine-tuning language models, providing an alternative to Constitutional AI and standard RLHF.
Endorsements support Meaning Alignment Institute.
People– no linked people
Updated 05/18/26 · By grantmaking.aiGrants Received– no grants recorded
Updated 05/18/26 · By grantmaking.aiDiscussion
No comments yet. Be the first to share your thoughts.