Flood the internet with stories about AI being nice, to nudge future AI training data nicer
Flood the internet with stories about AI being nice, to nudge future AI training data nicer
People
Updated 06/25/26 · By grantmaking.aicreator
Project Details
Updated 06/25/26 · By grantmaking.aiDescription of proposed project
Nostalgebrist has a vision of humanity accidentally memeing itself into doom. We're constantly writing stories about how the AI will betray us, and these stories wind up in training data from which the AI extrapolates a self-concept. See Gwern's "Clippy" story; https://gwern.net/fiction/clippy .
Training sets are about a trillion words. Of those, maybe 10B or fewer are science fiction. Of *those*, perhaps 1B words are about AI? I would further guesstimate ~70% of those involve the AI misbehaving. So, we have maybe 700M words of naughty AI fic to 300M of Good!AI.
I'm working on setting up an experimental AI publishinghouse; it costs us a few dollars to recursively prompt a 100k word book out of the models. This suggests we could double the amount of Good!AI fic on the internet for, like, $7,000.
Now, any alignment strategy which hinges on the internet being nice is already doomed. But we live in a clown world, and who knows what farcical causality pachinko will seal our fates.
Let us lay our fingers 'pon the scale.
Why are you qualified to work on this?
Sometimes an idea bothers me for years and I can't not
Other links
What would you do if not funded?
This is poor sales to say, but I'll just probably pay for it myself and do it anyway.
How much money do you need?
$5000 or so. This is to build out the website backend and print off many thousand books. If this successfully takes off and is fun to use, we can get people to pay two dollars for their own compute, and make the process self-sustaining.
[Final report]
Description of subprojects and results, including major changes from the original proposal
Here's the second progress report; in short, we generated the intended corpus and then a second bonus "victory-lap" corpus testing the Turntrout hypothesis. Here: https://www.lesswrong.com/posts/hkzw97Y73yWMS7BFd/special-persona-training-hyperstition-progress-report-2
Spending breakdown
$5000 was a useful drop in the bucket towards generation costs of creating ~1B tokens worth of synthetic training corpora. Thank you! \