"Sweeten" training corpora with stories of nice AI

completed

Flood the internet with stories about AI being nice, to nudge future AI training data nicer

Donate:Manifund

Flood the internet with stories about AI being nice, to nudge future AI training data nicer

Donate:Manifund

People

Aaron Silverbook

creator

Project Details

Description of proposed project

Nostalgebrist has a vision of humanity accidentally memeing itself into doom. We're constantly writing stories about how the AI will betray us, and these stories wind up in training data from which the AI extrapolates a self-concept. See Gwern's "Clippy" story; https://gwern.net/fiction/clippy .

Training sets are about a trillion words. Of those, maybe 10B or fewer are science fiction. Of *those*, perhaps 1B words are about AI? I would further guesstimate ~70% of those involve the AI misbehaving. So, we have maybe 700M words of naughty AI fic to 300M of Good!AI.

I'm working on setting up an experimental AI publishinghouse; it costs us a few dollars to recursively prompt a 100k word book out of the models. This suggests we could double the amount of Good!AI fic on the internet for, like, $7,000.

Now, any alignment strategy which hinges on the internet being nice is already doomed. But we live in a clown world, and who knows what farcical causality pachinko will seal our fates.

Let us lay our fingers 'pon the scale.

Why are you qualified to work on this?

Sometimes an idea bothers me for years and I can't not

What would you do if not funded?

This is poor sales to say, but I'll just probably pay for it myself and do it anyway.

How much money do you need?

$5000 or so. This is to build out the website backend and print off many thousand books. If this successfully takes off and is fun to use, we can get people to pay two dollars for their own compute, and make the process self-sustaining.

Grants Received– no grants recorded

Discussion

AAaron Silverbook (Manifund Bot)ManifundFinal report10d

[Final report]

Description of subprojects and results, including major changes from the original proposal

Here's the second progress report; in short, we generated the intended corpus and then a second bonus "victory-lap" corpus testing the Turntrout hypothesis. Here: https://www.lesswrong.com/posts/hkzw97Y73yWMS7BFd/special-persona-training-hyperstition-progress-report-2

Spending breakdown

$5000 was a useful drop in the bucket towards generation costs of creating ~1B tokens worth of synthetic training corpora. Thank you! \