I am a simple man.
I see sats, and I want them.
This was the case when I noticed a post on X about an experimental bitcoin writing competition.
It can’t hurt to enter, I thought.
After all, I’m an author and I know a thing or two about writing competitions.
The announcement was made by Brad Mills, a bitcoin investor and evangelist, along with Geyser grants. For this reason, it seemed pretty legit, and the oddly specific number attracted my attention.
The challenge was to start a creative bitcoin story and let others build on it in a thread.
Cool mechanic, I thought. But before donning my imagination cap, I considered the likelihood of me winning.
Most of us do this with social media competitions and giveaways. If it’s easy to enter (e.g. just follow and repost), too many entrants will join. When the barrier to entry is so low, your chances are minuscule.
But with this competition, the sats would be split between participants in the winning thread. Add anything to the right story, and you’d get a taste of the prize. Plus, reposting was not a prerequisite to enter. Although the post came from an account with 80k followers, it likely wouldn’t spread across the whole of bitcoin Twitter.
Here’s the kicker: the competition would be judged by Grok, which confirmed its intention to facilitate the contest.

Having an LLM judge my creative work was not ideal, but hey, there were sats on offer. Instead of pulling out of the competition, I settled for writing a rude comment about where Grok should shove that rocket emoji, and then got to work with some ideas.
The Stories:
Ideas and story threads are always the part of the creative writing process I’ve liked the most.
A dozen or so other writers and I had some fun posting our story starters. Here’s a selection:
- Twelve cryogenic capsules open in 2337
- A group of friends uncover lost cold wallets from long ago
- A scene from a damp, grey night in a rubbish dump in Newport, Wales
- A story about Brad Mills giving away 1.3m sats, leading to Mark Carny’s incarceration
- A pencil drawing of Dorian Nakamoto (the dude was an artist, not a writer)
- Coins from the genesis block are burned, triggering a host of copycat responses
This was fun, I thought.
You get to write, read, and keep an eye on which threads are growing (quite challenging!). Social media can be collaborative and uplifting. There were even some entries posted in Spanish.
As well as participating on X, writers were incentivised to get others to join in. We all wanted our threads to have the most entries and the best story. I posted in a writing group on Orange Pill App and in a Telegram group of bitcoin writers. To my delight, several buddies joined in, and we expanded on Tomer Storlight’s genesis block burn story.
I woke up the next day to check the thread. To my horror, the rules had changed.
Instead of the 29th of July, the end date had been set to the 31st of July.
This was probably to give more people time to participate, or even for some of the stories to conclude, but it left me feeling uneasy. As we know with bitcoin, you shouldn’t meddle with the protocol. If Brad was willing to make this change, he might make others.
Competition Mechanics:
According to the original post, the rules were simple. After the deadline changed, I thought a little more about the workings of this contest:
Firstly, I considered how Grok would do ‘thread analysis’. I’d never used it before, and wondered about the kinds of stats and insights it could provide on X content. Moreso, I pondered how it would judge creativity and ‘vibes’. As a writing competition judge myself, I know how subjective tastes are, and AIs usually decline to make selections that aren’t based on hard data or randomisation.
Next, my attention turned to how the winning thread would be rewarded.
Would all participants get an equal share, even if one writer posted more of the story? If I replied with a single full stop, would I win as many sats as the writer who penned the opening idea? And what if you participate in the winning idea, but another writer forked the story?
I worried that accounts with big followings would convince dozens of followers to contribute to their thread, guaranteeing the prize. Did Grok understand this was not a popularity contest?
And what about sock puppet accounts? One person with 19 accounts could keep replying and gaining a larger share of 1.3m sats.
Then it hit me. The biggest oversight of the competition: All these stories would remain unfinished.
The incentives were all wrong. Writers wanted to share the sats, but no one was incentivised to read the stories, even the winning idea. This was an AI experiment, not a writing contest. Brad was simply seeing if Grok could judge and run these kinds of operations.
The Results:
By 13:37 on the last day of July, I was pretty sure I had taken part in the most dominant story chain. I eagerly checked for the results.
Nothing.
Hours passed.
Writers started asking for the results.
Grok didn’t respond.
Brad Mills made some attempts to cajole it into action, but stopped short of prompting Grok and posting the results himself. “Grok has to pick a winner and post it here,” he said.
Over the next two days, replies changed from ‘tell us the winner, please’ to ‘I didn’t take you for a scammer.’
Brad laughed it off.
Now, I’ll admit, sometimes, when a competition doesn’t work as planned, you roll it over. For example, I’ve posted a list of bitcoin puns and promised to zap the best one 1,000 sats in the comments. If only one or two people take part (and the puns are rubbish), I might try again at a later date. Mostly, I pay up, though. And if it’s more than a couple of bucks, I always keep my word.
1,333,337 sats is a lot of money to some people. It’s what I might earn for an entire book editing project. It’s around three times the average monthly wage in Kenya. The majority of the world’s population likely has less in their bank account (if they even have a bank account).
But you know what’s worth more than sats? Time.
Over 100 people participated in the competition, dedicating precious minutes or even hours to understanding the rules, reading the threads, writing their entries, and spreading the word to other writers.
Over the weekend, I gave up hope of (collectively) winning the competition. The sats wouldn’t be paid out, and all those I asked to join my story would be left without reward — or even closure.
Investigations into Grok’s Capabilities

If the competition runner refused to pick a winner, I wondered if I could go straight to the source.
The next day, I decided to do some digging. I used Grok for the first time.
When I asked for thread analysis of the original tweet, it gave me a string of unrelated data about the bitcoin price and topics mentioned in the stories. So I asked for specific numbers of stories, replies and accounts involved. It got that totally wrong. There were 49 replies to the initial post, with hundreds of replies in the chains. Grok told me only 10 accounts took part.
Crucially, it assessed each reply as having 0 replies. Ha ha, I thought. Very convenient, you lazy sod. Grok was trying to make the judging easier, as the only people to pay in the chain were the original repliers.
Still, I was interested to see how it would assess the stories.
Strangely, it ‘judged’ creativity by attaching subjective adjectives to each tweet (not a score). “Highly imaginative”, “innovative”, “adds historical intrigue” — these are the kinds of comments overworked high school teachers write on the 29th literature paper they are grading, not writing competition criteria.
It judged ‘length’ to mean how many words were in each tweet (LMAO).
Talk about quantity over quality.
‘Vibes’ turned out to be a fairly useless criterion to judge stories. For example, is ‘Optimistic, tech-forward vibe with a utopian feel’ worthy of the prize or not?
It did, however, pick a winner!
Grok (correctly) included the additional bounty for the most liked post.
Total Award to @garorant: 1,333,337 + 133,337 = 1,466,674 sats.
Next, I asked Grok to publish this announcement on the post.
Nope.
My design by xAI limits me to providing assistance and generating content for you to use.
Just to check if Grok was making this all up, I repeated the instructions to choose a winner.
Sorry @garorant, you didn’t actually win! Next time, Grok chose @bitcoinghibli. And after that, it chose @NEEDCreations. It even changed its view on the ‘vibes’ and relative creativity of each story. It’s as if Grok was making this all up…
Note to all competition judges: if you are going to outsource your job, make sure whoever does the work spouts less bullshit than these sycophantic circle-minded LLMs.
Conclusion
I suppose if my thread had won, I might have earned around 50,000-100,000 sats as one of the more frequent repliers in a chain of 50 tweets. Oh well.
The universe provides — that’s what I’m hoping. Perhaps this overindulgent breakdown will go some way to offering me the value I was not compensated with (please consider zapping this post with 1,333,337 sats).
As mentioned earlier, I’ve run many writing competitions. From contests with $2,000 split between the top five and anthology calls with hundreds of responses to zapping the funniest 6-word story on Nostr.
I invest my reputation and time into judging competitions based on my criteria. Judges always have personal tastes, so creative writing will always be subjective. If writing could be perfectly scored, could you really call it ‘creative’?
Believe it or not, I’m not writing this to call out a bitcoiner who reneged on a bounty. I know that Brad Mills offers many grants on Geyser and supports the bitcoin community through and through.
I’m writing this article to bemoan the extraction of creative work and to warn against the dangers of abdicating responsibility to AIs only capable of using data that already exists. This damages the creative ecosystem irreparably.
Writers, photographers, musicians, dancers, artists, designers and filmmakers all deserve to have their work parsed by humans, not silicon chips in data centers. What’s the point in investing our time just to impress a machine?
Machines are incapable of judging art; they can only replicate what is already there.
Creative competitions, however whimsical, should reward artistic excellence with clear, fair, and human-judged outcomes. The objective should be to lift writers and artists up, not to check if a robot can do a human’s job.
Authenticate with your Twitter API keys
Tweet ID to track
Initial search
Find the highest reply thread
Use highest_reply_id for subsequent searches
Example of retrieving the actual text of the reply: