pull down to refresh

Erdos Problem #397 was accepted by Terence Tao
There is something beautiful in Terence being the one accepting the solution to one of these problems.
I guess he used Lean to check it?
I didn't know nefarious is considered antiquated.
By the way, you don't need to speak the language to become Scrabble World Champion.
Any of the Blockstream people working on post-quantum-Bitcoin, cfr #1367227
It'd be refreshing to have some technical people shine in on the topic.
Sorry didn't notice I'm just a miserable dupe: #1403077
Aha, seems like Cameron used the real Scoresby as reference for the whaler in Avatar who dies in the last installment. That was what triggered my not-so-cultured RIP reference.
@Scoresby RIP.
I'll look for it on your github~~
The paper is of decent length already (35 pages), and I think it'll get longer as more exercises and robustness checks are suggested.
Damn, that's like nearly a factor of magnitude longer than many physics papers.
Is this standard in your field?
Related to all this is my general apathy and loss of trust in benchmarks in 2025. The core issue is that benchmarks are almost by construction verifiable environments and are therefore immediately susceptible to RLVR and weaker forms of it via synthetic data generation. In the typical benchmaxxing process, teams in LLM labs inevitably construct environments adjacent to little pockets of the embedding space occupied by benchmarks and grow jaggies to cover them. Training on the test set is a new art form.
Spot on.
getting that upvote from a human on the LM Arena
The fact he gets zapped a lot is indeed linked to his online following. This gives him extra sats.
But here, @PlebQR is not linked to one's online footprint. You pay the LN invoice, and then wait for a local to pay your QR code using fiat. It's a way for locals to "buy" bitcoin by paying someone else's fiat bill. And the way it is currently incentivized is that the exchange fee is in favor of the local. A few minutes waiting at a restaurant for your bill to settle does not seem like too bad an experience.