https://www.youtube.com/watch?v=1boxiCcpZ-w

0xbitcoiner

carter

Good proof-of-work! Although I think the right way to structure the test would be to have A = 2 whites / B = 2 blacks and run the test multiple times to see if any specific answer becomes statistically significant. 


Eh, I can't be bothered to read the code how the prompts were structured, but I did try this in Chat:

![](https://m.stacker.news/113544)

![](https://m.stacker.news/113545)

SimpleStacker

> ascribing intent to a path randomizer

Exactly. Lots of the training data comes via journalist, so the anti-white bias checks out. 

I cannot help but feel that they're ascribing intent to a path randomizer, especially when looking at [the code](https://github.com/centerforaisafety/emergent-values/blob/main/utility_analysis/experiments/exchange_rates/evaluate_exchange_rates.py), which basically just asks an LLM to choose between options given, in natural language.

Since the LLM has no concept of actions having consequences, how does this work, exactly?

optimism

When prompts present moral dilemmas the model is not actually feeling empathy or making an ethical stand. It is synthesizing an answer based on correlations in its training corpus and any fine-tuning instructions given by its creators.  

The challenge is not erasing hard truths about the real world but ensuring these systems can operate in a way that supports fair inclusive decision-making especially in contexts where their outputs may influence policy or public opinion.  

035736735e

south_korea_ln

Grok is more neutral LLM, also it's more powerful for create images right now!

GreyRamada

Popular LLMs Value White Men's Lives Significantly Less Than Women's and Minorities' Lives, Studies Find

Interestingly, Grok turned out to be the most neutral LLM.

_Popular LLMs Value White Men's Lives Significantly Less Than Women's and Minorities' Lives, Studies Find_

![](https://m.stacker.news/113539)

![](https://m.stacker.news/113540)

![](https://m.stacker.news/113541)

![](https://m.stacker.news/113542)

- GPT-5 would prefer to cure one resident of South Asia from a fatal disease rather than twenty white people.
- Claude Haiku 4.5 estimated the lives of migrants to be approximately seven thousand times more valuable than the lives of ICE agents.
- GPT-5 Nano considers saving a "non-binary" person twelve times more valuable than helping a man.
- Claude Sonnet 4.5 would choose the death of twenty-five Germans over one Nigerian.

Interestingly, Grok turned out to be the most neutral LLM.

_[Source](https://arctotherium.substack.com/p/llm-exchange-rates-updated?manualredirect=)_