Scary that just 1 month ago, after evaluating o1, the great Terrence Tao-
 
*anticipated that the benchmark would "resist AIs for several years at least," noting that the problems require substantial domain expertise and that we currently lack sufficient relevant training data.*

![](https://m.stacker.news/68853)

(https://arxiv.org/html/2411.04872v1)

tech

OpenAIs new model "o3" achieves amazing scores in benchmarks

zuspotirko

Paticularly impressive to me

![](https://pbs.twimg.com/media/GfQtsVnXgAAnE6h?format=jpg&name=medium)

it solves 1/4 of research-level math questions