Sam Altman
@sama · 3d agohere is a little hint: 🎁
1,971
357
8,546
611
2,238
here is a little hint: 🎁
Important new eval!
We’re releasing a new eval to measure expert-level scientific reasoning: FrontierScience.
This benchmark measures PhD-level scientific reasoning across physics, chemistry, and biology.
It contains hard, expert-written questions (both olympiad-style problems and longer research-style tasks) designed to reveal where models succeed and where they fall short.
openai.com/index/frontier…
GPT-5.2-Codex launches today.
It is trained specifically for agentic coding and terminal use, and people at OpenAI have been having great success with it.
Images 1.5 launches today in ChatGPT and the API!
Much better images in tons of ways, faster, and new editing capability.
Chain-of-thought monitorability:
openai.com/index/evaluati…
Last week, a security researcher using our previous model found and disclosed a vulnerability in React that could lead to source code exposure.
I believe these models will be a net win for cybersecurity, but we are in the 'real impact phase' as they improve.
Also a very fun way to use it to easily get fun images in ChatGPT:
For example:
Codex is getting extremely good and will rapidly improve. If you want to help make it 100x better next year, the team is hiring. Crazy adventure guaranteed; success probable.