1-dan master of the unyielding fist of Bayesian inference
5834 stories
·
1 follower

[D] OpenAI o3 87.5% High Score on ARC Prize Challenge

1 Share

https://arcprize.org/blog/oai-o3-pub-breakthrough

OpenAI's new o3 system - trained on the ARC-AGI-1 Public Training set - has scored a breakthrough 75.7% on the Semi-Private Evaluation set at our stated public leaderboard $10k compute limit. A high-compute (172x) o3 configuration scored 87.5%.

submitted by /u/currentscurrents to r/MachineLearning
[link] [comments]
Read the whole story
clumma
10 hours ago
reply
Berkeley, CA
Share this story
Delete

Deimos, first critical experiment using HALEU in decades

1 Share
submitted by /u/NukesDoItAllNight to r/nuclear
[link] [comments]
Read the whole story
clumma
20 hours ago
reply
Berkeley, CA
Share this story
Delete

The Year in Computer Science

1 Share

The end of 2024 seems a particularly uncertain time in history, and theoretical computer science is no exception. Amid several breakthroughs and new findings, the field also confronted its own doubts and limitations. For example, artificial intelligence once again dominated the popular discourse this year. Researchers have begun to understand what might be going on within the “black boxes” of…

Source



Read the whole story
clumma
1 day ago
reply
Berkeley, CA
Share this story
Delete

The Year in Biology

2 Shares

Many types of discoveries can surprise and delight, but few findings are more exciting than the overturned assumption — when scientists, sometimes accidentally, stumble upon a way to flip received wisdom on its head. For example, biologists have assumed for decades that the immune system regulates itself, without the intervention of our brains. But this year they discovered that a neural circuit…

Source



Read the whole story
clumma
2 days ago
reply
Berkeley, CA
Share this story
Delete

FACTS Grounding: A new benchmark for evaluating the factuality of large language models

1 Share
Our comprehensive benchmark and online leaderboard offer a much-needed measure of how accurately LLMs ground their responses in provided source material and avoid hallucinations

Read the whole story
clumma
3 days ago
reply
Berkeley, CA
Share this story
Delete

The Year in Physics

1 Share

Will 2024 be remembered as a banner year in the quest to understand the universe, or just an average one? That depends on whether a result from this spring turns out to be real. In April, physicists detected a hint of a signal suggesting that dark energy, the mysterious energy of space itself, may be weakening. “Hint” is the preferred term because the sign in the heavens isn’t quite robust…

Source



Read the whole story
clumma
3 days ago
reply
Berkeley, CA
Share this story
Delete
Next Page of Stories