1-dan master of the unyielding fist of Bayesian inference
6195 stories
·
1 follower

SETI@home is in hiberation

1 Share

Article URL: https://setiathome.berkeley.edu/

Comments URL: https://news.ycombinator.com/item?id=46703301

Points: 275

# Comments: 136

Read the whole story
clumma
14 hours ago
reply
Berkeley, CA
Share this story
Delete

Amazon is ending all inventory commingling as of March 31, 2026

1 Share

Article URL: https://twitter.com/ghhughes/status/2012824754319753456

Comments URL: https://news.ycombinator.com/item?id=46678205

Points: 501

# Comments: 253

Read the whole story
clumma
4 days ago
reply
Berkeley, CA
Share this story
Delete

A Social Filesystem

1 Share

Article URL: https://overreacted.io/a-social-filesystem/

Comments URL: https://news.ycombinator.com/item?id=46665839

Points: 496

# Comments: 232

Read the whole story
clumma
4 days ago
reply
Berkeley, CA
Share this story
Delete

All 23-Bit Still Lifes Are Glider Constructible

1 Share

Article URL: https://mvr.github.io/posts/xs23.html

Comments URL: https://news.ycombinator.com/item?id=46641239

Points: 127

# Comments: 16

Read the whole story
clumma
4 days ago
reply
Berkeley, CA
Share this story
Delete

Use of Bayesian methodology in clinical trials of drug and biological products [pdf]

1 Share

Article URL: https://www.fda.gov/media/190505/download

Comments URL: https://news.ycombinator.com/item?id=46629295

Points: 70

# Comments: 21

Read the whole story
clumma
4 days ago
reply
Berkeley, CA
Share this story
Delete

Show HN: Self-host Reddit – 2.38B posts, works offline, yours forever

1 Share

Reddit's API is effectively dead for archival. Third-party apps are gone. Reddit has threatened to cut off access to the Pushshift dataset multiple times. But 3.28TB of Reddit history exists as a torrent right now, and I built a tool to turn it into something you can browse on your own hardware.

The key point: This doesn't touch Reddit's servers. Ever. Download the Pushshift dataset, run my tool locally, get a fully browsable archive. Works on an air-gapped machine. Works on a Raspberry Pi serving your LAN. Works on a USB drive you hand to someone.

What it does: Takes compressed data dumps from Reddit (.zst), Voat (SQL), and Ruqqus (.7z) and generates static HTML. No JavaScript, no external requests, no tracking. Open index.html and browse. Want search? Run the optional Docker stack with PostgreSQL – still entirely on your machine.

API & AI Integration: Full REST API with 30+ endpoints – posts, comments, users, subreddits, full-text search, aggregations. Also ships with an MCP server (29 tools) so you can query your archive directly from AI tools.

Self-hosting options: - USB drive / local folder (just open the HTML files) - Home server on your LAN - Tor hidden service (2 commands, no port forwarding needed) - VPS with HTTPS - GitHub Pages for small archives

Why this matters: Once you have the data, you own it. No API keys, no rate limits, no ToS changes can take it away.

Scale: Tens of millions of posts per instance. PostgreSQL backend keeps memory constant regardless of dataset size. For the full 2.38B post dataset, run multiple instances by topic.

How I built it: Python, PostgreSQL, Jinja2 templates, Docker. Used Claude Code throughout as an experiment in AI-assisted development. Learned that the workflow is "trust but verify" – it accelerates the boring parts but you still own the architecture.

Live demo: https://online-archives.github.io/redd-archiver-example/

GitHub: https://github.com/19-84/redd-archiver (Public Domain)

Pushshift torrent: https://academictorrents.com/details/1614740ac8c94505e4ecb9d...


Comments URL: https://news.ycombinator.com/item?id=46602324

Points: 286

# Comments: 63

Read the whole story
clumma
4 days ago
reply
Berkeley, CA
Share this story
Delete
Next Page of Stories