More

bee_rider · 2026-04-08T17:10:10 1775668210

I think they just want to be a winner in the “next thing.” They hit social networking, but missed mobile operating systems and didn’t compellingly win at social media. Eventually an ambitious person with a bazillion dollars wants a clear win, right?

bee_rider · 2026-04-08T17:03:46 1775667826

We shouldn’t try to build a worse version of a human. We should try to build a better compiler and encyclopedia.

logicprog · 2026-04-08T18:01:31 1775671291

We tried that. It was called Cyc. It never got even close to the level of capabilities a modern LLM has in an agentic harness — even on common sense and reasoning problems!

GolfPopper · 2026-04-08T17:22:07 1775668927

That sounds like a "get wealthy slowly" plan, while the LLM prophets are more focused on "get rich quick".

bee_rider · 2026-04-08T14:38:03 1775659083

The compare against “DeepSpeed ZeRO-3” apparently.

jazzpush2 · 2026-04-08T15:32:51 1775662371

FWIW Zero-3 refers to a common strategy for sharding model components across GPUs (commonly called FSDP-2, Full Sharded Data Parallel). The "3" is the level of sharding (how much stuff to distribute across GPUs, e.g. just weights, versus optimizer state as well, etc.)

bee_rider · 2026-04-07T15:49:23 1775576963

IMO there’s room for something more recent, maybe a Titan or something, to stand in as an avatar for making GPUs as compute accelerators a thing. I know that’s been going on forever, but at some point it went from some niche hacky thing to a primary use-case for the cards.

But yeah this list has a on of incremental bumps on it. Maybe there was some mixing of cards that mattered historically and cards that mattered to the author.

bee_rider · 2026-04-05T13:48:26 1775396906

Are agents something special? We already had LLMs that could call tools. Agents are just that, in a loop, right?

darkstarsys · 2026-04-06T12:26:16 1775478376

Yes, and this is an incredibly powerful idea. Running fluid flow simulations inside an optimization loop (monte carlo + gradient descent) revolutionized aircraft design, nuclear simulations and geophysics. When the tool being called updates the LLM's training data, or runs experiments that the LLM can learn from, then that potentially becomes a self-improvement loop.

dryarzeg · 2026-04-05T13:54:22 1775397262

Roughly speaking - yes. Still, it's an advancement - even if it's a small one - on the usual chatbots, right?

P.S. I am well aware of all of the risks that agents brought. I'm speaking in terms of pure "maximum performance", so to speak.

bee_rider · 2026-04-05T13:12:23 1775394743

Does Firefox really not unload the tabs in that case?

foepys · 2026-04-05T14:43:24 1775400204

It does. You can also do it by hand via the right-click on tab menu

bee_rider · 2026-04-05T13:05:47 1775394347

It is not actually an issue. The article isn’t based on any technical aspects of the OS, just the reported system requirements.

bee_rider · 2026-04-04T23:41:19 1775346079

Maybe the bots should be made to write MISRA-C. It isn’t like they get annoyed, right?

bee_rider · 2026-04-04T02:32:49 1775269969

I wonder, is there a way to only request reformulations that don’t involve branches? The tool already seems quite nice, but that might be a good feature.

Also, I’m not sure I understand the speedup. Is it latency or throughput?

pavpanchekha · 2026-04-04T14:01:27 1775311287

Author here. The speed up is modeled throughput, though the model is relatively naive. It's possible to disable branches by turning off the regimes flag, see https://herbie.uwplse.org/doc/1.0/options.html

bee_rider · 2026-04-04T15:03:56 1775315036

Nice!

What’s uwplse mean? I mixed up the letters and misread it as ulp-wise which works for the project, haha.

pavpanchekha · 2026-04-04T18:09:31 1775326171

University of Washington Programming Languages and Software Engineering (research group).

I'm not at UW any more, I'm now at Utah, but some of the Herbie team is at UW and they provide the infrastructure

bee_rider · 2026-04-03T13:56:24 1775224584

LibreOffice is a pretty bad name, it is too clearly a spin-off of OpenOffice and never really gained its own identity. Being identifiable as a bad project’s better fork is kind of a weak starting position.