More

justincormack · 2026-04-07T19:12:56 1775589176

Software security heavily favours the attacker (ex. its much easier to find a single vulnerability than to patch every vulnerability). Thus with better tools and ample time to reach steady-state, we would expect software to remain insecure.

pants2 · 2026-04-07T19:41:02 1775590862

If we think in the context of LLMs, why is it easier to find a single vulnerability than to patch every vulnerability? If the defender and the attacker are using the same LLM, the defender will run "find a critical vulnerability in my software" until it comes up empty and then the attacker will find nothing.

Defenders are favored here too, especially for closed-source applications where the defender's LLM has access to all the source code while the attacker's LLM doesn't.

dist-epoch · 2026-04-07T21:12:31 1775596351

You also need to deploy the patch. And a lot of software doesn't have easy update mechanisms.

A fix in the latest Linux kernel is meaningless if you are still running Ubuntu 20.

Tepix · 2026-04-08T07:16:25 1775632585

It's not because fixes get backported.

conradkay · 2026-04-07T19:47:23 1775591243

That generally makes sense to me, but I wonder if it's different when the attacker and defender are using the same tool (Mythos in this case)

Maybe you just spend more on tokens by some factor than the attackers do combined, and end up mostly okay. Put another way, if there's 20 vulnerabilities that Mythos is capable of finding, maybe it's reasonable to find all of them?

conradkay · 2026-04-07T20:42:10 1775594530

From the red team post https://red.anthropic.com/2026/mythos-preview/

"Most security tooling has historically benefitted defenders more than attackers. When the first software fuzzers were deployed at large scale, there were concerns they might enable attackers to identify vulnerabilities at an increased rate. And they did. But modern fuzzers like AFL are now a critical component of the security ecosystem: projects like OSS-Fuzz dedicate significant resources to help secure key open source software.

We believe the same will hold true here too—eventually. Once the security landscape has reached a new equilibrium, we believe that powerful language models will benefit defenders more than attackers, increasing the overall security of the software ecosystem. The advantage will belong to the side that can get the most out of these tools. In the short term, this could be attackers, if frontier labs aren’t careful about how they release these models. In the long term, we expect it will be defenders who will more efficiently direct resources and use these models to fix bugs before new code ever ships. "

fsflover · 2026-04-07T19:36:12 1775590572

This is only true if your approach is security through correctness. This never works in practice. Try security through compartmentalization. Qubes OS provides it reasonably well.

justincormack · 2026-04-07T18:45:14 1775587514

And the Linux Foundation.

justincormack · 2026-04-04T18:54:10 1775328850

It carries pcie, but only at x4. Thunderbolt 4 is pcie gen 3 and Thunderbolt 5 is pcie gen 4.

brcmthrowaway · 2026-04-04T22:29:44 1775341784

Thats poor.. It's just copper, why can't it be as fast as a PCIe slot..

wtallis · 2026-04-05T15:38:03 1775403483

Thunderbolt is its own protocol, electrically incompatible with PCIe. Its purpose is to encapsulate data traffic from multiple other protocols (PCIe, DisplayPort, USB) and multiplex that over the same wires. It cannot function exactly like an external PCIe port because it's solving a bigger, more complicated problem.

justincormack · 2026-04-03T10:44:17 1775213057

I mean, the org had already decreed everything needed to be rewritten in Rust according to the account.

justincormack · 2026-04-01T11:06:58 1775041618

It hasnt really been a barrier to entry though has it, cloud adoption is doing just fine.

justincormack · 2026-04-01T11:06:20 1775041580

I am currently implementing S3, with correct error handling for edge cases. Its been about a month and I have almost the whole API surface done, with some gaps (you actually need a broader surface eg some IAM, KMS encrytion needs a persistence layer so only done sse-c so far). Close to the point where you could drop it in as a local emulation tool, its a sub 9MB binary (Rust).

Literally as I write this the AI code review said "suspended-bucket current-version selection is still wrong when the numbered version and the newer null version land in the same millisecond." - thats the level of detail you have to deal with.

hrmtst93837 · 2026-04-01T11:45:39 1775043939

[flagged]

jillesvangurp · 2026-04-01T15:15:45 1775056545

Actually no, because they actually have to worry about their millions of customer deployments not randomly breaking every other week. So, they are generally very good about not breaking compatibility unless they have a really good reason to. And when they do, they typically give people years of time to deal with any backwards compatibility breaking changes. Because otherwise they are dealing with lots of support overhead and angry customers. That and the multiple independently developed s3 compatible clients or alternatives make this probably the easiest thing to simulate in the AWS universe as it is probably the most widely used thing in AWS.

justincormack · 2026-04-01T13:10:16 1775049016

Not really, the main thing they could change is the order in which errors are checked when there are multiple errors, but even a lot of that is not likely to change, eg html based stuff comes first. There are actual reasons for a lot of the subtleties, plus they have to have a stable contract for all the success cases as people rely on it. And its not hard to update, I have 800 or so tests against AWS that run in a few minutes.

justincormack · 2026-04-01T10:38:48 1775039928

Mostly I think its not about cost, it is about iteration speed and ability to run in CI environments fast.

whalesalad · 2026-04-01T14:41:58 1775054518

If your CI DB is not 100% in parity with the real db -- you are setting yourself up for disaster. It is a false sense of security.

Same reason I wince when people say things like "we just use SQLite in tests, but our real db is mysql/psql/etc"

The test results have no merit if they are testing against a glorified mock.

victorbjorklund · 2026-04-01T17:23:02 1775064182

You could have the fast tests that catch 99,9999% of the issues on feature branches and just run the real slow test before putting in prod.

justincormack · 2026-04-01T15:13:35 1775056415

You can have fast tests and slow. Having fast tests to make progress working on features is important.

justincormack · 2026-03-28T12:23:23 1774700603

We also have many wild bees in Europe.

justincormack · 2026-03-28T12:18:46 1774700326

no, dual 100Gb are not that expensive any more, eg https://www.scan.co.uk/products/2-port-intel-e810-cqda2blk-d... UK retail for gbp349.

justincormack · 2026-03-21T10:47:41 1774090061

Its hard to get anyone not to capitalise three letter words and best to just have a longer product name.

strogonoff · 2026-03-22T15:52:49 1774194769

The use of “raw” being objected to is not in a product name.