More

Myrmornis · 2026-05-11T03:11:47 1778469107

> I typed :rs pods to switch back to the pods view. Nothing rendered. The table was empty... > now something was fundamentally broken and I couldn't just prompt my way out of it.

Hey I don't want to over simplify, I'm sure it was complicated, but did the author have functional tests for these broken views? As long as there are functional tests passing on the previous commit I'd have thought that claude could look at the end situation and work out how to get the desired feature without breaking the other stuff.

TUIs aren't an exception, it's still essential to have a way to end-to-end test each view.

jvuygbbkuurx · 2026-05-11T06:40:16 1778481616

The problem wasn't the view didn't work. The problem was the view didn't work after something else had been done.

You can't test every permutation of app usage. You actually need good architechture so you can trust your test and changes to be local with minimal side-effects.

Myrmornis · 2026-05-11T12:03:30 1778501010

> The problem was the view didn't work after something else had been done.

In that situation you have two choices:

1. Tell claude to iterate until the tests for the new view and the old views are all passing.

2. git reset --hard back to the previous commit at which all tests are passing and tell claude to try again, making sure not to break any tests.

It's essential to use tests when vibecoding anything non trivial. Almost certainly in a TDD style.

Myrmornis · 2026-05-03T01:41:57 1777772517

On the one hand I'm not sure Dawkins has read/thought enough about how LLMs actually work. I'm getting the impression he doesn't fully appreciate or is somehow forgetting that it's a text completion algorithm with a vast number of parameters and that even if the patterns of learned parameter tunings are not really comprehendible, the architecture was very deliberately designed.

But on the other hand his thoughts at the end are interesting. Summary:

Maybe our "consciousness" is like an LLM's intelligence. But if not, then it raises the question of why do we even have this "extra" consciousness, since it appears that something like a humanoid LLM would be decent at surviving. His suggestions: maybe our extra thing is an evolutionary accident (and maybe there _are_ successful organisms out there with the LLM-style non-conscious intelligence), or maybe as evolved organisms it's necessary that we really feel things like pain, so that evolutionary mechanisms like pain (and desire for food, sex etc) had strong adaptive benefits.

collyw · 2026-05-03T09:30:06 1777800606

"But if not, then it raises the question of why do we even have this "extra" consciousness"

Keep chipping away Dawkins, you might arrive at God eventually.

mmustapic · 2026-05-03T18:59:59 1777834799

The brain uses a lot less energy than an LLM, so most probably it is something completely different. Maybe consciousness is a byproduct of the architecture of the brain, so there is no version of a humanoid with no consciousness.

thrownthatway · 2026-05-03T10:33:28 1777804408

GTP’s, or transformers more generally can be trained on data other than language (text / audio).

They can operate on data other than natural language.

So can humans.

Myrmornis · 2026-05-03T01:25:04 1777771504

I don't think you read carefully what he said. At the end he gave three quite interesting thoughts about what might be true assuming LLMs are less conscious than we are (i.e. assuming our consciousness is not a purely algorithmic phenomenon as we obviously know LLMs are).

Myrmornis · 2026-04-25T15:07:11 1777129631

Your notes look really interesting, thanks. I'm curious --from the prose style it's clear they were written by an LLM. For design notes like this do you sort of have a mental TODO to go back and write them up in your own words to make sure they really capture your own opinions?

zby · 2026-04-25T15:18:57 1777130337

For the design notes like: https://zby.github.io/commonplace/notes/designing-agent-memo... - I iterate over and over to clean them. This one is also a compilation with many intermediate documents.

But the reviews are written automatically - here are the instructions: https://github.com/zby/commonplace/blob/main/kb/agent-memory...

Overall the knowledgebase is a mixture of these. I have this disclaimer on the first page:

This KB is itself agent-operated: a human directs the inquiry, AI agents draft, connect, and maintain the notes. The framework for building knowledge bases is documented using that framework.

I hope it is enough - I've seen many people get angry with publishing LLM generated work.

Myrmornis · 2026-04-18T11:58:09 1776513489

The article is still missing the most important point about a "trust system" -- you have to explain what it is and convince me that I even care about the problem you're trying to solve. It's my machine, what is a "trusted" or "untrusted" file? If people just force security "solutions" on me without asking me whether I understand or agree with their problem diagnosis then I will immediately disable the protection if I can or blanket accept all prompts without thinking.

This is good, but it doesn't go far enough:

> ... the problem with security measures that cause too much friction is that users tend to disable them in order to get on with their work. To fulfill its security purposes, a good trust system needs to stay out of your way.

Myrmornis · 2026-04-15T15:21:12 1776266472

https://d2lang.com/ is a nicer language than Mermaid with much nicer visual appearance. It would be great if it became more widely supported.

erajasekar · 2026-04-15T15:41:47 1776267707

I reached the same conclusion after comparing diagram-as-code tools — D2 feels cleaner and more expressive than Mermaid.

I’ve been working on an AI diagramming tool built around D2: https://aidiagrammaker.com/ You describe a system in plain English, and it generates architecture diagrams, flowcharts, and sequence diagrams in D2.

Edits can be made either directly in the D2 code or via a context-aware editor.

wps · 2026-04-15T18:29:04 1776277744

Has anyone here used https://pikchr.org/ from the creator of SQLite?

neuronexmachina · 2026-04-15T16:17:44 1776269864

I agree that it's nicer and more powerful, but it's a little concerning it hasn't had any commits in the past 6 months: https://github.com/terrastruct/d2/commits/master/

16bitvoid · 2026-04-15T18:18:54 1776277134

I think the founder/lead developer, Alexander Wang, works at OpenAI now.

Plus, according to this comment on an issue, folks in their discord say it's not being actively maintained.

https://github.com/terrastruct/d2/issues/2735#issuecomment-4...

vladsanchez · 2026-04-15T19:10:31 1776280231

Never imagined Scale.ai CEO was D2Lang creator nor that he joined his room-mate at OpenAI (giggle).

Thanks for sharing

16bitvoid · 2026-04-16T04:24:57 1776313497

Scale AI is a different Alex. Their first names are spelled slightly different. Alexandr vs Alexander

nine_k · 2026-04-15T17:58:05 1776275885

Maybe it doesn't need constant fixing? Are there many issues in the tracker?

jauntywundrkind · 2026-04-15T17:00:29 1776272429

How good is the LLM at creating d2? What if any skills/material can folks recommend? (Follow-up: D2-mcp has a cheat sheet, https://github.com/h0rv/d2-mcp/blob/main/d2/CHEATSHEET.md)

And, does GitHub support it? (Follow up: alas not! Sadness. Please add!)

elcritch · 2026-04-21T11:58:50 1776772730

For me that's the deal breaker :/

lugao · 2026-04-15T15:23:24 1776266604

Does it produce real svgs as opposed to foreign object html in svg mess that mermaid compilers produce?

troyvit · 2026-04-16T15:08:20 1776352100

d2 produces real svgs but I've found them to have a hard time displaying in other svg editors. The d2 folks talk about that somewhere and they have some fixes for it.

spacecow · 2026-04-15T16:48:11 1776271691

Oh, finally, something that supports actual hierarchical state diagrams (that isn't Graphviz, no offense)... Mermaid's "You cannot define transitions between internal states belonging to different composite states" [1] has driven me up a wall for years.

  parentA.childA -> parentB.childB: voop
  parentB.childB -> parentA: vorp

shouldn't be that hard!

[1] https://mermaid.ai/open-source/syntax/stateDiagram.html#comp...

pastelsky · 2026-04-15T23:24:29 1776295469

Thanks for sharing! I haven’t looked into that before but looks neat.

growrow · 2026-04-15T15:23:08 1776266588

What makes it nicer?

Myrmornis · 2026-04-15T15:40:49 1776267649

Take a look at https://d2lang.com/examples/dagre/ and https://d2lang.com/tour/intro/

The language is richer and all diagram types are implemented consistently in the same language in a way that can be composed, as opposed to being a collection of unrelated DSLs.

The improved visual appearance is clear from inspecting example diagrams, I believe.

Myrmornis · 2026-04-11T16:38:36 1775925516

I read the first couple of posts in the series. The essay is full of criticism of LLMs, and in a couple of places the author distances himself, as if he himself isn't using them ("some people I respect tell me that...").

It's certainly worth discussing the fact that the entire industry is starting to outsource large amounts of our thinking and writing work to non-sentient statistical algorithms, but this discussion needs to honestly confront the extent to which they are successfully completing useful tasks today.

Myrmornis · 2026-04-01T19:25:25 1775071525

This is really cool! Is there an alternative way of thinking about it involving a hidden markov model, looking for a change in value of an unknown latent P(fail)? Or does your approach end up being similar to whatever the appropriate Bayesian approach to the HMM would be?

tazsat0512 · 2026-04-02T06:10:23 1775110223

[flagged]

basilikum · 2026-04-02T12:05:52 1775131552

Please stop spamming HN wih AI slop.

Myrmornis · 2026-03-29T20:16:27 1774815387

Doubtless the current LLMs aren't the last word. But this author sounds like they would get more out of the current LLMs if they put their energies into that rather than into criticism.

Myrmornis · 2026-03-29T00:52:16 1774745536

The rounded corners are nothing compared to the notch camera making part of the top of my applications invisible.