More

azeirah · 2026-04-20T20:15:37 1776716137

reMarkable is doing a decent job, their first generation device launched in 2017. Still getting updates. It is discontinued for sale, but there is no reason to believe reMarkable will stop updating their other devices if they're _still_ updating a device they don't even sell anymore.

On top of that, their aftermarket and open source situation is pretty good.

They're not ideal e-readers though, but if you're in the market for a good e-ink device with long-term support and that works well with calibre? Might be worth a look.

Insanity · 2026-04-20T20:23:12 1776716592

until the software compatibility with the older model compromises the newer models. Kudos for them for still updating a device no longer on sale, but Apple does the same, until it doesn't. The fact that they are still updating the first generation today does not mean they will do so tomorrow.

That said, remarkable are great devices as well.

azeirah · 2026-04-16T15:11:19 1776352279

There is no hallucination benchmark currently.

I was researching how to predict hallucinations using the literature (fastowski et al, 2025) (cecere et al, 2025) and the general-ish situation is that there are ways to introspect model certainty levels by probing it from the outside to get the same certainty metric that you _would_ have gotten if the model was trained as a bayesian model, ie, it knows what it knows and it knows what it doesn't know.

This significantly improves claim-level false-positive rates (which is measured with the AUARC metric, ie, abstention rates; ie have the model shut up when it is actually uncertain).

This would be great to include as a metric in benchmarks because right now the benchmark just says "it solves x% of benchmarks", whereas the real question real-world developers care about is "it solves x% of benchmarks *reliably*" AND "It creates false positives on y% of the time".

So the answer to your question, we don't know. It might be a cherry picked result, it might be fewer hallucinations (better metacognition) it might be capability to solve more difficult problems (better intelligence).

The benchmarks don't make this explicit.

azeirah · 2026-04-05T08:22:28 1775377348

This sounds lovely! This definitely sounds like the kind of thing that would really work well to let all the stresses of the everyday wash off!

It reminds me a lot of the YouTube channel "life in jars", he normally makes videos about microbiology and freshwater ecology in... jars!

But on top of that he also had a short series on gaining the trust of and befriending crows in his city.

Good incentive for me to try this out!

azeirah · 2026-03-11T21:38:12 1773265092

Cerebras?

azeirah · 2026-03-04T21:29:14 1772659754

I wrote a program that has programmable brushes about ten years ago, it's a bit different from moss in that it has a physics simulation underneath rather than a sort of shader, but I've always thought this kind of approach has a lot of potential.

It feels _amazing_ to draw a bird in a single stroke!

Maybe this can give you some inspiration!

https://laura.fm/generative-art/wind/wind.html

frisia · 2026-03-05T13:11:24 1772716284

This was very interesting to read! My choice of drawing program now is Rebelle, which does have a "swarm" brush (they call them bristle brushes, designed to emulate real paintbrushes) and together with its physical simulation where paint applied on the canvas has a thickness instead of opacity, the results can look absolutely stunning. Have given me the itch to also experiment with simulation-based drawing programs.

vunderba · 2026-03-04T21:43:57 1772660637

Very cool. Side note, but I've always thought that a combination of a pixel art editor like Aseprite coupled "cellular sand sim" would be really cool.

nextaccountic · 2026-03-05T01:48:19 1772675299

This looks like Krita brushes

azeirah · 2026-02-25T10:06:25 1772013985

Sorry but what do you mean crystal clear requirements?

I don't particularly think "y7u8888888ftrg34BC" would pass as a crystal clear requirement at my workplace :<

Do you mean something different?

dormento · 2026-02-25T13:42:25 1772026945

> y7u8888888ftrg34BC

This is more information than the average users gives you when requesting new features.

wazHFsRy · 2026-02-25T12:02:01 1772020921

I mean you get a random game in the authors example :) But in real life you do not want a random game. That's what I mean, you need the great scaffolding + exact requirements. Then the prompt to do the implementation does not matter too much.

withinboredom · 2026-02-25T13:14:25 1772025265

Recent research has shown that specs actually degrade output, so I’m more inclined to think you’ve got the slot machine mentality mentioned up-thread.

dovirk · 2026-02-26T18:12:27 1772129547

Source?

Last I checked, specs improved output quality. There are hiccups with complex, difficult specs but I guess that much is obvious.

withinboredom · 2026-02-27T07:03:56 1772175836

https://arxiv.org/abs/2602.11988

azeirah · 2026-02-10T22:44:18 1770763458

If I understand the author correctly, he chose the hyperbolic model specifically because the story of "the singularity" _requires_ a function that hits infinity.

He's looking for a model that works for the story in the media and runs with it.

Your criticism seems to be criticizing the story, not the author's attempt to take it "seriously"

marifjeren · 2026-02-10T23:15:45 1770765345

Correct.

azeirah · 2026-02-07T21:09:54 1770498594

My friend, this is amazing, Thank you!

azeirah · 2026-02-05T16:40:32 1770309632

I made a service to sync remarkable to obsidian. It's not free, but it _is_ open-source so you can self-host for free.

https://scrybble.ink

azeirah · 2026-02-02T22:28:27 1770071307

This is being actively researched (in the open!). https://github.com/open-spaced-repetition/srs-benchmark