Hacker Newsnew | past | comments | ask | show | jobs | submit | lovasoa's commentslogin

I use the latest codex with gpt5.4 and Claude opus every day. they hallucinate every day. If you think they don't, you are probably being gaslighted by the models.


I was interested in the human results, so I had an llm build a visualization for them: https://codepen.io/lovasoaaa/pen/QwKWGBd

You can see that 17% of answers come from India alone and that software developers got below average results, for instance.


This is amazing thanks for sharing!


The author of this project is also the author of redis. He knows what he is doing.

Running inference for a model, even when you have all the weights, is not trivial.


I use linux at home (with a HiDPI screen) and MacOS for work. The screen works well with both computers. I mostly just use a text editor, a browser, and a terminal though.

Linux has bugs, bug MacOS does too. I feel like for a dev like me, the linux setup is more comfortable.


Same here. I stick to 100% scaling and side step the whole hi dpi issue. I even have a single USB type c cable that connects my laptop to the laptop stand and that laptop stand is what connects to the monitor, keyboard, and mouse.

I know people will say meh but coming from the world of hurt with drivers and windows based soft modems — I was on dial up even as late as 2005! — I think the idea that everything works plug and play is amazing.

Compare with my experience on Windows — maybe I did something wrong, I don't know but the external monitor didn't work over HDMI when I installed windows without s network connection and maybe it was a coincidence but it didn't work until I connected to the Internet.


How do you call the opposite of green washing? When you want to show that you are burning as much energy on training models as the others.


Because we can


I use a fork of sqlx in SQLPage [1]. I think my main complaint about it is runtime errors (or worse, values decoded as garbage) when decoding SQL values to the wrong rust type.

* [1] https://sql-page.com/


A consortium of investigative journalists investigated and attributed the attacks the Ukraine's secret service: https://en.wikipedia.org/wiki/Nord_Stream_pipelines_sabotage


No. The LLM's answer is correct.



> No it is not…

That’s a queue, not a stack. The LLM response was correct.


But a stack is commonly LIFO, not FIFO?!


This reads like a line from a QwQ or Qwen3 CoT chain :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: