Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
XCSme
28 days ago
|
parent
|
context
|
favorite
| on:
Qwen3.5: Towards Native Multimodal Agents
I just started creating my own benchmarks (very simple questions for humans but tricky for AI, like how many r's in strawberry kind of questions, still WIP).
Qwen3.5 is doing ok on my limited tests:
https://aibenchy.com
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
Qwen3.5 is doing ok on my limited tests: https://aibenchy.com