More

jamescham · on Oct 21, 2024

Pete Warden and team just published a paper on Moonshine, their speech to text model.

Key features include:

- 1.7x overall speed boost compared to Whisper - Flexible-sized input window, allowing for more efficient processing of shorter audio clips - Up to 5x faster performance on 10-second audio clips - Matches or exceeds Whisper's accuracy

jamescham · on March 17, 2023

How is this not getting any news?

jamescham · on Feb 6, 2023

The actual interview is kind of amazing.

https://twitter.com/priyadclemens/status/1621229630765125639...

jamescham · on Jan 29, 2023

This is a great point! (And kind of terrifying.)

jamescham · on Sept 10, 2022

“Some good companies this year”—observations from my friend Joshua.

jamescham · on Aug 3, 2020

This is exactly right—we now live in a world in which most jobs are knowledge work, and we should look to those who are the most productive (and lazy) knowledge workers: software developers.

jamescham · on May 28, 2020

My understanding is that Google’s big advantage is that they’ve collected so much good, annotated voice data.

jamescham · on May 28, 2020

Yeah. I’m convinced the current model is just too confusing. But I really wish there were new interaction patterns that took advantage of low latency speech recognition...

jamescham · on May 28, 2020

Oh! Good to know!

jamescham · on March 17, 2013

ardit33 is right. My experience is that most successful startups are so hungry for talent that they really don't care where you come from or who you know.