Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

mlb game data, within 24 hours of each game


Isn't that already done? You can even get raw Pitch F/X data.

http://gd2.mlb.com/components/game/mlb/year_2009/

Here's an example day's scoreboard in JSON:

http://gd2.mlb.com/components/game/mlb/year_2009/month_07/da...


i wrote a parser for NBA games, get the data straight from yahoo. Using Python and BeautifulSoup, piece of cake. I would think MLB could work the same.


Yeah, HTML scraping ESPN.com is not very hard. I did it for college basketball once in a strange sort of database that attempted to predict March Madness brackets. It worked really well for teams that had encountered each other during the season... except only 2 or 3 pairs of teams had ever played each other during the regular season. Works much better on NBA and NFL games, where there are far fewer teams, they see each other more often, and the rosters are a little more static.


> Yeah, HTML scraping ESPN.com is not very hard.

I've seen a few people respond with this "you can just html scrape that." Sure you can HTML-scrape for the information, but the topic of this "Ask HN" is about what APIs you would like to see. Maybe he already HTML-scrapes ESPN.com, but would prefer that there was an official API for it, no?


Yeah, but I prefer real solutions, regardless of how ugly they are, rather than wishful thinking for so-called "elegant" solutions.


No offense, but isn't this wrong discussion for you then?


Care to share? Sounds awesome.


Alright, will send you an email




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: