Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Apache Spark


I would not use Apache Spark when the amount of data is this little. I have myself done this mistake when processing only a few gigabytes.

Take a look at this: http://aadrake.com/command-line-tools-can-be-235x-faster-tha...

IMHO I think people abuse Spark and I would be truly impressed if anybody could write a Spark program faster then just a regular Scala program for processing this.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: