Apache Spark - Lightning-fast cluster computing

By Alvin Alexander. Last updated: September 18, 2013

URL

http://spark.incubator.apache.org/

Apache Spark is an open source cluster computing system that aims to make data analytics fast — both fast to run and fast to write.

To run programs faster, Spark offers a general execution model that can optimize arbitrary operator graphs, and supports in-memory computing, which lets it query data faster than disk-based engines like Hadoop.

To make programming faster, Spark provides clean, concise APIs in Python, Scala and Java. You can also use Spark interactively from the Scala and Python shells to rapidly query big datasets.

Apache Spark - Lightning-fast cluster computing

books by alvin