Beyond Parallelize and Collect

Holden Karau presented this important work at Spark Summit East in NYC in February 2016. See the slides on SlideShare

Beyond parallelize and collect – Spark Summit East 2016 from Holden Karau

Effectively testing Apache Spark programs is a notoriously complex challenge — and it’s a challenge some of us would rather avoid completely. In this presentation, author and researcher Holden Karau makes the case for rigorous testing, especially at full scale with workloads that are too large for a single machine.

Spark Technology Center


Subscribe to the Spark Technology Center newsletter for the latest thought leadership in Apache Spark™, machine learning and open source.



You Might Also Enjoy