While graphs are often the most natural way to represent the connections among data, the complexity of large graphs makes them conceptually difficult and computationally expensive to explore, query, and analyze. GraphX, a powerful graph processing API for the Apache Spark analytics engine, makes it possible to efficiently explore and interpret large-scale graph data at near-realtime speeds. GraphX works with Spark's in-memory distributed framework to offer unprecedented speed and capacity for analyzing social media data, performing complex textual analysis, handling important machine learning algorithms, and much more. Spark GraphX in Action starts out with an overview of Apache Spark and the GraphX graph processing API. This example-based tutorial explains how to configure GraphX and use GraphX interactively. It offers a crystal-clear introduction to graph elements, which are needed to build big data graphs. Then, it explores the problems and possibilities of graph algorithm implementations. Along the way, it details practical techniques for enhancing applications and applying machine learning algorithms to graph data. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
Author : Michael Malak Release : 2016-05-01 Publisher : Manning Publications ISBN : 9781617292521 File Size : 36.84 MB Format : PDF, ePub, Mobi Download : 313 Read : 838