Preface

This book is intended to present the GraphX library for Apache Spark and to teach the fundamental techniques and recipes to process graph data at scale. It is intended to be a self-study step-by-step guide for anyone new to Spark with an interest in or need for large-scale graph processing.

Distinctive features

The focus of this book is on large-scale graph processing with Apache Spark. The book teaches a variety of graph processing abstractions and algorithms and provides concise and sufficient information about them. You can confidently learn all of it and put it to use in different applications.

  • Step-by-step guide: Each chapter teaches important techniques for every stage of the pipeline, from loading and transforming graph data to implementing graph-parallel operations and machine learning algorithms.
  • Hands-on approach: We show how each technique works using the Scala REPL with simple examples and by building standalone Spark applications.
  • Detailed code: All the Scala code in the book is available for download from the book webpage of Packt Publishing.
  • Real-world examples: We apply these techniques on open datasets collected from a broad variety of applications ranging from social networks to food science and sports analytics.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset