Home » Nonfiction » Sandy Ryza » Advanced Analytics with Spark

March 10 , 2009

Advanced Analytics with Spark

Patterns for Learning from Data at Scale


In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example.

Download LIT

Read also

Developments in International Bridge Engineering

The book includes peer-reviewed contributions selected from presentations given at the Istanbul Bridge Conference 2014, held from August 11 – 13 in Istanbul, Turkey. It reports on the current challenges in bridge engineering faced by professionals around the globe, giving a special emphasis to recently developed techniques, innovations and…

You’ll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques—classification, collaborative filtering, and anomaly detection among others—to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you’ll find these patterns useful for working on your own data applications.

Patterns include:

  • Recommending music and the Audioscrobbler data set
  • Predicting forest cover with decision trees
  • Anomaly detection in network traffic with K-means clustering
  • Understanding Wikipedia with Latent Semantic Analysis
  • Analyzing co-occurrence networks with GraphX
  • Geospatial and temporal data analysis on the New York City Taxi Trips data
  • Estimating financial risk through Monte Carlo simulation
  • Analyzing genomics data and the BDG project
  • Analyzing neuroimaging data with PySpark and Thunder
Show more...

How to download book

Buy this book

You can buy this book now only for $30.59. This is the lowest price for this book.

Buy book

Download book free

If you want to download this book for free, please register, approve your account and get one book for free.


After that you may download book «Advanced Analytics with Spark»:

Download DOCX:


Download Amazon Kindle:


Download ePub:


Download Mobi: