Advanced Analytics with Spark: Patterns for Learning from Data a

发布时间: 2015-10-19 阅读数: 668

商品简介

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example.

You ll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques classification, collaborative filtering, and anomaly detection among others to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you ll find these patterns useful for working on your own data applications.

Patterns include: Recommending music and the Audioscrobbler data setPredicting forest cover with decision treesAnomaly detection in network traffic with K-means clusteringUnderstanding Wikipedia with Latent Semantic AnalysisAnalyzing co-occurrence networks with GraphXGeospatial and temporal data analysis on the New York City Taxi Trips dataEstimating financial risk through Monte Carlo simulationAnalyzing genomics data and the BDG projectAnalyzing neuroimaging data with PySpark and Thunder"

分享到:
热点图书
利用Python进行数据分析

利用Python进行数据分析

2015年10月20日发布 1585次阅读
Spark快速大数据分析

Spark快速大数据分析

2015年10月16日发布 1287次阅读
统计学基础

统计学基础

2015年10月26日发布 1170次阅读
数据挖掘:实用案例分析

数据挖掘:实用案例分析

2015年10月19日发布 1050次阅读
R软件及其在金融定量分析中的应用

R软件及其在金融定量分析中的应用

2016年01月13日发布 1008次阅读
R语言实战

R语言实战

2015年10月16日发布 986次阅读
登录 注册