Guide to High Performance Distributed Computing, Softcover reprint of the original 1st ed. 2015
Case Studies with Hadoop, Scalding and Spark
Computer Communications and Networks Series

Authors: Srinivasa K.G., Muppalla Anil Kumar

Language: English

Cover of the book Guide to High Performance Distributed Computing

Subjects for Guide to High Performance Distributed Computing:

Keywords

Algorithms; Case Studies; Hadoop; High Performance Computing; Spark

Approximative price 52.74 €

In Print (Delivery period: 15 days).

Add to cart

the print on demand of Srinivasa K.G., Muppalla Anil Kumar

Guide to High Performance Distributed Computing
Publication date: 10-2016
Support: Print on demand

Approximative price 52.74 €

In Print (Delivery period: 15 days).

Add to cart

the book of Srinivasa K.G., Muppalla Anil Kumar

Guide to High Performance Distributed Computing
Publication date: 03-2015
304 p. · 15.5x23.5 cm · Hardback

Description
/li>
Contents
/li>
Comment
/li>

This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.

Part I: Programming Fundamentals of High Performance Distributed Computing

Introduction

Getting Started with Hadoop

Getting Started with Spark

Programming Internals of Scalding and Spark

Part II: Case studies using Hadoop, Scalding and Spark

Case Study I: Data Clustering using Scalding and Spark

Case Study II: Data Classification using Scalding and Spark

Case Study III: Regression Analysis using Scalding and Spark

Case Study IV: Recommender System using Scalding and Spark

Provides a guide to the distributed computing technologies of Hadoop and Spark, from the perspective of industry practitioners

Supports the theory with case studies taken from a range of disciplines, including data mining, machine learning, graph processing and image processing

Supplies working source code to aid understanding through step-by-step implementation

Includes supplementary material: sn.pub/extras

Guide to High Performance Distributed Computing, Softcover reprint of the original 1st ed. 2015Case Studies with Hadoop, Scalding and SparkComputer Communications and Networks Series

Authors: Srinivasa K.G., Muppalla Anil Kumar

Subjects for Guide to High Performance Distributed Computing:

Keywords

Description

Contents

Comment

These books may interest you

Guide to High Performance Distributed Computing, Softcover reprint of the original 1st ed. 2015
Case Studies with Hadoop, Scalding and Spark
Computer Communications and Networks Series