Big Data Analytics with Hadoop 3 상세페이지

Big Data Analytics with Hadoop 3

Name: Big Data Analytics with Hadoop 3
Price: 19000 KRW
Availability: OnlineOnly
Author: Sridhar Alla

Build highly effective analytics solutions to gain valuable insight into your big data

0명참여

Sridhar Alla 저

Packt (GCO Science) 출판

소장	전자책 정가	19,000원
소장	판매가	19,000원

출간 정보
- 2018.05.31. 전자책 출간
파일 정보
- PDF
- 34.3MB
- 471쪽
ISBN
- 9781788624954
ECN
- -

듣기 기능미지원
지원 기기
- PAPER
- iOS
- Android
- PC
- Mac

Big Data Analytics with Hadoop 3작품 소개

<Big Data Analytics with Hadoop 3> ▶Book Description
Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. Big Data Analytics with Hadoop 3 shows you how to do just that, by providing insights into the software as well as its benefits with the help of practical examples.

Once you have taken a tour of Hadoop 3's latest features, you will get an overview of HDFS, MapReduce, and YARN, and how they enable faster, more efficient big data processing. You will then move on to learning how to integrate Hadoop with the open source tools, such as Python and R, to analyze and visualize data and perform statistical computing on big data. As you get acquainted with all this, you will explore how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing. In addition to this, you will understand how to use Hadoop to build analytics solutions on the cloud and an end-to-end pipeline to perform big data analysis using practical use cases.

By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform big data analytics and get insight effortlessly.

▶What You Will Learn
⦁ Explore the new features of Hadoop 3 along with HDFS, YARN, and MapReduce
⦁ Get well-versed with the analytical capabilities of Hadoop ecosystem using practical examples
⦁ Integrate Hadoop with R and Python for more efficient big data processing
⦁ Learn to use Hadoop with Apache Spark and Apache Flink for real-time data analytics
⦁ Set up a Hadoop cluster on AWS cloud
⦁ Perform big data analytics on AWS using Elastic Map Reduce

▶Key Features
⦁ Learn Hadoop 3 to build effective big data analytics solutions on-premise and on cloud
⦁ Integrate Hadoop with other big data tools such as R, Python, Apache Spark, and Apache Flink
⦁ Exploit big data using Hadoop 3 with real-world examples

▶Who This Book Is For
Big Data Analytics with Hadoop 3 is for you if you are looking to build high-performance analytics solutions for your enterprise or business using Hadoop 3's powerful features, or you're new to big data analytics. A basic understanding of the Java programming language is required.

▶What this book covers
⦁ Chapter 1, Introduction to Hadoop, introduces you to the world of Hadoop and its core components, namely, HDFS and MapReduce.

⦁ Chapter 2, Overview of Big Data Analytics, introduces the process of examining large datasets to uncover patterns in data, generating reports, and gathering valuable insights.

⦁ Chapter 3, Big Data Processing with MapReduce, introduces the concept of MapReduce, which is the fundamental concept behind most of the big data computing/processing systems.

⦁ Chapter 4, Scientific Computing and Big Data Analysis with Python and Hadoop, provides an introduction to Python and an analysis of big data using Hadoop with the aid of Python packages.

⦁ Chapter 5, Statistical Big Data Computing with R and Hadoop, provides an introduction to R and demonstrates how to use R to perform statistical computing on big data using Hadoop.

⦁ Chapter 6, Batch Analytics with Apache Spark, introduces you to Apache Spark and demonstrates how to use Spark for big data analytics based on a batch processing model.

⦁ Chapter 7, Real-Time Analytics with Apache Spark, introduces the stream processing model of Apache Spark and demonstrates how to build streaming-based, real-time analytical applications.

⦁ Chapter 8, Batch Analytics with Apache Flink, covers Apache Flink and how to use it for big data analytics based on a batch processing model.

⦁ Chapter 9, Stream Processing with Apache Flink, introduces you to DataStream APIs and stream processing using Flink. Flink will be used to receive and process real-time event streams and store the aggregates and results in a Hadoop cluster.

⦁ Chapter 10, Visualizing Big Data, introduces you to the world of data visualization using various tools and technologies such as Tableau.

⦁ Chapter 11, Introduction to Cloud Computing, introduces Cloud computing and various concepts such as IaaS, PaaS, and SaaS. You will also get a glimpse into the top Cloud providers.

⦁ Chapter 12, Using Amazon Web Services, introduces you to AWS and various services in AWS useful for performing big data analytics using Elastic Map Reduce (EMR) to set up a Hadoop cluster in AWS Cloud.

출판사 서평

▶ Preface
Apache Hadoop is the most popular platform for big data processing, and can be combined with a host of other big data tools to build powerful analytics solutions. Big Data Analytics with Hadoop 3 shows you how to do just that, by providing insights into the software as well as its benefits with the help of practical examples.

Once you have taken a tour of Hadoop 3's latest features, you will get an overview of HDFS, MapReduce, and YARN, and how they enable faster, more efficient big data processing. You will then move on to learning how to integrate Hadoop with open source tools, such as Python and R, to analyze and visualize data and perform statistical computing on big data. As you become acquainted with all of this, you will explore how to use Hadoop 3 with Apache Spark and Apache Flink for real-time data analytics and stream processing. In addition to this, you will understand how to use Hadoop to build analytics solutions in the cloud and an end-to-end pipeline to perform big data analysis using practical use cases.

By the end of this book, you will be well-versed with the analytical capabilities of the Hadoop ecosystem. You will be able to build powerful solutions to perform big data analytics and get insights effortlessly.

저자 소개

⦁Sridhar Alla
Sridhar Alla is a big data expert helping companies solve complex problems in distributed computing, large scale data science and analytics practice. He presents regularly at several prestigious conferences and provides training and consulting to companies. He holds a bachelor's in computer science from JNTU, India.

He loves writing code in Python, Scala, and Java. He also has extensive hands-on knowledge of several Hadoop-based technologies, TensorFlow, NoSQL, IoT, and deep learning.

▶TABLE of CONTENTS
1: INTRODUCTION TO HADOOP
2: OVERVIEW OF BIG DATA ANALYTICS
3: BIG DATA PROCESSING WITH MAPREDUCE
4: SCIENTIFIC COMPUTING AND BIG DATA ANALYSIS WITH PYTHON AND HADOOP
5: STATISTICAL BIG DATA COMPUTING WITH R AND HADOOP
6: BATCH ANALYTICS WITH APACHE SPARK
7: REAL-TIME ANALYTICS WITH APACHE SPARK
8: BATCH ANALYTICS WITH APACHE FLINK
9: STREAM PROCESSING WITH APACHE FLINK
10: VISUALIZING BIG DATA
11: INTRODUCTION TO CLOUD COMPUTING
12: USING AMAZON WEB SERVICES

리뷰

구매자 별점

0.0 점

점수비율

0명이 평가함

리뷰 작성 영역

이 책을 평가해주세요!

내가 남긴 별점 0.0

별로예요

그저 그래요

보통이에요

최고예요

별점 취소

별 1개 별 2개 별 3개 별 4개 별 5개

스포일러가 있습니다.

구매자 표시 기준은 무엇인가요?

'구매자' 표시는 리디에서 유료도서 결제 후 다운로드 하시거나 리디셀렉트 도서를 다운로드하신 경우에만 표시됩니다.

무료 도서 (프로모션 등으로 무료로 전환된 도서 포함): '구매자'로 표시되지 않습니다.
시리즈 도서 내 무료 도서: '구매자’로 표시되지 않습니다. 하지만 같은 시리즈의 유료 도서를 결제한 뒤 리뷰를 수정하거나 재등록하면 '구매자'로 표시됩니다.
영구 삭제: 도서를 영구 삭제해도 ‘구매자’ 표시는 남아있습니다.
결제 취소: ‘구매자’ 표시가 자동으로 사라집니다.

Big Data Analytics with Hadoop 3

Big Data Analytics with Hadoop 3작품 소개

출판사 서평

저자 소개

목차

리뷰

리뷰 및 사용자 신고하기

댓글 및 사용자 신고하기

사용자 차단

사용자 차단 해제

이 책과 함께 구매한 책

이 책과 함께 둘러본 책

성인 인증 안내

성인 재인증 안내