본문 바로가기

리디 접속이 원활하지 않습니다.
강제 새로 고침(Ctrl + F5)이나 브라우저 캐시 삭제를 진행해주세요.
계속해서 문제가 발생한다면 리디 접속 테스트를 통해 원인을 파악하고 대응 방법을 안내드리겠습니다.
테스트 페이지로 이동하기

[체험판] Large Scale Machine Learning with Python 상세페이지

컴퓨터/IT 개발/프로그래밍 ,   컴퓨터/IT 컴퓨터/앱 활용

[체험판] Large Scale Machine Learning with Python

Learn to build powerful machine learning models quickly and deploy large-scale predictive applications
판매가 무료
[체험판] Large Scale Machine Learning with Python 표지 이미지

리디 info

* 이 책은 본권의 일부를 무료로 제공하는 체험판입니다.
* 본권 구입을 원하실 경우, [이 책의 시리즈]→[책 선택] 후 구매해주시기 바랍니다.


[체험판] Large Scale Machine Learning with Python작품 소개

<[체험판] Large Scale Machine Learning with Python> ▶About This Book
⦁ Design, engineer and deploy scalable machine learning solutions with the power of Python
⦁ Take command of Hadoop and Spark with Python for effective machine learning on a map reduce framework
⦁ Build state-of-the-art models and develop personalized recommendations to perform machine learning at scale

▶Who This Book Is For
This book is for anyone who intends to work with large and complex data sets. Familiarity with basic Python and machine learning concepts is recommended. Working knowledge in statistics and computational mathematics would also be helpful.

▶What You Will Learn
⦁ Apply the most scalable machine learning algorithms
⦁ Work with modern state-of-the-art large-scale machine learning techniques
⦁ Increase predictive accuracy with deep learning and scalable data-handling techniques
⦁ Improve your work by combining the MapReduce framework with Spark
⦁ Build powerful ensembles at scale
⦁ Use data streams to train linear and non-linear predictive models from extremely large datasets using a single machine

▶In Detail
Large Python machine learning projects involve new problems associated with specialized machine learning architectures and designs that many data scientists have yet to tackle. But finding algorithms and designing and building platforms that deal with large sets of data is a growing need. Data scientists have to manage and maintain increasingly complex data projects, and with the rise of big data comes an increasing demand for computational and algorithmic efficiency. Large Scale Machine Learning with Python uncovers a new wave of machine learning algorithms that meet scalability demands together with a high predictive accuracy.

Dive into scalable machine learning and the three forms of scalability. Speed up algorithms that can be used on a desktop computer with tips on parallelization and memory allocation. Get to grips with new algorithms that are specifically designed for large projects and can handle bigger files, and learn about machine learning in big data environments. We will also cover the most effective machine learning techniques on a map reduce framework in Hadoop and Spark in Python.

▶Style and approach
This efficient and practical title is stuffed full of the techniques, tips and tools you need to ensure your large scale Python machine learning runs swiftly and seamlessly.

Large-scale machine learning tackles a different issue to what is currently on the market. Those working with Hadoop clusters and in data intensive environments can now learn effective ways of building powerful machine learning models from prototype to production.

This book is written in a style that programmers from other languages (R, Julia, Java, Matlab) can follow.



출판사 서평

▶Customer Review
⦁Python + Machine Learning + Scalability Under One Cover
: Many books when their subject is Machine Learning with Python concentrate on a few most known and used libraries to explain Machine Learning tasks and solutions. Although I don't want to say that such books are useless for readers, they may still leave gaps in understanding of how a certain method or library would work in real-world scenarios. Authors of the book "Large Scale Machine Learning with Python" set up an ambitious goal to teach readers how to solve real-world Machine Learning problems by employing a variety of libraries, frameworks, and tools relying on Python. This advantageously differentiates a given book from many other books on the same subject.

The following practical situations are considered and their solutions are presented:

- Tall datasets when the number of cases is large, compared to the number of features.
- Wide datasets when the number of features is large, compared to the number of cases.
- Both tall and wide datasets when both the number of features and the number of cases are large.
- Sparse datasets when there are many zero-valued elements.

The book treats the problem of scalability from different angles, such as fast batch (offline) processing, incremental online processing (one instance at a time arrives), streaming processing (a chunk of instances at a time arrives) and distributed processing. Popular libraries and frameworks, such as Gensim, H2O, XGBoost, TensorFlow, Theano, Theanets, Keras, Vowpal Wabbit, and Spark and their applications are explained through numerous Python snippets. In my opinion, this is one of the first books presenting all these tools under one cover.

In addition to Python code, the book also covers such advanced topics like Deep Learning, Ensemble Learning, validation of streaming algorithm performance, and GPU processing.

I recommend this book as a good companion to any Machine Learning practitioner who already has fairly good understanding of theory behind Machine Learning algorithms.
(-by Oleg Okun)

⦁Excellent data science resource, even if you are not an adept in Python
: This is the best book for Python-based data science, focusing on ML and big data I have encountered (and I’ve been around!). The authors cover a wide-range of intermediate and advanced topics, which they explain in terms of theory and applications. I particularly liked the Unsupervised Learning chapter, where they not only covered the quite popular k-means algorithm, but also provided a couple of heuristics for finding the optimum number of clusters while they wrote a few words about one of its most powerful variants (k-means++) too.

Although Python falls short when it comes to handling large data sets or multiple CPUs/GPUs on its own, the authors describe the various solutions to these issues via the use of large scale frameworks, such as Spark, making Python a versatile tool for big data scenarios. Also, they introduce the various packages required to accomplish all the analytics-related tasks, making this book also a great reference manual for all data scientists who veer towards this language.

Personally I lean towards more elegant and more modern programming tools, such a s Julia and Scala, but I found this book quite refreshing and insightful, definitely a great addition to my data science library. If you are someone who takes data science seriously and has learned the basics, I would highly recommend this book for you.
(-by Z.V. )


저자 소개

▶About the Author
⦁ Bastiaan Sjardin
Bastiaan Sjardin is a data scientist and founder with a background in artificial intelligence and mathematics. He has a MSc degree in cognitive science obtained at the University of Leiden together with on campus courses at Massachusetts Institute of Technology (MIT). In the past 5 years, he has worked on a wide range of data science and artificial intelligence projects. He is a frequent community TA at Coursera in the social network analysis course from the University of Michigan and the practical machine learning course from Johns Hopkins University. His programming languages of choice are Python and R. Currently, he is the cofounder of Quandbee (http://www.quandbee.com/), a company providing machine learning and artificial intelligence applications at scale.

⦁ Luca Massaron
Luca Massaron is a data scientist and marketing research director specializing in multivariate statistical analysis, machine learning, and customer insight, with over a decade of experience of solving real-world problems and generating value for stakeholders by applying reasoning, statistics, data mining, and algorithms. From being a pioneer of web audience analysis in Italy to achieving the rank of a top ten Kaggler, he has always been very passionate about every aspect of data and its analysis, and also about demonstrating the potential of data-driven knowledge discovery to both experts and non-experts. Favoring simplicity over unnecessary sophistication, Luca believes that a lot can be achieved in data science just by doing the essentials.

⦁ Alberto Boschetti
Alberto Boschetti is a data scientist, with strong expertise in signal processing and statistics. He holds a Ph.D. in telecommunication engineering and currently lives and works in London. In his work projects he daily faces challenges spanning among natural language processing (NLP), machine learning and distributed processing. He is very passionate about his job and he always tries to be updated on the latest development of data science technologies, attending meetups, conferences and other events.

목차

▶TABLE of CONTENTS
1: FIRST STEPS TO SCALABILITY
2: SCALABLE LEARNING IN SCIKIT-LEARN
3: FAST SVM IMPLEMENTATIONS
4: NEURAL NETWORKS AND DEEP LEARNING
5: DEEP LEARNING WITH TENSORFLOW
6: CLASSIFICATION AND REGRESSION TREES AT SCALE
7: UNSUPERVISED LEARNING AT SCALE
8: DISTRIBUTED ENVIRONMENTS – HADOOP AND SPARK
9: PRACTICAL MACHINE LEARNING WITH SPARK


리뷰

구매자 별점

0.0

점수비율
  • 5
  • 4
  • 3
  • 2
  • 1

0명이 평가함

리뷰 작성 영역

이 책을 평가해주세요!

내가 남긴 별점 0.0

별로예요

그저 그래요

보통이에요

좋아요

최고예요

별점 취소

구매자 표시 기준은 무엇인가요?

'구매자' 표시는 리디에서 유료도서 결제 후 다운로드 하시거나 리디셀렉트 도서를 다운로드하신 경우에만 표시됩니다.

무료 도서 (프로모션 등으로 무료로 전환된 도서 포함)
'구매자'로 표시되지 않습니다.
시리즈 도서 내 무료 도서
'구매자’로 표시되지 않습니다. 하지만 같은 시리즈의 유료 도서를 결제한 뒤 리뷰를 수정하거나 재등록하면 '구매자'로 표시됩니다.
영구 삭제
도서를 영구 삭제해도 ‘구매자’ 표시는 남아있습니다.
결제 취소
‘구매자’ 표시가 자동으로 사라집니다.

이 책과 함께 구매한 책


이 책과 함께 둘러본 책



본문 끝 최상단으로 돌아가기

spinner
모바일 버전