Scalable data analysis in Python with Dask /

Understand the concept of Block algorithms and how Dask leverages it to load large data. Implement various examples using Dask Arrays, Bags, and Dask Data frames for efficient parallel computing. Combine Dask with existing Python packages such as NumPy and pandas. See how Dask works under the hood a...

Full description

Saved in:
Bibliographic Details
Main Author: Mohey, Ahmad (Author)
Other Authors: Kashif, Mohammed (Speaker)
Format: Video
Language:English
Published: Birmingham, England : PACKT Publishing, 2019.
Subjects:
Online Access:CONNECT
CONNECT
LEADER 04280ngm a22005413i 4500
001 mig00005870030
003 VaAlASP
005 20200325093954.0
006 m|||||o||c||||||||
007 cr |n||||||||a
007 vz |za|z|
008 200325s2019 enk222 e |o v|eng d
020 |z 9781789808926 
024 8 |a ASP4740648/marc 
035 |a (OCoLC)1138949915 
035 |a (VaAlASP)ASP4740648/marc 
035 0 0 |a on199239574000971 
040 |a VaAlASP  |b eng  |e rda  |c VaAlASP 
099 |a Streaming video 
245 0 0 |a Scalable data analysis in Python with Dask /  |c Mohammed Kashif. 
264 1 |a Birmingham, England :  |b PACKT Publishing,  |c 2019. 
300 |a 1 online resource (222 minutes) 
306 |a 034153 
336 |a two-dimensional moving image  |b tdi  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
337 |a video  |b v  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a video file  |2 rda 
500 |a Title from resource description page (viewed March 25, 2020). 
511 0 |a Presenter, Mohammed Kashif. 
520 |a Understand the concept of Block algorithms and how Dask leverages it to load large data. Implement various examples using Dask Arrays, Bags, and Dask Data frames for efficient parallel computing. Combine Dask with existing Python packages such as NumPy and pandas. See how Dask works under the hood and the various in-built algorithms it has to offer. Leverage the power of Dask in a distributed setting and explore its various schedulers. Implement an end-to-end Machine Learning pipeline in a distributed setting using Dask and scikit-learn. Use Dask Arrays, Bags, and Dask Data frames for parallel and out-of-memory computations. About: Data analysts, Machine Learning professionals, and data scientists often use tools such as pandas, scikit-Learn, and NumPy for data analysis on their personal computer. However, when they want to apply their analyses to larger datasets, these tools fail to scale beyond a single machine, and so the analyst is forced to rewrite their computation. If you work on big data and you're using pandas, you know you can end up waiting up to a whole minute for a simple average of a series. And that's just for a couple of million rows! In this course, you'll learn to scale your data analysis. Firstly, you will execute distributed data science projects right from data ingestion to data manipulation and visualization using Dask. Then, you will explore the Dask framework. After, see how Dask can be used with other common Python tools such as NumPy, pandas, Matplotlib, scikit-learn, and more. You'll be working on large datasets and performing exploratory data analysis to investigate the dataset, then coming up with the findings from the dataset. You'll learn by implementing data analysis principles using different statistical techniques in one go across different systems on the same massive datasets. Throughout the course, we'll go over the various techniques, modules, and features that Dask has to offer. Finally, you'll learn to use its unique offering for Machine Learning, using the Dask-ML package. You'll also start using parallel processing in your data tasks on your own system without moving to the distributed environment. All the code files and related files are uploaded on GitHub. 
546 |a In English. 
650 0 |a Python (Computer program language) 
650 0 |a Data mining. 
650 0 |a Electronic data processing  |x Distributed processing. 
650 0 |a Information visualization. 
655 7 |a Instructional films.  |2 lcgft 
700 1 |a Kashif, Mohammed,  |e speaker. 
700 1 |a Mohey, Ahmad,  |e author. 
710 2 |a Packt Publishing,  |e production company. 
730 0 |a Alexander Street Press: discovery records 
776 0 8 |i DVD version:  |z 9781789808926 
856 4 0 |u https://ezproxy.mtsu.edu/login?url=http://www.aspresolver.com/aspresolver.asp?MARC;4740648  |z CONNECT  |t 0 
907 |a 5294127  |b 03-05-21  |c 02-08-21 
998 |a wi  |b 03-05-21  |c m  |d g   |e -  |f eng  |g enk  |h 0  |i 2 
999 f f |i 29458b0e-6f99-4602-8ebb-3eec56f21600  |s a1d26f96-d796-4547-a4e1-46f8d538ea90  |t 0 
952 f f |t 1  |h Library of Congress classification 
856 4 0 |t 0  |u https://ezproxy.mtsu.edu/login?url=http://www.aspresolver.com/aspresolver.asp?MARC;4740648  |z CONNECT