Scaling Python with Dask

This post was published 2 years ago. Download links are most likely obsolete. If that's the case, try asking the uploader to re-upload.

Scaling Python with Dask

English | ISBN: 9781098119867 | 18 pages | EPUB | 2022 | 1 Mb

Dask is a free and open source library for parallel computing in Python that helps you scale your data science and machine learning workflows. With this quick but thorough resource, data scientists and Python programmers will learn how Dask provides APIs that make it easy to parallelize PyData libraries like NumPy, pandas, and scikit-learn.

Author Holden Karau shows you how you can use Dask computations in local systems and then scale to the cloud for heavier workloads. This practical book explains why Dask is popular among industry experts and academics and used by organizations that include Walmart, Capital One, Harvard Medical School, and NASA.

With this book, you'll learn about

What is Dask is, where you can use it, and how it compares to other tools
Batch data parallel processing
Key distributed system concepts for Dask users
Higher-level APIs and building blocks
Integrated libraries, such as scikit-learn, pandas, and PyTorch
How to use Dask with GPUs

No comments have been posted yet. Please feel free to comment first!

    Load more replies

    Join the conversation!

    Log in or Sign up
    to post a comment.