Dask provides advanced parallelism for analytics, enabling performance at scale for Pandas, Numpy or even low-level functions.
Source codeOverview
Dask is a Python library for parallel and distributed computing. Dask is: - Easy to use and set up (it’s just a Python library) - Powerful at providing scale, and unlocking complex algorithms - and Fun 🎉
Usage
See 10 minutes to Dask.
Installation
See installation page.
At CNES
Available in Conda environment on CNES HPC Cluster.
Resources
- Dask documentation page
- License File
- Website
- Dask Distributed repo
- Dask-jobqueue documentation
- Community forum