User guide
Installation
Usage
First, you have to import pandarallel:
Then, you have to initialize it.
This method takes 5 optional parameters:
shm_size_mb: Deprecated - Do not use.nb_workers: Number of workers used for parallelization. (int) If not set, default to the number of cores available.progress_bar: Display progress bars if set toTrue. (bool,Falseby default)verbose: The verbosity level (int,2by default)0- don't display any logs1- display only warning logs2- display all logs
use_memory_fs: (bool,Noneby default)- If set to None and if memory file system is available,
pandarallelwill use it to transfer data between the main process and workers. If memory file system is not available,pandarallelwill default on multiprocessing data transfer (pipe). - If set to
True,pandarallelwill use memory file system to transfer data between the main process and workers and will raise aSystemErrorif memory file system is not available. - If set to
False,pandarallelwill use multiprocessing data transfer (pipe) to transfer data between the main process and workers.
- If set to None and if memory file system is available,
Using memory file system reduces data transfer time between the main process and workers, especially for big data.
Memory file system is considered as available only if the directory /dev/shm exists
and if the user has read and write rights on it.
Basically, memory file system is only available on some Linux distributions (including Ubuntu).