How do I parallelize a simple Python loop?

python

parallelization

asyncio

joblib

byAlex Kataev·Feb 13, 2025

To parallelize loops instantaneously, the concurrent.futures package in Python is the key. If the tasks are CPU-intensive, go for a ProcessPoolExecutor. However, if you are in dealings with processes waiting on I/O operations, ThreadPoolExecutor is your best friend. Here’s a code snippet to parallelize using ProcessPoolExecutor:

from concurrent.futures import ProcessPoolExecutor

# The function we're going to execute in parallel
def compute_square(number):
    # This is a critical function. Lives depend on it,
    # especially those using graph papers 😜
    return number ** 2

numbers = [1, 2, 3, 4, 5]
with ProcessPoolExecutor() as executor:
    # Squaring all numbers simultaneously.
    # Because we don't like waiting.
    squares = executor.map(compute_square, numbers)

print(list(squares))

This neat piece of code executes the function across an array in parallel, computing squares in no time.

Parallelizing for advanced users

Taking it up a notch, let's look into additional tools and patterns beyond concurrent.futures, including optimizing overhead and improving performance gains.

Overhead and performance gains

Joblib is an excellent Python library for easy parallel loops. With its simple syntax, and iteration optimization feature, Joblib can sometimes outshine concurrent.futures.

Handling non-CPU-bound tasks with asyncio

Non-CPU-bound tasks are often related to I/O or network operations. In this case, Python's asyncio could be a fantastic tool. Here's how you can use asyncio.gather() for parallelizing network requests:

import asyncio

# Asynchronous fetcher 
async def get_data(session, url):
    async with session.get(url) as response:
        # If I had a nickel for every byte fetched.
        return await response.text()

# Task orchestrator gathering all fetcher results
async def gather_requests(urls):
    async with aiohttp.ClientSession() as session:
        # Performing sorcery to fetch all URLs concurrently.
        return await asyncio.gather(*(get_data(session, url) for url in urls))

urls = ['http://example.com', 'http://example.org']
loop = asyncio.get_event_loop()
results = loop.run_until_complete(gather_requests(urls))

This code efficiently handles multiple tasks concurrently, eliminating excessive thread or process creation.

Compatibility with Windows

Windows users should note that creating multiple processes may cause additional overhead, due to Windows' lack of support for fork(). Fear not, the ThreadPoolExecutor can come to your rescue and provide superior performance over the ProcessPoolExecutor.

Error tracking with Joblib

Joblib also provides transparency in tracking errors during parallel execution, making it easy for you to debug and maintain code quality.

Advanced patterns with asyncio

You can use the @background decorator and asyncio.get_event_loop().run_in_executor() to run functions concurrently. This comes handy when dealing with synchronous and asynchronous codes.

Handling the right tool

Your choice of parallelization depends upon the task types and picking the right tool:

For CPU-bound tasks, opt for ProcessPoolExecutor or joblib.Parallel with the processes backend.
I/O-bound tasks are best handled by ThreadPoolExecutor or joblib.Parallel with the threads backend.

Monitoring and fine-tuning

It's essential to try various parallelization strategies while monitoring the time to ensure optimization. You can easily achieve this using Python's native time module.

Using asyncio in Jupyter

If you're using Jupyter Notebook, remember to apply nest_asyncio.apply(). This allows for nested event loops.

Error Handling in Parallel Loops

It's always smart to employ exception handling within your functions. Unhandled exceptions could terminate processes or threads, leading to frustrating and costly crashes!

explain-codes / Python / How do I parallelize a simple Python loop?

Linked

Multiprocessing vs multithreading vs asyncio



How do I use threading in Python?



Asynchronous method call in Python?



What is the fastest way to send 100,000 HTTP requests in Python?



Wait until all threads are finished in Python

