将异步与多工作器ProcessPoolExecutor结合使用和用于异步

问题描述

我的问题与Combining asyncio with a multi-worker ProcessPoolExecutor非常相似,但稍有更改(我相信是async for),我就不会使用那里的优秀答案。

我正在尝试以下MWE:

import concurrent.futures
import asyncio
import time

async def mygen(u: int = 2):
    i = 0
    while i < u:
        yield i
        i += 1

def blocking(delay):
    time.sleep(delay+1)
    return('EXECUTOR: Completed blocking task number ' + str(delay+1))

async def non_blocking(loop):
    with concurrent.futures.ProcessPoolExecutor() as executor:
        async for i in mygen():
            print('MASTER: Sending to executor blocking task number ' + str(i+1))
            result = await loop.run_in_executor(executor, blocking, i)
            print(result)
            print('MASTER: Well done executor - you seem to have completed blocking task number ' + str(i+1))

loop = asyncio.get_event_loop()
loop.run_until_complete(non_blocking(loop))

不出所料,它的输出不是异步的:

MASTER: Sending to executor blocking task number 1
EXECUTOR: Completed blocking task number 1
MASTER: Well done executor - you seem to have completed blocking task number 1
MASTER: Sending to executor blocking task number 2 
EXECUTOR: Completed blocking task number 2 
MASTER: Well done executor - you seem to have completed blocking task number 2

我想调整代码,使任务在两个并发进程中运行,并在输出可用时打印输出。所需输出为:

MASTER: Sending to executor blocking task number 1
MASTER: Sending to executor blocking task number 2
EXECUTOR: Completed blocking task number 1
MASTER: Well done executor - you seem to have completed blocking task number 1
EXECUTOR: Completed blocking task number 2
MASTER: Well done executor - you seem to have completed blocking task number 2
我从Combining asyncio with a multi-worker ProcessPoolExecutor了解到,就目前情况而言,我的await loop.run_in_executor()语法是阻塞的。我不知道如何替换它,以允许async for移动到下一个生成的值,同时等待执行器完成他们的工作。注意:我没有像他们的示例中那样使用asyncio.gather


解决方案

如果希望最多有两个进程运行任务,最简单的方法是使用max_workers=2创建执行器。然后,您可以尽可能快地提交任务,即继续async for的下一个迭代,而无需等待前一个任务完成。您可以在结束时收集所有任务的结果,以确保异常不会被忽略(并可能获得实际结果)。

以下代码生成预期的输出:

from concurrent.futures import ProcessPoolExecutor
import asyncio
import time

async def mygen(u: int = 2):
    i = 0
    while i < u:
        yield i
        i += 1

def blocking(delay):
    time.sleep(delay+1)
    return('EXECUTOR: Completed blocking task number ' + str(delay+1))

async def run_blocking(executor, task_no, delay):
    print('MASTER: Sending to executor blocking task number '
          + str(task_no))
    result = await loop.run_in_executor(executor, blocking, delay)
    print(result)
    print('MASTER: Well done executor - you seem to have completed '
          'blocking task number ' + str(task_no))

async def non_blocking(loop):
    tasks = []
    with ProcessPoolExecutor(max_workers=2) as executor:
        async for i in mygen():
            # spawn the task and let it run in the background
            tasks.append(asyncio.create_task(
                run_blocking(executor, i + 1, i)))
        # if there was an exception, retrieve it now
        await asyncio.gather(*tasks)

loop = asyncio.get_event_loop()
loop.run_until_complete(non_blocking(loop))

相关文章