将异步与多工作器ProcessPoolExecutor结合使用和用于异步
问题描述
我的问题与Combining asyncio with a multi-worker ProcessPoolExecutor非常相似,但稍有更改(我相信是async for
),我就不会使用那里的优秀答案。
我正在尝试以下MWE:
import concurrent.futures
import asyncio
import time
async def mygen(u: int = 2):
i = 0
while i < u:
yield i
i += 1
def blocking(delay):
time.sleep(delay+1)
return('EXECUTOR: Completed blocking task number ' + str(delay+1))
async def non_blocking(loop):
with concurrent.futures.ProcessPoolExecutor() as executor:
async for i in mygen():
print('MASTER: Sending to executor blocking task number ' + str(i+1))
result = await loop.run_in_executor(executor, blocking, i)
print(result)
print('MASTER: Well done executor - you seem to have completed blocking task number ' + str(i+1))
loop = asyncio.get_event_loop()
loop.run_until_complete(non_blocking(loop))
不出所料,它的输出不是异步的:
MASTER: Sending to executor blocking task number 1
EXECUTOR: Completed blocking task number 1
MASTER: Well done executor - you seem to have completed blocking task number 1
MASTER: Sending to executor blocking task number 2
EXECUTOR: Completed blocking task number 2
MASTER: Well done executor - you seem to have completed blocking task number 2
我想调整代码,使任务在两个并发进程中运行,并在输出可用时打印输出。所需输出为:
MASTER: Sending to executor blocking task number 1
MASTER: Sending to executor blocking task number 2
EXECUTOR: Completed blocking task number 1
MASTER: Well done executor - you seem to have completed blocking task number 1
EXECUTOR: Completed blocking task number 2
MASTER: Well done executor - you seem to have completed blocking task number 2
我从Combining asyncio with a multi-worker ProcessPoolExecutor了解到,就目前情况而言,我的await loop.run_in_executor()
语法是阻塞的。我不知道如何替换它,以允许async for
移动到下一个生成的值,同时等待执行器完成他们的工作。注意:我没有像他们的示例中那样使用asyncio.gather
。
解决方案
如果希望最多有两个进程运行任务,最简单的方法是使用max_workers=2
创建执行器。然后,您可以尽可能快地提交任务,即继续async for
的下一个迭代,而无需等待前一个任务完成。您可以在结束时收集所有任务的结果,以确保异常不会被忽略(并可能获得实际结果)。
以下代码生成预期的输出:
from concurrent.futures import ProcessPoolExecutor
import asyncio
import time
async def mygen(u: int = 2):
i = 0
while i < u:
yield i
i += 1
def blocking(delay):
time.sleep(delay+1)
return('EXECUTOR: Completed blocking task number ' + str(delay+1))
async def run_blocking(executor, task_no, delay):
print('MASTER: Sending to executor blocking task number '
+ str(task_no))
result = await loop.run_in_executor(executor, blocking, delay)
print(result)
print('MASTER: Well done executor - you seem to have completed '
'blocking task number ' + str(task_no))
async def non_blocking(loop):
tasks = []
with ProcessPoolExecutor(max_workers=2) as executor:
async for i in mygen():
# spawn the task and let it run in the background
tasks.append(asyncio.create_task(
run_blocking(executor, i + 1, i)))
# if there was an exception, retrieve it now
await asyncio.gather(*tasks)
loop = asyncio.get_event_loop()
loop.run_until_complete(non_blocking(loop))
相关文章