OpenMP 如何处理嵌套循环?

2021-12-08 00:00:00 loops parallel-processing openmp c++

以下代码只是并行化第一个(外部)循环,还是并行化整个嵌套循环?

Does the following code just parallelize the first (outer) loops, or it parallelize the entire nested loops?

    #pragma omp parallel for
    for (int i=0;i<N;i++)
    { 
      for (int j=0;j<M;j++)
      {
       //do task(i,j)//
      }
    }

我只想确定上面的代码是否会并行化整个嵌套的 for 循环(因此一个线程直接与 task(i,j) 相关),或者它只并行化外部 for 循环(从而确保,对于每个循环索引为 i 的并行线程,其内部循环将在单个线程中依次完成,这非常重要).

I just want to make sure if the above code will parallelize the entire nested for-loops (thus one thread directly related task(i,j)), or it only parallelizes the outer for-loop (thus it ensures that, for each parrallel thread with loop index i, its inner loop will be done sequentially in a single thread, which is very import).

推荐答案

您编写的行将仅并行化外循环.要并行化两者,您需要添加一个 collapse 子句:

The lines you have written will parallelize only the outer loop. To parallelize both you need to add a collapse clause:

#pragma omp parallel for collapse(2)
    for (int i=0;i<N;i++)
    { 
      for (int j=0;j<M;j++)
      {
       //do task(i,j)//
      }
    }

您可能需要查看 OpenMP 3.1 规范(第 2.5.1 节)以了解更多详细信息.

You may want to check OpenMP 3.1 specifications (sec 2.5.1) for more details.

相关文章