使用 2 个索引列表对 2D Numpy 数组进行索引

2022-01-31 00:00:00 python numpy arrays

问题描述

我遇到了一个奇怪的情况.

I've got a strange situation.

我有一个 2D Numpy 数组,x:

I have a 2D Numpy array, x:

x = np.random.random_integers(0,5,(20,8))

我有 2 个索引器——一个带有行索引,一个带有列索引.为了索引 X,我必须执行以下操作:

And I have 2 indexers--one with indices for the rows, and one with indices for the column. In order to index X, I am having to do the following:

row_indices = [4,2,18,16,7,19,4]
col_indices = [1,2]
x_rows = x[row_indices,:]
x_indexed = x_rows[:,column_indices]

不仅仅是:

x_new = x[row_indices,column_indices]

(失败并出现:错误,不能用(2,)广播(20,))

(which fails with: error, cannot broadcast (20,) with (2,))

我希望能够使用广播在一行中进行索引,因为这样可以使代码保持清洁和可读性......而且,我对 python 的底层知识不太了解,但是据我了解,在一行中完成它应该更快(我将使用相当大的数组).

I'd like to be able to do the indexing in one line using the broadcasting, since that would keep the code clean and readable...also, I don't know all that much about python under the hood, but as I understand it, it should be faster to do it in one line (and I'll be working with pretty big arrays).

测试用例:

x = np.random.random_integers(0,5,(20,8))

row_indices = [4,2,18,16,7,19,4]
col_indices = [1,2]
x_rows = x[row_indices,:]
x_indexed = x_rows[:,col_indices]

x_doesnt_work = x[row_indices,col_indices]


解决方案

如果你用 np.newaxis 编写你的第一次尝试会成功

Your first try would work if you write it with np.newaxis

x_new = x[row_indices[:, np.newaxis],column_indices]

相关文章