如何进行逆“范围",即根据一组数字创建一个紧凑的范围?

2022-01-24 00:00:00 python numbers range pcre

问题描述

Python 有一个 range 方法,它允许使用以下内容:

Python has a range method, which allows for stuff like:

>>> range(1, 6)
[1, 2, 3, 4, 5]

我正在寻找的是相反的:获取一个数字列表,然后返回开始和结束.

What I’m looking for is kind of the opposite: take a list of numbers, and return the start and end.

>>> magic([1, 2, 3, 4, 5])
[1, 5] # note: 5, not 6; this differs from `range()`

对于上面的示例来说,这很容易做到,但是是否也可以允许间隙或多个范围,以类似 PCRE 的字符串格式返回范围? 像这样:

This is easy enough to do for the above example, but is it possible to allow for gaps or multiple ranges as well, returning the range in a PCRE-like string format? Something like this:

>>> magic([1, 2, 4, 5])
['1-2', '4-5']
>>> magic([1, 2, 3, 4, 5])
['1-5']

我正在寻找 Python 解决方案,但我也欢迎使用其他语言的工作示例.它更多地是关于找出一个优雅、高效的算法.额外的问题:是否有任何编程语言对此具有内置方法?

I’m looking for a Python solution, but I welcome working examples in other languages as well. It’s more about figuring out an elegant, efficient algorithm. Bonus question: is there any programming language that has a built-in method for this?


解决方案

简化代码的一个好技巧是查看排序列表的每个元素及其索引的差异:

A nice trick to simplify the code is to look at the difference of each element of the sorted list and its index:

a = [4, 2, 1, 5]
a.sort()
print [x - i for i, x in enumerate(a)]

打印

[1, 1, 2, 2]

每个相同数字的运行对应于 a 中的一系列连续数字.我们现在可以使用 itertools.groupby() 来提取这些运行.这是完整的代码:

Each run of the same number corresponds to a run of consecutive numbers in a. We can now use itertools.groupby() to extract these runs. Here's the complete code:

from itertools import groupby

def sub(x):
    return x[1] - x[0]

a = [5, 3, 7, 4, 1, 2, 9, 10]
ranges = []
for k, iterable in groupby(enumerate(sorted(a)), sub):
     rng = list(iterable)
     if len(rng) == 1:
         s = str(rng[0][1])
     else:
         s = "%s-%s" % (rng[0][1], rng[-1][1])
     ranges.append(s)
print ranges

打印

['1-5', '7', '9-10']

相关文章