如何进行逆“范围"，即根据一组数字创建一个紧凑的范围?

2022-01-24 00:00:00 python numbers range pcre

问题描述

Python 有一个 range 方法，它允许使用以下内容:

Python has a range method, which allows for stuff like:

>>> range(1, 6) [1, 2, 3, 4, 5]

我正在寻找的是相反的:获取一个数字列表，然后返回开始和结束.

What I’m looking for is kind of the opposite: take a list of numbers, and return the start and end.

>>> magic([1, 2, 3, 4, 5]) [1, 5] # note: 5, not 6; this differs from `range()`

对于上面的示例来说，这很容易做到，但是是否也可以允许间隙或多个范围，以类似 PCRE 的字符串格式返回范围? 像这样:

This is easy enough to do for the above example, but is it possible to allow for gaps or multiple ranges as well, returning the range in a PCRE-like string format? Something like this:

>>> magic([1, 2, 4, 5]) ['1-2', '4-5'] >>> magic([1, 2, 3, 4, 5]) ['1-5']

我正在寻找 Python 解决方案，但我也欢迎使用其他语言的工作示例.它更多地是关于找出一个优雅、高效的算法.额外的问题:是否有任何编程语言对此具有内置方法?

I’m looking for a Python solution, but I welcome working examples in other languages as well. It’s more about figuring out an elegant, efficient algorithm. Bonus question: is there any programming language that has a built-in method for this?

解决方案

简化代码的一个好技巧是查看排序列表的每个元素及其索引的差异:

A nice trick to simplify the code is to look at the difference of each element of the sorted list and its index:

a = [4, 2, 1, 5] a.sort() print [x - i for i, x in enumerate(a)]

打印

[1, 1, 2, 2]

每个相同数字的运行对应于 a 中的一系列连续数字.我们现在可以使用 itertools.groupby() 来提取这些运行.这是完整的代码:

Each run of the same number corresponds to a run of consecutive numbers in a. We can now use itertools.groupby() to extract these runs. Here's the complete code:

from itertools import groupby def sub(x): return x[1] - x[0] a = [5, 3, 7, 4, 1, 2, 9, 10] ranges = [] for k, iterable in groupby(enumerate(sorted(a)), sub): rng = list(iterable) if len(rng) == 1: s = str(rng[0][1]) else: s = "%s-%s" % (rng[0][1], rng[-1][1]) ranges.append(s) print ranges

打印

['1-5', '7', '9-10']

相关文章