在 Python 中测量带参数的函数的时间
问题描述
我正在尝试测量 raw_queries(...)
的时间,但到目前为止没有成功.我发现我应该使用 timeit 模块.问题是我不能(= 我不知道如何)将参数从环境传递给函数.
I am trying to measure the time of raw_queries(...)
, unsuccessfully so far. I found that I should use the timeit module. The problem is that I can't (= I don't know how) pass the arguments to the function from the environment.
重要提示:在调用raw_queries
之前,我们必须执行phase2()
(环境初始化).
Important note: Before calling raw_queries
, we have to execute phase2()
(environment initialization).
旁注:代码在 Python 3 中.
Side note: The code is in Python 3.
def raw_queries(queries, nlp):
""" Submit queries without getting visual response """
for q in queries:
nlp.query(q)
def evaluate_queries(queries, nlp):
""" Measure the time that the queries need to return their results """
t = Timer("raw_queries(queries, nlp)", "?????")
print(t.timeit())
def phase2():
""" Load dictionary to memory and subsequently submit queries """
# prepare Linguistic Processor to submit it the queries
all_files = get_files()
b = LinguisticProcessor(all_files)
b.loadDictionary()
# load the queries
queries_file = 'queries.txt'
queries = load_queries(queries_file)
if __name__ == '__main__':
phase2()
感谢您的帮助.
更新:我们可以使用 Timer
的第二个参数调用 phase2()
.问题是我们需要环境中的参数 (queries, nlp)
.
UPDATE: We can call phase2()
using the second argument of Timer
. The problem is that we need the arguments (queries, nlp)
from the environment.
更新:迄今为止最好的解决方案,在 unutbu 的帮助下(仅更改了什么):
UPDATE: The best solution so far, with unutbu's help (only what has changed):
def evaluate_queries():
""" Measure the time that the queries need to return their results """
t = Timer("main.raw_queries(queries, nlp)", "import main;
(queries,nlp)=main.phase2()")
sf = 'Execution time: {} ms'
print(sf.format(t.timeit(number=1000)))
def phase2():
...
return queries, b
def main():
evaluate_queries()
if __name__ == '__main__':
main()
解决方案
首先,永远不要使用time模块对函数进行计时.很容易得出错误的结论.有关示例,请参阅 timeit 与计时装饰器.
First, never use the time module to time functions. It can easily lead to wrong conclusions. See timeit versus timing decorator for an example.
为函数调用计时的最简单方法是使用 IPython 的 %timeit 命令.在那里,您只需启动一个交互式 IPython 会话,调用 phase2()
,定义 queries
,然后运行
The easiest way to time a function call is to use IPython's %timeit command.
There, you simply start an interactive IPython session, call phase2()
, define queries
,
and then run
%timeit raw_queries(queries,nlp)
我知道使用 timeit 的第二个最简单的方法是从命令行调用它:
The second easiest way that I know to use timeit is to call it from the command-line:
python -mtimeit -s"import test; queries=test.phase2()" "test.raw_queries(queries)"
(在上面的命令中,我假设脚本名为 test.py
)
(In the command above, I assume the script is called test.py
)
这里的成语是
python -mtimeit -s"SETUP_COMMANDS" "COMMAND_TO_BE_TIMED"
为了能够将 queries
传递给 raw_queries
函数调用,您必须定义 queries
变量.在您发布的代码中,queries
是在 phase2()
中定义的,但仅限于本地.因此,要将 queries
设置为全局变量,您需要执行一些操作,例如让 phase2
return queries
:
To be able to pass queries
to the raw_queries
function call, you have to define the queries
variable. In the code you posted queries
is defined in phase2()
, but only locally. So to setup queries
as a global variable, you need to do something like have phase2
return queries
:
def phase2():
...
return queries
如果你不想这样弄乱phase2
,创建一个虚拟函数:
If you don't want to mess up phase2
this way, create a dummy function:
def phase3():
# Do stuff like phase2() but return queries
return queries
相关文章