在 lxml 中解析 XML 时如何不加载注释

2022-01-10 00:00:00 python xml xml-parsing comments lxml

问题描述

我尝试像这样使用 lxml 在 Python 中解析 XML 文件:

I try to parse XML file in Python using lxml like this:

objectify.parse(xmlPath, parserWithSchema)

但 XML 文件可能在奇怪的地方包含注释:

but XML file may contains comments in strange places:

<root> <text>Sample text</text>  <float>1.23456</float> </root>

是一种在解析前不加载或删除评论的方法吗?

It is a way to not load or delete comments before parsing?

解决方案

在解析器上设置 remove_comments=True (文档):

Set remove_comments=True on the parser (documentation):

from lxml import etree, objectify parser = etree.XMLParser(remove_comments=True) tree = objectify.parse(xmlPath, parser=parser)

或者，使用 makeparser() 方法:

parser = objectify.makeparser(remove_comments=True) tree = objectify.parse(xmlPath, parser=parser)

希望对您有所帮助.

相关文章