如何使用 LXML 递归查找 XML 标记?

2022-01-20 00:00:00 python xml find lxml

问题描述

<?xml version="1.0" ?>
<data>
    <test >
        <f1 />
    </test >
    <test2 >
        <test3>
         <f1 />
        </test3>
    </test2>
    <f1 />
</data>

使用 lxml 是否可以递归地找到标签"f1 "?我尝试了 findall 方法,但它只适用于直系子女.

Using lxml is it possible to find recursively for tag " f1 "? I tried findall method but it works only for immediate children.

我想我应该为此选择 BeautifulSoup !!!

I think I should go for BeautifulSoup for this !!!


解决方案

可以使用XPath递归搜索:

You can use XPath to search recursively:

>>> from lxml import etree
>>> q = etree.fromstring('<xml><hello>a</hello><x><hello>b</hello></x></xml>')
>>> q.findall('hello')     # Tag name, first level only.
[<Element hello at 414a7c8>]
>>> q.findall('.//hello')  # XPath, recursive.
[<Element hello at 414a7c8>, <Element hello at 414a818>]

相关文章