解析XPath语法之在C#中使用XPath的示例详解

程序员文章站 2023-12-18 00:01:34

xpath可以快速定位到xml中的节点或者属性。xpath语法很简单，但是强大够用，它也是使用xslt的基础知识。示例xml：复制代码代码如下:

xpath可以快速定位到xml中的节点或者属性。xpath语法很简单，但是强大够用，它也是使用xslt的基础知识。
示例xml：

<?xml version="1.0" encoding="utf-8" ?>
<pets>
  <cat color="black" weight="10">
    <price>100</price>
    <desc>this is a black cat</desc>
  </cat>
  <cat color="white" weight="9">
    <price>80</price>
    <desc>this is a white cat</desc>
  </cat>
  <cat color="yellow" weight="15">
    <price>80</price>
    <desc>this is a yellow cat</desc>
  </cat>

 
  <dog color="black" weight="10">
    <price>100</price>
    <desc>this is a black dog</desc>
  </dog>
  <dog color="white" weight="9">
    <price>80</price>
    <desc>this is a white dog</desc>
  </dog>
  <dog color="yellow" weight="15">
    <price>80</price>
    <desc>this is a yellow dog</desc>
  </dog>
</pets>

xpath的语法：
1. xpath中的符号

符号

说明

示例

示例说明

表示从根节点开始选择

/pets

选择根节点pets

表示节点和子节点之间的间隔符

/pets/dog

选择pets节点下的dog节点

//xx

表示从整个xml文档中查找，而不考虑当前节点位置

//price

选择文档中所有的price节点

单个英文半角句点表示选择当前节点

/pets/.

选择pets节点

双点，表示选择父节点

/pets/dog[0]/..

表示pets节点，也就是第一个dog节点的父节点

@xx

表示选择属性

//dog/@color

表示选择所有dog节点的color属性集合

[…]

中括号表示选择条件，括号内为条件

//dog[@color='white']

所有color为white的dog节点

//dog[/price<100]

所有price字节点值小于100的dog节点

中括号内数字为节点索引，类似c#等语言中的数组,数组下标是从1开始的

//dog[1]

第1个dog节点

//dog[last()]

最后一个dog节点，last()是xpath内置函数

单竖杠表示合并节点结合

//dog[@color='white'] | //cat[@color='white']

color属性为white的dog节点和color属性为white的cat节点

星号表示任何名字的节点或者属性

//dog/*

表示dog节点的所有子节点

//dog/@*

表示dog节点的所有属性节点

2. xpath数学运算符
+ 加号表示加
- 表示数字相减
* 表示乘以
div 表示除以，这里数学上的除号/已经被用作节点之间分隔符了
mod 表示取余
3. xpath逻辑运算符
= 等于，相当于c#中的 ==
!= 不等于
> 大于
>= 大于等于
< 小于
<= 小于等于
and 并且与关系
or 或者或关系
4. xpath axes 从字面翻译这个是xpath轴的意思，但根据我的理解这个翻译成xpath节点关系运算关键字更合适，就是一组关键字加上::双冒号表示和当前节点有关系的一个或者一组节点.
使用语法： axisname::nodetest[predicate] 即轴名字::节点名字[取节点条件]
具体说明如下：

关键字

说明

示例

示例说明

ancestor

当前节点的父祖节点

ancestor::pig

当前节点的祖先节点中的pig节点

ancestor-or-self

当前节点以及其父祖节点

ancestor::pig

attribute

当前节点的所有属性

attribute::weight

相当于@weight，attribute::和@是等价的

child

当前节点的所有字节点

child::*[name()!='price']

选择名字不是price的子节点

descendant

子孙节点

descendant::*[@*]

有属性的子孙节点

descendant-or-self

子孙节点以及当前节点

descendant-or-self::*

following

xml文档中当前节点之后的所有节点

following::*

following-sibling

当前节点的同父弟弟节点

following-sibling::

preceding

xml文档中当前节点之前的所有节点

preceding::*

namespace

选取当前节点的所有命名空间节点

namespace::*

parent

当前节点的父节点

parent::

相当于双点..

preceding-sibling

当前节点之后的同父兄节点

preceding-sibling::*

self

当前节点

self::*

相当于单点.

5. 常用的xpath函数介绍：

在xpath表达式中常用的函数有下面两个：

position() 表示节点的序号例如 //cat[position() = 2] 表示取序号为2的dog节点

last() 表示取最后一个节点 //cat[last()]

name() 表示当前节点名字 /pets/*[name() != 'pig'] 表示/pets下名字不是pig的子节点

xpath的函数还有很多，包括字符串函数，数字函数和时间函数等，具体可以参考w3的网站。

以上是xpath的语法，下面我们看下如何在.net中使用xpath

在.net中可以通过xpathdocument或者xmldocument类使用xpath。xpathdocument是只读的方式定位xml节点或者属性文本等，而xmldocument则是可读写的。

如下代码示例展示了如何使用xpathdocument和xmldocument。

复制代码代码如下:

using system;
using system.collections.generic;
using system.linq;
using system.text;
using system.xml.xpath;
using system.xml;

namespace usexpathdotnet
{
    class program
    {
        static void main(string[] args)
        {
            usexpathwithxpathdocument();

            usexpathwithxmldocument();

            console.read();
        }

        static void usexpathwithxmldocument()
        {
            xmldocument doc = new xmldocument();
            doc.load("//www.jb51.net");
            //使用xpath选择需要的节点
            xmlnodelist nodes = doc.selectnodes("/rss/channel/item[position()<=10]");
            foreach (xmlnode item in nodes)
            {
                string title = item.selectsinglenode("title").innertext;
                string url = item.selectsinglenode("link").innertext;
                console.writeline("{0} = {1}", title, url);
            }
        }

        static void usexpathwithxpathdocument()
        {
            xpathdocument doc = new xpathdocument("//www.jb51.net");
            xpathnavigator xpathnav = doc.createnavigator();
            //使用xpath取rss中最新的10条随笔
            xpathnodeiterator nodeiterator = xpathnav.select("/rss/channel/item[position()<=10]");
            while (nodeiterator.movenext())
            {
                xpathnavigator itemnav = nodeiterator.current;
                string title = itemnav.selectsinglenode("title").value;
                string url = itemnav.selectsinglenode("link").value;
                console.writeline("{0} = {1}",title,url);
            }

        }
    }
}

xpath使用示例，请看下面的代码注释　

复制代码代码如下:

using system;
using system.collections.generic;
using system.linq;
using system.text;
using system.io;
using system.xml;

namespace usexpath1
{
    class program
    {
        static void main(string[] args)
        {
            string xml = @"<?xml version=""1.0"" encoding=""utf-8"" ?>
<pets>
  <cat color=""black"" weight=""10"" count=""4"">
    <price>100</price>
    <desc>this is a black cat</desc>
  </cat>
  <cat color=""white"" weight=""9"" count=""5"">
    <price>80</price>
    <desc>this is a white cat</desc>
  </cat>
  <cat color=""yellow"" weight=""15"" count=""1"">
    <price>110</price>
    <desc>this is a yellow cat</desc>
  </cat>

 
  <dog color=""black"" weight=""10"" count=""7"">
    <price>114</price>
    <desc>this is a black dog</desc>
  </dog>
  <dog color=""white"" weight=""9"" count=""4"">
    <price>80</price>
    <desc>this is a white dog</desc>
  </dog>
  <dog color=""yellow"" weight=""15"" count=""15"">
    <price>80</price>
    <desc>this is a yellow dog</desc>
  </dog>

    <pig color=""white"" weight=""100"" count=""2"">
    <price>8000</price>
    <desc>this is a white pig</desc>   
    </pig>
</pets>";

            using (stringreader rdr = new stringreader(xml))
            {
                xmldocument doc = new xmldocument();
                doc.load(rdr);

                //取所有pets节点下的dog字节点
                xmlnodelist nodelistalldog = doc.selectnodes("/pets/dog");

                //所有的price节点
                xmlnodelist allpricenodes = doc.selectnodes("//price");

                //取最后一个price节点
                xmlnode lastpricenode = doc.selectsinglenode("//price[last()]");

                //用双点号取price节点的父节点
                xmlnode lastpriceparentnode = lastpricenode.selectsinglenode("..");

                //选择weight*count=40的所有动物，使用通配符*
                xmlnodelist nodelist = doc.selectnodes("/pets/*[@weight*@count=40]");

                //选择除了pig之外的所有动物,使用name()函数返回节点名字
                xmlnodelist animalsexceptpignodes = doc.selectnodes("/pets/*[name() != 'pig']");

 
                //选择价格大于100而不是pig的动物
                xmlnodelist pricegreaterthan100s = doc.selectnodes("/pets/*[price div @weight >10 and name() != 'pig']");
                foreach (xmlnode item in pricegreaterthan100s)
                {
                    console.writeline(item.outerxml);
                }

                //选择第二个dog节点
                xmlnode theseconddognode = doc.selectsinglenode("//dog[position() = 2]");

                //使用xpath ，axes 的 parent 取父节点
                xmlnode parentnode = theseconddognode.selectsinglenode("parent::*");

                //使用xpath选择第二个dog节点前面的所有dog节点
                xmlnodelist dogpresibling = theseconddognode.selectnodes("preceding::dog");

                //取文档的所有子孙节点price
                xmlnodelist childrennodes = doc.selectnodes("descendant::price");
            }

            console.read();
        }
    }
}

解析XPath语法之在C#中使用XPath的示例详解