(Python) XPath Grammer

3월 01, 2018

XPath is "(XML Path)" of abbreviation.

This is query language that explore and select some part of XML document.

In 1999, W3C make this and it is used in Java, Pytho, C#

Unfortunately BeatifulSoup isn`t supported this XPath library.
The XPath usage is similar with usage of CSS selector.(like my#idname)

It is consist of four concept

root node vs non root node

- /div is only choosing root div node in a document.

- //div is every div node in a document

choice attriute

- //@href is choosing all href attribute node

ex) //a[@href='http://google.com] is choosing all node which is indicated to "google.com" in a document

choice node according to location

- (//a)[3] is choosing third link in a document.

- (//table)[last()] is choosing last table in a document.

- (//a)[position()<3] is choosing first and second link in a document.

asterik(*) is every character node set, this is useful in every situation.

- //table/tr/* is choosing every child tr tag in all table.

- //div[@*] is choosing all div tag which have more than one attribute.

이 블로그 검색

세상의 모든 Software 지식을 정리

(Python) XPath Grammer

댓글

댓글 쓰기

이 블로그의 인기 게시물

(18장) WebSocekt과 STOMP를 사용하여 메시징하기

(C++) new를 통한 객체 생성 vs 그냥 객체 생성

(네트워크)폴링방식 vs 롱 폴링방식