5.7.10. Scrapy项目——开源爬虫系统

  • /html/head/title : selects the <title> element, inside the <head> element of a HTML document
  • /html/head/title/text() : selects the text inside the aforementioned <title> element.
  • //td : selects all the <td> elements
  • //div[@class="mine"] : selects all div elements which contain an attribute class=”mine”

轻量级爬虫: