Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I have used it when using scraping some data from web pages using scrapy framework. It's reliable way to extract something from web pages compared to regex.


don't overlook the ability to mix and match them, because each "axis" is good at its own things

  response.xpath("//div[string-contains(@data-foo, "foo")").css(".some-class").re(r"[a-z][a-zA-Z]+")
The .css() flavor gets complied down into .xpath() but there is no contest about their expressivity: https://github.com/scrapy/parsel/blob/v1.9.1/parsel/csstrans...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: