html - Python scrapy, how to only get immediate children -

- June 15, 2015

so have html this

<div class="content">     <div class="infobox">         <p> text </p>         <p> more text </p>     </div>     <p> text again </p>     <p> more text </p> </div>

and using selector '.content p::text' thought me immediate children, wanted extract "text again" , "even more text" it's getting text paragraphs inside other div, how can prevent happening, want text paragraphs immediate children of div class .content

scrapy uses extended set of css selectors , xpath selectors. in case, you're using css selectors. css relationship selector want > denoting parent/child relationship, in: .content > p::text. scrapy's selectors described in section titled "selectors" in documentation.

Search This Blog

Swift

html - Python scrapy, how to only get immediate children -

Comments

Post a Comment

Popular posts from this blog

asp.net - How to correctly use QUERY_STRING in ISAPI rewrite? -

jsf - "PropertyNotWritableException: Illegal Syntax for Set Operation" error when setting value in bean -

arrays - Algorithm to find ideal starting spot in a circle -