Giuseppe: python - Scrapy: How to make a conditional (present or absent) XPATH return values when absent? -

Tuesday, 15 February 2011

python - Scrapy: How to make a conditional (present or absent) XPATH return values when absent? -

I am trying to scrape the information of a particular product from a website. One of my desired XPATH criteria, however, does not appear on every product page. (While all products have names, prices, etc., some recommended age is not shown).

This is not a problem, however, when Scrappy writes or even gives data to a shell, then it is no longer associated with the list of start-url, nor does it allow data from some URL Respects absence Therefore, all of my data (multiple columns of different variables) do not match the new age column because it is too short and out of order. This is not the case when I focus on the products that display the age.

Is there a way to create a page without the desired XPATH, and the age returns an empty space to maintain the matching column order in my data?

Here is my XPATH selector:

  items ["age"] = hxs.select ('// li [contains (@class,' our-age ') ] / Span / text () '). Remove ()    (Some webpages do not have age and there is a complete lack of path in this way.)   
 
   xpath = '// li [contains (@class," our-age ")] / span / text ()' items [" age "] = hxs Select (xpath) .extract () or [']]    

 




Posted by



Unknown




at

02:22











Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest




No comments:







Post a Comment




Newer Post


Older Post

Home




Subscribe to:
Post Comments (Atom)


















    
About Me




Unknown



View my complete profile



Blog Archive








        ► 
      



2015

(1886)





        ► 
      



September

(203)







        ► 
      



August

(208)







        ► 
      



July

(224)







        ► 
      



June

(210)







        ► 
      



May

(230)







        ► 
      



April

(195)







        ► 
      



March

(209)







        ► 
      



February

(201)







        ► 
      



January

(206)









        ► 
      



2014

(2117)





        ► 
      



September

(239)







        ► 
      



August

(251)







        ► 
      



July

(226)







        ► 
      



June

(208)







        ► 
      



May

(229)







        ► 
      



April

(199)







        ► 
      



March

(255)







        ► 
      



February

(275)







        ► 
      



January

(235)









        ► 
      



2013

(2011)





        ► 
      



September

(199)







        ► 
      



August

(228)







        ► 
      



July

(210)







        ► 
      



June

(222)







        ► 
      



May

(217)







        ► 
      



April

(229)







        ► 
      



March

(243)







        ► 
      



February

(221)







        ► 
      



January

(242)









        ► 
      



2012

(1993)





        ► 
      



September

(227)







        ► 
      



August

(235)







        ► 
      



July

(225)







        ► 
      



June

(206)







        ► 
      



May

(221)







        ► 
      



April

(216)







        ► 
      



March

(206)







        ► 
      



February

(227)







        ► 
      



January

(230)









        ▼ 
      



2011

(1964)





        ► 
      



September

(220)







        ► 
      



August

(222)







        ► 
      



July

(219)







        ► 
      



June

(224)







        ► 
      



May

(219)







        ► 
      



April

(206)







        ► 
      



March

(216)







        ▼ 
      



February

(221)

List tables in a PostgreSQL schema -
c - Programming assignment says test program will ...
ios - load images in tableview ios6.1 -
c# - Dynamically set UserControl as Listbox DataTe...
php - mysqlworkbench giving version error on expor...
iphone - willSendRequestForAuthenticationChallenge...
c++ - Pipe erratic behaviour in linux / MSVC 2012 -
sql server - Conditional reuse of subquery -
c# - MVC4 with EF sample code with Unit of work, g...
Can Silverlight Pivotviewer handle 3 levels of sem...
javascript - how can i get rid of duplication in t...
php - I have one select list, which should send da...
asp.net mvc - JQuery Post data to MVC Action Metho...
android - Enumerate all elements in Selenium Pytho...
java - Can't run yui jar file - missing variable n...
javascript - Get Json from asp.net errors -
php - workaround for tinymce utf8 bug -
java - how can i send information from broadcast r...
oauth - Does the User ID returned by
https://www.g...
c# - Finding the intersection of a line -
java - Can I hide the file extensions in an SWT Fi...
c# - Kendo TabStrip: Passing the Parent Model to t...
java - Algorithm need assistance -
python - How to set color of text using xlwt -
jsp - Cookie sharing across application deployed o...
sql server - SUM Function For 2 Tables Produces Wr...
How do I send file name with file using sockets in...
How to remove a char from a string, and return a n...
Cannot display XML in my JavaScript dropdown menu....
unicode - UnicodeDecodeError in SQLite -
bash - How can I print out all lines of a file con...
python - Sub matrix of a list of lists (without nu...
javascript - How to check if a property contains a...
javascript - Create a setter that updates a dom el...
c# - CommandBinding in ContextMenu -
Official Google Translate API v2 (Android) -
map - Google Earth Pro Alternatives -
linq - C# Removing Duplicates from a List Containi...
splitting values to a column with case statement i...
android - Html(jquery) drag and drop not working i...
c++ - How to check if CD drive is open or closed i...
c# - Size of image increases after cropping -
regex - regular expression pattern in PHP -
c# . MySql. "INSERT". error -
c# - Is Azure TableStorageMembershipProvider reall...
floating - How do I get a float value when dividin...
java - The this keyword in classes -
c - When is it necessary to allocate dynamic memor...
javascript - Efficient way to convert units from o...
Q. Is there a way to create Foursquare user accoun...
.htaccess - Rewriting Subdomains to Subdirectory w...
If the client and server use the same certificate ...
Block elements take full width when text wraps on ...
ruby - Validate Rails model just when some method ...
java - Is setting a hashmap thread safe? -
processing - ProcessingJS fails to draw images pro...
c# - It's possible signalR client web to connect a...
jquery - Lazy Load Collection Images in Shopify -
.Net 4.0 C# When loading SHA256 key SignatureAlgor...
javascript - Ember.js computed property with Array...
php - AJAX chat - Personal alerts for users (comma...
javascript - Three.JS - Child object rotation in i...
android - AndEngine Smooth Rotation along Path -
python - Create subdictionary from dictionary if v...
c# - Connecting to web-service without metadata -
triggers - Jenkins Job - Build If SQL Condition Tr...
javascript - Canvas eyeDropper -
visual studio 2010 - Sevenzip extractFile(String f...
amazon web services - puppet exec vagrant plugin i...
Jquery vegas plugin image resize within elements -
android - Creating OnDragListener for Google Map v...
css - Why Won't My Simple Icons Appear? -
Codekit: Any way to reorder projects? -
html - Change page title with PJAX? -
ios - Creating UIScrollView Programmatically -
java - BFS graph traversal - Append new node to ad...
I need to extract numbers from a String in Java -
php - Sometimes missing textarea value when proces...
c# - Why does a generic and a non-generic ICompara...
Excel: delete entire row when other cell equals a ...
data structures - How to code 2D segment tree? -
cocoa - Autolayout constraint where height is the ...
javascript - register click on everything on body ...
c++ - Any method to calculate the distance between...
stored procedures - Fatal error: Call to a member ...
Extjs overrides - loading required file before ove...
java - How to get count of childnodes in XML file ...
oracle11g - Oracle rebuild- what exactly did our D...
asp.net - Why does this give me a YSOD? -
combobox - Set Default Value to a Combo box, when ...
Django Admin Inlines -
wpf - CheckBoxes In ListBox Make Scrolling Lag -
How to get the url parameter in Portofino 4.0.8 de...
jquery - Create a broadcast service for my website...
ipad - How do videos autoplay on iOS at youtube.com -
How to migrate SVN repo with full codebase to GIT ...
asp.net mvc - Can not parse datatable ajax request -
CSS - is it good practice to override an attribute...
How to create a new directory using java? -
jquery - Design pattern for real time data -








        ► 
      



January

(217)









        ► 
      



2010

(1952)





        ► 
      



September

(230)







        ► 
      



August

(202)







        ► 
      



July

(221)







        ► 
      



June

(207)







        ► 
      



May

(213)







        ► 
      



April

(199)







        ► 
      



March

(234)







        ► 
      



February

(244)







        ► 
      



January

(202)


















    















Powered by Blogger.