Giuseppe: regex - Tcl regexp parts of this string -

Saturday, 15 February 2014

regex - Tcl regexp parts of this string -

  [22.06.2013 23:23:41 यूटीसी] - [& amp; nbsp; & amp; nbsp; ; Nbsp; PRE & amp; nbsp; & amp; nbsp; & amp; nbsp;] - [& amp; nbsp; एंड amp; nbsp; एंड amp; nbsp; & amp; nbsp; & amp; nbsp; & lt; a href = "? अनुभाग = 0 दिन "शीर्षक =" केवल 0 दिन दिखाएं "gt; 0DAY & lt; / a & gt; & amp; nbsp; & amp; nbsp; & amp; nbsp; & amp; nbsp; & amp; nbsp; & amp; nbsp; - [ & Lt; a href = "? खोज = एमिन्ससमples + सेक्सी + मेलोडी + वॉल्यूप + 2 + मिडी + 6581" & gt; एमीन्सैमल्स। एक्सी। मेलोडी। वोल्। 2। एमडीआई -6581 & lt; / a & gt; ] - & lt; b & gt; [2.30 एमबी] & lt; / b & gt; - & lt; b & gt; [1 एफ] & lt; / b & gt; - & lt; span शैली = "फ़ॉन्ट-वजन: बोल्ड;" & gt; [& lt; A href = "डाउनलोड / अमिन्सामल्स। एक्सी। मेलोडी। V.2.2.MIDI-6581.रार" शीर्षक = "एमिन्सामल्स। एक्सी। मेलोडी। वोल्। 2। एमडीआई-6581.र" & gt; डाउनलोड & lt; / a & gt; ]    मैं उस डेटा को पकड़ना चाहता हूं जो इस तरह दिखाता है। << p> 
  [22.06.2013 23:23:41 यूटीसी] - [पूर्व] - - [0 दिन] - [अमिन्सामल्स। एक्सी। मेलोडी। वोल्। 2। एमडीआई -6581] - [2.30 एमबी] - [1 एफ] - [डाउनलोड]    लेकिन आईएम यह बिल्कुल निश्चित नहीं है कि मैं यह कैसे कर सकता हूं, मैं सभी को हड़पने के लिए प्रबंधन कर सकता हूं। अमिन्सामल्स। एक्सी। मैलॉडी। वोल्। 2। एमडीआई-6581.र  
 मैं इसे टीसीएल के भीतर करना चाहता हूं  < P> यहाँ है जो मैं वर्तमान में मिला है।  
  पकड़ {सेट http [:: http :: geturl http://www.prelist.ws -timeout 15000]} त्रुटि अगर { स्ट्रिंग मैच "* त्रुटि *" $ त्रुटि]} {"कनेक्ट त्रुटि!" ; वापसी 0} अगर {[स्ट्रिंग मैच "* टाइमआउट *" $ त्रुटि]} {"समय समाप्त!" डालता है; वापसी 0} सेट करें html [:: http: data [split $ http "\ n"]] regsub -all "& amp; amp; $ Html {\ & amp;} html रेग्यूब -सभी "& amp; बार;" $ Html {*} html रेग्यूब -सभी "& amp; nbsp;" $ Html {} html रेग्यूब -सभी- nocase "& amp; # 215;" $ Html "x" html रेग्यूब -सभी- nocase "& lt;" $ Html "& lt;" Html रेग्यूब -सभी- nocase "& amp; जी;" $ Html "& gt;" Html रेग्यूब -सभी "& gt;" यदि {{string match "* title *"] "$ html" "html regsub -all" & lt; tt "$ html" "html foreach line $ html {अगर {[string match" * SHOW * "$ line]} {continue} $ Line]} {regexp-nocase - {title = "(। *?) & Gt;} $ पंक्ति - & gt; सभी लाइन रेग्यूब -सभी -नोकेश" शीर्षक = "$ पंक्ति {} पंक्ति रेग्यूब -सभी- nocase" डाउनलोड करें "$ पंक्ति {} रेखा regsub -all-nocase" \ "& lt; / a" $ line {} रेखा regsub -all-nocase "\" मुक्त "$ पंक्ति {} पंक्ति regsub -all -nocase" \ "" $ line {} लाइन रेग्यूब -सभी- nocase "\\\ [" $ line {} लाइन रेग्यूब -सभी- nocase "& lt; शीर्षक" $ line {} पंक्ति regsub -all-nocase "\\\] & lt; / title" $ रेखा {} पंक्ति "$ पंक्ति"}}     
  यह आसानी से किया जा सकता है Xpath:  
  #! / Usr / bin / tclsh पैकेज की आवश्यकता है tdom set fp [open "input.txt" r] सेट html [read $ fp] close $ fp set doc [dom parse -html $ html] सेट रूट [$ doc documentElement] set itemNodes [ $ Doc selectNodes {// div [@ id = "list"] / tt / small}] foreach itemNode $ itemNodes {डालता है "[$ itemNode asText]"}    ध्यान दें कि आप कर सकते हैं इस पद्धति के साथ प्रत्येक फ़ील्ड विभाजित करें:  
  foreach itemNode $ itemNodes {सेट पंक्ति "[स्ट्रिंग ट्रिम [$ itemNode asText] \ [\] \]" सेट फ़ील्ड [regexp -inline -all- {[^ [\ S] [^] [] *? \ S (? = \s * (?)] | $))} $ पंक्ति डालता है [लिंडेक्स $ फ़ील्ड 2]}    

 




Posted by



Unknown




at

02:22











Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest




No comments:







Post a Comment




Newer Post


Older Post

Home




Subscribe to:
Post Comments (Atom)


















    
About Me




Unknown



View my complete profile



Blog Archive








        ► 
      



2015

(1886)





        ► 
      



September

(203)







        ► 
      



August

(208)







        ► 
      



July

(224)







        ► 
      



June

(210)







        ► 
      



May

(230)







        ► 
      



April

(195)







        ► 
      



March

(209)







        ► 
      



February

(201)







        ► 
      



January

(206)









        ▼ 
      



2014

(2117)





        ► 
      



September

(239)







        ► 
      



August

(251)







        ► 
      



July

(226)







        ► 
      



June

(208)







        ► 
      



May

(229)







        ► 
      



April

(199)







        ► 
      



March

(255)







        ▼ 
      



February

(275)

How to generate a set of linear independent vector...
How to find the selected row index of a grid in Dojo
How to fill an HTML Form with (a lot) of records f...
How to execute a code in Bluej after some seconds?...
How to easily consume a web service from PHP
How to dispatch an application in a new process?
How to detect window was resized by Windows7
How to delay opening of Colorbox iframe popup?
How to create walls and up/down/right/left code fo...
How to create and drop database in DB2 using maven...
How to create a custom write.table function?
how to convert long double to string format in Xco...
How to configure PHP under Apache web server in GE...
How to close websocket and release a port
How to check if a function/variable is defined in ...
how to change high-lighting color for python #comm...
How to call method from when a button in a gridvie...
how to block readback via JTAG using BSCAN_SPARTAN...
how to Authenticate Username and pasword from webs...
How to alert(date) with jquery?
how to add google translator in this script (java ...
How to add a background image to string?
How Taking Value of flag from json?
How move first item of arraylist to last position?
How I can Use Crystal Report 8.0 in C# 2008?
How does processor affinity work on virtual machin...
How do you use Presentation Model with Winforms?
How do you limit options selected in a html select...
How do you 'clone' WebControls in C# .NET?
How do i use negative numbers to access an array e...
how can I integrate formBean to my Spring MVC?
How can I Fail a WebTest?
How can i define my error pages in CMS?
Hindi Devanagari characters appear mangled in Goog...
Hide div from javascript
Has anyone tried to use IPython with the Hylang sy...
HAML - syntax error, unexpected keyword_ensure
Grunt watch livereload not working on MAMP
Greatest possible width for textbox
Global suppression class file for Fxcop
Find SmallestInt : Smallest integer greater than o...
File resetting after running [duplicate]
failing to compile a project, missing io.h file
Facebook Like box does not work with Google Chrome
Delete contact working on emulator but not on device
A topic to experienced database architects -
axapta - Using aotimport on server startup -
ruby - Why is Discourse running so slowly? -
angularjs - String Interpolation Won't Work when S...
Best practice: storing sql statements in mysql dat...
javascript - create an a link html and perform an ...
sql - Remove default 0 from numeric textbox -
ios - Find Core Data Entities through two 1-to-1 r...
javascript - marginLeft only resets if called twice -
ExtJS Costum store representation -
jQuery UI Sortable - Disable sideways movement -
java - Thread value not cached by threads even wit...
SNMP OID for CPU core temperature of Windows compu...
regex - non-word characters vim -
ios - reload jquery mobile list view on click -
java - How to mock an exception when creating an i...
javascript - How do I create a summation series in...
windows 7 - Net use works in batch file, fails in ...
css - Completely centering a form in bootstrap -
php - mysql_fetch_array mysql_num -
gem - Bootstrap sass permission denied -
c - fgetpos() behaviour depends on newline charact...
ruby on rails - Rspec stubbing results in Mocha er...
sql server 2008 - select in check constraint -
css - Styling DataGrid GWT -
php - Modify function which dynamically populates ...
java - Repopulate ArrayList from JSP with Struts 2 -
ios - Not Key Value Coding-Compliant -
javascript - how do you get the $this properties/a...
highcharts waterfall not summing correctly -
c# - Calling a windowsservice ExecuteCommand to ex...
visual studio - How to use different resource file...
csv - PHP: Getting entire path of a file selected ...
c# - How to separate parallel requests? -
python - Tuples in unwanted alphabetical order -
html - execute Java applet as big as possible with...
css - Navigation Bar IE8 issue -
asp.net mvc 3 - Passing Data Between Pages - Sessi...
jquery - How to drag an element from one gridster ...
rally - Deleting Release Objects via the API -
facebook ads api - Can't place order for ad becaus...
Learning Windows Batch File Scripting -
Good Program example Mongodb backend with Django 1...
Get Data from Array PHP -
php - Query to return rows that match a block of i...
java - Does anything bad happen if an array(list) ...
vb.net - What is good practice for abstracting gen...
mysqli - Why does my php code not send its data to...
c# - Does protobuf-net support Dictionary when it ...
java - 1000 * 60 * 60 * 24 * 30 results in a negat...
c++ - Double-linked-list insert item algorithm fla...
What is the shortest way in Scala/Java to grab all...
javascript - Backbone Model .changedAttributes() n...
c# - How to call code behind server method from a ...
java - Google Cloud Endpoints SPI restricted -








        ► 
      



January

(235)









        ► 
      



2013

(2011)





        ► 
      



September

(199)







        ► 
      



August

(228)







        ► 
      



July

(210)







        ► 
      



June

(222)







        ► 
      



May

(217)







        ► 
      



April

(229)







        ► 
      



March

(243)







        ► 
      



February

(221)







        ► 
      



January

(242)









        ► 
      



2012

(1993)





        ► 
      



September

(227)







        ► 
      



August

(235)







        ► 
      



July

(225)







        ► 
      



June

(206)







        ► 
      



May

(221)







        ► 
      



April

(216)







        ► 
      



March

(206)







        ► 
      



February

(227)







        ► 
      



January

(230)









        ► 
      



2011

(1964)





        ► 
      



September

(220)







        ► 
      



August

(222)







        ► 
      



July

(219)







        ► 
      



June

(224)







        ► 
      



May

(219)







        ► 
      



April

(206)







        ► 
      



March

(216)







        ► 
      



February

(221)







        ► 
      



January

(217)









        ► 
      



2010

(1952)





        ► 
      



September

(230)







        ► 
      



August

(202)







        ► 
      



July

(221)







        ► 
      



June

(207)







        ► 
      



May

(213)







        ► 
      



April

(199)







        ► 
      



March

(234)







        ► 
      



February

(244)







        ► 
      



January

(202)


















    















Powered by Blogger.