Giuseppe: clojure read large text file and count occurrences -

Friday, 15 June 2012

clojure read large text file and count occurrences -

I am trying to read a large text file and see the incidence of specific errors counting. For example, for the following sample text

  some bla error123 foo test error123 line junk error 55 more accessories    I want to end with ( Not really care that data structure though I'm thinking of a map)  
  error 123 - 2 error 55 - 1    Even I've tried so far away  
  (read-big- File find-error "sample.txt") Returns:    (zero zero "error 123" zero zero "error 123" zero zero "error 55 "zero zero)    Next I tried to remove items like zero values and groups  
  (   Which gives  
    code> {"error 123" ["error 123" "error 123"], "error 55" ["error 55"] }   
 This desired value Getting closer to production, though it may not be efficient now how can I mean? In addition, in the form of close-up and functional programming in any new form, I appreciate any suggestions about how I can improve it. Thanks!   
 
  I think you are looking for the frequency function:  
 < Code> User = & gt; (Doctor Frequency) ------------------------- Closer.core / Frequency ([cola]) Number of maps compared to different items The times they appear void    Therefore, it should you do what you want:  
  (frequencies (delete from zero? (Read-large -File search-error "sample.txt"))) ;; = & Gt; If your text file is really big, then I would recommend it to the  line-seq  inline. "Error 123" 2, "Error 55" 1}    To make sure that you do not get out of memory, instead of using  filter  You can also use the  map  and  to delete .  
  (defn count-lines [ex, filename] (with - Open [RDR (IO / Reader Filename)] (frequencies (filter-line-RDR))) (defn is-error-line? [Line] (re-search # error "line)) (count-lines Is-error-line? "Sample.txt") ;; = & Gt; {"Error123" 2, "Error 55" 1}    

 




Posted by



Unknown




at

03:22











Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest




No comments:







Post a Comment




Newer Post


Older Post

Home




Subscribe to:
Post Comments (Atom)


















    
About Me




Unknown



View my complete profile



Blog Archive








        ► 
      



2015

(1886)





        ► 
      



September

(203)







        ► 
      



August

(208)







        ► 
      



July

(224)







        ► 
      



June

(210)







        ► 
      



May

(230)







        ► 
      



April

(195)







        ► 
      



March

(209)







        ► 
      



February

(201)







        ► 
      



January

(206)









        ► 
      



2014

(2117)





        ► 
      



September

(239)







        ► 
      



August

(251)







        ► 
      



July

(226)







        ► 
      



June

(208)







        ► 
      



May

(229)







        ► 
      



April

(199)







        ► 
      



March

(255)







        ► 
      



February

(275)







        ► 
      



January

(235)









        ► 
      



2013

(2011)





        ► 
      



September

(199)







        ► 
      



August

(228)







        ► 
      



July

(210)







        ► 
      



June

(222)







        ► 
      



May

(217)







        ► 
      



April

(229)







        ► 
      



March

(243)







        ► 
      



February

(221)







        ► 
      



January

(242)









        ▼ 
      



2012

(1993)





        ► 
      



September

(227)







        ► 
      



August

(235)







        ► 
      



July

(225)







        ▼ 
      



June

(206)

php - Loading a Parent Class in Symfony 2 -
IIS 7.5 forces the http header CacheControl to Pri...
vb.net - How can I set the text for a label equal ...
c# - asp.net submit process -
Django TypeError: 'int' object is not callable -
sql - Select all Users related to Company -
JavaScript Exception not working -
html - Anchor links inside iframe does nothing -
php - need help creating a mysql query -
javascript - Fancybox - Implementing afterClose ca...
c# - Inner left join with linq returns too many re...
osx - Adding Maven Installation to EClipse -
ServiceStack OrmLite + Foreign Key -
php - Handling multiple fields inserted for one row -
understanding sockets - c -
php - Live email availability check -
objective c - NSViewController on Mac OS X 10.4 -
Pass a C Structure from Python using Ctypes -
c - New to Dynamically memory "first time" -
How to run the Konami code in Powershell? -
html - how do I subtract single textbox values wit...
c# - Unable to find intersection of two array -
c++ - Does the address of a function change per ru...
c# - Derive XAML UserControl -
php - SwiftMail SMTP Error -
type conversion - C++/CX - How to convert a number...
java - Calculate array of grades -
xcode - iphone Play different alert sounds for cor...
android - Replacing a fragment in a view pager -
php - Doctrine2 from subqueries -
assembly - Nasm: Convert integer to string using p...
xml - wix XmlConfig: is it possible to insert a ne...
Facebook 3.0: Things I don't understand when authe...
ios - JSON Tweet Post Date -
awk all lines of the input file where first and th...
How can I get ALL photo from the facebook album? -
java - Oracle AQ performance tuning -
google script Session.getActiveUser().getEmail() n...
debugging - Popping debugger with code in Java -
objective c - Cocos 2d game pause when home button...
configuration - How can I send a 404 when user tri...
maven - Custom category in liferay portlet -
jquery - XDR post returning "access denied" in IE ...
node.js - Can not catch over >1000 message when se...
PHP array data used in javascript graph - codeigni...
c++ - Why is an integer being promoted via an expl...
Android - Bitmap.createScaledBitmap() setting Conf...
html5 - Image inside a textbox with a value in it -
c# - Fake caller ID on text messages using Twilio ...
jquery - Responding to an AJAX call with only Succ...
r - ggplot2 lines from point to point on xy are so...
css - How to make div tags expand to content when ...
objective c - How do you keep the UIImageView "ful...
javascript - Automatically Login to Zoho Applicati...
python - How to create a simple event consisting o...
html - Select element with font family Verdana and...
target all parents(".class") of a specific class j...
Rails 3.2 app with no models, which form validatio...
http status code 404 - Top Level pages in wordpres...
c# - RadioButton comes checked automatically -
python - Pass a string to the command line -
php - incrementing a date in a loop -
XML to JSON using Ruby and save it for separate fi...
ios - UIImageView added to cell contentView displa...
iphone - Is it possible to customize the keyboard ...
php - storing array in a sql database -
python - Run code before url mapping on google app...
javascript - Should I use the Underscore-compatibl...
How to handle console keyboard input in javascript? -
Mysql Select PageId based on count -
jquery - AJAX is not calling PHP? -
python 3.x - Komodo edit 8 - Python3 user input -
c - Hi, can somebody help me with this loop -
Semaphores the output of the following -
audio - Lag between two sound "Mediaplayer" android -
cocoa - Autolayout view swapping -
java - How should I use UTF-8 in MySQL? -
java - Reading artifacts from a maven repository -
c++ - What is the easiest way to create a inherite...
How to concatenate text from a column into a new c...
android - How to use Toast messages in Andengine L...
c++ - QTcpSocket - An invalid handle was specified...
javascript - issue with self-invoking function in ...
iterate through an array of vector in c++ -
php - Creating a cache layer for images -
string - Limiting the number of words PHP Magento ...
jsf multiple instances managed bean for multiple t...
javascript - How do I loop through nested Models i...
c# - PictureBox for Windows Store App -
javascript - Sequence to draw a HighStock chart wi...
node.js with mongodb giving error on windows while...
php - Merge three mysql entries to one entry -
facebook - FB add friend dialog on mobile doesnt w...
Read an IIS log to pandas dataframe -
r - How to install a package not located on CRAN r...
java - Netbeans IDE not issuing warnings about met...
Getting Firefox undecorated -
symfony - How to set a css class of the widget typ...
javascript - How to copy selection from one select...
url - Is it possible two hostnames share the same ...








        ► 
      



May

(221)







        ► 
      



April

(216)







        ► 
      



March

(206)







        ► 
      



February

(227)







        ► 
      



January

(230)









        ► 
      



2011

(1964)





        ► 
      



September

(220)







        ► 
      



August

(222)







        ► 
      



July

(219)







        ► 
      



June

(224)







        ► 
      



May

(219)







        ► 
      



April

(206)







        ► 
      



March

(216)







        ► 
      



February

(221)







        ► 
      



January

(217)









        ► 
      



2010

(1952)





        ► 
      



September

(230)







        ► 
      



August

(202)







        ► 
      



July

(221)







        ► 
      



June

(207)







        ► 
      



May

(213)







        ► 
      



April

(199)







        ► 
      



March

(234)







        ► 
      



February

(244)







        ► 
      



January

(202)


















    















Powered by Blogger.