Giuseppe: Stanford NER: How do I create a new training set that I can use and test out? -

Thursday, 15 September 2011

Stanford NER: How do I create a new training set that I can use and test out? -

In my understanding, to create a training file, you put your words in a text file. Then after each word, add a place or tab to the tag (such as PAR, LOC, etc ...)

I have also copied one sample from a sample file to a word pad. How do I get them into a gz file, which I can input and use in a classifier?

Please guide me though. I am a newbie and quite inefficient with technology.

Your training file ( training - data.tsv ) should look like this :

  IO has been transferred to Vancouver Venue BC LOCATION Tomorrow o    Where  o  means "outside ", As is not a named unit  
 where the space between columns is a  tab .  
 You do not put them in the  ser.gz  file. The CRGs file classifier model is  created  by the training process.  
 To train a classifier driver:  
  anticipated java -cp Ner.jar edu.stanford.nlp.ie.crf.CRFClassifier- my-classifier.properties    Where  my-classifier.properties  will look like this:  
  trainFile = training-data.tsv serializeTo = my- Classification-model.ser.gz map = word = 0, answer = 1 ...

Posted by Unknown at 03:22 Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments: Post a Comment

Newer Post Older Post Home Subscribe to: Post Comments (Atom)

About Me Unknown View my complete profile Blog Archive ► 2015 (1886) ► September (203) ► August (208) ► July (224) ► June (210) ► May (230) ► April (195) ► March (209) ► February (201) ► January (206) ► 2014 (2117) ► September (239) ► August (251) ► July (226) ► June (208) ► May (229) ► April (199) ► March (255) ► February (275) ► January (235) ► 2013 (2011) ► September (199) ► August (228) ► July (210) ► June (222) ► May (217) ► April (229) ► March (243) ► February (221) ► January (242) ► 2012 (1993) ► September (227) ► August (235) ► July (225) ► June (206) ► May (221) ► April (216) ► March (206) ► February (227) ► January (230) ▼ 2011 (1964) ▼ September (220) highcharts - Dual Axis in Highstock? - ruby - Can I include/extend a module but mark all ... javascript - What is the right way to open titaniu... javascript - Auto-adjust canvas by itself in all p... javascript - How do I rotate two images [on the sa... c# - 'System.UnauthorizedAccessException' when try... android - AutoCompleteTextView dropdown style igno... algorithm - Sum of a subset of numbers - sharepoint - javascript edit html on the fly - mysql - Complex CakePHP query refactoring - javascript - jQuery matching a single class agains... html - how to do fixed nav, footer and content of ... objective c - How to pause a game in cocos2d? - ios - Table view doesn't store data - regex - Parsing Character Sets without Converting ... Matlab not processing second for-loop - Java Singleton.getInstance() returns null? - javascript - TextArea cell renderer for dojo/dgrid - Windows batch file read text file and convert all ... sql - Finding previous records after specified one... python - redis-py AttributeError: 'module' object ... winforms - Visual c++, resize a control on mousedo... ios6 - How to store high scores in a list in Cocos... java - Implementing BubbleSort at the end of Quick... django queryset with count filter - objective c - How to redirect certain NSLog output... Rails 3 Routes - prepend all url paths with set st... manually enter a subtotal value in a vb6 flexgrid - lotusscript - Lotus Notes Domino Getting Date Diff... c++ - How to check boost thread is running and Kil... javascript - .click() is called at times when ther... jquery - Disable hover state of a html tag - Update with select in mysql - ios - How to set Action for UIButton in UITableVie... actionscript 3 - AS#3 organize variables loaded fr... C++ link issue after switching to /MT - animation - Is this possible to change flash docum... excel vba - VBA favourite code management - codeigniter - gocart session expiration, i want to... Print all Unique Values in a Python Dictionary - Change language for YouTube embed - SQL Server 2008 only delete if field is unique - css - footer getting mixed up in content area. - php - How to retrieve data from drop down list sel... vb.net - DirectoryServices.AccountManagement User ... winforms - DevExpress LookupEdit, Select by item - How to retrieve column name from the table and par... Stanford NER: How do I create a new training set t... compiler construction - How to build PIC controlle... graph search api locale filter not working in any ... How to increase the memory heap size on IntelliJ I... r - Plots with good resolution for printing and sc... forms - Microsoft Access 2010 - Foreign Key as Dro... How use include method for ruby range values - asp.net - How to apply a localization to a javascr... Zend naming of views - asp.net - Unrecognized configuration section conne... javascript - Extracting both the full match, and t... responsive design - CSS: 100% height relative to t... Hibernate Search ErrorHandler: Continue Indexing - c# - TestCase with list or params - java - how to increase the performance of cmd comm... javascript - Google Chrome Omnibox API -- automati... xml - What does '@comment()' do in XPath? - perl - Count number of times an item is placed wit... java - Android - How to end a Fragment - java - Web crawling few websites and searching the... regex - Bulk replace with regular expressions in P... inheritance - Implementing Inherited Interface wit... php - Ajax based chat - infinite server side loop - coldfusion - Unable to see DataSource in the Datab... Qunit - Assert the arguments of a function are val... VB.NET and SQL Server Express application deployme... java - Temporal operations with a layout manager? ... r - How can I tell if a time point exists between ... ios - Drawing stroked UIBezierPath in drawRect not... powershell v2.0 - Microsoft.SystemCenter.VirtualMa... html - Issue with iOS Web App Splash Screen - java - Is it possible to determine the CRC of a zi... Simple cross import in python - java - Libgdx GestureListener to Handle Hold Touch... java - Swap cards in deck - c# - how to get the first touch point and the seco... Delete Specific Rows in Matlab - dictionary - Python: Accessing elements from neste... command line - browser form must launch PHP script... linux - How to check the status code of a function... java - Can a desktop application be created using ... objective c - Why aren't objects released when ref... android - Sliding menu mapview issue - google app engine - Go : How to Set same Cookie on... iphone - how to access Salesforce Attachment Body ... java - Delete Rows containing the String Excel Apa... amazon web services - Using memcache client with r... zookeeper initial discovery - c# - Search Issue in ASP.NET MVC 4 - java - Connect database with MyBatis without xml f... When working with images in MATLAB, what do myImag... java - how to trigger ajax update with radio butto... How to write a link to an Alfresco Share folder? (... ► August (222) ► July (219) ► June (224) ► May (219) ► April (206) ► March (216) ► February (221) ► January (217) ► 2010 (1952) ► September (230) ► August (202) ► July (221) ► June (207) ► May (213) ► April (199) ► March (234) ► February (244) ► January (202)

Powered by Blogger.