text - Easiest way or lightlest library to get bigrams and trigrams in Java? -


i'd rather not have fire lingpipe if possible leaves me wondering if there quick, easy ways in java extract bigrams , trigrams string of text?

thanks

always easiest way use existing library. can take on simmetrics library. can use lucene ngramtokenizer. can implement algorithm yourself. first, have find words (using stringtokenizer) in text , generate n-grams need.


Comments

Popular posts from this blog

c++ - Convert big endian to little endian when reading from a binary file -

C#: Application without a window or taskbar item (background app) that can still use Console.WriteLine() -

unicode - Are email addresses allowed to contain non-alphanumeric characters? -