NgramJ logo.Other Information > Alternative (Java) Implementations2007-03-19 09:47:52 v1.0
NGramJ, smart scanning for document properties.

Alternative (Java) Implementations


What is NGramJ?
Getting Started
How Does it Work?
Contact
How to Contribute?
Developer Information
Other Information
  Alternative (Java) Implementations
  Other Projects of Us
  What is spieleck.de?
  References
 

There seem to be two contenders in the Open Source Java sphere.

  • Nutch contains its own language guessing mechanism based on characters.
  • TCatNG is another implementation based on bytes, together with extensions.
Both projects however seem to have pulled some code from the ancient 2001 NGramJ versions. With Nutch i'm not entirely sure, but TCatNG even contains NGramJ's misspellings and strange ad hoc random number generators. On the other hand, we have drawn some of our character based language profiles from the Nutch project.

After all, if you need something additional, you might consider joining forces and work with NGramJ.

Outside the Java universe there are plenty of other implementation in whatever languages (C, Perl, Python and more).

NewsfeedRSS feed
FilefeedRSS feed
Sourceforge Logo