▲ 3 ▼ prose - text processing, part-of-speech tagging, and named-entity extraction
A Golang library for text processing, including tokenization, part-of-speech tagging, and named-entity extraction.
Register to comment or vote on this story
Looks like this is English only, but it looks like a good package for text processing, it has summaries and lots of the other transformations you might expect, though I was hoping for snake case -> titlecase. The tagging bit I don't understand what you'd use it for though - I thought it'd be for applying tags like 'spam' to text corpus, but it looks like tokenising with the addition of parts of speech. The docs could do with some work to explain how to use the various packages, and why you'd use them.