RuDriCo
From HLT@INESC-ID
Performs a morphological analysis over unrestricted text and easily integrates a Natural Language Processing System as it regroups words and their constituents according to rewriting rules. Works over real texts, such as newspapers and books. It is a simplification of former PAsMo. It does NOT change the tags according to correspondence rules; NOR splits phrases according to a list of separators.
Goals
RuDriCo rewrites the text according to the morphological features of is words and a set of given rewriting rules. The possibility of referring initial (and final) position of the sentence to use in the rules is possible.
Features
- Team / Authors
- Platforms
- Windows / Linux / Mac OS X
- Developing status
- Stable
- Last stable version: 2.1
Integration in other systems
As RuDriCo regroups the words it is mainly used to ease the integration of a morphological analyzer with the following module in the NLP chain.
Implementation issues
RuDriCo is written in C++ and the algorithm was enhanced, reducing processing time. Also XML input and output is available in order to ease communication with other modules and allow data verification.
Client/Server platforms
A distributed version of RuDriCo will be available allowing the use of the system in a client/server platform through WSDL.