Performs a morphological analysis over unrestricted text and easily integrates a Natural Language Processing System as it regroups words and their constituents according to rewriting rules. Works over real texts, such as newspapers and books. It is a simplification of former PAsMo. It does NOT change the tags according to correspondence rules; NOR splits phrases according to a list of separators.
RuDriCo rewrites the text according to the morphological features of is words and a set of given rewriting rules. The possibility of referring initial (and final) position of the sentence to use in the rules is possible.
As RuDriCo regroups the words it is mainly used to ease the integration of a morphological analyzer with the following module in the NLP chain.
RuDriCo is written in C++ and the algorithm was enhanced, reducing processing time. Also XML input and output is available in order to ease communication with other modules and allow data verification.
A distributed version of RuDriCo will be available allowing the use of the system in a client/server platform through WSDL.