Fairy tale corpus semantically organized and tagged.
This fairy tale corpus is divided in semantically related clusters. Clusters overlap, i.e., each tale can be allocated to more than one cluster.
Fairy tales are written for children and its plot and language are simpler than tales written for adults. Fairy tales are also easily read and understood. Fairy tale sentences are shorter and emotions are well defined. A fairy tale corpus can be useful for emotion extraction, semantic role extraction, meaning extraction, recommendation, text classification, among others.
The corpus is free for non-commercial use. Please contact Paula Cristina Vaz for other uses.
If you use the corpus, please cite the following article:
Download the corpus: fairy-tales-corpus-map.tar.gz