BabelNet

From Infogalactic: the planetary knowledge core
Jump to: navigation, search
BabelNet
BabelNet logo.
Stable release BabelNet 3.6 / February 2016
Operating system Virtuoso Universal Server
Type
License Attribution-NonCommercial-ShareAlike 3.0 Unported
Website babelnet.org

BabelNet is a multilingual lexicalized semantic network and ontology developed at the Linguistic Computing Laboratory in the Department of Computer Science of the Sapienza University of Rome.[1][2] BabelNet was automatically created by linking the largest multilingual Web encyclopedia, Wikipedia, to the most popular computational lexicon of the English language, WordNet. The integration is performed by means of an automatic mapping and by filling in lexical gaps in resource-poor languages with the aid of statistical machine translation. The result is an "encyclopedic dictionary" that provides concepts and named entities lexicalized in many languages and connected with large amounts of semantic relations. Additional lexicalizations and definitions are added by linking to free-license wordnets, OmegaWiki, the English Wiktionary and Wikidata. Similarly to WordNet, BabelNet groups words in different languages into sets of synonyms, called Babel synsets. For each Babel synset, BabelNet provides short definitions (called glosses) in many languages harvested from both WordNet and Wikipedia.

File:The BabelNet structure.png
BabelNet is a multilingual semantic network obtained as an integration of WordNet and Wikipedia.

Statistics of BabelNet

As of February 2016, BabelNet (version 3.6) covers 271 languages, including all European languages, most Asian languages, and even Latin. BabelNet 3.6 contains almost 14 million synsets and about 745 million word senses (regardless of their language). Each Babel synset contains 2 synonyms per language, i.e., word senses, on average. The semantic network includes all the lexico-semantic relations from WordNet (hypernymy and hyponymy, meronymy and holonymy, antonymy and synonymy, etc., totaling around 364,000 relation edges) as well as an underspecified relatedness relation from Wikipedia (totaling around 380 million relation edges).[1] Version 3.6 also associates about 11 million images with Babel synsets and provides a Lemon RDF encoding of the resource.,[3] available via a SPARQL endpoint. 1.5 million synsets are assigned domain labels.

Applications

BabelNet has been shown to enable multilingual Natural Language Processing applications. The lexicalized knowledge available in BabelNet has been shown to obtain state-of-the-art results in:

See also

References

<templatestyles src="Reflist/styles.css" />

Cite error: Invalid <references> tag; parameter "group" is allowed only.

Use <references />, or <references group="..." />

External links