Skip to Main content Skip to Navigation
Conference papers

On the evaluation of retrofitting for supervised short-text classification

Kaoutar Ghazi 1 Andon Tchechmedjiev 1 Sébastien Harispe 1 Nicolas Sutton-Charani 1 Tagny Gildas 2 
1 I3A - Informatique, Image, Intelligence Artificielle
LGI2P - Laboratoire de Génie Informatique et d'Ingénierie de Production
Abstract : Current NLP systems heavily rely on embedding techniques that are used to automatically encode relevant information about linguistic entities of interest (e.g., words, sentences) into latent spaces. These embeddings are currently the cornerstone of the best machine learning systems used in a large variety of problems such as text classification. Interestingly, state-of-the-art embeddings are commonly only computed using large corpora, and generally do not use additional knowledge expressed into established knowledge resources (e.g. WordNet). In this paper, we empirically study if retrofitting, a class of techniques used to update word vectors in a way that takes into account knowledge expressed in knowledge resources, is beneficial for short text classification. To this aim, we compared the performances of several state-of-the-art classification techniques with or without retrofitting on a selection of benchmarks. Our results show that the retrofitting approach is beneficial for some classifiers settings and only for datasets that share a similar domain to the semantic lexicon used for the retrofitting.
Complete list of metadata

Cited literature [29 references]  Display  Hide  Download
Contributor : Andon Tchechmedjiev Connect in order to contact the contributor
Submitted on : Tuesday, November 3, 2020 - 12:17:47 PM
Last modification on : Tuesday, August 2, 2022 - 3:43:44 AM
Long-term archiving on: : Thursday, February 4, 2021 - 6:28:58 PM


Files produced by the author(s)


  • HAL Id : hal-02986853, version 1


Kaoutar Ghazi, Andon Tchechmedjiev, Sébastien Harispe, Nicolas Sutton-Charani, Tagny Gildas. On the evaluation of retrofitting for supervised short-text classification. 1st International Workshop DeepOntoNLP: Deep Learning meets Ontologies and Natural Language Processing, Sep 2020, Virtual & Bozen-Bolzano, Italy. ⟨hal-02986853⟩



Record views


Files downloads