On the evaluation of retrofitting for supervised short-text classification - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

On the evaluation of retrofitting for supervised short-text classification

(1) , (1) , (1) , (1) , (2)
1
2

Résumé

Current NLP systems heavily rely on embedding techniques that are used to automatically encode relevant information about linguistic entities of interest (e.g., words, sentences) into latent spaces. These embeddings are currently the cornerstone of the best machine learning systems used in a large variety of problems such as text classification. Interestingly, state-of-the-art embeddings are commonly only computed using large corpora, and generally do not use additional knowledge expressed into established knowledge resources (e.g. WordNet). In this paper, we empirically study if retrofitting, a class of techniques used to update word vectors in a way that takes into account knowledge expressed in knowledge resources, is beneficial for short text classification. To this aim, we compared the performances of several state-of-the-art classification techniques with or without retrofitting on a selection of benchmarks. Our results show that the retrofitting approach is beneficial for some classifiers settings and only for datasets that share a similar domain to the semantic lexicon used for the retrofitting.
Fichier principal
Vignette du fichier
donlp2.pdf (301.89 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02986853 , version 1 (03-11-2020)

Identifiants

  • HAL Id : hal-02986853 , version 1

Citer

Kaoutar Ghazi, Andon Tchechmedjiev, Sébastien Harispe, Nicolas Sutton-Charani, Tagny Gildas. On the evaluation of retrofitting for supervised short-text classification. 1st International Workshop DeepOntoNLP: Deep Learning meets Ontologies and Natural Language Processing, Sep 2020, Virtual & Bozen-Bolzano, Italy. ⟨hal-02986853⟩
65 Consultations
146 Téléchargements

Partager

Gmail Facebook Twitter LinkedIn More