2015 AFrameworkfortheConstructionofM

(Camacho-Collados et al., 2015) ⇒ Jose Camacho-Collados, Mohammad Taher Pilehvar, and Roberto Navigli. (2015). “A Framework for the Construction of Monolingual and Cross-lingual Word Similarity Datasets.” In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL 2015) Volume 2: Short Papers.

Subject Headings: Rubenstein-Goodenough (RG-65) Dataset; Semantic Word Similarity Dataset; Semantic Word Similarity Benchmark Task.

Notes

Cited By

Google Scholar: ~ 59 Citations.

Quotes

Abstract

Despite being one of the most popular tasks in lexical semantics, word similarity has often been limited to the English language. Other languages, even those that are widely spoken such as Spanish, do not have a reliable word similarity evaluation framework. We put forward robust methodologies for the extension of existing English datasets to other languages, both at monolingual and cross-lingual levels. We propose an automatic standardization for the construction of cross-lingual similarity datasets, and provide an evaluation, demonstrating its reliability and robustness. Based on our procedure and taking the RG-65 word similarity dataset as a reference, we release two high-quality Spanish and Farsi (Persian) monolingual datasets, and fifteen cross-lingual datasets for six languages: English, Spanish, French, German, Portuguese, and Farsi.

References

BibTeX

@inproceedings{2015_AFrameworkfortheConstructionofM,
  author    = {Jose Camacho-Collados and
               Mohammad Taher Pilehvar and
               Roberto Navigli},
  title     = {A Framework for the Construction of Monolingual and Cross-lingual
               Word Similarity Datasets},
  booktitle = {Proceedings of the 53rd Annual Meeting of the Association for Computational
               Linguistics and the 7th International Joint Conference on Natural
               Language Processing of the Asian Federation of Natural Language Processing (ACL 2015) Volume 2: Short Papers},
  pages     = {1--7},
  publisher = {The Association for Computer Linguistics},
  year      = {2015},
  url       = {https://doi.org/10.3115/v1/p15-2001},
  doi       = {10.3115/v1/p15-2001},
}

	Author	volume	Date Value	title	type	journal	titleUrl	doi	note	year
2015 AFrameworkfortheConstructionofM	Mohammad Taher Pilehvar Jose Camacho-Collados Roberto Navigli			A Framework for the Construction of Monolingual and Cross-lingual Word Similarity Datasets						2015