2016 SemanticIndexingwithDeepLearnin

From GM-RKB
Jump to navigation Jump to search

Subject Headings: Neural Semantic Indexing Algorithm.

Notes

Cited By

Quotes

Author Keywords

Abstract

Background

Deep learning techniques, particularly convolutional neural networks (CNNs), are poised for widespread application in the research fields of information retrieval and natural language processing. However, there are very few publications addressing semantic indexing with deep learning. In particular, there are few studies of semantic indexing in biomedical literature because of several specific challenges including a vast amount of semantic labels from automatically annotating MeSH terms for MEDLINE citations and a massive collection with only the title and abstract information.

Results

In this paper, we introduce a novel CNN-based semantic indexing method for biomedical abstract document collections. First, we adaptively group word2vec categories into (coarse) subsets by clustering. Next, we construct a high-dimensional space representation with Wikipedia category extension, which contains more semantic information than bag-of-words. Thereafter, we design a hierarchical CNN indexing architecture for learning documents from a coarse- to fine-grained level with several multi-label training techniques. We believe that the low-dimensional representation of the output layer in CNNs should be more compact and effective. Finally, we perform comparative experiments for semantic indexing of biomedical abstract documents.

Conclusion

Experimental results on the MEDLINE dataset show that our model achieves superior performance than conventional models.

References

;

 AuthorvolumeDate ValuetitletypejournaltitleUrldoinoteyear
2016 SemanticIndexingwithDeepLearninYan Yan
Xu-Cheng Yin
Bo-Wen Zhang
Chun Yang
Hong-Wei Hao
Semantic Indexing with Deep Learning: A Case Study10.1186/s41044-016-0007-z2016