The Library
A neural generative model for joint learning topics and topic-specific word embeddings
Tools
Zhu, Lixing, He, Yulan and Zhou, Deyu (2020) A neural generative model for joint learning topics and topic-specific word embeddings. Transactions of the Association for Computational Linguistics, 8 . pp. 471-485. doi:10.1162/tacl_a_00326 ISSN 2307-387X.
|
PDF
WRAP-neural-generative-model-joint-learning-topics-specific-embeddings-Zhu-2020.pdf - Published Version - Requires a PDF viewer. Available under License Creative Commons Attribution 4.0. Download (584Kb) | Preview |
|
PDF
WRAP-neural-generative-model-joint-learning-topics-topic-specific-word-embeddings-Zhu-2020.pdf - Accepted Version Embargoed item. Restricted access to Repository staff only - Requires a PDF viewer. Download (898Kb) |
Official URL: https://doi.org/10.1162/tacl_a_00326
Abstract
We propose a novel generative model to explore both local and global context for joint learning topics and topic-specific word embeddings. In particular, we assume that global latent topics are shared across documents; a word is generated by a hidden semantic vector encoding its contextual semantic meaning; and its context words are generated conditional on both the hidden semantic vector and global latent topics. Topics are trained jointly with the word embeddings. The trained model maps words to topic-dependent embeddings, which naturally addresses the issue of word polysemy. Experimental results show that the proposed model outperforms the word-level embedding methods in both word similarity evaluation and word sense disambiguation. Furthermore, the model also extracts more coherent topics compared to existing neural topic models or other models for joint learning of topics and word embeddings. Finally, the model can be easily integrated with existing deep contextualized word embedding learning methods to further improve the performance of downstream tasks such as sentiment classification.
Item Type: | Journal Article | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software T Technology > TK Electrical engineering. Electronics Nuclear engineering |
|||||||||||||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | |||||||||||||||
Library of Congress Subject Headings (LCSH): | Natural language processing (Computer science) , Neural networks (Computer science) , Embedded computer systems | |||||||||||||||
Journal or Publication Title: | Transactions of the Association for Computational Linguistics | |||||||||||||||
Publisher: | Association for Computational Linguistics | |||||||||||||||
ISSN: | 2307-387X | |||||||||||||||
Official Date: | 2020 | |||||||||||||||
Dates: |
|
|||||||||||||||
Volume: | 8 | |||||||||||||||
Page Range: | pp. 471-485 | |||||||||||||||
DOI: | 10.1162/tacl_a_00326 | |||||||||||||||
Status: | Peer Reviewed | |||||||||||||||
Publication Status: | Published | |||||||||||||||
Access rights to Published version: | Open Access (Creative Commons) | |||||||||||||||
Copyright Holders: | © 2020 Association for Computational Linguistics | |||||||||||||||
Date of first compliant deposit: | 31 July 2020 | |||||||||||||||
Date of first compliant Open Access: | 23 September 2020 | |||||||||||||||
RIOXX Funder/Project Grant: |
|
|||||||||||||||
Related URLs: |
Request changes or add full text files to a record
Repository staff actions (login required)
View Item |
Downloads
Downloads per month over past year