
The Library
Multichannel attention network for analyzing visual behavior in public speaking
Tools
Sharma, Rahul, Guha, Tanaya and Sharma, Gaurav (2018) Multichannel attention network for analyzing visual behavior in public speaking. In: IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA, 12-15 Mar 2018 pp. 476-484. ISBN 9781538648865. doi:10.1109/WACV.2018.00058
|
PDF
WRAP-multichannel-attention-network-analyzing-visual-behavior-public-speaking-Guha-2018.pdf - Accepted Version - Requires a PDF viewer. Download (4Mb) | Preview |
Official URL: http://dx.doi.org/10.1109/WACV.2018.00058
Abstract
We investigate the importance of human centered visual cues for predicting the popularity of a public lecture. We construct a large database of more than 1800 TED talk videos and leverage the corresponding (online) viewers' ratings from YouTube for a measure of popularity of the TED talks. Visual cues related to facial and physical appearance, facial expressions, and pose variations are learned using convolutional neural networks (CNN) connected to an attention-based long short-term memory (LSTM) network to predict the video popularity. The proposed overall network is end-to-end-trainable, and achieves state-of-the-art prediction accuracy indicating that the visual cues alone contain highly predictive information about the popularity of a talk. We also demonstrate qualitatively that the network learns a human-like attention mechanism, which is particularly useful for interpretability, i.e. how attention varies with time, and across different visual cues as a function of their relative importance.
Item Type: | Conference Item (Paper) | ||||||
---|---|---|---|---|---|---|---|
Subjects: | Q Science > QA Mathematics > QA76 Electronic computers. Computer science. Computer software | ||||||
Divisions: | Faculty of Science, Engineering and Medicine > Science > Computer Science | ||||||
Library of Congress Subject Headings (LCSH): | Public speaking, Visual perception -- Data processing, Neural networks (Computer science), Human-computer interaction | ||||||
Publisher: | IEEE | ||||||
ISBN: | 9781538648865 | ||||||
Book Title: | 2018 IEEE Winter Conference on Applications of Computer Vision (WACV) | ||||||
Official Date: | 7 May 2018 | ||||||
Dates: |
|
||||||
Page Range: | pp. 476-484 | ||||||
DOI: | 10.1109/WACV.2018.00058 | ||||||
Status: | Peer Reviewed | ||||||
Publication Status: | Published | ||||||
Reuse Statement (publisher, data, author rights): | © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | ||||||
Access rights to Published version: | Restricted or Subscription Access | ||||||
Date of first compliant deposit: | 11 October 2018 | ||||||
Date of first compliant Open Access: | 12 October 2018 | ||||||
Conference Paper Type: | Paper | ||||||
Title of Event: | IEEE Winter Conference on Applications of Computer Vision (WACV) | ||||||
Type of Event: | Conference | ||||||
Location of Event: | Lake Tahoe, NV, USA | ||||||
Date(s) of Event: | 12-15 Mar 2018 |
Request changes or add full text files to a record
Repository staff actions (login required)
![]() |
View Item |
Downloads
Downloads per month over past year