Cited time in webofscience Cited time in scopus

Video Face Recognition with Audio-Visual Aggregation Network

Title
Video Face Recognition with Audio-Visual Aggregation Network
Author(s)
Li, QinboWan, QingLee, Sang-HeonChoe, Yoonsuck
Issued Date
2021-12-08
Citation
28th International Conference on Neural Information Processing, ICONIP 2021, pp.150 - 161
Type
Conference Paper
ISBN
9783030922726
ISSN
0302-9743
Abstract
With the continuing improvement in deep learning methods in recent years, face recognition performance is starting to surpass human performance. However, current state-of-the-art approaches are usually trained on high-quality still images and do not work well in unconstrained video face recognition. We propose to use audio information in the video to aid in the face recognition task with mixed quality inputs. We introduce an Audio-Visual Aggregation Network (AVAN) to aggregate multiple facial and voice information to improve face recognition performance. To effectively train and evaluate our approach, we constructed an Audio-Visual Face Recognition dataset. Empirical results show that our approach significantly improves the face recognition accuracy on unconstrained videos. © 2021, Springer Nature Switzerland AG.
URI
http://hdl.handle.net/20.500.11750/46883
DOI
10.1007/978-3-030-92273-3_13
Publisher
Springer Science and Business Media Deutschland GmbH
Related Researcher
Files in This Item:

There are no files associated with this item.

Appears in Collections:
Division of Automotive Technology 2. Conference Papers

qrcode

  • twitter
  • facebook
  • mendeley

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE