Cited time in webofscience Cited time in scopus

3D facial Landmarks Detection and Head Pose Estimation using Multi-task Learning and Vision Transformer

Title
3D facial Landmarks Detection and Head Pose Estimation using Multi-task Learning and Vision Transformer
Author(s)
Kim, HyundukLee, Sang-HeonSohn, Myoung-Kyu
Issued Date
2023-03
Citation
Journal of Industrial Information Technology and Application, v.7, no.1, pp.666 - 670
Type
Article
Author Keywords
3d facial landmarks detectionhead pose estimation, multi-task learningvision transformer
ISSN
2586-0852
Abstract
In this paper, we present 3D facial landmarks detection and head pose estimation algorithms. To solve these two tasks simultaneously, we apply the multi-task learning technique. Our architecture consists of three components: a multi-head to deal with different tasks, a backbone to represent common features, and linear layers to output results. For the real-time process, we apply MobileViT as a backbone network. Moreover, we employ the PCGrad algorithm for stable convergence during training. To evaluate the performance of the proposed algorithm, we trained and tested on AFLW200-3D datasets, respectively. In the experiments, we demonstrate the experimental results for comparing the accuracy between MobileNetV3 and MobileViT.
URI
http://hdl.handle.net/20.500.11750/46118
DOI
10.22664/ISITA.2021.7.1.666
Publisher
Journal of Industrial Information Technology and Application
Related Researcher
Files in This Item:
3D facial Landmarks Detection and Head Pose Estimation using Multi_task Learning and Vision Transfor.pdf

3D facial Landmarks Detection and Head Pose Estimation using Multi_task Learning and Vision Transfor.pdf

기타 데이터 / 208.34 kB / Adobe PDF download
Appears in Collections:
Division of Automotive Technology 1. Journal Articles

qrcode

  • twitter
  • facebook
  • mendeley

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE