Detail View
3D facial Landmarks Detection and Head Pose Estimation using Multi-task Learning and Vision Transformer
WEB OF SCIENCE
SCOPUS
- Title
- 3D facial Landmarks Detection and Head Pose Estimation using Multi-task Learning and Vision Transformer
- Issued Date
- 2023-03
- Citation
- Kim, Hyunduk. (2023-03). 3D facial Landmarks Detection and Head Pose Estimation using Multi-task Learning and Vision Transformer. Journal of Industrial Information Technology and Application, 7(1), 666–670. doi: 10.22664/ISITA.2021.7.1.666
- Type
- Article
- Author Keywords
- 3d facial landmarks detection ; head pose estimation, multi-task learning ; vision transformer
- ISSN
- 2586-0852
- Abstract
-
In this paper, we present 3D facial landmarks detection and head pose estimation algorithms. To solve these two tasks simultaneously, we apply the multi-task learning technique. Our architecture consists of three components: a multi-head to deal with different tasks, a backbone to represent common features, and linear layers to output results. For the real-time process, we apply MobileViT as a backbone network. Moreover, we employ the PCGrad algorithm for stable convergence during training. To evaluate the performance of the proposed algorithm, we trained and tested on AFLW200-3D datasets, respectively. In the experiments, we demonstrate the experimental results for comparing the accuracy between MobileNetV3 and MobileViT.
더보기
- Publisher
- Journal of Industrial Information Technology and Application
File Downloads
공유
Total Views & Downloads
???jsp.display-item.statistics.view???: , ???jsp.display-item.statistics.download???:
