Paper
10 October 2023 Research and implementation of chinese lip recognition technology based on pinyin features
Author Affiliations +
Proceedings Volume 12799, Third International Conference on Advanced Algorithms and Signal Image Processing (AASIP 2023); 1279929 (2023) https://doi.org/10.1117/12.3006298
Event: 3rd International Conference on Advanced Algorithms and Signal Image Processing (AASIP 2023), 2023, Kuala Lumpur, Malaysia
Abstract
With the development of deep learning, lip recognition technology has made great progress in English, but there is a certain gap in Chinese in both data set richness and recognition accuracy. By analyzing the visual characteristics of Chinese pronunciation, this paper puts forward the pinyin sequence of picture frames in order to avoid the ambiguity of Chinese visual expression. In order to verify the validity of pinyin sequence of picture frames, a pinyin sequence recognition model combining 3D+2D convolutional neural network with Bi-ConvLSTM and a Chinese character prediction model PPTC are proposed. Using the pinyin sequence of picture frames as the medium, the picture frames are converted into Chinese sentences. Through experiments, the superiority of the model is proved, which provides a benchmark for future work.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Yi Jiang, Wei Zeng, Nan Wang, Huan Xia, Yuedong Luo, and Jun Zheng "Research and implementation of chinese lip recognition technology based on pinyin features", Proc. SPIE 12799, Third International Conference on Advanced Algorithms and Signal Image Processing (AASIP 2023), 1279929 (10 October 2023); https://doi.org/10.1117/12.3006298
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Convolution

Convolutional neural networks

Back to Top