In Taiwan, approximately 10.5% of the disabled people are people with hearing impairments, with 44% of those facing difficulties in daily communication due to severity of their conditions.
Communication is an essential part of human interactions; however, for individuals with hearing impairments, communication barriers may be elevated due to the limited popularity of sign language and the shortage of sign language interpreters.
Our objective was to create a sign language translation system to enhance communication efficiency between the hearing-impaired and the general public.
In addition to soliciting user feedback, we established success metrics as follows:
Through interviews with hearing-impaired individuals and sign language interpreters, we identified challenges with current communication methods:
After comprehensive evaluation of machine learning methodologies, we opted for the Recurrent Neural Network (RNN), to be more specific, the Gated Recurrent Unit (GRU) model — a suitable choice for processing sequential data and real-time applications.
We successfully trained the GRU-based model and seamlessly integrated it into Microsoft's HoloLens 2 mixed reality headset. This innovation empowered users to view sign language interpretation subtitles and facial expressions simultaneously, even while on the move, enhancing the interactivity, mobility and convenience of the system.
Our project yielded a sign language translator capable of converting sign language video clips and audio into subtitles. Both the training and testing datasets exceeded the predefined accuracy benchmarks. While real-world testing accuracy could be influenced by network speed, it still achieved an accuracy of 80%.
Furthermore, the development of this system has received recognition from both industry and academia, notably clinching first place in the International ICT Innovative Services Awards 2022 and the departmental IS project competition. These achievements underscored the innovation and scalability of our system.