Comparative analysis between a respeaking captioning system and a captioning system without human intervention Articles uri icon

authors

  • RUIZ ARROYO, ADRIAN
  • GARCIA CRESPO, ANGEL
  • FUENMAYOR GONZALEZ, FRANCISCO JAVIER
  • RODRIGUEZ GONCALVES, ROXANA DEL VALLE

publication date

  • October 2022

start page

  • 1

end page

  • 12

International Standard Serial Number (ISSN)

  • 1615-5289

Electronic International Standard Serial Number (EISSN)

  • 1615-5297

abstract

  • People living with deafness or hearing impairment have limited access to information broadcast live on television. Live closed captioning is a currently active area of study; to our knowledge, there is no system developed thus far that produces high-quality captioning results without using scripts or human interaction. This paper presents a comparative analysis of the quality of captions generated for four Spanish news programs by two captioning systems: a semiautomatic system based on respeaking (system currently used by a Spanish TV station) and an automatic system without human interaction proposed and developed by the authors. The analysis is conducted by measuring and comparing the accuracy, latency and speed of the captions generated by both captioning systems. The captions generated by the system presented higher quality considering the accuracy in terms of Word Error Rate (WER between 3.76 and 7.29%) and latency of the captions (approximately 4 s) at an acceptable speed to access the information. We contribute a first study focused on the development and analysis of an automatic captioning system without human intervention with promising quality results. These results reinforce the importance of continuing to study these automatic systems.

subjects

  • Computer Science
  • Mechanical Engineering

keywords

  • automated closed captioning; asr; automatic speech recognition; live broadcasting