Energy-Efficient Video Text-Spotting
Abstract
There is a lot of research done to increase the accuracy and reduce the latency of deep learning algorithms. But, there is very little research done to reduce the energy consumption of the deep learning models. For applications that require deploying the deep learning models on the edge devices that have low compute resources, it is important that these algorithms are energy-efficient. Efficient Video Text Spotting is the field that deals with developing deep learning models to be deployed on edge devices to detect, localize, and recognize text appearing in the frames of the videos. Previous methods followed a four-step pipeline: text detection in every frame, text recognition for the localized text region in every frame, tracking text streams, and post-processing. The two main problems with the above approach are high computational cost and low performance. This thesis focuses on the text spotting model design for an Efficient Video Text Spotting System. In this thesis, model design experiments are carried out keeping efficiency in mind. Two different real-time text spotting models were experimented i.e. ABCNet and FOTS. For ABCNet different backbones, normalization schemes, and feature pyramid variations are experimented with to attain the best accuracy and energy tradeoff. For the FOTS model, the two-step text spotting and two-stage text spotting model design are experimented. The influence of various factors such as bounding box to character count ratio, character count, blur level, bounding box count, bounding box area are experimented. From the experiments, it was observed that the two-step text spotting model design method performed better for all resolutions. Further, it was observed that the recognition performance improves with a higher bounding box to character count ratio and lower character count. The energy measurement of the two-step FOTS text spotting model on Raspberry Pi is also presented.
Citation
Sridhar, Rahul (2021). Energy-Efficient Video Text-Spotting. Master's thesis, Texas A&M University. Available electronically from https : / /hdl .handle .net /1969 .1 /195834.