Show simple item record

dc.contributor.advisorTian, Chao
dc.creatorGarg, Vineet
dc.date.accessioned2020-04-23T19:56:51Z
dc.date.created2019-05
dc.date.issued2019-04-16
dc.date.submittedMay 2019
dc.identifier.urihttps://hdl.handle.net/1969.1/187959
dc.description.abstractVideo object segmentation is gaining increased research and commercial importance in recent times from no checkout lines in Amazon Go stores to autonomous vehicles operating on roads. Efficient operation for such use cases require segmentation inference in real time. Even though there has been significant research in image segmentation, both semantic and instance, there is still much scope for improvement in video segmentation. Video seg-mentation is a direct extension of image segmentation, except that there is temporal relation between neighboring frames of videos. Exploiting this temporal relation in an efficient way is one of the most important challenges in video segmentation. This temporal relation has a lot of redundancy involved and many of the prevalent state-of-the-art techniques do not exploit this redundancy. Optical flow is one of the approaches for exploiting temporal redundancies. Intermediate feature maps of previous frames are interpolated using this information and rest of the segmentation operation is performed. However, optical flow provides motion resolution on a pixel level. There is not enough motion between consecutive frames to warrant motion estimation on pixel level. Instead we can divide a frame into multiple blocks and estimate the movement of their centroids in consecutive video frames. Based on this idea, we present a motion vector approach to video semantic segmentation. Additionally, we also propose an adaptive technique to select keyframes during inference. We show that our proposed algorithm can bring down the computational complexity during inference by as much as 50% with only a 2-3% drop in the accuracy metric. Our algorithm can operate at as high as 136 frames per second indicating that it can easily handle real time inference.en
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.subjectDeep Learningen
dc.subjectVideo Segmentationen
dc.subjectMotion Vectoren
dc.titleA Deep Motion Vector Approach to Video Object Segmentationen
dc.typeThesisen
thesis.degree.departmentElectrical and Computer Engineeringen
thesis.degree.disciplineElectrical Engineeringen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameMaster of Scienceen
thesis.degree.levelMastersen
dc.contributor.committeeMemberJiang, Anxiao
dc.contributor.committeeMemberBraga-Neto, Ulisses
dc.contributor.committeeMemberXiong, Zixiang
dc.type.materialtexten
dc.date.updated2020-04-23T19:56:52Z
local.embargo.terms2021-05-01
local.embargo.lift2021-05-01
local.etdauthor.orcid0000-0002-2020-3185


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record