Reducing Time for Rate-Distortion Optimization in HEVC Using Attention Mechanism

Tedla, Chandra Sekhar

The full text of this item is not available at this time because the student has placed this item under an embargo for a period of time. The Libraries are not authorized to provide a copy of this work during the embargo period, even for Texas A&M users with NetID.

Show simple item record

dc.contributor.advisor	Tian, Chao
dc.creator	Tedla, Chandra Sekhar
dc.date.accessioned	2023-09-19T18:48:51Z
dc.date.created	2023-05
dc.date.issued	2023-04-09
dc.date.submitted	May 2023
dc.identifier.uri	https://hdl.handle.net/1969.1/199017
dc.description.abstract	High-Efficient Video Coding(HEVC) is the video coding standard proposed by the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group. HEVC, also termed H.265, can support 8K UHD videos and saves upto 50% bit-rate saving over its predecessor, Advanced Video Coding(AVC/H.264) standard. HEVC similar to AVC uses a recursive quad-tree-based algorithm on its Coding Tree Unit(CTU) to partition it into multiple Coding blocks(CB) for its encoding, where it uses a brute-force approach to find the best possible partition of CTU as a part of the Rate-Distortion Optimization(RDO). This part of the RDO search is very computation-ally expensive and time-consuming. This work proposes a deep learning-based approach using the CNN and the attention-based network to reduce the time to find the best partition of CTUs. A large database of videos is used to generate the inter-mode CTU partition data using the HEVC soft-ware for the training. CNN is used to extract the features within the frame and then the attention based network uses the temporal correlation of frames to improve the performance of the network. We found that there was a reduction of about 40% of the time taken to encode, with a minute loss of less than 1% drop in bit-rate savings for any PSNR and supports all the configurations, outperforming the proposed state-of-the-art approaches to reduce the complexity of HEVC.
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.subject	Advanced video coding (AVC/H.264)
dc.subject	High Efficient Video Coding(HEVC)
dc.subject	Joint Collaborative Team on Video Coding (JCT-VC)
dc.subject	Video Coding Experts Group(VCEG)
dc.subject	Moving Picture Experts Group(MPEG)
dc.subject	video compression
dc.subject	Convolutional Neural Network
dc.subject	Attention-based network
dc.subject	Coding Tree Unit
dc.subject	Coding Unit depth decision
dc.title	Reducing Time for Rate-Distortion Optimization in HEVC Using Attention Mechanism
dc.type	Thesis
thesis.degree.department	Electrical and Computer Engineering
thesis.degree.discipline	Electrical Engineering
thesis.degree.grantor	Texas A&M University
thesis.degree.name	Master of Science
thesis.degree.level	Masters
dc.contributor.committeeMember	Xiong, Zixiang
dc.contributor.committeeMember	Jiang, Anxiao
dc.contributor.committeeMember	Ji, Jim
dc.type.material	text
dc.date.updated	2023-09-19T18:48:52Z
local.embargo.terms	2025-05-01
local.embargo.lift	2025-05-01
local.etdauthor.orcid	0009-0002-1368-6550

Files in this item

Name:: TEDLA-THESIS-2023.pdf
Size:: 1.216Mb
Format:: PDF

View/ Open

This item appears in the following Collection(s)

Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )

Show simple item record

Reducing Time for Rate-Distortion Optimization in HEVC Using Attention Mechanism

Files in this item

This item appears in the following Collection(s)

Related items