The full text of this item is not available at this time because the student has placed this item under an embargo for a period of time. The Libraries are not authorized to provide a copy of this work during the embargo period, even for Texas A&M users with NetID.
Reducing Time for Rate-Distortion Optimization in HEVC Using Attention Mechanism
dc.contributor.advisor | Tian, Chao | |
dc.creator | Tedla, Chandra Sekhar | |
dc.date.accessioned | 2023-09-19T18:48:51Z | |
dc.date.created | 2023-05 | |
dc.date.issued | 2023-04-09 | |
dc.date.submitted | May 2023 | |
dc.identifier.uri | https://hdl.handle.net/1969.1/199017 | |
dc.description.abstract | High-Efficient Video Coding(HEVC) is the video coding standard proposed by the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group. HEVC, also termed H.265, can support 8K UHD videos and saves upto 50% bit-rate saving over its predecessor, Advanced Video Coding(AVC/H.264) standard. HEVC similar to AVC uses a recursive quad-tree-based algorithm on its Coding Tree Unit(CTU) to partition it into multiple Coding blocks(CB) for its encoding, where it uses a brute-force approach to find the best possible partition of CTU as a part of the Rate-Distortion Optimization(RDO). This part of the RDO search is very computation-ally expensive and time-consuming. This work proposes a deep learning-based approach using the CNN and the attention-based network to reduce the time to find the best partition of CTUs. A large database of videos is used to generate the inter-mode CTU partition data using the HEVC soft-ware for the training. CNN is used to extract the features within the frame and then the attention based network uses the temporal correlation of frames to improve the performance of the network. We found that there was a reduction of about 40% of the time taken to encode, with a minute loss of less than 1% drop in bit-rate savings for any PSNR and supports all the configurations, outperforming the proposed state-of-the-art approaches to reduce the complexity of HEVC. | |
dc.format.mimetype | application/pdf | |
dc.language.iso | en | |
dc.subject | Advanced video coding (AVC/H.264) | |
dc.subject | High Efficient Video Coding(HEVC) | |
dc.subject | Joint Collaborative Team on Video Coding (JCT-VC) | |
dc.subject | Video Coding Experts Group(VCEG) | |
dc.subject | Moving Picture Experts Group(MPEG) | |
dc.subject | video compression | |
dc.subject | Convolutional Neural Network | |
dc.subject | Attention-based network | |
dc.subject | Coding Tree Unit | |
dc.subject | Coding Unit depth decision | |
dc.title | Reducing Time for Rate-Distortion Optimization in HEVC Using Attention Mechanism | |
dc.type | Thesis | |
thesis.degree.department | Electrical and Computer Engineering | |
thesis.degree.discipline | Electrical Engineering | |
thesis.degree.grantor | Texas A&M University | |
thesis.degree.name | Master of Science | |
thesis.degree.level | Masters | |
dc.contributor.committeeMember | Xiong, Zixiang | |
dc.contributor.committeeMember | Jiang, Anxiao | |
dc.contributor.committeeMember | Ji, Jim | |
dc.type.material | text | |
dc.date.updated | 2023-09-19T18:48:52Z | |
local.embargo.terms | 2025-05-01 | |
local.embargo.lift | 2025-05-01 | |
local.etdauthor.orcid | 0009-0002-1368-6550 |
Files in this item
This item appears in the following Collection(s)
-
Electronic Theses, Dissertations, and Records of Study (2002– )
Texas A&M University Theses, Dissertations, and Records of Study (2002– )