The AV1 Abstract

The abstract breaks down the AOMedia Video 1 (AV1) codec, the open, royalty-free standard designed for the next generation of video streaming. This application breaks down the key technologies that give AV1 its superior compression efficiency.

Core Architecture & Concepts

AV1 builds upon a hybrid block-based coding framework, but enhances it with more flexible structures and parallelism. This section introduces the fundamental building blocks of the codec.

AV1 Encoding Pipeline

🎬

Input Frame

The process starts with a single video frame.

→

🧩

Partitioning

The frame is divided into large Superblocks (up to 128x128), which are recursively partitioned.

→

🔮

Prediction

Each block is predicted using information from the same frame (Intra) or other frames (Inter).

→

✨

Transform & Quantize

The prediction error (residual) is transformed to frequency domain and its precision is reduced.

→

🗃️

Entropy Coding

The quantized values and metadata are losslessly compressed into the final bitstream.

Advanced Prediction Techniques

Prediction is key to reducing redundancy. AV1 introduces a vast toolkit of new prediction methods to handle diverse content types, from natural video to screen captures. Select a category below to explore the specific tools.

Directional Modes

AV1 expands from VP9's 8 directional modes to 56 modes, including finer angles. This allows for much more accurate prediction of edges and textures, reducing the residual that needs to be coded.

Chroma from Luma (CfL)

A powerful chroma-only predictor. It leverages the high correlation between luma (brightness) and chroma (color) by modeling chroma pixels as a linear function of the reconstructed luma pixels in the same block.

Intra Block Copy (IBC)

Extremely effective for screen content (text, graphics). It allows a block to be predicted by copying a previously decoded block from within the same frame, essentially performing motion compensation locally.

Palette Mode

Designed for content with few colors, like animations or screen sharing. It identifies a small palette of unique colors within a block and then encodes each pixel by its palette index, achieving massive compression for such content.

Smooth & Paeth Predictors

AV1 introduces new smooth prediction modes for handling gradients and replaces VP9's TrueMotion with the Paeth predictor, which excels at predicting corners and sharp edges by selecting the best of three neighboring pixel values.

Recursive Filtering

The FILTER_INTRA modes treat a block as a 2D Markov process, using pre-designed 7-tap filters on small patches to generate a highly accurate prediction, especially for textured areas.

Efficient Coding & Quality Enhancement

After prediction, AV1 uses advanced transform, quantization, and filtering techniques to compress the residual data and clean up the final image, maximizing both efficiency and visual quality.

Transform & Quantization

Flexible Transforms: Supports both square and rectangular transform shapes (e.g., 64x16, 8x32) up to 64x64, adapting better to object shapes.
Multiple Kernels: Uses a rich set of transform kernels (DCT, ADST, Flip-ADST, Identity) to better compact energy for different signal types.
Quantization Matrices (QMS): Applies different quantization levels to different frequencies based on human perception, preserving important details while saving bits.

In-Loop Filtering Pipeline

1. Deblocking Filter (DBF)

Smooths sharp artifacts across transform block boundaries.

2. Constrained Directional Enhancement (CDEF)

A powerful deringing filter that removes "mosquito noise" around sharp edges by identifying edge directions and filtering along them.

3. Loop Restoration Filter (LRF)

The final step, using Wiener and Self-Guided filters to restore detail and reverse some of the damage caused by compression.

Performance & Codec Comparison

The combination of these advanced techniques results in significant compression gains over previous generations. The chart below shows typical bitrate savings for equivalent visual quality, according to industry benchmarks.

AV1 Bitrate Savings vs. Predecessors

Feature Showdown: AV1 vs. HEVC vs. H.264

Feature	H.264 (AVC)	H.265 (HEVC)	AV1
Max Block Size	16x16	64x64	128x128
Intra Prediction Modes	9	35	56+
Motion Compensation	Translational	Advanced	Warped, Affine, OBMC
In-Loop Filters	Deblocking	Deblocking, SAO	Deblocking, CDEF, LRF
Screen Content Tools	None	Limited	Intra Block Copy, Palette
Royalty Status	Complex Licensing	Complex Licensing	Royalty-Free