Table of Content
The same as an AV1 temporal unit. In rav1e, a Temporal Unit always starts with a temporal delimiter, followed by zero or one sequence headers, one or more frame headers and zero or more tile groups.
Same as an AV1 tile group. Zero or more of these, plus a Frame Header, make up a Frame.
Same as an AV1 frame header. These are followed by a tile group, except when the show-existing-frame feature is used, in which case there are zero tile groups.
Otherwise known as a subgop, this is a group of reordered frames. The first frames in the group will be non-shown frames, followed by a sequence of shown frames. A group may either start with a key frame or inter frame.
In the input, this is one picture of YUV data. In the bitstream, this is a Frame Header followed by zero or more Tile Groups. With reordering, there will be more frames in the output than the input, but some will not be shown.
Group Of Pictures, A sequence of groups. The first group will start with a keyframe, and the rest of the groups will start with inter frames.
One mode info block, the smallest unit of pixels that has a prediction mode coded for it. Corresponds to a 4x4 block of luma pixels and a 4x4, 2x4, or 2x2 unit of chroma pixels (depending on chroma subsampling).
Operations on raw pixel values of the image. Distortion measurements in pixel domain are the most accurate as they most closely represent the video output.
Otherwise known as frequency domain, it refers to coefficient values after they have gone through the forward transform and before the inverse transform. Computing distortion in transform domain is faster than pixel domain as the inverse transform can be skipped, but it is less accurate as a result.