Move "scanned" metadata as video-only and add `begin_stream_from_header` #591

NicolasHug · 2025-03-24T11:32:18Z

This PR does a bunch of related things:

We add a new begin_stream_seconds_from_header field to audio and video metadata. It is derived from the AVStream's start_time field. This field is useful e.g. in our tutorial example because it is now clear that the stream doesn't exactly starts at 0.
We move the begin_stream_seconds_from_content and end_stream_seconds_from_content fields as video-only: these fields are derived from a scan, so there's no point having them for audio.
As a consequence of that, we move the duration_seconds metadata as video-only, because it is derived from begin_stream_seconds_from_content and end_stream_seconds_from_content. This means that audio metadata now only contains duration_seconds_from_header.
We now allow negative start_seconds in the call to get_samples_played_in_range: the input check was based on begin_stream_seconds_from_content so that didn't make much sense. We still have tests that make sure we provide good error messages in the incorrect cases.

NicolasHug · 2025-03-24T11:54:08Z

src/torchcodec/decoders/_audio_decoder.py

@@ -13,7 +13,7 @@
 from torchcodec.decoders import _core as core
 from torchcodec.decoders._decoder_utils import (
    create_decoder,
-    get_and_validate_stream_metadata,
+    ERROR_REPORTING_INSTRUCTIONS,


Changes below: we previously had a common util for audio and video that extracted and validated the metadata: get_and_validate_stream_metadata(). Since we moved a bunch of fields as video-only, this util wasn't generic enough anymore to justify its existence, hence a few edits here and in the video_decoder.py file.

scotts · 2025-03-24T13:33:41Z

src/torchcodec/decoders/_core/VideoDecoderOps.cpp

+  if (streamMetadata.beginStreamFromHeader.has_value()) {
+    map["beginStreamFromHeader"] =
+        std::to_string(*streamMetadata.beginStreamFromHeader);
+  }


This reminded me to create Issue #593.

NicolasHug added 2 commits March 24, 2025 10:50

Add begin_stream_seconds_from_header field to metadata

d3d643b

Move from_content fields as video only and update audio bound checks

8969bcf

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 24, 2025

Lint

6904ab1

NicolasHug changed the title ~~Rework metadata~~ Move "scanned" metadata as video-only and add begin_stream_from_header Mar 24, 2025

NicolasHug commented Mar 24, 2025

View reviewed changes

Fix repr

c601207

scotts reviewed Mar 24, 2025

View reviewed changes

scotts approved these changes Mar 24, 2025

View reviewed changes

NicolasHug merged commit abc9b10 into pytorch:main Mar 24, 2025
46 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move "scanned" metadata as video-only and add `begin_stream_from_header` #591

Move "scanned" metadata as video-only and add `begin_stream_from_header` #591

NicolasHug commented Mar 24, 2025 •

edited

Loading

NicolasHug Mar 24, 2025

scotts Mar 24, 2025

Move "scanned" metadata as video-only and add begin_stream_from_header #591

Move "scanned" metadata as video-only and add begin_stream_from_header #591

Conversation

NicolasHug commented Mar 24, 2025 • edited Loading

NicolasHug Mar 24, 2025

Choose a reason for hiding this comment

scotts Mar 24, 2025

Choose a reason for hiding this comment

Move "scanned" metadata as video-only and add `begin_stream_from_header` #591

Move "scanned" metadata as video-only and add `begin_stream_from_header` #591

NicolasHug commented Mar 24, 2025 •

edited

Loading