video_pipeο
- lightning_pose.data.dali.video_pipe(filenames: list[str] | str | list[list[str]], resize_dims: list[int] | None = None, random_shuffle: bool = False, sequence_length: int = 16, pad_sequences: bool = True, initial_fill: int = 16, normalization_mean: list[float] = [0.485, 0.456, 0.406], normalization_std: list[float] = [0.229, 0.224, 0.225], name: str = 'reader', step: int = 1, pad_last_batch: bool = False, imgaug: str = 'default', skip_vfr_check: bool = True) tuple[source]ο
Generic video reader pipeline that loads videos, resizes, augments, and normalizes.
- Parameters:
filenames β list of absolute paths of video files to feed through pipeline
resize_dims β [height, width] to resize raw frames
random_shuffle β True to grab random batches of frames from videos; False to sequential read
seed β random seed when random_shuffle is True
sequence_length β number of frames to load per sequence
pad_sequences β allows creation of incomplete sequences if there is an insufficient number of frames at the very end of the video
initial_fill β size of the buffer that is used for random shuffling
normalization_mean β mean values in (0, 1) to subtract from each channel
normalization_std β standard deviation values to subtract from each channel
name β pipeline name, used to string together DataNode elements
step β number of frames to advance on each read
pad_last_batch
imgaug β string identifying which imgaug pipeline to use; βdefaultβ, βdlcβ, βdlc-top-downβ
skip_vfr_check β donβt check for variable frame rates, can throw errors with small diffs
- Returns:
pipeline object to be fed to DALIGenericIterator data augmentation matrix (returned so that geometric transforms can be undone) size of video frames, used for bbox