You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While working on VMS (UI wrapper around Finetrainers) I realized that I ended up using a similar format to WebDataset, except I upload multiple .zip files containing .mp4/.txt pairs, instead of .tar shards.
@jbilcke Thanks for the recommendation! I recently rewrote a majority of the codebase to allow for this. This file lists all the supported dataset formats:
I haven't tested on a large scale run with a big webdataset yet, but I did verify it to be working with in a smaller setting so please let me know if it works when you give it a try.
I have a simple test dataset which I use for verifying loading in the fast-tests. You could try with it for a quick look:
Feature request / 功能建议
I propose for Finetrainers to support WebDataset as a dataset format.
Motivation / 动机
While working on VMS (UI wrapper around Finetrainers) I realized that I ended up using a similar format to WebDataset, except I upload multiple .zip files containing .mp4/.txt pairs, instead of .tar shards.
Unrelated to my project, I also notice some interest about using WebDataset in Finetrainers
Your contribution / 您的贡献
I've started refactoring my project to support WebDataset
The text was updated successfully, but these errors were encountered: