Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about finetuning with different resolution and frame nums #220

Open
zqh0253 opened this issue Jan 15, 2025 · 1 comment
Open

Question about finetuning with different resolution and frame nums #220

zqh0253 opened this issue Jan 15, 2025 · 1 comment

Comments

@zqh0253
Copy link

zqh0253 commented Jan 15, 2025

Thank you for providing such an excellent codebase. I'm curious if this repository supports fine-tuning the model (e.g., CogVideox) with different resolutions and frame numbers compared to the default setting, such as 256 x 256 x 6 (H x W x T). If so, what is the best practice for doing this?

@neph1
Copy link

neph1 commented Jan 15, 2025

I've done some experimentation with this with LTX-V. Multiple resolutions add flexibility to the lora, ie results get better when using different resolutions during inference (which is expected, I guess).
LTX doesn't seem to like lower framerates. I've tried to train it at as low as 12 fps, but seem to be getting smeared results as it interpolates to 24 fps. Although it's possible to offset it somewhat by raising the fps during inference (40fps, 60fps or more). I'd be happy to be corrected if anyone has better results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants