
DepthAnything/Video-Depth-Anything - GitHub
Jan 21, 2025 · This work presents Video Depth Anything based on Depth Anything V2, which can be applied to arbitrarily long videos without compromising quality, consistency, or generalization ability. …
【EMNLP 2024 】Video-LLaVA: Learning United Visual ... - GitHub
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection If you like our project, please give us a star ⭐ on GitHub for latest update. 💡 I also have other video-language …
Video-R1: Reinforcing Video Reasoning in MLLMs - GitHub
Feb 23, 2025 · Our Video-R1-7B obtain strong performance on several video reasoning benchmarks. For example, Video-R1-7B attains a 35.8% accuracy on video spatial reasoning benchmark VSI …
GitHub - DAMO-NLP-SG/Video-LLaMA: [EMNLP 2023 Demo] Video …
Jun 3, 2024 · Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding This is the repo for the Video-LLaMA project, which is working on empowering large …
GitHub - k4yt3x/video2x: A machine learning-based video super ...
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018. - k4yt3x/video2x
GitHub - MME-Benchmarks/Video-MME: [CVPR 2025] Video-MME: The …
We introduce Video-MME, the first-ever full-spectrum, M ulti- M odal E valuation benchmark of MLLMs in Video analysis. It is designed to comprehensively assess the capabilities of MLLMs in processing …
Wan: Open and Advanced Large-Scale Video Generative Models
Feb 25, 2025 · Wan: Open and Advanced Large-Scale Video Generative Models In this repository, we present Wan2.1, a comprehensive and open suite of video foundation models that pushes the …
hao-ai-lab/FastVideo - GitHub
A unified inference and post-training framework for accelerated video generation. - hao-ai-lab/FastVideo
VideoLLM-online: Online Video Large Language Model for Streaming …
Online Video Streaming: Unlike previous models that serve as offline mode (querying/responding to a full video), our model supports online interaction within a video stream. It can proactively update …
GitHub - Lightricks/LTX-Video: Official repository for LTX-Video
LTX-Video is the first DiT-based video generation model that contains all core capabilities of modern video generation in one model: synchronized audio and video, high fidelity, multiple performance …