Skip to content

vLAR-group/PhysInOne

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

13 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

PhysInOne: Visual Physics Learning and Reasoning in One Suite

CVPR 2026

arXiv Project Page Dataset License

PhysInOne Teaser

Overview

We present PhysInOne, the largest dataset addressing the critical scarcity of physically-grounded training data for AI systems.

Scale and Diversity

  • 2 million videos generated from 153,810 dynamic 3D scenes
  • Covers 71 fundamental physical phenomena in everyday environments, spanning four major domains: Mechanics, Optics, Fluid Dynamics, Magnetism
  • Includes 2,231 common objects tailored to daily physical interactions
  • Enriched with 623 materials across five categories: plastic, metal, wood, stone, and fabric
  • Features 528 diverse 3D backgrounds to ensure realism and environmental variety

Scene Characteristics

  • Each scene involves 1–3 physical phenomena, reflecting real-world activities
  • Supports complex multi-object interactions, with increasing scene complexity
  • Average number of objects per scene: 3.9 (single-physics), 6.3 (double-physics), 7.8 (triple-physics)
  • Each scene is captured from 13 viewpoints: 12 static cameras and 1 moving camera

Rich Annotations

  • 3D geometry
  • Semantic labels
  • Object motion and dynamics
  • Physical properties
  • Natural-language scene descriptions

Supported Applications

  • Physics-aware video generation
  • Short- and long-term future frame prediction
  • Physical property estimation
  • Motion transfer
  • And more...

πŸš€ Release Timetable

Component Progress Status Notes
SubSet β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ100% Released
Rendered Data - Train β–ˆβ–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ 5%(5277/122,988) In progress Last updated: May 21
Rendered Data - Test β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ 0%(0/15411) In progress
Rendered Data - Val β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ 1%(103/15411) In progress
3D Assets β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ 0% Not released Expected around June
Leaderboard β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ 0% Ongoing Link will be added when available
PMF β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ100% Released
Data processing β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘β–‘ 0% Not released Expected around June

Links

Resource Link
πŸ“„ Paper arXiv
🌐 Project Page vlar-group.github.io/PhysInOne
πŸ€— Dataset Hugging Face

πŸ’» Code

PMF Metric

The PMF (Physical Motion Fidelity) evaluates video similarity in the frequency domain using 3D FFT-based energy distributions. It is designed for physics-aware video generation, future prediction, and motion transfer tasks in the PhysInOne benchmark.

Install via pip (Recommended)

# Step 1: Install PyTorch first (choose your variant)
# CPU only:
pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cpu
# CUDA 12.6:
pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu126

# Step 2: Install pmf from this repo
pip install git+https://github.com/vLAR-group/PhysInOne.git#subdirectory=pmf

Demo

#!/usr/bin/env python
"""Test PMF metric with random tensors."""

import torch
from pmf import compute_pmf

def main():
    torch.manual_seed(42)
    B, C, T, H, W = 1, 3, 16, 128, 128
    video_pred = torch.randn(B, C, T, H, W)
    video_gt = torch.randn(B, C, T, H, W)

    score = compute_pmf(video_pred, video_gt, device='cpu') 
    # If you want to use gpu, set device='cuda'
    # score = compute_pmf(video_pred, video_gt, device='cuda') 
    if isinstance(score, torch.Tensor):
        score = score.item()
        
    print(f"PMF similarity score: {score:.4f}")

if __name__ == "__main__":
    main()

🚧 Coming Soon 🚧

Data processing code will be released soon. Stay tuned!

Citation

If you find this work useful, please cite:

@misc{zhou2026physinonevisualphysicslearning,
      title={PhysInOne: Visual Physics Learning and Reasoning in One Suite}, 
      author={Siyuan Zhou and Hejun Wang and Hu Cheng and Jinxi Li and Dongsheng Wang and Junwei Jiang and Yixiao Jin and Jiayue Huang and Shiwei Mao and Shangjia Liu and Yafei Yang and Hongkang Song and Shenxing Wei and Zihui Zhang and Peng Huang and Shijie Liu and Zhengli Hao and Hao Li and Yitian Li and Wenqi Zhou and Zhihan Zhao and Zongqi He and Hongtao Wen and Shouwang Huang and Peng Yun and Bowen Cheng and Pok Kazaf Fu and Wai Kit Lai and Jiahao Chen and Kaiyuan Wang and Zhixuan Sun and Ziqi Li and Haochen Hu and Di Zhang and Chun Ho Yuen and Bing Wang and Zhihua Wang and Chuhang Zou and Bo Yang},
      year={2026},
      eprint={2604.09415},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2604.09415}, 
}

License

This project is licensed under the CC BY-NC-SA 4.0 license.

Acknowledgements

We would like to express our sincere gratitude to all contributors who participated in human evaluations and data collection efforts.

Releases

No releases published

Packages

 
 
 

Contributors

Languages