PhysInOne: Visual Physics Learning and Reasoning in One Suite

CVPR 2026

Overview

We present PhysInOne, the largest dataset addressing the critical scarcity of physically-grounded training data for AI systems.

Scale and Diversity

2 million videos generated from 153,810 dynamic 3D scenes
Covers 71 fundamental physical phenomena in everyday environments, spanning four major domains: Mechanics, Optics, Fluid Dynamics, Magnetism
Includes 2,231 common objects tailored to daily physical interactions
Enriched with 623 materials across five categories: plastic, metal, wood, stone, and fabric
Features 528 diverse 3D backgrounds to ensure realism and environmental variety

Scene Characteristics

Each scene involves 1–3 physical phenomena, reflecting real-world activities
Supports complex multi-object interactions, with increasing scene complexity
Average number of objects per scene: 3.9 (single-physics), 6.3 (double-physics), 7.8 (triple-physics)
Each scene is captured from 13 viewpoints: 12 static cameras and 1 moving camera

Rich Annotations

3D geometry
Semantic labels
Object motion and dynamics
Physical properties
Natural-language scene descriptions

Supported Applications

Physics-aware video generation
Short- and long-term future frame prediction
Physical property estimation
Motion transfer
And more...

🚀 Release Timetable

Component	Progress	Status	Notes
SubSet	`██████████`100%	Released
Rendered Data - Train	`█░░░░░░░░░` 5%(5277/122,988)	In progress	Last updated: May 21
Rendered Data - Test	`░░░░░░░░░░` 0%(0/15411)	In progress
Rendered Data - Val	`░░░░░░░░░░` 1%(103/15411)	In progress
3D Assets	`░░░░░░░░░░` 0%	Not released	Expected around June
Leaderboard	`░░░░░░░░░░` 0%	Ongoing	Link will be added when available
PMF	`██████████`100%	Released
Data processing	`░░░░░░░░░░` 0%	Not released	Expected around June

Links

Resource	Link
📄 Paper	arXiv
🌐 Project Page	vlar-group.github.io/PhysInOne
🤗 Dataset	Hugging Face

💻 Code

PMF Metric

The PMF (Physical Motion Fidelity) evaluates video similarity in the frequency domain using 3D FFT-based energy distributions. It is designed for physics-aware video generation, future prediction, and motion transfer tasks in the PhysInOne benchmark.

Install via pip (Recommended)

# Step 1: Install PyTorch first (choose your variant)
# CPU only:
pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cpu
# CUDA 12.6:
pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu126

# Step 2: Install pmf from this repo
pip install git+https://github.com/vLAR-group/PhysInOne.git#subdirectory=pmf

Demo

#!/usr/bin/env python
"""Test PMF metric with random tensors."""

import torch
from pmf import compute_pmf

def main():
    torch.manual_seed(42)
    B, C, T, H, W = 1, 3, 16, 128, 128
    video_pred = torch.randn(B, C, T, H, W)
    video_gt = torch.randn(B, C, T, H, W)

    score = compute_pmf(video_pred, video_gt, device='cpu') 
    # If you want to use gpu, set device='cuda'
    # score = compute_pmf(video_pred, video_gt, device='cuda') 
    if isinstance(score, torch.Tensor):
        score = score.item()
        
    print(f"PMF similarity score: {score:.4f}")

if __name__ == "__main__":
    main()

🚧 Coming Soon 🚧

Data processing code will be released soon. Stay tuned!

Citation

If you find this work useful, please cite:

@misc{zhou2026physinonevisualphysicslearning,
      title={PhysInOne: Visual Physics Learning and Reasoning in One Suite}, 
      author={Siyuan Zhou and Hejun Wang and Hu Cheng and Jinxi Li and Dongsheng Wang and Junwei Jiang and Yixiao Jin and Jiayue Huang and Shiwei Mao and Shangjia Liu and Yafei Yang and Hongkang Song and Shenxing Wei and Zihui Zhang and Peng Huang and Shijie Liu and Zhengli Hao and Hao Li and Yitian Li and Wenqi Zhou and Zhihan Zhao and Zongqi He and Hongtao Wen and Shouwang Huang and Peng Yun and Bowen Cheng and Pok Kazaf Fu and Wai Kit Lai and Jiahao Chen and Kaiyuan Wang and Zhixuan Sun and Ziqi Li and Haochen Hu and Di Zhang and Chun Ho Yuen and Bing Wang and Zhihua Wang and Chuhang Zou and Bo Yang},
      year={2026},
      eprint={2604.09415},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2604.09415}, 
}

License

This project is licensed under the CC BY-NC-SA 4.0 license.

Acknowledgements

We would like to express our sincere gratitude to all contributors who participated in human evaluations and data collection efforts.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
pmf		pmf
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
teaser.png		teaser.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PhysInOne: Visual Physics Learning and Reasoning in One Suite

Overview

Scale and Diversity

Scene Characteristics

Rich Annotations

Supported Applications

🚀 Release Timetable

Links

💻 Code

PMF Metric

Install via pip (Recommended)

Demo

Citation

License

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PhysInOne: Visual Physics Learning and Reasoning in One Suite

Overview

Scale and Diversity

Scene Characteristics

Rich Annotations

Supported Applications

🚀 Release Timetable

Links

💻 Code

PMF Metric

Install via pip (Recommended)

Demo

Citation

License

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages