add DPA-ADAPT toolkit for downstream property adaptation by zhaiwenxi · Pull Request #5572 · deepmodeling/deepmd-kit

zhaiwenxi · 2026-06-22T11:27:26Z

Summary

This PR adds DPA-ADAPT, a toolkit for adapting pretrained DPA models to downstream atomistic property prediction tasks.

The new package provides a scikit-learn-style Python API and standalone CLI for fine-tuning, descriptor extraction, prediction, evaluation, cross-validation, and data preparation, without requiring users to manually write DeePMD-kit training input files.

Main changes

Add the top-level dpa_adapt Python package.
Add standalone CLI entry points:
- dpa-adapt
- dpaad
Support multiple adaptation strategies:
- frozen_sklearn: frozen DPA descriptors with scikit-learn regressors
- frozen_head: train a property head on top of a frozen DPA backbone
- finetune: end-to-end DPA fine-tuning
- mft: multi-task fine-tuning with auxiliary energy/force training
Add data utilities for:
- DeepMD/npy loading and validation
- label attachment
- descriptor caching
- train/test split and cross-validation
- SMILES/formula-based conversion workflows
- optional frame parameters via fparam.npy
Add prediction and evaluation helpers with MAE, RMSE, and R2 reporting.
Add documentation under doc/dpa_adapt/.
Add a runnable QM9 HOMO-LUMO gap example under examples/dpa_adapt/.
Add dpa-adapt optional dependencies in pyproject.toml.
Add dedicated lightweight CI for source/tests/dpa_adapt/.
Co-authored-by: zirenjin <zirenjin@umich.edu>

for more information, see https://pre-commit.ci

feat: add DeePMD property tools

for more information, see https://pre-commit.ci

Add property tools

… leak)

dpa_tools merge

…re paths

…t, unify --target-key

…t→convert)

…_path

…utput parsing - DPAFineTuner: extract _FrozenSklearnPipeline helper; keep public API unchanged - MFTFineTuner: defer _read_fitting_net_from_ckpt to first access - DPATrainer._parse_test_output: single anchored regex per metric, auto-detect format

…perty metrics - _load_labels: accept str | list[str], stack columns for multi-property - build_sklearn_head: n_outputs param, wrap RF/Ridge with MultiOutputRegressor - evaluate: per-property mae/rmse/r2 dict when target_key is a list - freeze/DPAPredictor: store and load target_key as-is (str or list) - CLI: --target-key homo,lumo parsed via _maybe_split_list - 6 new tests covering fit, evaluate, freeze/load round-trip

The old _load_descriptor_model, _validate_type_map, _remap_atom_types, _extract_features_cached, and _extract_features method bodies were left in place alongside the new thin delegators, causing CodeQL 'variable defined multiple times' warnings. Removed the old bodies; kept _extract_features_cached on DPAFineTuner directly so that test patches on DPAFineTuner._extract_features are honoured through the cache wrapper.

… method - Replace try/except ImportError in _unwrap_multioutput with direct import (sklearn is always available when dpa_tools is loaded) - Remove _FrozenSklearnPipeline.extract_features_cached (dead code; the caching wrapper lives on DPAFineTuner so test patches work)

The workflow still referenced the deleted deepmd_property_tools/ directory. Updated paths trigger to deepmd/dpa_tools/** and test command to source/tests/dpa_tools/. Added torch to lightweight dependencies.

numpy 2.3+ requires Python>=3.11, but the property_tools_tests workflow runs on Python 3.10. Pin numpy>=1.21,<2.2 to keep the lightweight dependency install working on older Python.

refactor: unify dpa_tools CLI/API and merge deepmd_property_tools

Fix unicode headers in dp test detail output

Signed-off-by: zhaiwenxi <144502730+zhaiwenxi@users.noreply.github.com>

fix: guard _sklearn._device assignment against None

ci: align build wheel workflow with upstream

for more information, see https://pre-commit.ci

github-advanced-security

CodeQL found more than 20 potential problems in the proposed changes. Check the Files changed tab for more details.

codecov · 2026-06-22T12:49:17Z

Codecov Report

❌ Patch coverage is 66.66667% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 82.14%. Comparing base (03682bf) to head (7111f67).

Files with missing lines	Patch %	Lines
deepmd/__about__.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #5572      +/-   ##
==========================================
- Coverage   82.14%   82.14%   -0.01%     
==========================================
  Files         900      901       +1     
  Lines      104139   104139              
  Branches     4471     4473       +2     
==========================================
- Hits        85550    85547       -3     
- Misses      17178    17181       +3     
  Partials     1411     1411

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

njzjz-bot · 2026-06-22T15:32:00Z

Linking the property-workflow issues that this PR covers under the updated DPA-ADAPT command surface:

[Feature Request] Add DPA-ADAPT high-level property workflow CLI/Python interface #5376 — umbrella high-level property workflow CLI/Python interface
Add standalone DPA-ADAPT CLI namespace (dpa-adapt/dpaad) and shared command plumbing #5401 — standalone dpa-adapt / dpaad CLI namespace and shared plumbing
Add dpa-adapt predict MVP for high-level property inference #5402 — dpa-adapt predict high-level property inference
Add dpa-adapt extract-descriptors MVP for representation extraction #5403 — dpa-adapt extract-descriptors representation/descriptor extraction
Add DPA-ADAPT Python API for property prediction and descriptor extraction #5404 — DPA-ADAPT Python API for prediction and descriptor extraction
Add dpa-adapt fit and DPAFineTuner high-level property training API #5405 — dpa-adapt fit / DPAFineTuner high-level property training API

Command-name update: this PR implements the workflow as standalone dpa-adapt / dpaad commands (fit, predict, extract-descriptors, etc.) rather than the earlier dp property ... / dp --pt property train sketches.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot

Thanks for putting this together. I think this needs another revision before merge: there are a few correctness issues in the DPA-ADAPT code path, and the example/test material should be trimmed and made portable. I left inline comments on the specific blockers.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T15:53:02Z

+        self._sklearn._model = self._model
+        if self._device is not None:
+            self._sklearn._device = self._device
+        self._sklearn._checkpoint_type_map = self._checkpoint_type_map


This sync overwrites the pipeline's checkpoint type_map with the parent object's initial []. _FrozenSklearnPipeline.load_descriptor_model() sets self._checkpoint_type_map from the checkpoint, but the parent DPAFineTuner._checkpoint_type_map is never updated, so the next _ensure_sklearn() call clears it again. That disables unsupported-element validation and local-to-checkpoint atom-type remapping; for non-prefix type maps, descriptors can be computed with wrong atom-type indices. Please either sync the loaded value back to the parent or avoid overwriting the pipeline value after it is loaded.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T15:53:02Z

+def _per_system_cache_path(system) -> Path:
+    """Return the cache path for a single system's descriptors."""
+    fp = _system_fingerprint(system)
+    return _cache_dir() / f"{fp}.npy"


The per-system descriptor cache key only depends on the input system fingerprint, but ensure_per_system_cache() also takes pretrained, model_branch, and pooling. A cache file generated with one checkpoint/branch/pooling will be silently reused for another, which can train/evaluate on stale descriptors. Please include the resolved checkpoint identity/mtime, branch, and pooling in the per-system key; the bulk _cache_key() above should also include model_branch.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T15:53:02Z

+                stderr=subprocess.STDOUT,
+                text=True,
+                bufsize=1,
+                cwd=self.output_dir,


Running dp with cwd=self.output_dir breaks the default relative output_dir. Just above, input_json is built as ./dpa_output/mft_input.json (or similar); after changing cwd into ./dpa_output, the command now looks for ./dpa_output/dpa_output/mft_input.json. Relative train/aux paths embedded in the generated config have the same issue. Please use absolute paths in the config/command or run from the original working directory.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T15:53:03Z

+# ADAPT example
+
+This directory contains a small ready-to-run example for `dpa_adapt`.
+The example uses 50 pre-processed QM9 molecules to fine-tune and evaluate a


This example currently commits 50 preprocessed QM9 systems (252 files, about 1.5 MB) under examples/dpa_adapt/data. That feels too large and noisy for a repository example, especially since prepare_data.py can regenerate data. Please reduce the checked-in dataset to the minimal number of tiny systems needed to demonstrate the commands (or keep only generated-on-demand data), and leave larger QM9 regeneration to the script/docs.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T15:53:03Z

+import numpy as np
+
+# ── paths ──────────────────────────────────────────────────────────────────
+DEMO_DIR = Path("/home/ziren/aisi-intern/deepmd-kit/examples/dpa_adapt/data")


This test is not portable: it hard-codes a local /home/ziren/... checkout and, below, a local pretrained checkpoint path. It will fail for anyone else running the repository tests locally and should not be merged as-is. Please move this under source/tests/dpa_adapt/ and build paths from the repository root / tmp_path, with any real checkpoint-dependent coverage skipped or mocked.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T15:53:03Z

 - **implements the Deep Potential series models**, which have been successfully applied to finite and extended systems, including organic molecules, metals, semiconductors, insulators, etc.
 - **implements MPI and GPU supports**, making it highly efficient for high-performance parallel and distributed computing.
 - **highly modularized**, easy to adapt to different descriptors for deep learning-based potential energy models.
+- **fine-tunes pre-trained DPA models through a scikit-learn-style Python API**, via [`dpa_adapt`](dpa_adapt/README.md) — construct a `DPAFineTuner`, then `fit` and `predict` to adapt a large pre-trained model to your own property dataset, with no input files to write.


This link points to dpa_adapt/README.md, but this PR does not add that file (the README lives under doc/dpa_adapt/README.md). As written, the top-level README will contain a broken link. Please either add the package README or link to the documentation path that actually exists.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:10Z

+
+        # TODO: replace with dedicated DescriptorExtractor class after refactor.
+        # For now, DPAFineTuner is reused purely as a descriptor feature extractor.
+        self._extractor = DPAFineTuner(


The frozen-model predictor loads the saved type_map into self._type_map, but the descriptor extractor is constructed without that map. _extract_and_condition() validates against self._type_map, then _extract_features() uses the extractor's own empty/default type map state. For data without type_map.raw, this can compute descriptors with the wrong checkpoint atom-type indices. Please pass/sync the saved type map into the extractor before validation/extraction.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:13Z

+            t.train_data if isinstance(t.train_data, list) else [t.train_data]
+        )
+
+        training = {


MFTFineTuner.fit(..., valid_data=...) stores valid_data, but the generated MFT config never emits a validation_data block for either branch. As a result, callers who provide validation data silently train without validation. Please either wire valid_data into the config or reject it explicitly.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:15Z

+        )
+        # Paper default 0.5/0.5; aux_prob (default 0.5) controls the split, the
+        # downstream share is the complement. Legacy keeps downstream at 1.0.
+        downstream_prob = (1.0 - t.aux_prob) if is_property else 1.0


aux_prob is not range-validated before using 1.0 - t.aux_prob. Values outside [0, 1] produce negative model sampling probabilities (for example aux_prob=1.2 gives downstream -0.2), which will fail later or train with invalid branch weights. Please validate this in the tuner constructor before building the DeepMD input.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:17Z

+            )
+            return str(latest)
+
+        if self.fparam_dim > 0:


When fparam_dim > 0, this validates only the training systems. Validation systems can still be missing set.*/fparam.npy or have a different fparam width, so dp --pt train will fail later or validate with inconsistent feature dimensions. Please validate valid_systems with the same fparam_dim before writing/running the config.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:23Z

+        self._condition_manager = None
+        if self.fparam_dim > 0:
+            conditions = _read_fparam_from_systems(systems)
+            if conditions is not None:


For frozen sklearn training, requested fparams are silently ignored if _read_fparam_from_systems() returns None. If fparam_dim > 0, missing fparam data should be a hard error, not a fallback to a model without conditions. This also needs to ensure all systems have fparams with the expected width, otherwise a partial read can concatenate condition rows against the wrong descriptor rows.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:31Z

+                    "fmt is not supported for mft evaluate(); "
+                    "provide deepmd/npy system directories."
+                )
+            result = self._ensure_mft().predict(data)


The public wrapper's MFT evaluate() always calls MFTFineTuner.predict(), but predict() explicitly rejects downstream_task_type='ener'. MFTFineTuner.evaluate() already supports the energy-mode path, so legacy energy-mode MFT evaluation is unreachable through DPAFineTuner.evaluate(). Please dispatch to MFTFineTuner.evaluate() for energy mode.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz-bot · 2026-06-22T16:18:36Z

+                "freeze() was called before fit(). Train the model with fit() first."
+            )
+
+        bundle = {


After frozen_head, finetune, or mft training, _fitted is set to True, so this freeze() path is allowed even though no sklearn predictor/target metadata was fit. The resulting bundle has predictor=None (and default task metadata) and can be loaded by DPAPredictor only to fail or behave nonsensically. Please restrict this freeze format to the sklearn strategy, or implement separate serialization for the other strategies.

Authored by OpenClaw (model: custom-chat-jinzhezeng-group/gpt-5.5)

njzjz

I tested this PR locally on 7111f678f1df3d35679a2b7f49fbe3b686ceda41 with srun --gres=gpu:1 on an RTX 5090. After installing -e .[dpa-adapt] and replacing the CPU torch wheel with torch 2.12.1+cu129, CUDA was visible from the venv (torch.cuda.is_available() == True).

What passed:

python -m pytest source/tests/dpa_adapt/ -v --ignore=source/tests/dpa_adapt/test_trainer_dim_case_embd.py: 293 passed, 12 skipped.
python -m pytest source/tests/dpa_adapt/test_backend_contract.py -v: 7 passed with CUDA torch.
srun --gres=gpu:1 ... python examples/dpa_adapt/scripts/run_evaluate_frozen_sklearn.py: completed, MAE 1.1801 eV, RMSE 1.4642 eV, R2 -0.5223.
srun --gres=gpu:1 ... python examples/dpa_adapt/scripts/run_evaluate_frozen_head.py: completed numerically, but exposed the issue in the inline comment: the spawned dp --pt train came from /home/jzzeng/miniconda3/bin/dp instead of the active venv's dp.

Requesting changes because the dp subprocess resolution can silently run a different DeePMD-kit/torch environment from the one importing dpa_adapt, so the training/evaluation paths are not reliable in common symlinked-venv setups.

njzjz · 2026-06-22T17:01:15Z

+    from pathlib import Path as _Path
+
+    exe_name = "dp.exe" if _os.name == "nt" else "dp"
+    candidate = _Path(_sys.executable).resolve().parent / exe_name


This escapes the active virtualenv when sys.executable is a symlink. In my local venv, sys.executable is /home/jzzeng/codes/deepmd-kit/venv/bin/python, but Path(sys.executable).resolve().parent becomes /home/jzzeng/miniconda3/bin, so resolve_dp_command() returns /home/jzzeng/miniconda3/bin/dp even though shutil.which('dp') points at /home/jzzeng/codes/deepmd-kit/venv/bin/dp. The frozen_head example then printed Running: /home/jzzeng/miniconda3/bin/dp --pt train ..., i.e. it trained with a different DeePMD-kit/torch install (deepmd-kit 3.2.0b1.dev42, torch 2.10.0+cu128) than the PR venv (deepmd-kit 3.2.0b1.dev203, torch 2.12.1+cu129).

Please do not dereference the interpreter symlink here. Use the scripts directory for the active environment, e.g. Path(sys.executable).parent / exe_name or sysconfig.get_path('scripts'), before falling back to shutil.which('dp').

f"{sys.executable} -m deepmd has the same effect

zhaiwenxi and others added 30 commits May 27, 2026 16:08

feat: add DeePMD property tools

30351e9

[pre-commit.ci] auto fixes from pre-commit.com hooks

e9fe00f

for more information, see https://pre-commit.ci

[pre-commit.ci] auto fixes from pre-commit.com hooks

db05969

for more information, see https://pre-commit.ci

Merge pull request #1 from zhaiwenxi/add-property-tools

311a620

feat: add DeePMD property tools

Add SMILES coordinate generation for property tools

05479d4

[pre-commit.ci] auto fixes from pre-commit.com hooks

4445f1d

for more information, see https://pre-commit.ci

Merge branch 'deepmodeling:master' into master

9be45cd

Merge pull request #2 from zhaiwenxi/add-property-tools

d5df6fa

Add property tools

feat: add dpa_tools as self-contained subpackage (PR 1)

52033d7

feat: add dp dpa CLI subcommand group (Branch A)

3e0c3f9

feat: centralize deepmd API calls into _backend.py chokepoint (Branch B)

ffe609c

Merge branch-b-backend (_backend.py chokepoint)

beb7b42

fix: use yield fixture for contract test hook cleanup (prevents state…

ab024dc

… leak)

docs: add dpa_tools Python and CLI API reference

da3f26f

Merge pull request #3 from zirenjin/master

bb3c971

dpa_tools merge

feat: merge property_tools SMILES pipeline into dpa_tools

57f61bd

feat: auto-detect format in dp dpa data convert, unify SMILES+structu…

f61f0c2

…re paths

chore: remove deepmd_property_tools, migrate tests+data to dpa_tools

392a1a5

chore: rename DATA/ → demo/

871d600

docs: update README — add SMILES pipeline, auto_convert, demo data

8a8ec93

refactor: fold mft into fit --strategy mft, batch-convert into conver…

ae78fea

…t, unify --target-key

docs: update README for refactored CLI and API (mft→fit, batch-conver…

fbfb5a0

…t→convert)

feat: auto-download built-in pretrained models via resolve_pretrained…

5bb1b53

…_path

fix: update property_tools_tests CI after migration to dpa_tools

217868c

The workflow still referenced the deleted deepmd_property_tools/ directory. Updated paths trigger to deepmd/dpa_tools/** and test command to source/tests/dpa_tools/. Added torch to lightweight dependencies.

fix: pin numpy<2.2 in lightweight CI for Python 3.10 compat

3b1ed2c

numpy 2.3+ requires Python>=3.11, but the property_tools_tests workflow runs on Python 3.10. Pin numpy>=1.21,<2.2 to keep the lightweight dependency install working on older Python.

Merge pull request #4 from zirenjin/master

93b2c5d

refactor: unify dpa_tools CLI/API and merge deepmd_property_tools

zhaiwenxi and others added 7 commits June 19, 2026 19:52

Merge pull request #35 from zhaiwenxi/fix-utf8-test-detail-output

b9417f2

Fix unicode headers in dp test detail output

Merge branch 'master' into master

96a7bb7

Signed-off-by: zhaiwenxi <144502730+zhaiwenxi@users.noreply.github.com>

Merge pull request #31 from zirenjin/master

c9b59d7

fix: guard _sklearn._device assignment against None

ci: align build wheel workflow with upstream

4127b14

Merge pull request #36 from zhaiwenxi/sync-build-wheel-upstream

b186cdb

ci: align build wheel workflow with upstream

Merge branch 'deepmodeling:master' into master

cee0193

Merge branch 'deepmodeling:master' into master

5bd3e07

dosubot Bot added the new feature label Jun 22, 2026

github-actions Bot added Python Docs Examples labels Jun 22, 2026

[pre-commit.ci] auto fixes from pre-commit.com hooks

7111f67

for more information, see https://pre-commit.ci

github-advanced-security AI found potential problems Jun 22, 2026

View reviewed changes

njzjz-bot suggested changes Jun 22, 2026

View reviewed changes

njzjz-bot reviewed Jun 22, 2026

View reviewed changes

njzjz requested changes Jun 22, 2026

View reviewed changes

Conversation

zhaiwenxi commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Main changes

Uh oh!

github-advanced-security AI left a comment

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jun 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

njzjz-bot commented Jun 22, 2026

Uh oh!

njzjz-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

njzjz left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

zhaiwenxi commented Jun 22, 2026 •

edited

Loading

codecov Bot commented Jun 22, 2026 •

edited

Loading