We conduct real-world experiments in robotic environments where target data is collected by (a) WidowX robot and source data is collected by (b) Airbot, for 100 trajectories respectively. We build three manipulation tasks: (1) Picking up a red cup on a silver pan (Cup); (2) Picking up a duck on a green plate (Duck); (3) Moving a pot from right to left (Pot).

Figure 4: The comparisons between the target domain and the source domain. Target and source domains with complicated discrepancies on embodiments and viewpoints (top) and experiment results (bottom). The top right presents the snapshots from base and wrist camera views of data collection processes in target/source domain from Cup/Duck/Pot tasks respectively. The average success rate for real-robot tasks with/without distractors is obtained over 3 seeds.

Results on Real Robots

Figure: Real robot experimental results. Success rate is averaged over 10 episodes and 3 seeds.

Target + Edited Source (xTED)

Red cup on silver pan

Red cup on silver pan with distraction

Duck on green plate

Duck on green plate with distraction

Move pot

Move pot with distraction

Target

Red cup on silver pan

Red cup on silver pan with distraction

Duck on green plate

Duck on green plate with distraction

Move pot

Move pot with distraction

Target + Source

Red cup on silver pan

Red cup on silver pan with distraction

Duck on green plate

Duck on green plate with distraction

Move pot

Move pot with distraction

Replaying Edited and Original Source Trajectory Pairs

We select two edited and orginal source trajectory pairs (index [10] and [100] in datasets) for each domain gap in HalfCheetah environment.

Gravity Videos

1. HalfCheetah Original

1. HalfCheetah Edited

2. HalfCheetah Original

2. HalfCheetah Edited

Dynamics Error Distribution for Edited Source Data

Dynamics error distribution plots, where the results of 'Tgt' are evaluated on the mixture of HalfCheetah MR, ME, and M datasets from D4RL, and the results of 'Src (Edited)' are evaluated on the mixture of source data edited by diffusion models trained on all the HalfCheetah MR, ME and M datasets.

Figure 1

Figure 2

Figure 3

Domain Gap	Data	MSE Error (Mean ± Std)	MAE Error (Mean ± Std)
Gravity	Tgt	0.58 ± 1.75	0.33 ± 0.21
	Src (Edited)	1.02 ± 1.81	0.45 ± 0.23
	Src	4.62 ± 4.53	1.01 ± 0.44
Thigh Size	Tgt	0.58 ± 1.75	0.33 ± 0.21
	Src (Edited)	1.18 ± 2.20	0.49 ± 0.29
	Src	3.88 ± 3.35	1.02 ± 0.46
Friction	Tgt	0.58 ± 1.75	0.33 ± 0.21
	Src (Edited)	1.61 ± 2.20	0.61 ± 0.32
	Src	5.54 ± 3.36	1.18 ± 0.37

Table 1: Numerical results of dynamics errors of source data, edited source data and target data.

BibTeX


        @inproceedings{anonymous2025xted,
          title={xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing},
          author={Authors, Anonymous},
          booktitle={Under Review}
        }

xTED

Cross-Domain Adaptation via Diffusion-Based Trajectory Editing

Abstract

Downstream Robot Manipulation Tasks

Results on Real Robots

Target + Edited Source (xTED)

Red cup on silver pan

Red cup on silver pan with distraction

Duck on green plate

Duck on green plate with distraction

Move pot

Move pot with distraction

Target

Red cup on silver pan

Red cup on silver pan with distraction

Duck on green plate

Duck on green plate with distraction

Move pot

Move pot with distraction

Target + Source

Red cup on silver pan

Red cup on silver pan with distraction

Duck on green plate

Duck on green plate with distraction

Move pot

Move pot with distraction

Replaying Edited and Original Source Trajectory Pairs

Gravity Videos

1. HalfCheetah Original

1. HalfCheetah Edited

2. HalfCheetah Original

2. HalfCheetah Edited

Friction Videos

1. HalfCheetah Original

1. HalfCheetah Edited

2. HalfCheetah Original

2. HalfCheetah Edited

Thigh Videos

1. HalfCheetah Original

1. HalfCheetah Edited

2. HalfCheetah Original

2. HalfCheetah Edited

Dynamics Error Distribution for Edited Source Data

BibTeX