Data Generation

Data Generation#

This tutorial shows how to generate synthetic expert demonstration datasets using EmbodiChain’s built-in environment rollout and dataset manager. You will learn how to configure LeRobot recording in a gym config file (.json, .yaml, or .yml), how run_env.py builds an environment from configuration files, and how completed episodes are automatically saved to disk.

Overview#

EmbodiChain provides a built-in data generation workflow for imitation-learning and manipulation tasks:

Gym Configuration: Describes the scene, robot, sensors, randomization events, observations, dataset recorder, and rollout settings.
Action Configuration: Describes the task-specific expert action graph for tasks that use the action bank.
Environment Rollout: Builds the environment directly from configuration files and executes offline generation.
Expert Policy: Each task provides create_demo_action_list() or another scripted policy entry to generate expert actions.
Dataset Manager: Records observation-action pairs during env.step().
LeRobotRecorder: Converts completed episodes into LeRobot-compatible datasets, with optional video export.

What This Tutorial Records#

This page documents the full path from task configuration to saved dataset:

Prepare a task gym config (e.g. gym_config.json or gym_config.yaml).
Prepare an action config if the task uses the action bank (same supported extensions).
Launch the environment rollout with run-env.
Let the dataset manager automatically save completed episodes.

Example Task#

As a concrete example, this tutorial uses a real action-bank task shipped in the repository:

embodichain_tasks/configs/gym/pour_water/gym_config.json defines the simulation scene and dataset recording behavior (YAML equivalents such as embodichain_tasks/configs/gym/cobotmagic.yaml are also supported).
embodichain_tasks/configs/gym/pour_water/action_config.json defines the action-bank graph used to solve the task.

The Code#

The tutorial corresponds to the run_env.py script in embodichain/lab/scripts.

Code for run_env.py

# ----------------------------------------------------------------------------
# Copyright (c) 2021-2026 DexForce Technology Co., Ltd.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
# ----------------------------------------------------------------------------

import gymnasium
import numpy as np
import argparse
import os
import torch
import tqdm

from embodichain.lab.gym.utils.gym_utils import (
    add_env_launcher_args_to_parser,
    build_env_cfg_from_args,
)
from embodichain.lab.gym.utils.registration import (
    discover_task_packages,
    execute_init_hooks,
)
from embodichain.utils.logger import log_warning, log_info, log_error


def generate_and_execute_action_list(env, idx, debug_mode, **kwargs):

    action_list = env.get_wrapper_attr("create_demo_action_list")(
        action_sentence=idx, **kwargs
    )

    if action_list is None or len(action_list) == 0:
        log_warning("Action is invalid. Skip to next generation.")
        return False

    for action in tqdm.tqdm(
        action_list, desc=f"Executing action list #{idx}", unit="step"
    ):
        # Step the environment with the current action
        # The environment will automatically detect truncation based on action_length
        obs, reward, terminated, truncated, info = env.step(action)

    # TODO: We may assume in export demonstration rollout, there is no truncation from the env.
    # but truncation is useful to improve the generation efficiency.

    return True


def generate_function(
    env,
    num_traj,
    time_id: int = 0,
    save_path: str = "",
    save_video: bool = False,
    debug_mode: bool = False,
    **kwargs,
):
    """Generate and execute a sequence of actions in the environment.

    This function resets the environment, generates and executes action trajectories,
    collects data, and optionally saves videos of the episodes. It supports both online
    and offline data generation modes.

    Args:
        env: The environment instance.
        num_traj (int): Number of trajectories to generate per episode.
        time_id (int, optional): Identifier for the current time step or episode.
        save_path (str, optional): Path to save generated videos.
        save_video (bool, optional): Whether to save episode videos.
        debug_mode (bool, optional): Enable debug mode for visualization and logging.
        **kwargs: Additional keyword arguments for data generation.

    Returns:
        bool: True if data generation is successful, False otherwise.
    """

    valid = True
    _, _ = env.reset()
    while True:
        ret = []
        for trajectory_idx in range(num_traj):
            valid = generate_and_execute_action_list(
                env, trajectory_idx, debug_mode, **kwargs
            )

            if not valid:
                # Failed execution: reset without saving invalid data
                _, _ = env.reset(options={"save_data": False})
                break

        if valid:
            break
        else:
            log_warning("Reset valid flag to True.")
            valid = True

    return True


def main(args, env, gym_config):
    if getattr(args, "preview", False):
        log_info(
            "Preview mode enabled. Launching environment preview...", color="green"
        )
        preview(env)

    log_info("Start offline data generation.", color="green")
    # TODO: Support multiple trajectories per episode generation.
    num_traj = 1
    for i in range(gym_config.get("max_episodes", 1)):
        generate_function(
            env,
            num_traj,
            i,
            save_path=getattr(args, "save_path", ""),
            save_video=getattr(args, "save_video", False),
            debug_mode=getattr(args, "debug_mode", False),
            regenerate=getattr(args, "regenerate", False),
        )

    # Final reset.
    _, _ = env.reset()


def preview(env: gymnasium.Env) -> None:
    """
    Run the following code to create a demonstration and perform env steps.

    ```
    # Demo version of environment rollout
    for i in range(10):
        qpos = env.robot.get_qpos()

        obs, reward, terminated, truncated, info = env.step(qpos)

    # reset the environment
    env.reset()
    ```

    Run the following code to preview the sensor observations.

    ```
    env.preview_sensor_data("camera")
    ```
    """
    _, _ = env.reset()

    end = False
    while end is False:
        print("Press `p` to enter embed mode to interact with the environment.")
        print("Press `q` to quit the simulation.")
        txt = input()
        if txt == "p":
            try:
                from IPython import embed
            except ImportError:
                log_error(
                    "IPython is not installed. Preview mode requires IPython to be "
                    "available. Please install it with `pip install ipython` and try again."
                )
                continue

            embed()
        elif txt == "q":
            end = True

    exit(0)


def cli():
    """Command-line interface for environment runner.

    Parses CLI arguments, builds the environment config, and launches
    the data generation or preview workflow.
    """
    np.set_printoptions(5, suppress=True)
    torch.set_printoptions(precision=5, sci_mode=False)

    parser = argparse.ArgumentParser()

    add_env_launcher_args_to_parser(parser)

    args = parser.parse_args()

    # Step 1: Discover all task packages via entry_points
    discover_task_packages()

    # Step 2: Execute init hooks (register managers, asset resolvers, etc.)
    execute_init_hooks()

    env_cfg, gym_config, action_config = build_env_cfg_from_args(args)

    env = gymnasium.make(id=gym_config["id"], cfg=env_cfg, **action_config)

    main(args, env, gym_config)


if __name__ == "__main__":
    cli()

The Code Explained#

The rollout script builds the environment from configuration, generates expert trajectories, executes them step by step, and relies on the dataset manager to auto-save valid episodes.

Step 1: Prepare the Task Configuration#

The first input to the pipeline is the task gym config file. In the example below, the same file contains rollout settings, scene randomization, observations, dataset recording, and robot or sensor definitions.

The rollout settings include the episode count:

    "id": "PourWater-v3",
    "max_episodes": 5,
    "max_episode_steps": 300,

The dataset-related part looks like this:

                "quat": [0.15304635, 0.69034543, -0.69034543, -0.15304635]
            }
        },
        {
            "sensor_type": "Camera",
            "uid": "cam_left_wrist",
            "width": 640,
            "height": 480,
            "intrinsics": [488.1665344238281, 488.1665344238281, 322.7323303222656, 213.17434692382812],
            "extrinsics": {
                "parent": "left_link6",
                "pos": [-0.08, 0.0, 0.04],
                "quat": [0.15304635, 0.69034543, -0.69034543, -0.15304635]
            }
        }
    ],
    "light": {
        "direct": [
            {
                "uid": "light_1",
                "light_type": "point",

Important parameters are:

max_episodes: Number of rollout episodes generated by run_env.py.
max_episode_steps: Maximum number of environment steps per episode.
dataset.lerobot.params.robot_meta: Robot metadata such as robot type and control frequency.
dataset.lerobot.params.instruction: Task language instruction stored together with the dataset.
dataset.lerobot.params.extra: Additional metadata such as scene type and task description.
dataset.lerobot.params.use_videos: Whether camera observations should be stored as videos.
env.control_parts: Controlled robot parts in the environment.

In the current implementation, LeRobotRecorder stores robot state and action features following LeRobot official format: observation.state for joint positions, action for applied actions, and observation.images.{sensor_name} for camera images.

Step 2: Prepare the Action Configuration#

For tasks that use the action bank, the second input is action_config.json. This file defines the expert action graph consumed by create_demo_action_list(). In the example below, the file is organized around scope, node, edge, and sync.

Action bank structure in the example task Pour_Water

Scope Configuration

    "scope": {
        "right_arm": {
            "type": "DiGraph",
            "dim": [
                6
            ],
            "init": {
                "method": "current_qpos",
                "init_node_name": "right_arm_init_qpos"
            },
            "dtype": "float32"
        },
        "left_arm": {
            "type": "DiGraph",
            "dim": [
                6
            ],
            "init": {
                "method": "current_qpos",
                "init_node_name": "left_arm_init_qpos"
            },
            "dtype": "float32"
        },
        "left_eef": {
            "type": "DiGraph",
            "dim": [
                1
            ],
            "init": {
                "method": "given_qpos",
                "kwargs": {
                    "given_qpos": [
                        1
                    ]
                },
                "init_node_name": ""
            },
            "dtype": "float32"
        },
        "right_eef": {
            "type": "DiGraph",
            "dim": [
                1
            ],
            "init": {
                "method": "given_qpos",
                "kwargs": {
                    "given_qpos": [
                        1
                    ]
                },
                "init_node_name": ""
            },
            "dtype": "float32"
        }
    },

Node Configuration

                "bottle_grasp": {
                    "name": "generate_affordances_from_src",
                    "kwargs": {
                        "affordance_infos": [
                            {
                                "src_key": "bottle_pose",
                                "dst_key": "bottle_grasp_pose",
                                "valid_funcs_name_kwargs_proc": [
                                    {
                                        "name": "no_validation",
                                        "kwargs": {},
                                        "pass_processes": [
                                            {
                                                "name": "get_rotation_replaced_pose",
                                                "kwargs": {
                                                    "rotation_value": "env.affordance_datas['right_arm_aim_qpos'][0]",
                                                    "rot_axis": "z",
                                                    "mode": "intrinsic"
                                                }
                                            },
                                            {
                                                "name": "get_frame_changed_pose",
                                                "kwargs": {
                                                    "frame_change_matrix": "env.affordance_datas['bottle_pose']",
                                                    "mode": "intrinsic",
                                                    "inverse": true
                                                }
                                            },
                                            {
                                                "name": "get_frame_changed_pose",
                                                "kwargs": {
                                                    "frame_change_matrix": "env.affordance_datas['bottle_grasp_pose']",
                                                    "mode": "intrinsic"
                                                }
                                            }
                                        ]
                                    }
                                ]
                            },
                            {
                                "src_key": "bottle_grasp_pose",
                                "dst_key": "bottle_pre1_pose",
                                "valid_funcs_name_kwargs_proc": [
                                    {
                                        "name": "no_validation",
                                        "kwargs": {},
                                        "pass_processes": [
                                            {
                                                "name": "get_offset_pose", 
                                                "kwargs": {
                                                    "offset_value": -0.05,
                                                    "direction": "z",
                                                    "mode": "intrinsic"
                                                }
                                            }                                  
                                        ]
                                    }
                                ]
                            },
                            {
                                "src_key": "bottle_pre1_pose",
                                "dst_key": "bottle_pre2_pose",
                                "valid_funcs_name_kwargs_proc": [
                                    {
                                        "name": "no_validation",
                                        "kwargs": {},
                                        "pass_processes": [
                                            {
                                                "name": "get_offset_pose", 
                                                "kwargs": {
                                                    "offset_value": -0.05,
                                                    "direction": "z",
                                                    "mode": "intrinsic"
                                                }
                                            }                                  
                                        ]
                                    }
                                ]
                            }
                        ]
                    }
                }

Edge Configuration

                "up_to_move": {
                    "src": "bottle_up_qpos",
                    "sink": "pour_water_start_qpos",
                    "duration": 20,
                    "name": "plan_trajectory",
                    "kwargs": {
                        "agent_uid": "right_arm",
                        "keypose_names": [
                            "bottle_up_qpos",
                            "pour_water_start_qpos"
                        ]
                    }
                }
            },
            {
                "move_to_rotation": {
                    "src": "pour_water_start_qpos",
                    "sink": "bottle_rotation_qpos",
                    "duration": 24,
                    "name": "plan_trajectory",
                    "kwargs": {
                        "agent_uid": "right_arm",
                        "keypose_names": [
                            "pour_water_start_qpos",
                            "bottle_rotation_qpos"
                        ]
                    }
                }

Synchronization

    "sync": {
        "rclose0": {
            "depend_tasks": [
                "pre1_to_grasp"
            ]
        },
        "grasp_to_up": {
            "depend_tasks": [
                "rclose0"
            ]
        },
        "ropen0": {
            "depend_tasks": [
                "pre_place_back_to_place"
            ]
        },
        "place_back_to_init": {
            "depend_tasks": [
                "ropen0"
            ]
        },
        "left_arm_go_back": {
            "depend_tasks": [
                "ropen0"
            ]
        }
    },

This structure defines the expert rollout as follows:

Scope: Defines controllable sub-graphs such as right_arm, left_arm, right_eef, and left_eef.
Node: Defines key poses, targets computed from object affordances, and IK-generated joint targets.
Edge: Defines executable transitions between nodes, including duration and execution function.
Sync: Defines execution order rules between independently configured sub-actions.

Note: Action bank is not the only way to generate demonstrations. Depending on the task design, trajectories can also be produced by other scripted generation methods.

Step 3: Launch the Environment Rollout#

The rollout script parses command-line arguments, loads the gym and action config files, converts them into environment configuration objects, creates the environment instance, and then runs offline rollout for max_episodes episodes:

def cli():
    """Command-line interface for environment runner.

    Parses CLI arguments, builds the environment config, and launches
    the data generation or preview workflow.
    """
    np.set_printoptions(5, suppress=True)
    torch.set_printoptions(precision=5, sci_mode=False)

    parser = argparse.ArgumentParser()

    add_env_launcher_args_to_parser(parser)

    args = parser.parse_args()

    # Step 1: Discover all task packages via entry_points
    discover_task_packages()

    # Step 2: Execute init hooks (register managers, asset resolvers, etc.)
    execute_init_hooks()

    env_cfg, gym_config, action_config = build_env_cfg_from_args(args)

    env = gymnasium.make(id=gym_config["id"], cfg=env_cfg, **action_config)

    main(args, env, gym_config)

Each rollout internally calls create_demo_action_list(), validates the returned sequence, executes actions with env.step(action), and discards invalid rollouts by resetting with save_data=False.

The recommended CLI entrypoint is:

python -m embodichain run-env \
    --gym_config embodichain_tasks/configs/gym/pour_water/gym_config.json \
    --action_config embodichain_tasks/configs/gym/pour_water/action_config.json \
    --headless

For interactive inspection, you can use preview mode: replace --headless with --preview. When --preview is enabled, the script opens the environment in an interactive debugging mode. This mode is for inspection and does not save datasets.

Useful CLI arguments:

–gym_config: Path to the task config file (.json, .yaml, or .yml).
–action_config: Path to the action-bank config file (.json, .yaml, or .yml).
–num_envs: Number of environments to run in parallel.
–device: Simulation device, such as cpu or cuda.
–headless: Run without GUI for faster generation.
–enable_rt: Enable ray tracing for higher-quality visual observations.
–preview: Launch the environment in interactive preview mode.
–filter_dataset_saving: Disable dataset saving for debugging.

For the complete CLI argument list, see CLI Reference.

Outputs#

After successful execution, completed episodes are saved under the configured dataset root. A LeRobot dataset typically contains:

If no explicit save path is provided and EMBODICHAIN_DATASET_ROOT is not set, LeRobotRecorder uses ~/.cache/embodichain_datasets as the default dataset root.

data/: Recorded action and state data.
videos/: Camera observations saved as videos when use_videos=True.
meta/: Dataset metadata such as task information and robot description.

Dataset folders are automatically numbered, which makes it easy to run repeated generations without overwriting previous results.

In a practical workflow, the output of this stage is the synthesized dataset itself. Later training scripts typically consume these saved LeRobot episodes instead of regenerating trajectories each time.

Best Practices#

Keep the config pair together: Version gym and action configs together for action-bank tasks (either JSON or YAML).
Use valid scripted policies: Make sure create_demo_action_list() returns executable trajectories for the current scene.
Use ``–headless`` for throughput: Disable the GUI when generating large datasets.
Use ``–preview`` and ``–filter_dataset_saving`` for debugging: Inspect task logic without writing datasets.
Discard invalid rollouts: Keep the default validation logic so failed trajectories are not saved.