pytorchrl.envs.mujoco package
Submodules
pytorchrl.envs.mujoco.mujoco_env_factory module
- pytorchrl.envs.mujoco.mujoco_env_factory.mujoco_test_env_factory(env_id, index_col_worker, index_grad_worker, index_env=0, seed=0, frame_skip=0, frame_stack=1, reward_delay=1)[source]
Create test MuJoCo environment.
- Parameters
env_id (str) – Environment name.
index_col_worker (int) – Index of the collection worker running this environment.
index_grad_worker (int) – Index of the gradient worker running the collection worker running this environment.
index_env (int) – Index of this environment withing the vector of environments.
seed (int) – Environment random seed.
frame_skip (int) – Return only every frame_skip-th observation.
frame_stack (int) – Observations composed of last frame_stack frames stacked.
reward_delay (int) – Only return accumulated reward every reward_delay steps to simulate sparse reward environment.
- Returns
env – Test environment.
- Return type
gym.Env
- pytorchrl.envs.mujoco.mujoco_env_factory.mujoco_train_env_factory(env_id, index_col_worker, index_grad_worker, index_env=0, seed=0, frame_skip=0, frame_stack=1, reward_delay=1)[source]
Create train MuJoCo environment.
- Parameters
env_id (str) – Environment name.
index_col_worker (int) – Index of the collection worker running this environment.
index_grad_worker (int) – Index of the gradient worker running the collection worker running this environment.
index_env (int) – Index of this environment withing the vector of environments.
seed (int) – Environment random seed.
frame_skip (int) – Return only every frame_skip-th observation.
frame_stack (int) – Observations composed of last frame_stack frames stacked.
reward_delay (int) – Only return accumulated reward every reward_delay steps to simulate sparse reward environment.
- Returns
env – Train environment.
- Return type
gym.Env