README.md

Augmented Random Search (ARS)

ARS is a random search method for training linear policies for continuous control problems, based on the paper "Simple random search provides a competitive approach to reinforcement learning."

Prerequisites for running ARS

Our ARS implementation relies on Python 3, OpenAI Gym version 0.9.3, mujoco-py 0.5.7, MuJoCo Pro version 1.31, and the Ray library for parallel computing.

To install OpenAI Gym and MuJoCo dependencies follow the instructions here: https://github.com/openai/gym

To install Ray execute:

pip install ray

For more information on Ray see http://ray.readthedocs.io/en/latest/.

Running ARS

First start Ray by executing a command of the following form:

ray start --head --redis-port=6379 --num-workers=18

This command starts multiple Python processes on one machine for parallel computations with Ray. Set "num_workers=X" for parallelizing ARS across X CPUs. For parallelzing ARS on a cluster follow the instructions here: http://ray.readthedocs.io/en/latest/using-ray-on-a-large-cluster.html.

We recommend using single threaded linear algebra computations by setting:

export MKL_NUM_THREADS=1

To train a policy for HalfCheetah-v1, execute the following command:

python code/ars.py

All arguments passed into ARS are optional and can be modified to train other environments, use different hyperparameters, or use different random seeds. For example, to train a policy for Humanoid-v1, execute the following command:

python code/ars.py --env_name Humanoid-v1 --n_directions 230 --deltas_used 230 --step_size 0.02 --delta_std 0.0075 --n_workers 48 --shift 5

Rendering Trained Policy

To render a trained policy, execute a command of the following form:

python code/run_policy.py trained_polices/env_name/policy_directory_path/policy_file_name.npz env_name --render

For example, to render Humanoid-v1 with a galloping gait execute:

python code/run_policy.py trained_policies/Humanoid-v1/policy_reward_11600/lin_policy_plus.npz Humanoid-v1 --render

GitHub - modestyachts/ARS: An implementation of the Augmented Random Search algo...

README.md

Augmented Random Search (ARS)

Prerequisites for running ARS

Running ARS

Rendering Trained Policy

Recommend

GitHub - foolcage/fooltrader: trade as a fool

比粗盐粒还小:IBM研发全球最小电脑或5年内面世(图)

魅蓝的平价机 E3 登场，还有歼-20 限量套装

维权者痛诉OKEx：交易被操纵，1000万血本无归

腾讯公布2017年全年财报：净利润715亿元

最前线 | 腾讯游戏帝国再落子：购入法国游戏开发商育碧5%股权

GitHub - xobotyi/beansclient: PHP client for beanstalkd work queue with no depen...

GitHub - bzppx/bzppx-codepub: 暴走皮皮虾之代码发布系统,是现代的持续集成发布系统,...

36氪首发 | 「SEE小电铺」获红杉资本领投 C+ 轮融资，C 轮融资总计超 5000 万美元

中国版Space X：用1/10的价格送火箭升天 | 深氪

About Joyk