metarl.np.exploration_policies.exploration_policy module¶
Exploration Policy API used by off-policy algorithms.
-
class
ExplorationPolicy(policy)[source]¶ Bases:
abc.ABCPolicy that wraps another policy to add action noise.
Parameters: policy (metarl.Policy) – Policy to wrap. -
get_action(observation)[source]¶ Return an action with noise.
Parameters: observation (np.ndarray) – Observation from the environment. Returns: An action with noise. dict: Arbitrary policy state information (agent_info). Return type: np.ndarray
-
get_actions(observations)[source]¶ Return actions with noise.
Parameters: observations (np.ndarray) – Observation from the environment. Returns: Actions with noise. List[dict]: Arbitrary policy state information (agent_info). Return type: np.ndarray
-
get_param_values()[source]¶ Get parameter values.
Returns: Values of each parameter. Return type: list or dict
-