metarl.np.algos.meta_rl_algorithm module¶
Interface of Meta-RL ALgorithms.
-
class
MetaRLAlgorithm[source]¶ Bases:
metarl.np.algos.rl_algorithm.RLAlgorithm,abc.ABCBase class for Meta-RL Algorithms.
-
adapt_policy(exploration_policy, exploration_trajectories)[source]¶ Produce a policy adapted for a task.
Parameters: - exploration_policy (metarl.Policy) – A policy which was returned from get_exploration_policy(), and which generated exploration_trajectories by interacting with an environment. The caller may not use this object after passing it into this method.
- exploration_trajectories (metarl.TrajectoryBatch) – Trajectories to adapt to, generated by exploration_policy exploring the environment.
Returns: - A policy adapted to the task represented by the
exploration_trajectories.
Return type: metarl.Policy
-