xpag.agents.flax_agents.td3.td3.FlaxTD3#

class FlaxTD3(observation_dim, action_dim, params=None)#

Bases: Agent

Jax implementation of TD3 (https://arxiv.org/abs/1802.09477), without delayed policy updates. This version assumes that the actions are between -1 and 1 (for all dimensions).

Methods

load

save

select_action

train_on_batch

value

write_config