xpag.agents.rljax_agents.algorithm.tqc.TQC#
- class TQC(num_agent_steps, observation_dim, action_dim, seed, max_grad_norm=None, gamma=0.99, nstep=1, num_critics=5, buffer_size=1000000, use_per=False, batch_size=256, start_steps=10000, update_interval=1, tau=0.005, fn_actor=None, fn_critic=None, lr_actor=0.0003, lr_critic=0.0003, lr_alpha=0.0003, units_actor=(256, 256), units_critic=(512, 512, 512), log_std_min=-20.0, log_std_max=2.0, d2rl=False, num_quantiles=25, num_quantiles_to_drop=0)#
Bases:
SACMethods
calculate_value- rtype:
Array
exploreget_key_listget_maskis_updateload_paramssave_paramsselect_actionstepupdateAttributes
kwargs_actorkwargs_criticname