ACER raises an error when GaussianHeadWithFixedCovariance is used

Reported in #143 

ACER assumes that all the parameters of a distribution (defined by `get_params_of_distribution`) require grad so that the algorithm can compute the gradient wrt the parameters. 
https://github.com/pfnet/pfrl/blob/44bf2e483f5a2f30be7fd062545de306247699a1/pfrl/agents/acer.py#L172-L180
https://github.com/pfnet/pfrl/blob/44bf2e483f5a2f30be7fd062545de306247699a1/pfrl/agents/acer.py#L218-L221

However, `GaussianHeadWithFixedCovariance` (https://github.com/pfnet/pfrl/blob/44bf2e483f5a2f30be7fd062545de306247699a1/pfrl/policies/gaussian_policy.py#L96) is used, the `scale` parameter of the `torch.distributions.Normal` distribution does not require grad, resulting in an assertion error.

	def get_params_of_distribution(distrib):
	if isinstance(distrib, torch.distributions.Independent):
	return get_params_of_distribution(distrib.base_dist)
	elif isinstance(distrib, torch.distributions.Categorical):
	return (distrib._param,)
	elif isinstance(distrib, torch.distributions.Normal):
	return distrib.loc, distrib.scale
	else:
	raise NotImplementedError("{} is not supported by ACER".format(type(distrib)))

	distrib_params = get_params_of_distribution(distrib)
	for param in distrib_params:
	assert param.shape[0] == 1
	assert param.requires_grad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ACER raises an error when GaussianHeadWithFixedCovariance is used #144

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

ACER raises an error when GaussianHeadWithFixedCovariance is used #144

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions