possible max_grad_norm implementation error

Clipping is currently applied after the optimizer step. The gradient clipping code controlled by the `max-grad-norm` cli param should be implemented between the backward and step calls.

# Node Prediction Code

https://github.com/awslabs/graphstorm/blob/f3a063669aba254889d4902f6f5c7d512401632e/python/graphstorm/trainer/np_trainer.py#L214-L221

	self.optimizer.zero_grad()
	loss.backward()
	rt_profiler.record('train_backward')
	self.optimizer.step()
	rt_profiler.record('train_step')

	if max_grad_norm is not None:
	th.nn.utils.clip_grad_norm_(model.parameters(), max_grad_norm, grad_norm_type)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

possible max_grad_norm implementation error #1366

Node Prediction Code

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

possible max_grad_norm implementation error #1366

Description

Node Prediction Code

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions