order in which optimizer and lr_scheduler have to be called switched see: https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate