Use black box optimizer to pretrain

We will check that training loss decreases, but we won’t train until convergence.