General Motors (GM) interview question

Derive policy gradient algorithm on the board