Jye Sawtell-Rickson
1 min readMay 22, 2020

--

Can confirm this! I've tried training a couple of times with the above and it seems convergence is significantly slower than advertised with an avg. reward of around 50 @ 50K episodes.

--

--

Jye Sawtell-Rickson
Jye Sawtell-Rickson

Written by Jye Sawtell-Rickson

Talking about data science, product analytics, and artificial intelligence.

Responses (1)