Can confirm this!

Jye Sawtell-Rickson
·Follow
1 min read·
May 22, 2020
--
Can confirm this! I've tried training a couple of times with the above and it seems convergence is significantly slower than advertised with an avg. reward of around 50 @ 50K episodes.
--
--
Written by Jye Sawtell-Rickson878 Followers
·46 Following
Talking about data science, product analytics, and artificial intelligence.
Responses (1)
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams