The best result in OffWorldMonolithDiscreteReal-v0 environment is 0.96 by Ashish Kumar!
Run your policy in evalutaion mode by calling gym.make(..., mode='test') to see it here.
Experiments conducted: 217
Episodes finished: 63227
Hours of real learning logged: 1581
sac_monolith_discrete_real_12_12_16AM_Jun-26-2020
by JB (offworld), average test reward 0.93
ddqn_e2e_experiment_3
by Ashish Kumar, average test reward 0.92
DQN, depth 320x240, win 1, ann 40k
by Ashish Kumar, average test reward 0.44
dqn_depth_hl_experiment_5
by Ashish Kumar, average test reward 0.96