Location
Badges
Activity
Ratings Progression
Challenge Categories
Challenges Entered
Sample-efficient reinforcement learning in Minecraft
Latest submissions
Sample-efficient reinforcement learning in Minecraft
Latest submissions
Robots that learn to interact with the environment autonomously
Latest submissions
See Allfailed | 25312 | ||
failed | 25311 | ||
graded | 25310 |
A new benchmark for Artificial Intelligence (AI) research in Reinforcement Learning
Latest submissions
See Allgraded | 2943 | ||
graded | 2336 | ||
graded | 2171 |
Participant | Rating |
---|
Participant | Rating |
---|
NeurIPS 2019 - Robot open-Ended Autonomous Lear...
Have you ever successfully run 10M steps without resetting env?
About 5 years agoIβve been running intrinsic phase locally without resetting environment as env reset is not allowed in evaluation server. However, I found out that my script dies without error.
I observed increase of memory usage even when running RandomPolicy so I assume there is a memory issue in environment as the number of steps increases in one episode.
I also suspect that makes evaluation process stop or even timeout error. (My RandomPolicy submission is still stuck around 2M steps for a few days now)
Is there anyone facing similar situation?
Or is this just my problem?
Intrinsic phase timeout
About 5 years agoHi, I am using the latest real_robots package (0.1.16) and found out that my evaluation is stuck around 2M steps for a few hours. I made no change to the policy in the starter kit (RandomPolicy) just to see how long it takes to submit no-learning agent. I assume there is still something that slows down the evaluation in the environment.
Could you investigate a little more?
Unity Obstacle Tower Challenge
Announcement: Debug your submissions
Over 5 years agoHi, I used debug mode to test my submission and I tried to set it off to get actual result. But it seems it still runs in debug mode even after I updated aicrowd.json
, pushed to the repo, and created a tag.
Does it take time to reflect the debug flag?
What should I do?
Have you ever successfully run 10M steps without resetting env?
About 5 years agoAh, youβre right. It was definitely because of storing observations in my local code. (Should have noticed before asking this question )
But Iβm still thinking why my evaluation is stuck though. My code in gitlab repository at this point doesnβt store anything (RandomPolicy as it is). It seems like no submission has successfully finished yet (I see no entry for round 2).