Location
Badges
Activity
Ratings Progression
Challenge Categories
Challenges Entered
IIT-M, Reinforcement Learning, Value Iteration
Latest submissions
See Allgraded | 126128 | ||
graded | 126047 | ||
submitted | 126035 |
Reinforcement Learning, IIT-M, assignment 1
Latest submissions
See Allgraded | 126680 | ||
graded | 126656 | ||
failed | 126651 |
Latest submissions
See Allgraded | 132291 | ||
graded | 132094 | ||
graded | 132092 |
Latest submissions
See Allgraded | 143163 | ||
graded | 143162 | ||
graded | 143160 |
Latest submissions
See Allgraded | 158652 | ||
graded | 158651 | ||
graded | 158650 |
Train your RL agents
Latest submissions
Participant | Rating |
---|
Participant | Rating |
---|
IITM RL Final Project-b5d2e6
Mismatch in limits of observation space for CARTPOLE and MOUNTAINCAR
Over 3 years agoHi, when using the command βenv.observation_space.highβ and βenv.observation_space.lowβ for the above-mentioned two environments, it shows + \infty and - \infty respectively for all elements.
Does this mean that we just hardcode the limits (from the given table) for the two environments or will this issue be corrected, please do let us know!
In addition, when sampling the next state through env.step(), the value for theta_dot exceeds 5.
IIT-M RL-ASSIGNMENT-2-TAXI
Unable to submit post 20th april
Over 3 years agoHi, it looks like a lot of us are unable to perform a submission post 20th. It says βError in notifying rails about the uploaded fileβ. Please look into it as the deadline has been pushed to 24th.
RL-Taxi
Any questions about this assignment? Ask them here!
Over 3 years agoYes I understand, however I was trying to narrow down the problem as to where the βinference failedβ error was popping up. It pops up when I call the function dp_solve with a modified class object.
Any questions about this assignment? Ask them here!
Over 3 years agoHi, so there is an issue regarding inference failed. If i call the function dp_solve(env2) on another variation of the env object (such as chaning the policy), this is resulting in an inference failed error. Please look into this s it can be fixed, also if possible, please increase the number of submissions for the day cause most of my submissions went to waste for debugging.
RL-VI
Reducing tolerance for value grid, please resubmit your code
Over 3 years agoim having the same issue, please let us know what is the scoring criteria
Any questions about the assignment? Ask them here!
Over 3 years agoOk will make sure, also when you earlier said that the tolerance was 1e-8, did you mean the matrix norm ( || J_{i+1} - J_i|| ) or the max abs ( \max_{s \in S} |J_{i+1}(s) - J_i(s)| )? Thank you!
Any questions about the assignment? Ask them here!
Over 3 years agoHi, the env params loaded in the notebook dont seem to correlate to the grid world 1 figure. The co-ordinates in the notebook are transposed, for instance, env.brown_out = (9,7) (i.e. 10th row, 8th column visually) and should be instead (7,9) to match figure 1.
However this doesnt affect the code, it may be an issue in Task 5 where we need to compare and contrast these 2 grid worlds. Please clarify this discrepancy, thank you!
Notebooks
-
Solution for submission 132291 A detailed solution for submission 132291 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 132291 A detailed solution for submission 132291 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 132291 A detailed solution for submission 132291 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 132094 A detailed solution for submission 132094 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 132094 A detailed solution for submission 132094 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 132092 A detailed solution for submission 132092 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXIMizhaanΒ· Over 3 years ago
-
Solution for submission 132092 A detailed solution for submission 132092 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXIMizhaanΒ· Over 3 years ago
-
Solution for submission 132092 A detailed solution for submission 132092 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXIMizhaanΒ· Over 3 years ago
-
Solution for submission 131091 A detailed solution for submission 131091 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 131781 A detailed solution for submission 131781 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 131082 A detailed solution for submission 131082 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 129422 A detailed solution for submission 129422 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXIMizhaanΒ· Over 3 years ago
-
Solution for submission 129406 A detailed solution for submission 129406 submitted for challenge IIT-M RL-ASSIGNMENT-2-GRIDWORLDMizhaanΒ· Over 3 years ago
-
Solution for submission 129423 A detailed solution for submission 129423 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXIMizhaanΒ· Over 3 years ago
-
Solution for submission 129403 A detailed solution for submission 129403 submitted for challenge IIT-M RL-ASSIGNMENT-2-TAXIMizhaanΒ· Over 3 years ago
Submission limit
Over 3 years agoAnother reason why the submission should be increased (maybe to 10-20) is because of errors while submitting, as inference failed errors still count as a submission.