Skip to content

Reinforcement Learning Training Intermediate In Progress

Activity 5 of 5  ยท  Intermediate

โฑ Up to 30 minutes ๐Ÿง  RL parameters ๐Ÿ“ˆ Live reward graph Route: /activities/reinforcement-learning

This activity is not yet available

Reinforcement Learning Training is still being built. The page is accessible but the activity is not fully functional yet. Full documentation will be added once the activity is ready.


What This Activity Will Cover

When complete, this activity will let you:

  • Configure a reinforcement learning agent with your own learning rate, reward weight, and episode count
  • Launch a training job and watch the drone learn to fly episode by episode
  • Read a live reward graph to track whether the agent is improving
  • Hit a high score and appear on the leaderboard

How RL Works (Background)

Even though the activity is not ready yet, here is the concept so you understand what is coming.

The drone tries an action, receives a reward score from the environment, and updates its strategy based on that score. Over many repetitions it learns which actions lead to higher rewards.

Agent takes action
        |
        v
Environment gives reward (positive = good, negative = bad)
        |
        v
Agent updates its strategy
        |
        v
Repeat for the next episode

The goal is to choose training parameters that help the agent learn quickly and reliably.


Parameters You Will Configure

Parameter What it controls
Learning Rate How much the agent updates its strategy after each episode. Too high = unstable. Too low = slow. Recommended starting value: 0.001
Reward Weight Scales how strongly the agent responds to the reward signal. Recommended starting value: 1.0
Max Episodes How many episodes to run before training stops automatically. A standard run uses 100 episodes

Badges You Can Earn

Badge Requirement
๐Ÿง  First RL Success Complete any RL Training run
๐ŸŽ“ RL Master Score 950 or higher

Full step-by-step instructions and screenshots will be added here once the activity is complete.


Back to All Activities