Physical Intelligence: Science and Systems — University of Pennsylvania
This project explores end-to-end autonomous drone racing using Proximal Policy Optimization (PPO) in NVIDIA Isaac Sim. The goal: train a quadrotor agent to navigate through complex racing gates at speed while avoiding obstacles, without any hand-crafted trajectory planning.
Key design decisions:
The result demonstrates that deep reinforcement learning can produce high-speed, obstacle-aware flight policies without relying on privileged state information.