Shipping autonomous systems from simulation to the real world — production fleet deployment (200+ AMRs), real-vehicle autonomy, and sim-to-real RL for humanoid robots. MEng in Autonomy & Robotics at UIUC.
sunnydeshpande9900@gmail.comReal-world deployments, novel algorithms, and sim-to-real systems across humanoids, AVs, and mobile robots. Featured projects showcase the deepest work.

Replaced separate EfficientNet + BERT encoders with a frozen unified CLIP backbone (128-dim adapter projection) in the HumanVLA teacher–student pipeline, resolving catastrophic forgetting observed when fine-tuning visual encoders. Behavior Cloning + DAgger distillation across 615 episodes in 4 HITR room types; 62.1% success on unseen tasks (+1.9% over baseline, −4.4% placement error).

Modular ROS 2 autonomy stack on the UIUC Polaris GEM e4: Ouster OS1-128 LiDAR → DBSCAN clustering + EMA tracking fused with OAK-D LR RGB-D via YOLOv11 detection (weighted 0.8/0.2 distance, 0.3/0.7 bearing). Forward-simulated TTC drives a 3-state safety FSM; Stanley lateral + PID longitudinal control with PACMod2 hard-brake override. 91% field success.

Two-tier PPO hierarchy for dual Unitree G1 humanoids in IsaacLab: low-level [256,256] MLP drives 29-DOF joint-position-delta actions through a 7-phase standing→omnidirectional curriculum (30M steps, 256 envs); high-level [64,64] navigation policy converges agents from 10 m to 0.5 m with 95% success. 25+ shaped reward terms enforce gait symmetry, contact penalties, and energy regularization.

Open-vocabulary 6D tracking extending FoundationPose with Moondream2 VLM scene analysis, SAM-3 text-prompted segmentation, and hierarchical mesh acquisition (GT → Objaverse-XL retrieval → TripoSR generation). Composite scoring (IoU + Depth + Silhouette) selects best proxy; achieves 100% ADD-S AUC on texture-rich objects and supports language-driven dynamic target switching mid-task on YCB-Video.

Full-stack embodied AI on physical Booster K1 Humanoid: Gemini VLM semantic goal planning with ROS2 Nav2 autonomous navigation and Snap AR Spectacles AR collaboration.

Led 7-person team. Kalman-filtered admittance controller maintaining <150 N brush-to-glass contact. Full ROS2 stack on Jetson Orin Nano with SMACH FSM. Validated in live Changi Airport deployment.

Novel PP variant incorporating bicycle-model lateral error decomposition (cross-track slippage + heading error) into the control law. DIAPP adds a dynamics correction term c_d·e_lat,total to the steering output; RDIAPP extends this with RPP-style curvature-regulated velocity. At 70 km/h on an S-track, RDIAPP achieves 1.79 m max CTE vs. PP's 30.26 m (diverges); on a U-track, 2.49 m RMS CTE vs. 31.6 m for PP.
Autonomous lane-following on Jetson Xavier + ZED2: dual-filter perception (Gaussian-weighted adaptive grayscale + HLS color space) with bitwise AND fusion targeting 8% pixel density, vanishing-point radial-scan histogram rejecting racetrack markings, and sliding-window 2nd-order polynomial fit. Custom proportionate lateral controller with distance-weighted steering (f(c_y) = C/(c_y − h_y + ε)). Completed full 12-min autonomous laps across variable lighting conditions.

Autonomous unknown-maze exploration on TurtleBot3 (LattePanda): frontier-based exploration with DFS cluster grouping and centroid-based goal selection, A* costmap planner with B-spline path smoothing, Pure Pursuit trajectory follower, and fine-tuned YOLOv11n with a novel spatial deduplication tracker projecting bounding-box centroids into the map frame via LiDAR range lookup for unique instance counting.

Software Lead. Cascaded PID for 6-DOF underwater stability with IMU/depth fusion via state-space control. Real-time CV for autonomous target acquisition. SAUVC 2023 Finalist.

Systematic evaluation of 7 monocular and stereo VO algorithms (DSO, SVO, CNN-SVO, DF-VO, TartanVO, ORB-SLAM3, DROID-SLAM + our heuristic variant) across 3 geographically-diverse rain datasets (Oxford RobotCar, 4Seasons Munich, Singapore heavy rain) using Absolute Trajectory Error. DF-VO performed best for monocular <500 m; our DROID-SLAM + CGRP + Heuristic approach achieved the lowest stereo ATE for long-range localization. Published at IEEE CASE 2023.

CPG-based motion controller for soft batoid robot with sinusoidal propulsion and strain gauge obstacle detection.

CNN-RNN model predicting Cat 1 Lightning Risk from weather station features: rainfall, wind speed, temperature, humidity, wind direction.

Contextual transformer embeddings + sequential modeling. ~96% accuracy outperforming classical NLP baselines.

Hidden Markov Models with first- and second-order Viterbi decoding for entity recognition and sentiment tagging.

Extracted and benchmarked 9 nonlinear feature families (MMSE, RQA, DFA, entropies, FD, Hjorth, Hurst, LLE, LZC) from 7,500 focal/non-focal EEG signals (Bern-Barcelona database). LS-SVM (polynomial-3) with 10-fold CV achieved 87.93% accuracy, 89.97% sensitivity. MMSE ranked highest by t-test significance. Published in FGCS, Elsevier.

Low-cost electromagnetic leak detector with Bluetooth connectivity using EM conduction principles for non-invasive detection.

Custom PCB with motor control, dynamic air-cooling, slope-based actuation, and anti-roll basket mechanism.

Low-cost quadcopter with off-the-shelf components. Integrated propulsion, power distribution, and flight electronics for stable teleoperation.

Angsana seed-inspired autorotating craft with single-flap motor trajectory control and aerodynamic calibration.

8-bar lifting mechanism with novel intake for cube frame collection. Won Best Design Award at Asia-Pacific Robotics Championship.
Production deployments, real-vehicle autonomy, and simulation at scale — from 200+ AMR fleets to port-side AV trials.


Full-stack robotics — from perception and planning to learning and deployment.
Spoken Languages: English (Fluent) · Hindi (Fluent) · Marathi (Fluent)
A focused academic path through engineering, robotics, and CS.




Peer-reviewed contributions to robotics and biomedical AI.
Comprehensive evaluation of 7 VO algorithms (DSO, SVO, CNN-SVO, DF-VO, TartanVO, ORB-SLAM3, DROID-SLAM) on monocular and stereo setups across Oxford RobotCar, 4Seasons (Munich), and Singapore heavy-rain datasets. Proposed DROID-SLAM + CGRP + Heuristic variant achieving lowest stereo ATE for long-range rain localization; DF-VO identified as best monocular approach for <500 m.
Download PaperFirst systematic comparison of 9 nonlinear feature families (52 features, all p < 0.01) for focal vs. non-focal EEG classification on the full 7,500-signal Bern-Barcelona database. LS-SVM (polynomial-3, 10-fold CV) achieved 87.93% accuracy / 89.97% sensitivity. MMSE identified as top-ranked discriminator; proposed recurrence, bispectrum, and cumulant plots for visual class separation. DOI: 10.1016/j.future.2018.08.044
Download PaperTailored resumes highlighting relevant experience for each role.