Collaborative SLAM for Quadruped Fleet

Overview

In times of disaster, every second counts, and reaching survivors in hazardous terrains poses significant challenges. Imagine a coordinated team of agile, four-legged robots, working together to navigate treacherous environments like dense forests or mines. These quadrupedal robots autonomously perform simultaneous localization and mapping (SLAM), creating real-time detailed maps of their surroundings. By employing decentralized collaborative system, these robots can share and merge their individual maps, creating a comprehensive understanding of the complete area without relying on a central system. This approach enhances the robustness and speed of search operations, as the failure of a single unit does not compromise the entire mission. Quadrupeds inherently work well in uneven terrains, and harnessing the strengths of SLAM to explore unmapped areas with LIDAR and Visual-Inertial sensor data, these robotic swarms represent a leap forward in disaster response, offering hope and assistance when it’s needed most. By all means this is just the first iteration and needs good work for being deployable onsite.

The Main Idea

Unitree-GO2

The System Architecture of GO2

Hardware Setup

Connecting laptop with Ethernet cable to Unitree network and CycloneDDS as ROS Middleware. This combination seemed best for least latency of data transfer. Due to time constraints I did not configure Jetson for GO2. Implemented high level controls that interacts with the Unitree GO2 ROS2 SDK. Github Link

Implemented manual navigation package with following features in ROS2 Jazzy and C++:

Integrated with high level control pkg to operate on real robot.
Transform publishers.
Robot State publishers.
Rtabmap pkgs generating odom tf and occupancy grid using GO2’s 4d Lidar.
Rviz visulaizations with modified URDF.
Nav-to-pose to enable high level control with the GO2’s APIs.
Nav2 based manual goal subscription.
Footprint based obstacle avoidance,

Publishing the Nav goals from RViz

LiDAR-SLAM

Simultaneous Localization and Mapping (SLAM) is a fundamental technology in autonomous systems, which enables devices to perform real-time mapping while determining their position within an environment. LiDAR (Light Detection and Ranging) SLAM utilizes laser sensor technology to generate a highly accurate 3D map of the surrounding environment. By emitting laser pulses and calculating the Time of Flight (ToF), it can measure distances and map complex areas with precision.

Unitree GO2 has an inbuild 4D LiDAR which publishes pointCloud data, which encodes the ToF in form of intensity values. And am using the ROS2 RTabMap package, feeding in the Odom data from Unitree topic and filtered pointcloud, to generate map. I use the RTabMap ICP (Iterative Closest Point) algorithm to improve the mapping accuracy. Also GO2 has a highly mismatched frequency rate of publishing different sensor datas, so ensure that all packages use a sync or queuing mechanism.

Point Cloud Filtering

On passing the raw point cloud data to the RTabMap package, the occupancy grid generated did not have any free spaces. So, I had to incorprate certain standard techniques to filter the data, like deskewing(incorporating distortions caused by motion), Voxel filter(Downsampling scan data). I also utilized the point cloud library to cluster high density points and project them as obstacles and rest as free spaces to ensure a better occupancy grid is formed.

Before and After Filtering:

Before

After

Implemented autonomous navigation package with following features in ROS2 Jazzy and C++:

Integrated with the manual nav package to operate on real robot.
Nearest frontier based exploration
State machine based goal assignment
Services to start, and emergency stop
Markers for visualization.

Unitree-GO1

The System Architecture of GO1

Hardware Setup

Jetson Orin Nano, powered by Unitree Power port, connected to buck converter (24V->12V). Zed 2i camera and a Display Screen is connected and powered by Jetson. And Jetson is connected to the same Wifi Network as Unitree GO1 (Password: 00000000)

Front View

Side View

Manual Navigation Demo

Autonomous Navigation Demo

Obstacle Avoidance Demo

Visual-SLAM

Visual SLAM uses camera sensors and computer vision algorithms to map environments and track a device’s movement in real-time. By identifying and tracking key visual features across multiple frames, Visual SLAM estimates the camera’s movement and builds a 3D map.

I am using a Zed 2i camera mounted on and powered by GO1, connected to Jetson Orin Nano. Zed 2i has ROS2 SDK and built in image processing and SLAM algorithms. I am using its Visual-Inertial Odometry (VIO)** for generating occupancy grid. The 3D map generated needs to be projected to 2D for Nav2 to use it. I am using the Map filtering node written by Aditya Nair specifically for Zed 2i.

Stereo vision based Point Cloud generation by Zed:

Position Tracking (video from Zed Official site):

LiDAR vs Visual

This is just my opinion mainly based on the obsservations made while working on this project.

Robust in Challenging Conditions: LiDAR performs well in adverse conditions like low light making it highly adaptable for autonomous systems operating in GPS-denied areas. Visual SLAM relies on well-lit environments.
High Processing Demand: LiDAR generates massive amounts of data, requiring powerful hardware and more advanced data processing.
Feature-Rich Environments Required: Visual SLAM performs better in environments rich in features (e.g., edges, corners), which can be a challenge in sparse or plain areas.
Ease of debugging: Visual SLAM has camera data, which makes it easier to make sense of the data, compared to the heat-map obtained by the Lidars.

Frontier-based exploration

I have used the same technique both the quadruped’s exploration. Wherein the occupancy grid and current Odom is used to detect “frontiers” (spaces in map that are to be explored) and one of them is optimally chosen as the nest goal. A lot can be modified in how its being chosen. Currently the GO1 has a tendency to choose the periferal frontier, and thus is more suited for round areas but not for corridor-like environment. And GO2 has a tendency is go for consecutive goals, which reduces the breadth of area covered.

As observered on GO1 Display screen wihle exploration:

Map-Merging

The final step now, is to merge the occupancy grids generated by all the indivudual quadrupeds in a cluster. The initial relative position of both agents are known and I also have the IMU data of the motion series, which can be stiched into a combined map. The work is based on the research.

Awknowledgements

Unitree & ROS2 contributers, Prof. Paul Umbanhowar for guiding though the entire project lifecycle, Davin Landry for assiting in Maker’s space, all the work done by Aditya Nair, Nick Morales, Marno Nel, Katie Hughes and David Dorf which helped me to build on from, the robofriends repo contributers, 2025 Cohort, and most importantly the Open Source Community!!

And of course to Scooby and Oatmeal! It was lovely working with you both.

If any of you reading this, work on the project and add some segments to the codebase would love to know about it. Most issues, and operational stuff are fresh in my mind, so feel free add issues and raise PR. Detailed setup instruction and running procedure is mentioned in the ReadMe file of each package.