Stereo Visual Odometry

Stereo Visual Odometry (SVO) is an advanced, modular computer vision project that accurately estimates camera motion through 3D space by processing sequential stereo images. This implementation uses the ROS framework to perform stereo visual odometry in real-time, taking advantage of stereo camera calibration, depth calculation, and odometry visualization in a unified pipeline.

To achieve precise motion estimation, the pipeline begins with stereo camera calibration, calculating the intrinsic matrix and publishing it as a ROS topic. This calibration step provides a foundation for accurate depth perception, which is then calculated from the stereo images in the ROS bag and published as a dedicated ROS topic. Using the depth data, SVO is performed on the image stream, generating odometry estimations that are visualized in RViz alongside the ground truth odometry from the ROS bag.

This project also makes extensive use of OpenCV tools to handle feature detection, matching, and depth calculation tasks, ensuring a streamlined integration of image processing with the ROS framework. By combining efficient algorithms with a scalable, ROS-compatible design, this SVO project represents a reliable solution for real-time visual odometry in dynamic environments, ideal for applications in autonomous navigation and SLAM.

About the Project

This project represents a complete solution for stereo-based perception in robotics, using stereo visual odometry and real-time depth estimation for enhanced spatial understanding. This work has been implemented as ROS nodes, handling:

Camera Calibration: Computes intrinsic camera parameters via an asymmetric square grid pattern.
Depth Estimation: Real-time computation of depth maps from stereo image feeds, critical for robotic depth perception.
Stereo Visual Odometry (SVO): Position and orientation estimation based on feature matching in stereo images, essential for localization and navigation.

Technical Contributions

1. Development of a ROS-integrated Camera Calibration Node

Using an asymmetric square grid pattern and SimpleBlobDetector in OpenCV, the node computes the camera's intrinsic matrix, crucial for precise depth estimation and odometry.

Asymmetric Square Grid Pattern: This pattern is used for accurate camera calibration. The grid pattern is detected using OpenCV's SimpleBlobDetector.
Intrinsic Matrix Calculation: The intrinsic parameters of the camera, such as focal length and optical center, are calculated and published as a ROS topic.

2. Depth Estimation using Stereo Vision

Depth maps are generated using the disparity between stereo images. A robust configuration of StereoSGBM is used for improved accuracy in noisy environments.

Disparity Calculation: Disparity between the left and right stereo images is calculated using StereoSGBM (Semi-Global Block Matching).
Depth Map Generation: The disparity map is converted into a depth map, which provides the distance of objects from the camera.

3. Visual Odometry for 3D Localization

Implemented a feature tracking-based stereo visual odometry pipeline. The system calculates the robot's 3D position and orientation with feature detection, matching, and triangulation techniques, which are critical for continuous localization.

Feature Detection: Key points in the images are detected using the ORB (Oriented FAST and Rotated BRIEF) algorithm. ORB is chosen for its efficiency and robustness in detecting and describing key points, making it suitable for real-time applications.

Method:
- Convert the image to grayscale.
- Use the ORB detector to find key points and compute their descriptors.
```
def detect_features(self, img):
    orb = cv2.ORB_create()
    kp, des = orb.detectAndCompute(img, None)
    return kp, des
```
Feature Matching: Features between consecutive frames are matched using the BFMatcher (Brute Force Matcher) with Hamming distance and cross-checking. BFMatcher is selected for its simplicity and effectiveness in matching binary descriptors like those produced by ORB.

Method:
- Create a BFMatcher object with Hamming distance and cross-check enabled.
- Match the descriptors from the current frame to the previous frame.
- Sort the matches based on distance and extract the matched points.
```
def track_features(self, pt1, pt2, des1, des2):
    bf = cv2.BFMatcher(cv2.NORM_HAMMING, crossCheck=True)
    matches = sorted(bf.match(des1, des2), key=lambda x: x.distance)
    pts1 = np.float32([pt1[m.queryIdx] for m in matches]).reshape(-1, 1, 2)
    pts2 = np.float32([pt2[m.trainIdx] for m in matches]).reshape(-1, 1, 2)
    return pts1, pts2
```

Triangulation: The 3D positions of matched features are estimated using triangulation. This step is crucial for determining the spatial coordinates of the features in the scene.

Method:

Use the matched key points from the left and right images.
Apply the triangulation method to compute the 3D coordinates.

def compute_odometry(self, pts1, pts2):
    in1 = np.array([[self.fx, 0, self.cx, 0], [0, self.fy, self.cy, 0], [0, 0, 1, 0]])
    in2 = np.array([[self.fx, 0, self.cx, self.baseline], [0, self.fy, self.cy, 0], [0, 0, 1, 0]])
    t = cv2.triangulatePoints(in1, in2, np.transpose(pts1), np.transpose(pts2))
    wpoints = t[:3] / t[3]  # Convert to homogeneous coordinates

Pose Estimation: The camera's position and orientation are estimated using the PnP (Perspective-n-Point) algorithm with RANSAC. PnP is used for its ability to estimate the pose of the camera from 3D-2D point correspondences.

Method:

Use the 3D points obtained from the traingulation and their corresponding 2D image points.
Apply the PnP algorithm to estimate the camera pose.
Compute the new transformation matrix and extract the position and orientation.

def compute_odometry(self, pts1, pts2):
    # Triangulation as shown above
    _, R, t, _ = cv2.solvePnPRansac(wpoints.T, pts1, np.eye(3), np.zeros((4, 1)))
    Rot, _ = cv2.Rodrigues(R)
    T = np.vstack((np.hstack((Rot, t)), [0, 0, 0, 1]))
    new = np.matmul(self.prev, T)
    position = new[:3, 3]
    orientation = cv2.Rodrigues(new[:3, :3])[0]
    return position, orientation

4. ROS Integration and Real-time Data Handling

Leveraged ROS publishers and subscribers to ensure smooth communication between nodes, demonstrating expertise in real-time data management in robotics software.

ROS Publishers and Subscribers: Used for real-time data handling and communication between nodes.
RViz Visualization: Depth maps and odometry estimations are visualized in RViz.

These contributions collectively provide a robust and efficient pipeline for stereo visual odometry, enabling accurate 3D localization and navigation in dynamic environments.

Setup and Installation

Ensure ROS Noetic is Installed

Follow the ROS Noetic installation guide to install ROS Noetic on your system.

Create a ROS Workspace

mkdir -p ~/catkin_ws/src
cd ~/catkin_ws/
catkin_make
source devel/setup.bash

Clone the Repository

cd ~/catkin_ws/src
git clone https://github.com/yourusername/stereo_visual_odometry.git

Install Dependencies using Rosdep

cd ~/catkin_ws
rosdep install --from-paths src --ignore-src -r -y

Install Additional Packages Esnure that OpenCV and additional Python libraries are installed:
```
sudo apt install python3-pip
pip3 install numpy opencv-python rospkg
```

Build the ROS Workspace

cd ~/catkin_ws
catkin_make
source devel/setup.bash

Usage

Launch the Calibration Node

roslaunch stereo_visual_odometry calibrate.launch

Launch the Depth Estimation Node

roslaunch stereo_visual_odometry depth_estimation.launch

Launch the SVO Node

roslaunch stereo_visual_odometry svo.launch

Project Structure

stereo_visual_odometry
│   CMakeLists.txt
│   package.xml
├── calib
│   │   # Calibration files and data would be here.
├── launch
│   │   calibrate.launch
│   │   depth_estimation.launch
│   │   svo.launch
├── rviz
│   │   # RViz configuration files would be here.
└── src
    │   calibrate.py
    │   get_depth.py
    │   svo.py

Datasets

Ensure you have the appropriate stereo datasets for testing and validation of the SVO pipeline.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Stereo Visual Odometry

Table of Contents

About the Project

Technical Contributions

1. Development of a ROS-integrated Camera Calibration Node

2. Depth Estimation using Stereo Vision

3. Visual Odometry for 3D Localization

4. ROS Integration and Real-time Data Handling

Setup and Installation

Usage

Project Structure

Datasets

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
calib		calib
launch		launch
rviz		rviz
src		src
temp		temp
CMakeLists.txt		CMakeLists.txt
README.md		README.md
package.xml		package.xml

SharanIO/Visual-Odometry

Folders and files

Latest commit

History

Repository files navigation

Stereo Visual Odometry

Table of Contents

About the Project

Technical Contributions

1. Development of a ROS-integrated Camera Calibration Node

2. Depth Estimation using Stereo Vision

3. Visual Odometry for 3D Localization

4. ROS Integration and Real-time Data Handling

Setup and Installation

Usage

Project Structure

Datasets

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages