Camera spatial velocity computation through interaction matrix #3641

simutisernestas · 2024-02-20T19:50:27Z

Feature

The code computed camera spatial velocity given two images, pixels depth, camera matrix and the timestamp between the images. This is enabled by, so called, interaction matrix (usually utilized in visual servoing applications) relating pixel plane velocities to the camera spatial velocity in 3D i.e., twist - velocity and angular rate of the camera. The inverse problem can be solved by sampling pixel & their velocities to solve least-squares for twist. The relationship can be seen below in the picture.

The code does not include a proper example and is not tested but if there is interest I could contribute more appealing example and use case for camera velocity computation. However, I attach below a dummy example with random data simply to make sure that it's running as is. I have used this before in aiding UAV localization and thought someone else might benefit from it being integrated into opencv.

#include "opencv2/rgbd/twist.hpp"
#include "opencv2/core.hpp"
#include "opencv2/imgproc.hpp"

int main()
{
    using namespace cv::rgbd;
    Twist t;

    // create two random image
    cv::Mat im0(480, 640, CV_8UC1);
    cv::Mat im1(480, 640, CV_8UC1);
    cv::Mat depths0(480, 640, CV_32F);
    cv::Mat K = cv::Mat::eye(3, 3, CV_32F);

    cv::theRNG().state = time(0);
    cv::randn(im0, 0, 255);
    cv::randn(im1, 0, 255);
    cv::randn(depths0, 0, 100);

    cv::Vec6d twist = t.compute(im0, im1, depths0, K, 0.1);

    return 0;
}

References

Chaumette, François, and Seth Hutchinson. "Visual servo control. I. Basic approaches." IEEE Robotics & Automation Magazine 13.4 (2006): 82-90.
https://robotacademy.net.au/lesson/image-based-visual-servoing/

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

savuor

To my mind:

This should go into the tracking module
We should use sparse set of point correspondences, not dense and definitely not based on a specific optical flow algorithm.
The problem as it is formulated now can be solved by a combination of findEssentialMat + decomposeEssentialMat or recoverPose and Rt-to-twist decomposition available in DualQuat class from 5.x, even without using additional Z coordinates. And it's more preferable since the existing functions are quite robust and fast, thanks to RANSAC.
This function would make more sense as a solution to a different problem if it is changed to take pixel velocities as input instead of images or points.
The function should be documented
The function should be covered by at least one test. You can transform your sample code to a test.

modules/rgbd/CMakeLists.txt

modules/rgbd/include/opencv2/rgbd/twist.hpp

modules/rgbd/src/twist.cpp

simutisernestas · 2024-03-08T12:39:28Z

1. This should go into the `tracking` module

moved

2. We should use sparse set of point correspondences, not dense and definitely not based on a specific optical flow algorithm.

removed the the flow calculation from withing the class

3. The problem as it is formulated now can be solved by a combination of `findEssentialMat` + `decomposeEssentialMat` or `recoverPose` and Rt-to-twist decomposition available in `DualQuat` class from 5.x, even without using additional Z coordinates. And it's more preferable since the existing functions are quite robust and fast, thanks to RANSAC.

I suppose this doesn't provide information about the scale of the scene? Having the depth input here, the velocity information is in absolute units right away.

4. This function would make more sense as a solution to a different problem if it is changed to take pixel velocities as input instead of images or points.

I've refactored the code to explicitly take in the pixel velocities.

5. The function should be documented

Done

6. The function should be covered by at least one test. You can transform your sample code to a test.

Done

Also made the interaction matrix computation as public method. Thus one could use that for visual servoing applications, like driving a camera with specific pixel velocities, or up to a pixel location.

savuor

I think you're right, we need Z values to get an absolute scene scale. This means that we do need that functionality.
However, there are things to fix before it's merged:

modules/tracking/include/opencv2/tracking/twist.hpp

modules/tracking/test/test_twist.cpp

simutisernestas · 2024-03-13T09:31:25Z

Addressed all of the comments

simutisernestas · 2024-03-22T10:25:10Z

@savuor Could you rerun the CI, I've removed some dead comments in tests and fixed miss-matched docs parameter. Should pass all the checks now : )

simutisernestas · 2024-03-22T13:25:42Z

LGTM

asmorkalov requested a review from savuor February 21, 2024 12:55

savuor reviewed Feb 21, 2024

View reviewed changes

modules/rgbd/CMakeLists.txt Outdated Show resolved Hide resolved

modules/rgbd/include/opencv2/rgbd/twist.hpp Outdated Show resolved Hide resolved

modules/rgbd/src/twist.cpp Outdated Show resolved Hide resolved

simutisernestas added 7 commits March 8, 2024 13:32

twist computation

907b4d7

port twist to tracking

b6e397e

change API to take in pixel velocities

e7549e0

initial test setup

d4c0cd4

test coverage

2a2d6c6

shape asserts

5a7967f

docs

bed2b5a

simutisernestas force-pushed the 4.x branch from cc62da9 to bed2b5a Compare March 8, 2024 12:33

fix test issues

3031e4b

savuor requested changes Mar 13, 2024

View reviewed changes

refactor class to functions & cv::Mat initialiation

3310504

savuor approved these changes Mar 14, 2024

View reviewed changes

simutisernestas added 2 commits March 15, 2024 11:15

delete some dead comments in test code

da83de9

fix docs param name

9c5b093

savuor approved these changes Apr 4, 2024

View reviewed changes

asmorkalov assigned savuor Apr 17, 2024

asmorkalov added the category: tracking label Apr 17, 2024

asmorkalov merged commit ac994ed into opencv:4.x Apr 17, 2024
9 of 10 checks passed

asmorkalov mentioned this pull request Apr 19, 2024

5.x merge 4.x #3723

Merged

catree mentioned this pull request Apr 21, 2024

Rename getInteractionMatrix() to computeInteractionMatrix() #3724

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Camera spatial velocity computation through interaction matrix #3641

Camera spatial velocity computation through interaction matrix #3641

simutisernestas commented Feb 20, 2024

savuor left a comment

simutisernestas commented Mar 8, 2024

savuor left a comment

simutisernestas commented Mar 13, 2024

simutisernestas commented Mar 22, 2024

simutisernestas commented Mar 22, 2024

Camera spatial velocity computation through interaction matrix #3641

Camera spatial velocity computation through interaction matrix #3641

Conversation

simutisernestas commented Feb 20, 2024

Feature

Pull Request Readiness Checklist

savuor left a comment

Choose a reason for hiding this comment

simutisernestas commented Mar 8, 2024

savuor left a comment

Choose a reason for hiding this comment

simutisernestas commented Mar 13, 2024

simutisernestas commented Mar 22, 2024

simutisernestas commented Mar 22, 2024