5/5 - (1 vote)

1 Data Capture and Overview

Figure 1: You will capture your cellphone images to reconstruct camera pose and 3D points.

In this assignment, you will use your cellphone images (more than 5) to reconstruct 3D camera poses and points with full bundle adjustment. Make sure you have enough baseline (translation) between images for well conditioned fundamental matrix while retaining enough number of correspondences between image. Avoid a scene dominated by a planar surface, i.e., the images need to contain many 3D objects as shown in Figure 1.

You will write a full pipeline of the structure from motion algorithm including matching, camera pose estimation using fundamental matrix, PnP, triangulation, and bundle adjustment. A nonlinear optimization is always followed by the initial estimate by linear least squares solution. The pipeline is described in Algorithm 1.

Algorithm 1 Structure from Motion

1: [Mx, My] = GetMatches(I₁, , I_N)

2: Normalize coordinate in Mx and My, i.e., x = K¹x.

3: Select two images I_i₁and I_i₂for the initial pair reconstruction.

4: [R, C, X] = CameraPoseEstimation([Mx(:,i1) My(:,i1)], [Mx(:,i2) My(:,i2)])

5: P = {P₁,P₂} where P

6: R = {i1,i2}

7: while |R| < N do

8: i = GetBestFrame(Mx, My, R);

9: [R_i, C_i] = PnP RANSAC([Mx(:,i) My(:,i)], X)

10: [R_i, C_i] = PnP Nonlinear(R_iC_i, [Mx(:,i) My(:,i)], X)

11: P

12: for f = 1 : |R| do

13: U = FindUnreconstructedPoints(X, R_f, i, Mx, My)

14: for j = 1 : |U| do

15: u = [Mx(U_j, i), My(U_j, i)] and v = [Mx(U_j, R_f), My(U_j, R_f)]

16: x = LinearTriangulation(u, P_i, v, P_R_f)

17: x = NonlinearTriangulation(X, u, R_i, C_i, v, R_R_f, C_R_f)

18: X = X x

19: end for

20: end for

21: P = P P_iand R = R i.

22: [P, X] = BundleAdjustment(P, X, R, Mx, My)

23: end while

2 Matching

Given a set of images, I₁, ,I_N, you will find matches across all images where N is the number of images similar to HW #4. Pick a reference image, I_ref, and match with other images using SIFT features from VLFeat, i.e., I_ref↔ I₁, ,I_ref↔ I_N(no need to match I_ref↔ I_ref).

Your matches are outlier free, i.e., bidirectional knn match ratio test inliers from the fundamental matrix based RANSAC. Based on the matches, you will build a measurement matrix, Mx and My:

[Mx, My] = GetMatches(I₁, , I_N)

Mx: FN matrix storing x coordinate of correspondences

My: FN matrix storing y coordinate of correspondences

The f^thfeature point in image I_icorresponds to a point in image I_j. The x and y coordinates of the correspondence is stored at (f,i) and (f,j) elements in Mx and My, respectively. If (f,i) does not correspond to any point in image I_k, you set -1 to indicate no match as shown in Figure 2.

Important: For better numerical stability, you can transform the measurements to the normalized coordinate by multiplying K¹, i.e., x = K¹x where x is 2D measured points in homogeneous coordinate. You can run structure from motion in the normalized coordinate by factoring out K. When visualizing projection in the image, the coordinate needs to be transformed back to original coordinate by multiplying K.

f	x	xf ,i fMx	, _j	1	f	y	yf ,iMy	f , j	1

N N

(xf ,i , y f ,i )↔(xf , j , y f , j )↔(xf ,k , y f ,k )

Figure 2: The f^thfeature point in image I_icorresponds to a point in image I_j. The x and y coordinates of the correspondence is stored at (f,i) and (f,j) elements in Mx and My, respectively. If (f,i) does not correspond to any point in image I_k, you set -1 to indicate no match.

3 Camera Pose Estimation

You will write a camera pose estimation code that takes correspondences between two images, I_i₁and I_i₂where i1 and i2 are the indices of the initial images to reconstruct selected manually.

[R, C, X] = CameraPoseEstimation(u₁, u₂)

R and C: the relative transformation of the i2 image u₁and u₂: 2D-2D correspondences

As studied in HW #4, you will compute:

Fundamental matrix via RANSAC on correspondences, Mx(:,i1), My(:,i2)
Essential matrix from the fundamental matrix
Four configurations of camera poses given the essential matrix
Disambiguation via chierality (using 3D point linear triangulation): X = LinearTriangulation(u, P_i, v, P_j)

Write-up:

Figure 3: Camera pose estimation.

Visualize inlier matches as shown in Figure 3(a).
Visualize camera pose and 3D reconstructed points in 3D as shown in Figure 3(b).

4 Nonlinear 3D Point Refinement

You will write a nonlinear triangulation code. Given the linear estimate for the point

T triangulation, X, you will refine the 3D point Xto minimize geometric error (reprojection error) via iterative nonlinear least squares estimation,

T !1 T

f(X) f(X) f(X)

X = (b f(X)). (1) X X X

Write-up:

Derive the point Jacobian, i.e., ^f⁽_X^X⁾^jand write the following code. df dX = JacobianX(K, R, C, X)
Write a code to refine the 3D point by minimizing the reprojection error and visualize reprojection error reduction similar to Figure 5.

X = NonlinearTriangulation(X, u₁, R₁, C₁, u₂, R₂, C₂)

Algorithm 2 Nonlinear Point Refinement

T1:2: for j = 1 : nIters do3: Build point Jacobian, 4: Compute f(X).5:6: X = X + X7: end for

5 Camera Registration

You will register an additional image, I_jusing 2D-3D correspondences.

Write-up:

(3D-2D correspondences) Given 3D triangulated points, find 2D-3D matches, X ↔ u.
(Perspective-n-Point algorithm) Write a code that computes 3D camera pose from 3D-2D correspondences:

[R, C] = LinearPnP(u, X)

X: n 3 matrix containing n 3D reconstructed points u: n 2 matrix containing n 2D points in the additional image I₃R and C: rotation and translation for the additional image.

Hint: After the linear solve, rectify the rotation matrix such that det(R) = 1 and scale C according to the rectification.

(RANSAC PnP) Write a RANSAC algorithm for the camera pose registration (PnP) given n matches using the following pseudo code:

Algorithm 3 PnP RANSAC

1: nInliers 0

2: for i = 1 : M do

3: Choose 6 correspondences, X_rand u_r, randomly from X and u.

4: [R_r, t_r] = LinearPnP(u_r, X_r)

5: Compute the number of inliers, n_r, with respect to R_r, t_r.

6: if n_r> nInliers then 7: nInliers n_r

8: R = R_rand t = t_r

9: end if

10: end for

Visualize 3D registered pose as shown in Figure 4.

(a) Front view (b) Top view

Figure 4: Additional image registration.

(4) (Reprojection) Visualize measurement and reprojection to verify the solution.

6 Nonlinear Camera Refinement

Given the initial estimate R_iand t_i, you will refine the camera pose to minimize geometric error (reprojection error) via iterative nonlinear least squares estimation,

T !¹Tf(p) f(p) f(p)p = (b f(p)), p p p
u1/w1	x₁

v1/w1 y₁

,, b = (2)

f(p) =

un/wn x_n

vn/wn y_n

quaternion representation of the camera rotation.It is possible to minimize the overshooting by adding damping, as follows:
T !1 Tf(p) f(p) f(p)p = + I (b f(p)),	(3)

where p C^Tq^{T T}. C R³is the camera optical center and q S³is the

p p p

where is the damping parameter. You can try [0,10].

Note that the conversion between quaternion and rotation matrix is given as follows:

R ,

q , where Write-up:

Derive the quaternion Jacobian to rotation using Equation (4), i.e., ^R_qand write the following code. Note: ignore the normalization kqk = 1. dR dq = JacobianQ(q)
Derive the rotation Jacobian to projection using Equation (2), i.e., ^f⁽_R^p⁾^jwhere

T and write the following code. Note: use vectorized form of

the rotation matrix.

df dR = JacobianR(R, C, X)

Derive the expression of ^f⁽_q^p⁾^jusing the chain rule.
Derive the camera center Jacobian to projection using Equation (2), i.e., ^f⁽_C^p⁾^jand write the following code.

df dC = JacobianC(R, C, X)

Write a code to refine the camera pose by minimizing the reprojection error and visualize reprojection error reduction as shown in Figure 5:

[R, C] = PnP Nonlinear(R C, u, X)

Algorithm 4 Nonlinear Camera Pose Refinement
1: CT qT T2: for j = 1 : nIters do3: C = p_1:3, R=Quaternion2Rotation(q), q = p_4:74: Build camera pose Jacobian for all points, 5: Compute f(p).6: )) using Equation (3).7: p = p + p8: Normalize the quaternion scale, p_4:7= p_4:7/kp_4:7k.9: end for	.

Figure 5: Nonlinear refinement reduces the reprojection error (0.190.11).

7 Bundle Adjustment

You will write a nonlinear refinement code that simultaneously optimizes camera poses and 3D points using the sparse nature of the Jacobian matrix. [P, X] = BundleAdjustient(P, X, R, Mx, My)

For example, consider 3 camera poses and 2 points. The Jacobian matrix can be written as follows:

027

2 _f1

JJ_pJ_X(5)

027 023

023

where J_pand J_Xare the Jacobian for camera and point, respectively, and [0,10].

The normal equation, J^TJx = J^T(b f(x)) can be decomposed into:

A Be^p

= , (6)

B^TD X e_X

where

A = J^T_pJ_p+ I, B = J^T_pJ_X, D = J^T_XJ_X+ I e_p= J^T_p(b f(x)), e_X= J_X^T(b f(x))

where and X where I and M are the number of images and points, respectively.

The decomposed normal equation in Equation (6) allows us to efficiently compute the inverse of J^TJ using Schur complement of D:

pb = (A BD¹B^T)¹(e_p BD¹e_X),

where D is a block diagonal matrix whose inverse can be efficiently computed by inverting small block matrix:

d₁D =

, d_M

1 d₁D1 =

1d_M

(7)

The bundle adjustment algorithm is summarized in Algorithm 5. Note that not all points are visible from cameras. You need to reason about the visibility, i.e., if the point is not visible from the camera, the corresponding Jacobian and measurement from J and b will be omitted, respectively.

Algorithm 5 Bundle Adjustment

T1:and2: for iter = 1 : nIters do3: Empty J_p, J_X, b, f, D_inv.
4:	for i = 1 : M do
5:	d = 033
6:	for j = 1 : I do
7:	if the i^thpoint is visible from the j^thimage then
8:	J1 = 027I and J2 = 023M
9:	J
10:	J
11:	J_p	JT_p	J	Tand J_X	JT_X	T J
12:	d
13:	b = bT		u
14:	f fT		x	where		I
15:	end if
16:	end for
17:	d = d + I
18:	Dinv = blkdiag(Dinv, d1)
19:	end for
20:	e_p= J^T_p(b f)
21:	e_X= J^T_X(b f)
22:	A = JTpJp + I, B JX, D1 = Dinv
23:	p = (A BD1BT)1(ep BD1eX) b
24:	Normalize quaternions.
25:B^Tp) b26: end for

Write-up: You will first start with two images and 10 3D points to test your bundle adjustment program.

Derive J_pand J_X.
Run Algorithm 5 and visualize the reprojection error similar to Figure 5.

8 Putting All Things Together

Write-up: You will run with all images and 3D points based on Algorithm 1.

Visualize 3D camera pose and points as shown in Figure 6.
Visualize reprojection for all images.

Figure 6: You will reconstruct all images and 3D points using structure from notion.

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Whatsapp Us

[Solved] CSCI5980 assignment5- Bundle Adjustment

1 Data Capture and Overview

2 Matching

3 Camera Pose Estimation

4 Nonlinear 3D Point Refinement

5 Camera Registration

6 Nonlinear Camera Refinement

7 Bundle Adjustment

8 Putting All Things Together

Reviews

Whatsapp Us

[Solved] CSCI5980 assignment5- Bundle Adjustment

1 Data Capture and Overview

2 Matching

3 Camera Pose Estimation

4 Nonlinear 3D Point Refinement

5 Camera Registration

6 Nonlinear Camera Refinement

7 Bundle Adjustment

8 Putting All Things Together

Reviews

Related products

[Solved] CSCI5980 assignment3-Homography

[Solved] CSCI5980 assignment2-Image Transformaion

[Solved] CSCI5980 assignment1-Camera Obscura

[Solved] CSCI5980 assignment4- Fundermental Matrix