As discussed earlier in this chapter, to uniquely determine a position and perform a translation, a 3 by 1 position vectorp∈R3is the best way to do so without any complication, and it is a member of the real additive group in R3. To unify the representation for both position and orientation, it now only depends on how to determine an orientation and perform a rotation.
In general, to define a way to unify the representation for both position and orientation, and also for both translation and rotation simultaneously, the following conditions must hold:
1. It is unique and one-to-one correspondence at every position and orienta- tion without any condition or constraint imposed;
2. It can uniquely represent and perform both translation and rotation si- multaneously and form either an additive or a multiplicative group over a number field or ring;
3. With a certain step of differentiation, it can also uniquely and simultane- ously determine velocities of both translation and rotation;
4. The major computations involved in every transformation and differenti- ation are manageable without complication.
The first condition indicates that a single unified definition is unique and cannot correspond to two distinct positions or orientations, and in turn, one single position or orientation cannot have two different values. The second one implies that when performing a simultaneous translation/rotation, the representing form must be closed and invertible, and should also satisfy dis- tributivity and associativity for successive translations/rotations. The third one further requires a unique way of finding a unified tangent space to eval- uate the rate of 6 d.o.f. rigid motion. Of course, we wish the unified 6 d.o.f.
velocity could be an exact differential 1-form, which would be more ideal for our future robotic control systems design, especially in the robotic trajectory- tracking control applications. However, we are not imposing such a restrictive condition on the unification here, because at the level of modeling robotic and digital human kinematics, the exact 1-form may not necessarily be required.
The homogeneous transformation, defined in equation (3.2) or (3.4), clearly meets all the requirements of unification, because it is a member of the 6- dimensional multiplicative Lie group SE(3) over the real field. The 3 by 3 dual rotation matrix, as a special orthogonal dual matrix given in (2.23), can also be an ideal unification to represent both position and orientation and also uniquely and simultaneously perform both translation and rotation without exception, because all such dual rotation matrices constitutes a Lie group SO(3, d) over the dual ring. In particular, its exclusive advantage in numerical computations promotes itself to be one of the most promising approaches to uniquely representing 6 d.o.f. rigid motion, provided that every dual-number operation, including all the basic expansions, such as the exponential, sine and cosine given in equations (2.16), (2.17) and (2.18), can be implemented in a computer program as built-in functions, like the built-in complex number operations in MATLABT M. More advantages inherent in the dual rotation matrices will be further discussed in the next chapter.
As mentioned earlier in this chapter, we wish to directly define a 6 by 1 vector ξ ∈ R6 that augments a 3 by 1 position vector p ∈ R3 and the aforementioned 3 by 1 “orientation vector”ρ=φk∈R3to represent 6 d.o.f.
rigid motion. Namely, letξ= p ρ
, and we wish its time-derivative ˙ξ= p˙
˙ ρ
would also be able to represent a total velocity of 6 d.o.f. rigid motion. In the position part, ˙p=v is obviously a linear velocity without complication.
However, the time-derivative ofρhas to be determined by
˙
ρ= ˙φk+φk˙ =ω+φk.˙
This indicates that the unit vector k is no longer a fixed rotation axis so that ˙k= 0. On the other hand, the uniqueness in definition and the closure in successive rotation are the other two major concerns on the orientation part defined byρ=φk. In other words, such a conformal orientation vector definition ρ = φk may be able to represent an orientation, but may not perform a rotation correctly.
Recall that each pair ofφandkin the orientation vector ρis determined by a corresponding rotation matrix R that uniquely represents a frame ori- entation, i.e., R10 = exp(φ1K01) = exp(S(ρ1)). If we perform a rotation for this frame #1 byR21 in order to change the original orientationR10 of frame
#1 to a new one R20 =R10R12 for frame #2, then the correspondingφ2 and k2for this orientation change can be found fromR21through thek−φproce- dure. Note that the new unit vectork2 is now referred to frame #1, instead of the base frame #0. Thus, we have to use R10k2 =k2(0) to find the new unit axis projected onto the base, which is equivalent to its skew-symmetric matrix K1(0)2 = R10K12(R10)T = R10K12R01 based on (3.13), where the index inside the parenthesis (0) at each subscript denotes the frame of projection.
If the rotation from frame #1 to frame #2 is represented by the orientation vector ρ2 = φ2k2(0), in order to unify it with its counterpart, the position vector that uses addition to perform a translation, then the resulting orienta- tion of frame #2 is supposed to beρ=ρ1+ρ2, or its skew-symmetric matrix S(ρ) =φ1K01+φ2K1(0)2 , where the two rotation axes have to be projected onto the common reference before being added together. A fundamental question now arises and must be justified before continuing:
R20= exp(S(ρ)) = exp(φ1K01+φ2K1(0)2 ) =R10R21= exp(φ1K01)ãexp(φ2K12)?
To simplify the question, let us just test the truth of the general form:
exp(K1+K2) = exp(K1)ãexp(K2) ? (3.15) where each ofKifori= 1,2 is a skew-symmetric matrix.
According to the well-known Baker-Campbell-Hausdorff (BCH) formula in mathematics [8, 9], for any two square matricesAandB,
exp(A)ãexp(B) = exp(A+B+1
2[A, B] + 1
12[A,[A, B]]− 1
12[B,[A, B]]− ã ã ã), where each [A, B] =AB−BA is the Lie bracket defined as a commutator in Lie algebra, see Chapter 2. Therefore, equation (3.15) is true if and only ifK1 andK2 are commutable in multiplication, i.e.,K1K2 =K2K1. Since each Ki is skew-symmetric, we have (K1K2)T =K2TK1T = K2K1, but not K1K2 = K2K1, unless the product K1K2 is symmetric. In fact, based on equation (2.5) in Chapter 2,
[K1, K2] =K1K2−K2K1= (k1×k2)×=S(k1×k2). (3.16) This means that [K1, K2] = 0 if and only if the two vectors k1 k2, which becomes a trivial case, because the two successive rotations are performed about the same axis. Therefore, the definition of an orientation vector using ρ=φkunder regular vector addition to perform a rotation is fundamentally incorrect.
As a numerical example, consider an orientation of frame #1 referred to frame #0 is given byR10 and a rotation from frame #1 to frame #2 is given byR12, and the resultant orientation isR20=R01R21. The corresponding orien- tation vectors areρ1, ρ2 andρ3, respectively, in the following MATLABT M working window:
>> R01 =
0.0000 0.5000 0.8660 0.0000 -0.8660 0.5000
1.0000 0 -0.0000
R12 =
0.1294 0.8660 0.4830 0.2241 -0.5000 0.8365
0.9659 0 -0.2588
R02 =
0.9486 -0.2500 0.1941 0.2888 0.4330 -0.8539 0.1294 0.8660 0.4830
rho_1 = -1.9269 -0.5163 -1.9269
rho_2 = -1.8198 -1.0507 -1.3964
rho_3 = 1.0717 0.0403 0.3358
>> rho_1 + rho_2
ans = -3.7467 -1.5670 -3.3233
>> rho_1 + R01*rho_2 ans =
-3.6615 -0.3046 -3.7467
>>
It can be clearly seen thatρ1+ρ2=ρ3. Even if the projection ofρ2=φ2k2
is taken care byR10ρ2before adding it toρ1, the sumρ1+R10ρ2=ρ3. There- fore, the 3 by 1 conformal orientation vector can only be used to represent an instantaneous orientation for a frame given by a rotation matrix, but it cannot be utilized to perform a rotation. The following chart summarizes this fact:
R02 = R10 × R21
↓ ↓ ↓
ρ3 = ρ1 + ρ2
In history, there were other similar definitions attempted for orientation vectors than the conformal definitionρ=φk[10, 11]. All such attempts thus far may just improve their computational complexity in their conversions and may also uniquely represent an instantaneous orientation, but none of them could fix the infrastructural defect in performing a rotation. In general, we may define them in such a way that
ρ=γ(φ)k
with a periodic and odd scalar functionγ(φ). Typical definitions are listed below:
ρ(1)= sinφk; ρ(2)= sinφ
2k; and ρ(3)= tanφ 2k.
Based on all the equations related to both forward and inverse conversions between the rotation matrixR∈SO(3) and k−φin (2.8), (2.9), (2.11) and (2.13), each of the skew-symmetric matrices S(ρ(i)) = ρ(i)× of the above orientation vector definitions can be determined in terms ofRas follows:
S(ρ(1)) =1
2(R−RT); S(ρ(2)) = R−RT 2
1 + tr(R); and S(ρ(3)) = R−RT 1 + tr(R).
In comparison with the conformal definition:
S(ρ) =φK= φ
2 sinφ(R−RT) under φ= cos−1 tr(R)−1 2
, it becomes quite interesting that the first three definitions can be determined directly in terms of only a given rotation matrixRwithout need to go through any inverse conversion in (2.9) and (2.11). Whereas the conformal definition ρ=φkrequiresφ, and it has to rely upon equation (2.9). However, conversely to find the corresponding rotation matrixR in terms of a givenρ(i), no one can directly solve the above equation for eachS(ρ(i)) to find the equivalent R. Instead, we have to rely on both the inverse and forward conversions given in (2.9), (2.11) and (2.8). For instance, let us look at the third one ρ(3) = tanφ2k. If the numerical value of ρ(3) is given, first find its norm to determine tanφ2 = ρ(3) so that φ= 2 arctan(ρ(3)). Second, the unit vector k is just its normalized vector, i.e., k = ρρ(3)(3). Once we have φand k, apply the forward conversion (2.8) to reach the corresponding rotation matrixR.
All the attempts to define an orientation vector form have either a sin- gularity issue or a limited range of φ. When R is symmetric but not the identity, φ=±π and tr(R) = −1 so that the denominators of bothS(ρ(2)) and S(ρ(3)) vanish. However, we can directly determineρ(2) = sinφ2k=±k and use equation (2.13) to find k, but ρ(3) will go to infinity. The first one ρ(1)has the simplest conversion over the others, but it only offers half of the range for the rotation angleφ.
In summary, unlike the rotation matrix R∈SO(3), all such attempts to produce an orientation vector are conditional and will bring inconvenience in computer programming, plus none of them can uniquely perform rotations.
The only advantage is that the velocity of each definition is obviously using its time-derivative ˙ρ so that such an orientation velocity guarantees to be an exact differential 1-form, which may benefit to the future robotic control systems design.
Any kind of Euler angles may also possibly be adopted to define an orien- tation vector:
ρ=
⎛
⎝φ θ ψ
⎞
⎠
for either the Z-Y-Z or Roll-Pitch-Yaw Euler angle definitions, which may perform a rotation by adding another Euler angles-based vector Δρ = (Δφ Δθ Δψ)T. However, the resultant orientation vector, for instance, in the first Z-Y-Z case,
ρ1=ρ+Δρ=
⎛
⎝φ+Δφ θ+Δθ ψ+Δψ
⎞
⎠
may not be consistent to the resultant rotation matrix defined in (3.6).
Namely,
R1=R(z, φ+Δφ)R(y, θ+Δθ)R(z, ψ+Δψ) is not equal to the product
R(z, φ)R(y, θ)R(z, ψ)R(z, Δφ)R(y, Δθ)R(z, Δψ)
at all due to matrix multiplication not being commutable. Although you may not need to care if the results are consistent and just treat the sumρ1 as a new orientation after a rotationΔρ, it still has a non-unique issue in their inverse solutions, see the preceding discussion on the Euler angles in this chapter.
The well-knownquaternionmethod can also be employed to uniquely rep- resent orientation of a frame as well as to perform rotation effectively [12].
According to group theory, a quaternionqis defined as a “complex number”
of two complex numbers. Letc1=q0+j3q3andc2=q2+j3q1be two complex numbers, wherej32=−1. Then, a quaternionqis created by
q=c1+j2c2=q0+j3q3+j2q2+j2j3q1=q0+j1q1+j2q2+j3q3, where j1,j2 and j3 are three independent complex units with the following properties:
j12=j22=j32=j1j2j3=−1, and
j1j2=−j2j1=j3, j2j3=−j3j2=j1, j3j1=−j1j3=j2.
It has been shown that each quaternion consists of four independent com- ponents (q0 q1 q2 q3), and all such quaternions associated with the above multiplication rules constitute a non-commutative field, called the quaternion field. In other words, they are the members of both additive and multiplica- tive groups.
Regarding the addition between two quaternions, let qa = qa0 +j1qa1 + j2q2a+j3q3a andqb=qb0+j1q1b+j2qb2+j3q3b be two quaternions. Then,
qa+qb= (q0a+q0b) +j1(q1a+ab1) +j2(q2a+qb2) +j3(q3a+qb3).
The multiplication between qa and qb follows the distributive law and the result is called the Hamilton product:
qaãqb= (q0aqb0−q1aqb1−q2aq2b−qa3q3b) +j1(q0aqb1+q1aqb0+q2aqb3−q3aqb2)+
+j2(q0aqb2−q1aqb3+q2aqb0+q3aqb1) +j3(qa0q3b+q1aq2b−q2aq1b+q3aqb0).
Similar to the complex number, the conjugate of q is q∗ = q0−j1q1− j2q2−j3q3 such that
qãq∗=q2=q20+q21+q22+q32.
Actually, a quaternion can be linearly mapped to a 2 by 2 complex matrix with all the operations of algebraic field preserved, i.e.,
Γ(q) =Γ(q0+j1q1+j2q2+j3q3) = q0+jq1 q2+jq3
−q2+jq3 q0−jq1
with j2=−1.
One can easily verify that the above Hamilton product can also be found by Γ(qaãqb) = q0a+jqa1 qa2+jqa3
−q2a+jqa3 qa0−jqa1
q0b+jq1b q2b+jqb3
−q2b+jqb3 q0b−jqb1
. With the perfect algebraic structure owned by the quaternions, it is common to interpret a quaternion as a combination of a scalar and a 3- dimensional vector:q= (q0, v) with the vectorv= (q1q2q3)T. Based on the rules of complex units multiplication, for two quaternions:qa= (q0a, va) and qb= (qb0, vb),
qaãqb= (q0aqb0−vaTvb, qa0vb+q0bva+vaìvb). (3.17) Now, let the four components of a quaternion be further specified by the four Euler’s parameters:
q0= cosφ
2, qi=kisinφ
2, for i= 1,2,3,
where (k1k2k3)T =kis a unit vector, about which a frame is rotated byφ.
In other words, the quaternion is specified to be q= cosφ
2 + (j1k1+j2k2+j3k3) sinφ
2 = cosφ
2 + ˆksinφ
2, (3.18) where ˆk = j1k1 +j2k2+j3k3 is the unit vector k in the form of a pure imaginary quaternion. Equation (3.18) can be rewritten as
q= cosφ 2, sinφ
2k
.
Since ˆk2= (j1k1+j2k2+j3k3)2=−1, equation (3.18) can also be interpreted as a generalized Euler formula in complex analysis:
eˆkφ2 = cosφ
2 + ˆksinφ 2.
Such a specification makes the quaternion be able to uniquely perform a rotation as well as to uniquely represent a frame orientation. We can easily prove that (3.18) is a unity quaternion if k is a unit vector. Namely, since q∗= cosφ2 −ˆksinφ2 =
cosφ2, −sinφ2k
,
qãq∗= cos2φ
2 + sin2φ 2, 0
= 1.
Under the unity quaternion definition in (3.18), for two successive rota- tions, based on (3.17), the resultant orientation is
q=qaqb= cosφ1
2 , sinφ1
2 k1
cosφ2
2 , sinφ2
2 k2
= cosφ1
2 cosφ2
2 −sinφ1
2 sinφ2
2 kT1k2, cosφ1
2 sinφ2
2 k2+ cosφ2
2 sinφ1
2 k1+ sinφ1
2 sinφ2
2 (k1×k2)
. Ifk1=k2=k, i.e., the two rotations are about the same unit axis, then,
q=qaqb= cosφ1+φ2
2 , sinφ1+φ2
2 k
= cosφ1+φ2
2 + ˆksinφ1+φ2
2 .
For a 3D vector v, we may consider it as a special quaternion without the scalar part: qv = (0, v). Conventionally, to perform a rotation on this vector, the new vectorvnew = Rv for some rotation matrix R ∈ SO(3). If we ask the quaternion field to perform the same rotation, using the k−φ procedure to find the corresponding rotating angleφand the unit vectork, and then to define a quaternionq=
cosφ2, sinφ2k
. However, the resulting new vector vnew is the vector part, i.e., the imaginary part of the following new quaternion:
qnew= (0, vnew) =qãqvãq−1=qãqvãq∗,
instead of justqãqv, whereq∗ is the conjugate of qand q−1 =q∗/q=q∗ for the unity quaternionq.
Based on (3.17), the product qãqv= cosφ
2, sinφ 2k
(0, v) = −sinφ
2kTv, cosφ
2v+ sinφ 2(k×v)
. Then, the scalar (real) part ofqãqvãq∗=qãqvã
cosφ2, −sinφ2k
is, indeed, equal to 0, while its imaginary part, based on (3.17) again, becomes
vnew= sin2φ
2(kTv)k+ cos2φ
2v+ 2 sinφ 2cosφ
2(k×v) + sin2φ
2k×(k×v).
Now, since 2 sinφ2cosφ2 = sinφ, and according to the triple cross-product equation (2.1) in Chapter 2, k×(k×v) = (kTv)k−(kTk)v = (kTv)k−v so that (kTv)k = k×(k×v) +v. Substituting the results into the above imaginary part of the new quaternionqnewand noticing thatk×v=Kvand
k×(k×v) =K2v with K =S(k), the skew-symmetric matrix of the unit vectork, we obtain
vnew= 2 sin2φ
2K2v+v+ sinφKv=v+ sinφKv+ (1−cosφ)K2v.
In contrast, based on the rotation matrix expansion in equation (2.8) of Chapter 2, the new vector after a rotation about the unit axiskby the angle φfor a vectorv should be
vnew=Rv= (I+ sinφK+ (1−cosφ)K2)v.
The above answer evidently exposes how many more computations are in- curred by a quaternion to handle a vector rotation in comparison with the rotation matrix.
Therefore, the quaternion method is not quite as easy as the conventional rotation matrix approach for performing a 3D rotation and representing a frame orientation, although its infrastructure is a perfect algebraic field. In addition to the computational inconvenience, as evidently seen from the above introduction, a position vector has three coordinate elements, while a quater- nion consists of four components in nature, which lacks advantage for the unification of representing 6 d.o.f. rigid motion.