Singular value decomposition

. First, we see the unit disc in blue together with the two canonical unit vectors. We then see the actions of , which distorts the disk to an ellipse. The SVD decomposes into three simple transformations: an initial rotation , a scaling \mathbf{\Sigma} along the coordinate axes, and a final rotation . The lengths and of the semi-axes of the ellipse are the singular values of , namely and . Rotation, coordinate scaling, and reflection In the special case when is an real square matrix, the matrices and can be chosen to be real matrices too. In that case, "unitary" is the same as "orthogonal". Then, interpreting both unitary matrices as well as the diagonal matrix, summarized here as as a linear transformation {{tmath| \mathbf x \mapsto \mathbf{Ax} }} of the space the matrices and represent rotations or reflection of the space, while represents the scaling of each coordinate by the factor Thus the SVD decomposition breaks down any linear transformation of into a composition of three geometrical transformations: a rotation or reflection followed by a coordinate-by-coordinate scaling followed by another rotation or reflection In particular, if has a positive determinant, then and can be chosen to be both rotations with reflections, or both rotations without reflections. If the determinant is negative, exactly one of them will have a reflection. If the determinant is zero, each can be independently chosen to be of either type. If the matrix is real but not square, namely with it can be interpreted as a linear transformation from to Then and can be chosen to be rotations/reflections of and respectively; and besides scaling the first {{tmath|\min\{m,n\} }} coordinates, also extends the vector with zeros, i.e. removes trailing coordinates, so as to turn into Singular values as semiaxes of an ellipse or ellipsoid As shown in the figure, the singular values can be interpreted as the magnitude of the semiaxes of an ellipse in 2D. This concept can be generalized to -dimensional Euclidean space, with the singular values of any square matrix being viewed as the magnitude of the semiaxis of an -dimensional ellipsoid. Similarly, the singular values of any matrix can be viewed as the magnitude of the semiaxis of an -dimensional ellipsoid in -dimensional space, for example as an ellipse in a (tilted) 2D plane in a 3D space. Singular values encode magnitude of the semiaxis, while singular vectors encode direction. See below for further details. The columns of and are orthonormal bases Since and are unitary, the columns of each of them form a set of orthonormal vectors, which can be regarded as basis vectors. The matrix maps the basis vector to the stretched unit vector By the definition of a unitary matrix, the same is true for their conjugate transposes and except the geometric interpretation of the singular values as stretches is lost. In short, the columns of and are orthonormal bases. When is a positive-semidefinite Hermitian matrix, and are both equal to the unitary matrix used to diagonalize However, when is not positive-semidefinite and Hermitian but still diagonalizable, its eigendecomposition and singular value decomposition are distinct. Relation to the four fundamental subspaces • The first columns of are a basis of the column space of . • The last columns of are a basis of the null space of . • The first columns of are a basis of the column space of (the row space of in the real case). • The last columns of are a basis of the null space of . Geometric meaning Because and are unitary, we know that the columns of yield an orthonormal basis of and the columns of yield an orthonormal basis of (with respect to the standard scalar products on these spaces). The linear transformation T : \left\{\begin{aligned} K^n &\to K^m \\ x &\mapsto \mathbf{M}x \end{aligned}\right. has a particularly simple description with respect to these orthonormal bases: we have T(\mathbf{V}_i) = \sigma_i \mathbf{U}_i, \qquad i = 1, \ldots, \min(m, n), where is the -th diagonal entry of and for The geometric content of the SVD theorem can thus be summarized as follows: for every linear map one can find orthonormal bases of and such that maps the -th basis vector of to a non-negative multiple of the -th basis vector of and sends the leftover basis vectors to zero. With respect to these bases, the map is therefore represented by a diagonal matrix with non-negative real diagonal entries. To get a more visual flavor of singular values and SVD factorization – at least when working on real vector spaces – consider the sphere of radius one in The linear map maps this sphere onto an ellipsoid in Non-zero singular values are simply the lengths of the semi-axes of this ellipsoid. Especially when and all the singular values are distinct and non-zero, the SVD of the linear map can be easily analyzed as a succession of three consecutive moves: consider the ellipsoid and specifically its axes; then consider the directions in sent by onto these axes. These directions happen to be mutually orthogonal. Apply first an isometry sending these directions to the coordinate axes of On a second move, apply an endomorphism diagonalized along the coordinate axes and stretching or shrinking in each direction, using the semi-axes lengths of as stretching coefficients. The composition then sends the unit-sphere onto an ellipsoid isometric to To define the third and last move, apply an isometry to this ellipsoid to obtain As can be easily checked, the composition coincides with == Example ==