LU decomposition

Let be a square matrix. An LU factorization refers to expression of into product of two factors – a lower triangular matrix and an upper triangular matrix such that . Sometimes factorization is impossible without prior reordering of to prevent division by zero or uncontrolled growth of rounding errors hence alternative expression becomes , where in formal notation permutation matrices factors and indicate permutation of rows (or columns) of . In theory (or ) are obtained by permutations of rows (or columns) of the identity matrix, in practice the corresponding permutations are applied directly to rows (or columns) of . Matrix of side has n^{2} coefficients while two triangle matrices combined contain coefficients, therefore coefficients of matrices are not independent. Usual convention is to set unitriangular, i.e. with all main diagonal elements equal one. However, setting instead matrix unitriangular reduces to the same procedure after transpose of matrix product (cf. properties of matrix transposition): B = A^\textsf{T} = (LU)^\textsf{T} =U^\textsf{T}L^\textsf{T}. After transposition, is lower triangle while is upper unitriangular factor of . This demonstrates also, that operations on rows (e.g. pivoting) are equivalent to those on columns of a transposed matrix, and in general choice of row or column algorithm offers no advantage. In the lower triangular matrix all elements above the main diagonal are zero, in the upper triangular matrix, all the elements below the diagonal are zero. For example, for a matrix , its LU decomposition looks like this: \begin{bmatrix} a_{11} & a_{12} & a_{13} \\ a_{21} & a_{22} & a_{23} \\ a_{31} & a_{32} & a_{33} \end{bmatrix} = \begin{bmatrix} \ell_{11} & 0 & 0 \\ \ell_{21} & \ell_{22} & 0 \\ \ell_{31} & \ell_{32} & \ell_{33} \end{bmatrix} \begin{bmatrix} u_{11} & u_{12} & u_{13} \\ 0 & u_{22} & u_{23} \\ 0 & 0 & u_{33} \end{bmatrix}. Without a proper ordering or permutations in the matrix, the factorization may fail to materialize. For example, it is easy to verify (by expanding the matrix multiplication) that a_{11} = \ell_{11} u_{11}. If a_{11} = 0, then at least one of \ell_{11} and u_{11} has to be zero, which implies that either or is singular. This is impossible if is nonsingular (invertible). In terms of operations, zeroing/elimination of remaining elements of first column of involves division of a_{21}, a_{31} with a_{11}, impossible if it is 0. This is a procedural problem. It can be removed by simply reordering the rows of so that the first element of the permuted matrix is nonzero. The same problem in subsequent factorization steps can be removed the same way. For numerical stability against rounding errors/division by small numbers it is important to select a_{11} of large absolute value (cf. pivoting). LU Through recursion The above example of matrices demonstrates that matrix product of top row and leftmost columns of involved matrices plays special role for to succeed. Let us mark consecutive versions of matrices with (0),\;(1),\dots and then let us write matrix product A\equiv A^{(0)}=L^{(0)}U^{(0)} in such way that these rows and columns are separated from the rest. In doing so we shall use block matrix notation, such that e.g. a\equiv a_{11} is an ordinary number, {\bf w}^\textsf{T} \equiv(a_{12}, a_{13})^\textsf{T} is a row vector and {\bf v}=(a_{21},a_{31}) is a column vector and A' is sub-matrix of matrix A^{(0)} without top row and leftmost column. Then we can replace A^{(0)}=L^{(0)}U^{(0)} with a block matrix product. Namely it turns out that one can multiply matrix blocks in such way as if they were ordinary numbers, i.e. row times column, except that now their components are sub-matrices, sometimes reduced to scalars or vectors. Thus u{\bf l} denotes a vector obtained from {\bf l} after multiplication of each component by a number {\bf lu}^\textsf{T} is an outer product of vectors {{nowrap|{\bf l, u},}} i.e. a matrix which first column is {{nowrap|u_{12}{\bf l},}} next is u_{13}{\bf l} and so on for all components of {\bf u} and L^{(1)}U^{(1)} is a product of sub-matrices of L^{(0)},\;U^{(0)} \begin{align} \left( \begin{array}{c|c} a & {\bf w}^\textsf{T} \\ \hline \\[-0.5em] {\bf v} & \quad A' \quad \\[-0.5em] \\ \end{array} \right) &= \left( \begin{array}{c|c} {\rm 1} & {\bf 0}^\textsf{T} \\ \hline \\[-0.5em] {\bf l} & \quad L^{(1)} \quad \\[-0.5em] \\ \end{array} \right)\; \left( \begin{array}{c|c} u & {\bf u}^\textsf{T} \\ \hline \\[-0.5em] {\bf 0} & \quad U^{(1)} \\[-0.5em] \\ \end{array} \right) \\ &=\left( \begin{array}{c|c} u & {\bf u}^\textsf{T} \\ \hline \\[-0.5em] u{\bf l} & \quad {\bf lu}^\textsf{T} + L^{(1)}U^{(1)} \\[-0.5em] \\ \end{array} \right) \end{align} From equality of first and last matrices follow final {\bf l}={(1/a)}{\bf v} while matrix A' becomes updated/replaced with A^{(1)}\equiv L^{(1)}U^{(1)}= {{nowrap|A'-{\bf lu}^\textsf{T}.}} Now comes the crucial observation: nothing prevents us to treat A^{(1)} the same way as we did with {{nowrap|A^{(0)},}} repeatedly. If dimension of A is , after such steps all columns \bf v form sub-diagonal part of triangle matrix L and all pivots a combined with rows {\bf w}^\textsf{T} form upper triangle matrix as required. In the above example so only two steps suffice. The above procedure demonstrates that at no step the top diagonal pivot element a of consecutive sub-matrices can be zero. To avoid it columns or rows may be swapped so that a becomes nonzero. Such procedure involving permutation is called LUP, decomposition with pivoting. Permutation of columns corresponds to matrix product AQ^{(0)} where Q^{(0)} is a permutation matrix, i.e. the identity matrix I after the same column permutation. After all steps such LUP decomposition applies to {{nowrap|AQ^{(0)}\cdots Q^{(n-1)}\equiv AQ=LU.}} Present computation scheme and similar in Cormen et al. are examples of ''''. They demonstrate two general properties of LU factorization: Recurrence algorithms are not overly costly in terms of algebraic operations yet they suffer from practical disadvantage due to need to update and store most elements of at each step. It will be seen that by reordering calculations it is possible to dispose with storage of intermediate values. LU factorization with partial pivoting It turns out that a proper permutation of rows (or columns) to select column (or row) absolute maximal pivot is sufficient for numerically stable LU factorization, except for known pathological cases. It is called '''''' (LUP): PA = LU, \quad (AQ=LU), where and are again lower and upper triangular matrices, and and are corresponding permutation matrices, which, when correspondingly left- and right-multiplied to , reorder the rows and columns of . It turns out that all square matrices can be factorized in this form, and the factorization is numerically stable in practice. This makes LUP decomposition a useful technique in practice. A variant called '''''' at each step involves search of maximum element the way rook moves on a chessboard, along column, row, column again and so on till reaching a pivot maximal in both its row and column. It can be proven that for large matrices of random elements its cost of operations at each step is similarly to partial pivoting proportional to the length of matrix side unlike its square for full pivoting. LU factorization with full pivoting An '''''' involves both row and column permutations to find absolute maximum element in the whole submatrix: PAQ = LU, where , , and are defined as before, and is a permutation matrix that reorders the columns of . Lower-diagonal-upper (LDU) decomposition A '''''' (LDU) is a decomposition of the form A = LDU, where is a diagonal matrix, and and are unitriangular matrices, meaning that all the entries on the diagonals of and are one. Rectangular matrices Above we required that be a square matrix, but these decompositions can all be generalized to rectangular matrices as well. In that case, and are square matrices both of which have the same number of rows as , and has exactly the same dimensions as . 'Upper triangular' should be interpreted as having only zero entries below the main diagonal, which starts at the upper left corner. Similarly, the more precise term for is that it is the row echelon form of the matrix . == Example ==