Gr¨obner bases and linear algebra - ZuzanaK´ukelov´a AlgebraicMethodsinComputerVision

and not by parenthesis. However, when working with these sets of polynomials or monomials, we need some fixed ordering of polynomials or monomials in these sets, although it may be usu-ally an arbitrary ordering. In such cases we will use the ordering in which elements of such sets are ordered as they are given on the input. For example, for the setF ={

2x²+y²+ 3y−12, x²−y²+x+ 3y−4}

we will denote the first polynomialf1 = 2x²+y²+ 3y−12and the second polynomialf2=x²−y²+x+ 3y−4. When a result depends on the ordering then we will explicitly write that the set is ordered and we will use parentheses.

4.2 Gr ¨obner bases and linear algebra

Before describing the first class of methods for solving systems of polynomial equations, i.e.

methods based on Gr¨obner bases and on computations in quotient ringsC[x1, . . . , xn]/I, we first show how polynomials and Gr¨obner bases are connected with linear algebra.

Let us consider a setF = {f1, ..., fm|fi ∈C[x1, ..., xn]} ofm polynomials innvariables over the field Cof complex numbers, and let T_≻(F) denote the set of monomials in F or-dered with respect to the ordering ≻. Let s be the number of distinct monomials in F, i.e.

|T_≻(F)|= s.

Now we define a linear mapping

ψ_T_≻_(F₎:C[x₁, . . . , x_n]→C^s, (4.22) such that for the polynomialf =∑_s

i=1cix^α(i)∈F with monomialsx^α(i)ordered with respect to the ordering≻

ψ_T_≻_(F₎(f) = (c₁, . . . , c_s). (4.23) This means that we can interpret vectors ofC^sas polynomials and vice versa. We will call the mappingψthevector representationof polynomials.

Similarly we can define amatrix representationof a set ofmpolynomialsF as

Ψ_T_≻_(F₎ : (C[x1, . . . , xn])^m→Mm×s(C), (4.24) such that

Ψ_T_≻_(F₎(f₁, . . . , f_m) =



 ψ_T_≻_(F₎(f1) . . . ψ_T_≻_(F)(fm)



. (4.25)

HereMm×s(C) stands for the vector space of all matrices of sizem×swith elements from C. If it is clear with respect to which ordering the representation is created and which set of monomials is used, the subscripts are omitted, i.e. we write onlyΨ_T_(F₎or even onlyΨ.

These vector and matrix representations of polynomials allow us to use standard concepts of linear algebra and vector spaces when working with polynomials. Since many efficient linear algebra algorithms are available, such representations can significantly simplify and speed up computations with polynomials.

To show where linear algebra may be useful, we describe here how several polynomials can be reduced at the same time using Gaussian elimination, i.e. how to do polynomial division [40]

of several polynomials by Gaussian elimination.

Let us first consider polynomial division of a single polynomial, i.e. let us consider a poly-nomialf which we want to reduce by polynomialsR = (r1, . . . , r_k). We want to emulate the polynomial division off byR, i.e. to computef^Rusing Gaussian elimination of a single matrix M. This means that we need to create this matrix Mas a matrix representationΨ(4.25) of the polynomialf and suitable monomial multiples of polynomialsr₁, . . . , r_k. The main question is which monomial multiples ofr₁, . . . , r_kshould we add to this matrix. Before describing how these monomial multiples should look like for general polynomialsf andr1, . . . , r_k, we show how the matrixMlooks for a simple example.

Example 3. (Matrix polynomial division)Let’s consider polynomialf = x²+y+ 3, which we want to reduce by polynomialsR = (

x+y+ 2, y²+ 3y)

using the graded reverse lexico-graphic ordering withx > y. The polynomial division algorithm performs these steps:

1. ( r₂, r₁}w.r.t. the graded reverse lexicographic orderingx > y of monomials, wherer₁ andr₂ are polynomials fromR. In this case the set of ordered monomials contained inF isT_≻(F) = (x², xy, y², x, y,1)

and the matrix representationΨ_T_≻_(F)(F)has the form

M= Ψ_T_≻_(F)(F) =

After Gaussian elimination of this matrixMwe obtain the following matrix

eM=

As it can be seen, the last row of the matrixeMcorresponds to the polynomial−2x−7which is, up to some non-zero multiple, in this case−1, equal to the remainder off on the division byR.

This means that Gaussian elimination on this matrix does polynomial division. The fact that in this case we only obtain the remainder up to some non-zero multiple is not so important. It is because for most of the applications we only need to generate some non-zero multiples of remainder polynomials, or only to know weather the remainder is zero or not.

Let’s now show how the matrixMcan be constructed for general polynomialsfandr₁, . . . , r_k. The algorithmic description of the division algorithm can be found in [40]. Here it is sufficient to describe its main idea. In thej^th step of the division algorithm, the intermediate resulth_j, also called the dividend, is reduced using the divisorr_iwith the smallest possibleisuch that the leading term ofridivides the leading term ofhj, i.e.LT(ri)|LT(hj). If suchriexists, the new dividend has the formhj+1=hj−qi,jri, whereqi,j = ^LT_LT(r^(h^j⁾

i).

To emulate this using the matrix M, we need to add to this matrix a row corresponding to the polynomialq_i,jr_i for each intermediate dividend h_j and used divisor r_i. To “add a row corresponding to a polynomialptoM” means that we addpto the set of polynomialsF that is represented by the matrixM= Ψ_T_≻_(F₎(F)(4.24).

Now the question is: which divisorriwill be used in thej^thstep of this algorithm? To predict this we do not need to know the exact form ofh_j, whereh_j =h_j₋₁−q_l,j₋₁r_l. It is sufficient to know which monomials appear in the polynomialhj = hj−1 −ql,j−1rl. The polynomial hj contains maximally all monomials of hj−1 andq_l,j₋₁r_l, except for the leading monomial LM(h_j₋₁). This means that it is sufficient to assume that the polynomialh_j₋₁−q_l,j₋₁r_lcontains all these monomials and with such an assumption we will surely not miss any possible divisor.

This results in the following algorithm for constructing the matrixMfor emulating polynomial division of polynomialf by polynomialsR= (r₁, . . . , r_k):

Algorithm 2Matrix polynomial division

Input:f,R= (r1, . . . , r_k), monomial ordering≻

Output:MatrixMin the matrix representation of polynomialsF, i.e.M= Ψ_T_≻_(F₎(F), such that the remainderf^R∈F

1: SetF :={f}and letD=T_≻(F), whereT_≻(F)is the ordered set of monomials ofF

2: whileDis non-emptydo

3: Letx^αbe the maximal monomial inDw.r.t.≻

4: D:=D\ {x^α}

5: ifx^αcan be reduced by someri ∈Rthen

6: Letr_i ∈Rbe the divisor with the smallest possibleisuch thatLT(r_i)dividesx^αand letq:= _LT(r^x^α

7: SetF :=F ∪ {qr_i}andM= Ψ_T_≻_(F)(F)

8: SetD:=D∪ { all monomials ofqr_iexcept ofx^α }ordered w.r.t.≻

9: end if

10: end while

It can be proved that this algorithm terminates after a final number of steps and that the

re-sulting matrixM = Ψ_T_≻_(F₎(F) contains all necessary polynomials for emulating polynomial division by its Gaussian elimination. After Gaussian elimination ofM, the final reduced poly-nomialf^R will either be zero, i.e. will correspond to a zero row of reducedM, or will have a leading term that cannot be reduced by any leading term ofr₁, . . . , r_k. This means that, in this case, this reduced polynomial will correspond to a row of reducedMwith a pivot that was not a pivot inM. Recall that the result of polynomial division, i.e. the polynomialf^R, will for general polynomialsR= (r1, . . . , r_k)depend on the ordering of polynomialsr1, . . . , r_kinR.

Of course the matrix Mconstructed using the proposed algorithm might also contain some unnecessary rows. It is because we have assumed that no terms cancel in hj −qri, which might not always be true. However, this is not a problem since for polynomials with general coefficients only a small number of unnecessary rows is usually generated.

For the polynomials from Example 3, the presented algorithm generates directly the ma-trix (4.27) without any unnecessary rows.

This algorithm can be easily extended to emulate the polynomial division of several poly-nomials by Gaussian elimination of a single matrix. Let us consider that we want to reduce polynomialsf1, . . . , fmbyR = (r1, . . . , r_k). The algorithm for constructing a matrix emulat-ing polynomial division of these polynomials first setsF ={f₁, . . . , f_m}and then continues as Algorithm 2 for a single polynomial.

A similar algorithm is used in the F4 algorithm [49] to perform division of S-polynomials when constructing a Gr¨obner basis.

Note that using Gauss-Jordan elimination instead of Gaussian elimination, i.e. computing reduced row-echelon form ofM, will correspond to fully reducingf₁, . . . , f_musing polynomial division. This means that not only the leading terms off₁, . . . , f_mcannot be further reduced by R, but no terms off1, . . . , fmcan be further reduced.

Here we have shown how several polynomials can be reduced at the same time by using Gaussian elimination of a single matrix. Polynomial division is the main part of all algorithms for finding Gr¨obner bases, and therefore, its good implementation may speed them up. This is also the main idea of the F4 algorithm for computing Gr¨obner bases [49]. F4 speeds up the reduction step by exchanging multiple polynomial divisions of S-polynomials for Gaussian elimination of a single matrix created similarly as described above.

In Section 4.4 we will show that the matrix representation of polynomials and Gauss-Jordan elimination can be also used for efficient generation of polynomials from an ideal. Moreover in [33, 31] it was shown that the matrix representation and efficient matrix algorithms like QR or LU decomposition can be used for improving numerical stability of Gr¨obner bases computa-tions.

4.3 Standard Gr ¨obner basis methods for solving systems of

In document ZuzanaK´ukelov´a AlgebraicMethodsinComputerVision (Stránka 41-44)