1911 Encyclopædia Britannica/Algebraic Forms

←

1911 Encyclopædia Britannica, Volume 1

Algebraic Forms by Percy Alexander MacMahon

→

sister projects: Wikipedia article, Wikidata item.

13876541911 Encyclopædia Britannica, Volume 1 — Algebraic FormsPercy Alexander MacMahon

ALGEBRAIC FORMS. The subject-matter of algebraic forms is to a large extent connected with the linear transformation of algebraical polynomials which involve two or more variables. The theories of determinants and of symmetric functions and of the algebra of differential operations have an important bearing upon this comparatively new branch of mathematics. They are the chief instruments of research, and have themselves much benefited by being so employed. When a homogeneous polynomial is transformed by general linear substitutions as hereafter explained, and is then expressed in the original form with new coefficients affecting the new variables, certain functions of the new coefficients and variables are numerical multiples of the same functions of the original coefficients and variables. The investigation of the properties of these functions, as well for a single form as for a simultaneous set of forms, and as well for one as for many series of variables, is included in the theory of invariants. As far back as 1773 Joseph Louis Lagrange, and later Carl Friedrich Gauss, had met with simple cases of such functions, George Boole, in 1841 (Camb. Math. Journ. iii. pp. 1-20), made important steps, but it was not till 1845 that Arthur Cayley (Coll. Math. Papers, i. pp. 80-94, 95-112) showed by his calculus of hyper-determinants that an infinite series of such functions might be obtained systematically. The subject was carried on over a long series of years by himself, J. J. Sylvester, G. Salmon, L. O. Hesse, S. H. Aronhold, C. Hermite, Francesco Brioschi, R. F. A. Clebsch, P. Gordon, &c. The year 1868 saw a considerable enlargement of the field of operations. This arose from the study by Felix Klein and Sophus Lie of a new theory of groups of substitutions; it was shown that there exists an invariant theory connected with every group of linear substitutions. The invariant theory then existing was classified by them as appertaining to “finite continuous groups.” Other “Galois” groups were defined whose substitution coefficients have fixed numerical values, and are particularly associated with the theory of equations. Arithmetical groups, connected with the theory of quadratic forms and other branches of the theory of numbers, which are termed “discontinuous,” and infinite groups connected with differential forms and equations, came into existence, and also particular linear and higher transformations connected with analysis and geometry. The effect of this was to co-ordinate many branches of mathematics and greatly to increase the number of workers. The subject of transformation in general has been treated by Sophus Lie in the classical work Theorie der Transformationsgruppen. The present article is merely concerned with algebraical linear transformation. Two methods of treatment have been carried on in parallel lines, the unsymbolic and the symbolic; both of these originated with Cayley, but he with Sylvester and the English school have in the main confined themselves to the former, whilst Aronhold, Clebsch, Gordan, and the continental schools have principally restricted themselves to the latter. The two methods have been conducted so as to be in constant touch, though the nature of the results obtained by the one differs much from those which flow naturally from the other. Each has been singularly successful in discovering new lines of advance and in encouraging the other to renewed efforts. P. Gordan first proved that for any system of forms there exists a finite number of covariants, in terms of which all others are expressible as rational and integral functions. This enabled David Hilbert to produce a very simple unsymbolic proof of the same theorem. So the theory of the forms appertaining to a binary form of unrestricted order was first worked out by Cayley and P. A. MacMahon by unsymbolic methods, and later G. E. Stroh, from a knowledge of the results, was able to verify and extend the results by the symbolic method. The partition method of treating symmetrical algebra is one which has been singularly successful in indicating new paths of advance in the theory of invariants; the important theorem of expressibility is, directly we exclude unity from the partitions, a theorem concerning the expressibility of covariants, and involves the theory of the reducible forms and of the syzygies. The theory brought forward has not yet found a place in any systematic treatise in any language, so that it has been judged proper to give a fairly complete account of it.^[1]

I. The Theory of Determinants.^[1]

Let there be given $n^{2}$ quantities

${\begin{matrix}a_{11}&a_{12}&a_{13}&\ldots &a_{1n}\\a_{21}&a_{22}&a_{23}&\ldots &a_{2n}\\a_{31}&a_{32}&a_{33}&\ldots &a_{3n}\\.&.&.&\ldots &.\\a_{n1}&a_{n2}&a_{n3}&\ldots &a_{nn}\end{matrix}}$

and form from them a product of $n$ quantities

${\begin{matrix}a_{1\alpha }&a_{2\beta }&a_{3\gamma }&\ldots &a_{n\nu },\end{matrix}}$

where the first suffixes are the natural numbers $1,2,3,\ldots n$ taken in order, and $\alpha ,\beta ,\gamma ,\ldots \nu$ is some permutation of these $n$ numbers. This permutation by a transposition of two numbers, say $\alpha ,\beta ,$ becomes $\beta ,\alpha ,\gamma ,\ldots \nu ,$ and by successively transposing pairs of letters the permutation can be reduced to the form $1,2,3,\ldots n.$ Let $k$ such transpositions be necessary; then the expression

$\Sigma (-)^{k}a_{1\alpha }a_{2\beta }a_{3\gamma }\ldots a_{n\nu },$

the summation being for all permutations of the $n$ numbers, is called the determinant of the $n^{2}$ quantities. The quantities $a_{1\alpha }a_{2\beta }\ldots$ are called the elements of the determinant; the term $(-)^{k}a_{1\alpha }a_{2\beta }a_{3\gamma }\ldots a_{n\nu }$ is called a member of the determinant, and there are evidently $n!$ members corresponding to the $n!$ permutations of the $n$ numbers $1,2,3,\ldots n.$ The determinant is usually written

$\Delta ={\begin{vmatrix}a_{11}&a_{12}&a_{13}&\ldots &a_{1n}\\a_{21}&a_{22}&a_{23}&\ldots &a_{2n}\\a_{31}&a_{32}&a_{33}&\ldots &a_{3n}\\.&.&.&..{\phantom {.}}&.\\a_{n1}&a_{n2}&a_{n3}&\ldots &a_{nn}\end{vmatrix}}$

the square array being termed the matrix of the determinant. A matrix has in many parts of mathematics a signification apart from its evaluation as a determinant. A theory of matrices has been constructed by Cayley in connexion particularly with the theory of linear transformation. The matrix consists of $n$ rows and $n$ columns. Each row as well as each column supplies one and only one element to each member of the determinant. Consideration of the definition of the determinant shows that the value is unaltered when the suffixes in each element are transposed.

Theorem.—If the determinant is transformed so as to read by columns as it formerly did by rows its value is unchanged. The leading member of the determinant is $a_{11}a_{22}a_{33}\ldots a_{nn},$ and corresponds to the principal diagonal of the matrix.

We write frequently

$\Delta =\Sigma \pm a_{11}a_{22}a_{33}\ldots a_{nn}=(a_{11}a_{22}a_{33}\ldots a_{nn}).$

If the first two columns of the determinant be transposed the expression for the determinant becomes $\Sigma (-)^{k}a_{1\beta }a_{2\alpha }a_{3\gamma }...a_{n\nu }$ , viz. $\alpha$ and $\beta$ are transposed, and it is clear that the number of transpositions necessary to convert the permutation $\beta \alpha \gamma ...\nu$ of the second suffixes to the natural order is changed by unity. Hence the transposition of columns merely changes the sign of the determinant. Similarly it is shown that the transposition of any two columns or of any two rows merely changes the sign of the determinant.

Theorem.—Interchange of any two rows or of any two columns merely changes the sign of the determinant.

Corollary.—If any two rows or any two columns of a determinant be identical the value of the determinant is zero.

Minors of a Determinant.—From the value of $\Delta$ we may separate those members which contain a particular element $a_{ik}$ as a factor, and write the portion $a_{ik}{\text{A}}_{ik}$ ; ${\text{A}}_{ik}$ , the cofactor of $a_{ik}$ , is called a minor of order $n-1$ of the determinant.

Now $a_{11}{\text{A}}_{11}=\Sigma \pm a_{11}a_{22}a_{33}...a_{nn}$ , wherein $a_{11}$ is not to be changed, but the second suffixes in the product $a_{22}a_{33}...a_{nn}$ assume all permutations, the number of transpositions necessary determining the sign to be affixed to the member.

Hence $a_{11}{\text{A}}_{11}=a_{11}\Sigma \pm a_{22}a_{33}...a_{nn}$ , where the cofactor of $a_{11}$ is clearly the determinant obtained by erasing the first row and the first column.

Hence ${\text{A}}_{11}={\begin{vmatrix}a_{22}&a_{23}&...&a_{2n}\\a_{32}&a_{33}&...&a_{3n}\\.&.&...&.\\a_{n2}&a_{n3}&...&a_{nn}\end{vmatrix}}$

Similarly ${\text{A}}_{ik}$ , the cofactor of $a_{ik}$ , is shown to be the product of $(-)^{ik}$ and the determinant obtained by erasing from $\Delta$ the i ^th row and k ^th column. No member of a determinant can involve more than one element from the first row. Hence we have the development

$\Delta =a_{11}{\text{A}}_{11}+a_{12}{\text{A}}_{12}+a_{13}{\text{A}}_{13}+...+a_{1n}{\text{A}}_{1n}$ ,

proceeding according to the elements of the first row and the corresponding minors.

Similarly we have a development proceeding according to the elements contained in any row or in any column, viz.

$\Delta =a_{i1}{\text{A}}_{i1}+a_{i2}{\text{A}}_{i2}+a_{i3}{\text{A}}_{i3}+...+a_{in}{\text{A}}_{in}$	${\Bigg \rbrace }=({\text{A}}).$
$\Delta =a_{1k}{\text{A}}_{1k}+a_{2k}{\text{A}}_{2k}+a_{3k}{\text{A}}_{3k}+...+a_{nk}{\text{A}}_{nk}$

This theory enables the evaluation of a determinant by successive reduction of the orders of the determinants involved.

Ex. gr.	${\begin{vmatrix}1&0&3\\2&1&6\\0&-5&3\end{vmatrix}}$	$=$	$1{\begin{vmatrix}1&6\\-5&3\end{vmatrix}}-0{\begin{vmatrix}2&6\\0&3\end{vmatrix}}+3{\begin{vmatrix}2&1\\0&-5\end{vmatrix}}$
		$=$	$1\left\|3\right\|-6\left\|-5\right\|+3.2\left\|-5\right\|-3.1\left\|0\right\|$
		$=$	$3+30-30-0=3.$

Since the determinant

${\begin{vmatrix}a_{11}&a_{22}&a_{23}&...&a_{2n}\\a_{11}&a_{22}&a_{23}&...&a_{2n}\\a_{31}&a_{32}&a_{33}&...&a_{3n}\\.&.&.&...&.\\a_{n1}&a_{n2}&a_{n3}&...&a_{nn}\end{vmatrix}}$ , having two identical rows,

vanishes identically; we have by development according to the elements of the first row

$a_{21}{\text{A}}_{11}+a_{22}{\text{A}}_{12}+a_{23}{\text{A}}_{13}+...+a_{2n}{\text{A}}_{1n}=0$ ;

and, in general, since

$a_{i1}{\text{A}}_{i1}+a_{i2}{\text{A}}_{i2}+a_{i3}{\text{A}}_{i3}+...+a_{in}{\text{A}}_{in}=\Delta$ ,

if we suppose the i^th and k^th rows identical

$a_{k1}{\text{A}}_{i1}+a_{k2}{\text{A}}_{i2}+a_{k3}{\text{A}}_{i3}+...+a_{kn}{\text{A}}_{in}=0\qquad (k\gtrless i)$ ;

and proceeding by columns instead of rows,

$a_{1i}{\text{A}}_{1k}+a_{2i}{\text{A}}_{2k}+a_{3i}{\text{A}}_{3k}+...+a_{ni}{\text{A}}_{nk}=0\qquad (k\gtrless i)$

identical relations always satisfied by these minors.

If in the first relation of $(A)$ we write $a_{is}=b_{is}+c_{is}+d_{is}+...$ we find that $\Sigma a_{is}{\text{A}}_{is}=\Sigma b_{is}{\text{A}}_{is}+\Sigma c_{is}{\text{A}}_{is}+\Sigma d_{is}{\text{A}}_{is}+...$ so that $\Delta$ breaks up into a sum of determinants, and we also obtain a theorem for the addition of determinants which have $n-1$ rows in common. If we multiply the elements of the second row by an arbitrary magnitude $\lambda$ , and add to the corresponding elements of the first row, $\Delta$ becomes $\Sigma a_{1s}{\text{A}}_{1s}+\lambda \Sigma a_{2s}{\text{A}}_{1s}=\Delta$ , showing that the value of the determinant is unchanged. In general we can prove in the same way the—

Theorem.—The value of a determinant is unchanged if we add to the elements of any row or column the corresponding elements of the other rows or other columns respectively each multiplied by an arbitrary magnitude, such magnitude remaining constant in respect of the elements in a particular row or a particular column.

Observation.—Every factor common to all the elements of a row or of a column is obviously a factor of the determinant, and may be taken outside the determinant brackets.

Ex. gr.	${\begin{vmatrix}\alpha ^{2}&\beta ^{2}&\gamma ^{2}\\\alpha &\beta &\gamma \\1&1&1\end{vmatrix}}={\begin{vmatrix}\alpha ^{2}&\beta ^{2}-\alpha ^{2}&\gamma ^{2}-\alpha ^{2}\\\alpha &\beta -\alpha &\gamma -\alpha \\1&0&0\end{vmatrix}}={\begin{vmatrix}\beta ^{2}-\alpha 2&\gamma ^{2}-\alpha ^{2}\\\beta -\alpha &\gamma -\alpha \end{vmatrix}}$
	$=(\beta -\alpha )(\gamma -\alpha ){\begin{vmatrix}\beta +\alpha &\gamma +\alpha \\1&1\end{vmatrix}}=(\beta -\gamma )(\gamma -\alpha ){\begin{vmatrix}\beta -\gamma &\gamma +\alpha \\0&1\end{vmatrix}}$
	$=(\beta -\alpha )(\gamma -\alpha )(\beta -\gamma ).$

The minor ${\text{A}}_{ik}$ is ${\frac {\partial \Delta }{\partial a_{ik}}}$ , and is itself a determinant of order $n-1$ . We may therefore differentiate again in regard to any element $a_{rs}$ where $r\gtrless i$ , $s\gtrless k$ ; we will thus obtain a minor of ${\text{A}}_{ik}$ , which is a minor also of $\Delta$ of order $n-2$ . It will be ${\text{A}}_{ik \atop rs}={\frac {\partial {\text{A}}_{ik}}{\partial a_{rs}}}={\frac {\partial ^{2}\Delta }{\partial a_{ik}\partial a_{rs}}}$ and will be obtained by erasing from the determinant ${\text{A}}_{ik}$ the row and column containing the element $a_{rs}$ ; this was originally the r^th row and the s^th column of $\Delta$ ; the r^th row of $\Delta$ is the r^th or (r–1)^th row of ${\text{A}}_{ik}$ according as $r\gtrless i$ and the s^th column of $\Delta$ is the s^th or (s−1)^th column of ${\text{A}}_{ik}$ according as $s\gtrless k$ . Hence, if $T_{ri}$ denote the number of transpositions necessary to bring the succession $ri$ into ascending order of magnitude, the sign to be attached to the determinant arrived at by erasing the i ^th and r ^th rows and the k ^th and s ^th columns from $\Delta$ in order produce ${\text{A}}_{ik \atop rs}$ will be $-1$ raised to the power of $T_{ri}+T_{ks}+i+k+r+s$ .

Similarly proceeding to the minors of order $n-3$ , we find that ${\text{A}}_{ik \atop {rs \atop tu}}={\frac {\partial }{\partial a_{tu}}}{\text{A}}_{ik \atop rs}={\frac {\partial ^{2}}{\partial a_{rs}\partial a_{tu}}}{\text{A}}_{ik}={\frac {\partial ^{3}}{\partial a_{ik}\partial a_{rs}\partial a_{tu}}}\Delta$ is obtained from $\Delta$ by erasing the i^th, r^th, t^th, rows, the k^th, s^th, u^th columns, and multiplying the resulting determinant by $-1$ raised to the power $T_{tri}+T_{usk}+i+k+r+s+t+u$ and the general law is clear.

Corresponding Minors.—In obtaining the minor ${\text{A}}_{ik \atop rs}$ in the form of a determinant we erased certain rows and columns, and we would have erased in an exactly similar manner had we been forming the determinant associated with ${\text{A}}_{is \atop rk}$ , since the deleting lines intersect in two pairs of points. In the latter case the sign is determined by $-1$ raised to the same power as before, with the exception that $T_{uks}$ , replaces $T_{usk}$ ; but if one of these numbers be even the other must be uneven; hence

${\text{A}}_{ik \atop rs}=-{\text{A}}_{is \atop rk}$ .

Moreover

$a_{ik}a_{rs}{\text{A}}_{is \atop rk}={\begin{vmatrix}a_{ik}&a_{is}\\a_{ik}&a_{rs}\end{vmatrix}}{\text{A}}_{ik \atop rs}$ ,

where the determinant factor is given by the four points in which the deleting lines intersect. This determinant and that associated with ${\text{A}}_{ik \atop rs}$ are termed corresponding determinants. Similarly $p$ lines of deletion intersecting in $p^{2}$ points yield corresponding determinants of orders $p$ and $n-p$ respectively. Recalling the formula

$\Delta =a_{11}{\text{A}}_{11}+a_{12}{\text{A}}_{12}+a_{13}{\text{A}}_{13}+...+a_{1n}{\text{A}}_{1n}$ ,

it will be seen that $a_{1k}$ and ${\text{A}}_{1k}$ involve corresponding determinants. Since ${\text{A}}_{1k}$ is a determinant we similarly obtain

${\text{A}}_{1k}=a_{21}{\text{A}}_{1k \atop 21}+...+a_{2,k-1}{\text{A}}_{1,k \atop 2,k-1}+...+a_{2,n}{\text{A}}_{1,k \atop 2,n}$ ,

and thence

$\Delta =\Sigma _{i,k}a_{1i}a_{2i}{\text{A}}_{1i \atop 2k}\quad i$ ≷ $k$ ;

and as before

$\Delta =\sum _{i,k}{\begin{vmatrix}a_{1i}&a_{2i}\\a_{1k}a_{2k}\end{vmatrix}}{\text{A}}_{1i \atop 2k}\quad i>k$ ,

an important expansion of $\Delta$ .

Similarly

$\Delta =\sum _{i,k,r}{\begin{vmatrix}a_{1i}&a_{2i}&a_{3i}\\a_{1k}&a_{2k}&a_{3k}\\a_{1r}&a_{2r}&a_{3r}\end{vmatrix}}{\text{A}}_{1i \atop {2k \atop 3r}}\quad i>k>r$ ,

and the general theorem is manifest, and yields a development in a sum of products of corresponding determinants. If the j^th column be identical with the i^th the determinant $\Delta$ vanishes identically; hence if $j$ be not equal to $i$ , $k$ or $r$ ,

$0=\sum {\begin{vmatrix}a_{1j}&a_{2j}&a_{3j}\\a_{1k}&a_{2k}&a_{3k}\\a_{1r}&a_{2r}&a_{3r}\end{vmatrix}}{\text{A}}_{1i \atop {2k \atop 3r}}$ .

Similarly, by putting one or more of the deleted rows or columns equal to rows or columns which are not deleted, we obtain, with Laplace, a number of identities between products of determinants of complementary orders.

Multiplication.—From the theorem given above for the expansion of a determinant as a sum of products of pairs of corresponding determinants it will be plain that the product of $\Delta =(a_{11},a_{22},...a_{nn})$ and $D=(b_{11},b_{22},b_{nn})$ may be written as a determinant of order $2n$ , viz.

${\begin{vmatrix}a_{11}&a_{21}&a_{31}&...&a_{n1}&-1&0&0&...&0\\a_{12}&a_{22}&a_{32}&...&a_{n2}&0&-1&0&...&0\\a_{13}&a_{23}&a_{33}&...&a_{n3}&0&0&-1&...&0\\.&.&.&...&.&.&.&.&...&.\\a_{1n}&a_{2n}&a_{3n}&...&a_{nn}&0&0&0&...&-1\\0&0&0&...&0&b_{11}&b_{12}&b_{13}&...&b_{1n}\\0&0&0&...&0&b_{21}&b_{22}&b_{23}&...&b_{2n}\\0&0&0&...&0&b_{31}&b_{32}&b_{33}&...&b_{3n}\\0&0&0&...&0&b_{n1}&b_{n2}&b_{n3}&...&b_{nn}\\\end{vmatrix}}~{\begin{matrix}={\begin{vmatrix}{\text{A}}&{\text{B}}\\{\text{C}}&{\text{D}}\end{vmatrix}}\\{\text{for brevity.}}\end{matrix}}$

Multiply the 1^st, 2^nd ... n^th rows by $b_{11},b_{12},...b_{1n}$ respectively, and add to the (n+1)^th row; by $b_{21},b_{22}\dots b_{2n}$ , and add to the (n+2)^th row; by $b_{31},b_{32}\dots b_{3n}$ and add to the (n+3)^rd row, &c. C then becomes

${\begin{vmatrix}a_{11}b_{11}+a_{12}b_{12}+\dots +a_{1n}b_{1n},&a_{21}b_{11}+a_{22}b_{12}+\dots +a_{2n}b_{1n},&\dots &a_{n1}b_{11}+a_{n2}b_{12}+\dots +a_{nn}b_{1n}\\a_{11}b_{21}+a_{12}b_{22}+\dots +a_{1n}b_{2n},&a_{21}b_{21}+a_{22}b_{22}+\dots +a_{2n}b_{2n},&\dots &a_{n1}b_{21}+a_{n2}b_{22}+\dots +a_{nn}b_{2n}\\a_{11}b_{31}+a_{12}b_{32}+\dots +a_{1n}b_{3n},&a_{21}b_{31}+a_{22}b_{32}+\dots +a_{2n}b_{3n},&\dots &a_{n1}b_{31}+a_{n2}b_{32}+\dots +a_{nn}b_{3n}\\.~~.~~.&.~~.~~.&&.~~.~~.\\a_{11}b_{n1}+a_{12}b_{n2}+\dots +a_{1n}b_{nn},&a_{21}b_{n1}+a_{22}b_{n2}+\dots +a_{2n}b_{nn},&\dots &a_{n1}b_{n1}+a_{n2}b_{n2}+\dots +a_{nn}b_{nn}\\\end{vmatrix}}$

and all the elements of D become zero. Now by the expansion theorem the determinant becomes

$(-)^{1+2+3+\ldots +2n}{\mbox{B.C}}=(-1)^{n(2n+1)+n}{\mbox{C}}={\mbox{C}}.$

We thus obtain for the product a determinant of order $n$ . We may say that, in the resulting determinant, the element in the i ^th row and k^th column is obtained by multiplying the elements in the k^th row of the first determinant severally by the elements in the i ^th row of the second, and has the expression

$a_{k1}b_{i1}+a_{k2}b_{i2}+a_{k3}b_{i3}\dots +a_{kn}b_{in}$

,

and we obtain other expressions by transforming either or both determinants so as to read by columns as they formerly did by rows.

Remark.—In particular the square of a determinant is a determinant of the same order $(b_{11}b_{22}b_{33}\dotsb _{nn})$ such that $b_{ik}=b_{ki}$ ; it is for this reason termed symmetrical.

The Adjoint or Reciprocal Determinant arises from $\Delta =(a_{11}a_{22}a_{33}\dots a_{nn})$ by substituting for each element ${\mbox{A}}_{ik}$ the corresponding minor ${\mbox{A}}_{ik}$ so as to form ${\mbox{D}}=({\mbox{A}}_{11}{\mbox{A}}_{22}{\mbox{A}}_{33}\dots {\mbox{A}}_{nn})$ . If we form the product $\Delta .{\mbox{D}}$ by the theorem for the multiplication of determinants we find that the element in the i ^th row and k^th column of the product is

$a_{ki}{\mbox{A}}_{i1}+a_{k2}{\mbox{A}}_{i2}+\dots +a_{kn}{\mbox{A}}_{in}$ ,

the value of which is zero when $k$ is different from $i$ , whilst it has the value $\Delta$ when $k=i$ . Hence the product determinant has the principal diagonal elements each equal to $\Delta$ and the remaining elements zero. Its value is therefore $\Delta ^{n}$ and we have the identity

${\mbox{D}}.\Delta =\Delta ^{n}$ or ${\mbox{D}}=\Delta ^{n-1}$ .

It can now be proved that the first minor of the adjoint determinant, say ${\mbox{B}}_{rs}$ is equal to $\Delta ^{n-2}a_{rs}$ .

From the equations

	$a_{11}x_{1}$	$+$	$a_{12}x_{2}$	$+$	$a_{13}x_{3}$	$+$	$\dots$	$=$	$\xi _{1},$
	$a_{21}x_{1}$	$+$	$a_{22}x_{2}$	$+$	$a_{33}x_{3}$	$+$	$\dots$	$=$	$\xi _{2},$
	$a_{31}x_{1}$	$+$	$a_{32}x_{2}$	$+$	$a_{33}x_{3}$	$+$	$\dots$	$=$	$\xi _{3},$
we derive	$.$		$.$		$.$		$\dots$		$.$

	$\Delta x_{1}$	$=$	${\mbox{A}}_{11}\xi _{1}$	$+$	${\mbox{A}}_{21}\xi _{2}$	$+$	${\mbox{A}}_{31}\xi _{3}$	$+$	$\dots ,$
	$\Delta x_{2}$	$=$	${\mbox{A}}_{12}\xi _{1}$	$+$	${\mbox{A}}_{22}\xi _{2}$	$+$	${\mbox{A}}_{32}\xi _{3}$	$+$	$\dots ,$
	$\Delta x_{3}$	$=$	${\mbox{A}}_{13}\xi _{1}$	$+$	${\mbox{A}}_{23}\xi _{2}$	$+$	${\mbox{A}}_{33}\xi _{3}$	$+$	$\dots ,$
and thence	$.$		$.$		$.$		$.$		$\dots ,$

$\Delta ^{n-1}x_{1}$	$=$	${\mbox{B}}_{11}\Delta x_{1}$	$+$	${\mbox{B}}_{12}\Delta x_{2}$	$+$	${\mbox{B}}_{13}\Delta x_{3}$	$+$	$\dots ,$
$\Delta ^{n-1}x_{2}$	$=$	${\mbox{B}}_{21}\Delta x_{1}$	$+$	${\mbox{B}}_{22}\Delta x_{2}$	$+$	${\mbox{B}}_{23}\Delta x_{3}$	$+$	$\dots ,$
$\Delta ^{n-1}x_{3}$	$=$	${\mbox{B}}_{31}\Delta x_{1}$	$+$	${\mbox{B}}_{32}\Delta x_{2}$	$+$	${\mbox{B}}_{33}\Delta x_{3}$	$+$	$\dots ,$
$.$		$.$		$.$		$.$		$\dots ,$

and comparison of the first and third systems yields

${\mbox{B}}_{rs}=\Delta ^{n-2}a_{rs}$ .

In general it can be proved that any minor of order $p$ of the adjoint is equal to the complementary of the corresponding minor of the original multiplied by the (p – 1)^th power of the original determinant.

Theorem.—The adjoint determinant is the (n – 1)^th power of the original determinant. The adjoint determinant will be seen subsequently to present itself in the theory of linear equations and in the theory of linear transformation.

Determinants of Special Forms.—It was observed above that the square of a determinant when expressed as a determinant of the same order is such that its elements have the property expressed by $a_{ik}=a_{ki}$ . Such determinants are called symmetrical. It is easy to see that the adjoint determinant is also symmetrical, viz. such that ${\mbox{A}}_{ik}={\mbox{A}}_{ki}$ , for the determinant got by suppressing the i ^th row and k^th column differs only by an interchange of rows and columns from that got by suppressing the k^th row and i ^th column. If any symmetrical determinant vanish and be bordered as shown below

${\begin{vmatrix}a_{11}&a_{12}&a_{13}&\Lambda _{1}\\a_{12}&a_{22}&a_{23}&\Lambda _{2}\\a_{13}&a_{23}&a_{33}&\Lambda _{3}\\\Lambda _{1}&\Lambda _{2}&\Lambda _{3}&.\end{vmatrix}}$

it is a perfect square when considered as a function of $\Lambda _{1},\Lambda _{2},\Lambda _{3}$ . For since ${\mbox{A}}_{11}{\mbox{A}}_{22}-{\mbox{A}}_{12}^{3}=\Delta a_{33}$ , with similar relations, we have a number of relations similar to ${\mbox{A}}_{11}{\mbox{A}}_{22}={\mbox{A}}_{12}^{2}$ , and either ${\mbox{A}}_{rs}=+{\sqrt {(}}{\mbox{A}}_{rr}{\mbox{A}}_{ss})$ or $-{\sqrt {(}}{\mbox{A}}_{rr}{\mbox{A}}_{ss})$ for all different values of $r$ and $s$ . Now the determinant has the value

$-\{\lambda _{1}^{2}{\mbox{A}}_{11}+\lambda _{2}^{2}{\mbox{A}}_{22}+\lambda _{3}^{2}{\mbox{A}}_{33}+2\lambda _{2}\lambda _{3}{\mbox{A}}_{23}+2\lambda _{2}\lambda _{1}{\mbox{A}}_{31}+2\lambda _{1}\lambda _{2}{\mbox{A}}_{12}\}$

$=-\Sigma \lambda _{r}^{2}{\mbox{A}}_{rr}-2\Sigma \lambda _{r}\lambda _{s}{\mbox{A}}_{rs}$ in general, and hence by substitution

$\pm \{\lambda _{1}{\sqrt {\mbox{A}}}_{11}+\lambda _{2}{\sqrt {\mbox{A}}}_{22}+\dots +\lambda _{n}{\sqrt {\mbox{A}}}_{nn}\}^{2}.$

A skew symmetric determinant has $a_{rr}=0$ and $a_{rs}=-a_{sr}$ for all values of $r$ and $s$ . Such a determinant when of uneven degree vanishes, for if we multiply each row by $-1$ we multiply the determinant by $(-1)^{n}=-1$ , and the effect of this is otherwise merely to transpose the determinant so that it reads by rows as it formerly did by columns, an operation which we know leaves the determinant unaltered. Hence $\Delta =-\Delta$ or $\Delta =0$ . When a skew symmetric determinant is of even degree it is a perfect square. This theorem is due to Cayley, and reference may be made to Salmon’s Higher Algebra, 4th ed. Art. 39. In the case of the determinant of order 4 the square root is

${\mbox{A}}_{12}{\mbox{A}}_{34}-{\mbox{A}}_{13}{\mbox{A}}_{24}+{\mbox{A}}_{14}{\mbox{A}}_{23}$ .

A skew determinant is one which is skew symmetric in all respects, except that the elements of the leading diagonal are not all zero. Such a determinant is of importance in the theory of orthogonal substitution. In the theory of surfaces we transform from one set of three rectangular axes to another by the substitutions

${\mbox{X}}=ax\ +by\ +cz,$

${\mbox{Y}}=a'x+b'y+c'z,$

${\mbox{Z}}=a''x+b''y+c''z,$

where ${\mbox{X}}^{2}+{\mbox{Y}}^{2}+{\mbox{Z}}^{2}=x^{2}+y^{2}+z^{2}$ . This relation implies six equations between the coefficients, so that only three of them are independent. Further we find

$x=a{\mbox{X}}+a'{\mbox{Y}}+a''{\mbox{Z}},$

$y=b{\mbox{X}}+b'{\mbox{Y}}+b''{\mbox{Z}},$

$z=c{\mbox{X}}+c'{\mbox{Y}}+c''{\mbox{Z}},$

and the problem is to express the nine coefficients in terms of three independent quantities.

In general in space of $n$ dimensions we have $n$ substitutions similar to

$X_{1}=a_{11}x_{1}+A_{12}x_{2}+\dots +a_{1n}x_{n}$ ,

and we have to express the $n^{2}$ coefficients in terms of ${\tfrac {1}{2}}n(n-1)$ independent quantities; which must be possible, because

$x_{1}$	$=$	$b_{11}\xi _{1}$	$+$	$b_{12}\xi _{2}$	$+$	$b_{13}\xi _{3}$	$+\dots ,$
$x_{2}$	$=$	$b_{21}\xi _{1}$	$+$	$b_{22}\xi _{2}$	$+$	$b_{23}\xi _{3}$	$+\dots ,$
$.$		$.$		$.$		$.$	$.$
$X_{1}$	$=$	$b_{11}\xi _{1}$	$+$	$b_{21}\xi _{2}$	$+$	$b_{31}\xi _{3}$	$+\dots ,$
$X_{1}$	$=$	$b_{12}\xi _{1}$	$+$	$b_{22}\xi _{2}$	$+$	$b_{32}\xi _{3}$	$+\dots ,$
$.$		$.$		$.$		$.$	$.$

where $b_{rr}=1$ and $b_{rs}=-b_{sr}$ for all values of $r$ and $s$ . There are then ${\tfrac {1}{2}}n(n-1)$ quantities $b_{rs}$ . Let the determinant of the b’s be $\Delta _{b}$ and $B_{rs}$ , the minor corresponding to $b_{rs}$ . We can eliminate the quantities $\xi _{1},\xi _{2},\dots \xi _{n}$ and obtain $n$ relations

$\Delta _{b}{\text{X}}_{1}=(2{\text{B}}_{11}-\Delta _{b})x_{1}$ $+2{\text{B}}_{12}{\text{X}}_{2}+2{\text{B}}_{31}x_{3}+\dots ,$

$\Delta _{b}{\text{X}}_{2}=$ $2{\text{B}}_{12}x_{1}+(2{\text{B}}_{22}-\Delta _{b})x_{2}+2{\text{B}}_{32}x_{3}+\dots ,$

$.$ $.$ $.$ $.$ $.$ $.$ $.$

and from these another equivalent set

$\Delta _{b}x_{1}=(2{\text{B}}_{11}-\Delta _{b}){\text{X}}_{1}$ $+2{\text{B}}_{12}{\text{X}}_{2}+2{\text{B}}_{13}{\text{X}}_{3}+\dots ,$

$\Delta _{b}x_{2}=$ $2{\text{B}}_{21}{\text{X}}_{1}+(2{\text{B}}_{22}-\Delta _{b}){\text{X}}_{2}+2{\text{B}}_{23}{\text{X}}_{3}+\dots ,$

$.$ $.$ $.$ $.$ $.$ $.$ $.$

and now writing

${\frac {2{\text{B}}_{ii}-\Delta _{b}}{\Delta _{b}}}=a_{ii},\qquad {\frac {2{\text{B}}_{ik}}{\Delta _{b}}}=a_{ik},$

we have a transformation which is orthogonal, because $\Sigma X^{2}=\Sigma x^{2}$ and the elements $a_{ii}$ , $a_{ik}$ are functions of the ${\tfrac {1}{2}}n(n-1)$ independent quantities $b$ . We may therefore form an orthogonal transformation in association with every skew determinant which has its leading diagonal elements unity, for the ${\tfrac {1}{2}}n(n-1)$ quantities $b$ are clearly arbitrary.

For the second order we may take

$\Delta _{b}={\begin{vmatrix}1,&\lambda \\-\lambda ,&1\end{vmatrix}}=1+\Delta ^{2}$ ,

and the adjoint determinant is the same; hence

$(1+\lambda ^{2})x_{1}=(1-\lambda ^{2}){\text{X}}_{1}+$ $2\lambda {\text{X}}_{2},$

$(1+\lambda ^{2})x_{2}$ $=-2\lambda {\text{X}}_{1}+(1-\lambda ^{2}){\text{X}}_{2}.$

Similarly, for the order 3, we take

$\Delta _{b}={\begin{vmatrix}1&\nu &-\mu \\-\nu &1&\lambda \\\mu &-\lambda &1\end{vmatrix}}=1+\lambda ^{2}+\mu ^{2}+\nu ^{2},$

and the adjoint is

${\begin{vmatrix}1+\lambda ^{2}&\nu +\lambda \mu &-\mu +\lambda \nu \\-\nu +\lambda \mu &1+\mu ^{2}&\lambda +\mu \nu \\\mu +\lambda \nu &-\lambda +\mu \nu &1+\nu ^{2}\end{vmatrix}}$ ,

leading to the orthogonal substitution

$\Delta _{b}x_{1}=$	$(1+\lambda ^{2}-\mu ^{2}-\nu ^{2}){\text{X}}_{1}$	$+2(\nu +\lambda \mu ){\text{X}}_{2}$	$+2(-\mu +\lambda \nu ){\text{X}}_{3}$
$\Delta _{b}x_{2}=$	$2(\lambda \mu -\nu ){\text{X}}_{1}$	$+(1+\mu ^{2}-\lambda ^{2}-\nu ^{2}){\text{X}}_{2}$	$+2(\mu \nu +\lambda ){\text{X}}_{3}$
$\Delta _{b}x_{3}=$	$2(\lambda \nu +\mu ){\text{X}}_{1}$	$+2(\mu \nu -\lambda ){\text{X}}_{2}$	$+(1+\nu ^{2}-\lambda ^{2}-\mu ^{2}){\text{X}}_{3}$ .

Functional determinants were first investigated by Jacobi in a work De Determinantibus Functionalibus. Suppose

n

dependent variables

y_{1},y_{2},\dots y_{n}

, each of which is a function of

n

independent variables

x_{1},x_{2},\dots x_{n}

, so that

y_{s}=f_{s}(x_{1},x_{2},\dots x_{n})

. From the differential coefficients of the y’s with regard to the x’s we form the functional determinant

${\text{R}}={\begin{vmatrix}{\frac {\partial y_{1}}{\partial x_{1}}}&{\frac {\partial y_{1}}{\partial x_{2}}}&\dots &{\frac {\partial y_{1}}{\partial x_{n}}}\\{\frac {\partial y_{2}}{\partial x_{1}}}&{\frac {\partial y_{2}}{\partial x_{2}}}&\dots &{\frac {\partial y_{2}}{\partial x_{n}}}\\.&.&\dots &.\\{\frac {\partial y_{n}}{\partial x_{1}}}&{\frac {\partial y_{n}}{\partial x_{2}}}&\dots &{\frac {\partial y_{n}}{\partial x_{n}}}\\\end{vmatrix}}{\begin{matrix}={\binom {y_{1},~y_{2},\dots y_{n}}{x_{1},~x_{2},\dots x_{n}}}\\{\text{for brevity.}}\end{matrix}}$

If we have new variables z such that z_s＝φ_s(y₁, y₂,...y_n), we have also z_s＝ψ_s(x₁, x₂,...x_n), and we may consider the three determinants

(y₁, y₂,...y_n
x₁, x₂,...x_n), (z₁, z₂,...z_n
y₁, y₂,...y_n), (z₁, z₂,...z_n
x₁, x₂,...x_n)

Forming the product of the first two by the product theorem, we obtain for the element in the i_th row and k_th column

∂z_i/∂y₁ ∂y₁/∂x_k+∂z_i/∂y₂ ∂y₂/∂x_k+...+∂z_i/∂y_n ∂y_n/∂x_k

which is ∂z_i/∂x_k, the partial differential coefficient of z_i, with regard to x_k . Hence the product theorem

(z₁, z₂,...z_n
y₁, y₂,...y_n), (y₁, y₂,...y_n
x₁, x₂,...x_n)＝(z₁, z₂,...z_n
x₁, x₂,...x_n);
and as a particular case
(y₁, y₂,...y_n
x₁, x₂,...x_n) (x₁, x₂,...x_n
y₁, y₂,...y_n)＝1.

Theorem.—If the functions y₁, y₂,...y_n be not independent of one another the functional determinant vanishes, and conversely if the determinant vanishes, y₁, y₂,...y_n are not independent functions of x₁, x₂,...x_n.
Linear Equations.—It is of importance to study the application of the theory of determinants to the solution of a system of linear equations. Suppose given the n equations

ƒ₁＝a₁₁x₁+ a₁₂x₂+ ... a_1nx_n＝0,
ƒ₂＝a₂₁x₁+ a₂₂x₂+ ... a_2nx_n＝0,
.......
ƒ_n＝a_n1x₁+ a_n2x₂+ ... a_nnx_n＝0.

Denote by Δ the determinant (a₁₁a₂₂...a_nn).

Multiplying the equations by the minors A_1μ, A_2μ,...A_nμ respectively, and adding, we obtain

x_μ(a_1μA_1μ+a_2μA_2μ+...+a_nμA_nμ)＝x_μΔ＝0,

since from results already given the remaining coefficients of x₁, x₂,...x_μ–1, x_μ+1,...x_n vanish identically.

Hence if Δ does not vanish x₁ ＝x₁＝... ＝x_n＝0 is the only solution; but if Δ vanishes the equations can be satisfied by a system of values other than zeros. For in this case the n equations are not independent since identically

A_1μƒ₁ + A_2μƒ₂+...+A_nμƒ_n＝0,

and assuming that the minors do not all vanish the satisfaction of n–1 of the equations implies the satisfaction of the n^th.

Consider then the system of n–1 equations

a₂₁x₁+ a₂₂x₂ +...+ a_2nx_n＝0
a₃₁x₁+ a₃₂x₂ +...+ a_3nx_n＝0
......
a_n1x₁+ a_n2x₂ +...+ a_nnx_n＝0,
which becomes on writing x_s/x_n＝y_s,
a₂₁y₁+ a₂₂y₂ +...+ a_2,n−1y_n−1 +a_2n＝0
a₃₁y₁+ a₃₂y₂ +...+ a_3,n−1y_n−1 +a_3n＝0
.......
a_n1y₁+ a_n2y₂ +...+ a_n,n−1y_n−1 +a_nn＝0.
We can solve these, assuming them independent, for the n−1 ratios y₁, y₂,...y_n−1.
Now
a₂₁A₁₁ + a₂₂A₁₂+...+a_2nA_1n＝0
a₃₁A₁₁ + a₃₂A₁₂+...+a_3nA_1n＝0
.......
a_n1A₁₁ + a_n2A₁₂+...+a_nnA_1n＝0
and therefore, by comparison with the given equations, x_i＝ρA_1i, where ρ is an arbitrary factor which remains constant as i varies.

Hence y_i＝A_1i/A_1n where A _li and A_1n, are minors of the complete determinant
(a₁₁a₂₂...a_nn).

	a₂₁ a₂₂ ...a_2,i–1 a_2,i+1... a_2n
	a₃₁ a₃₂ ...a_3,i–1 a_3,i+1 ...a_3n
	...........
∴ y_i＝(−)ⁱ⁺ⁿ	a_n1 a_n2 ...a_n,i–1 a_n,i+1 ...a_2nn
	————————————,

	a₂₁ a₂₂ ...a_2,n–1
	a₃₁ a₂₂ ...a_2,n–1
	......
	a_n1 a_n2 ...a_n,n–1

or, in words, y_i is the quotient of the determinant obtained by erasing the i ^th column by that obtained by erasing the n^th column, multiplied by (–1)ⁱ⁺ⁿ. For further information concerning the compatibility and independence of a system of linear equations, see Gordon, Vorlesungen über Invariantentheorie, Bd. 1, § 8.

Resultants.—When we are given k homogeneous equations in k variables or k non-homogeneous equations in k − 1 variables, the equations being independent, it is always possible to derive from them a single equation R＝0, where in R the variables do not appear. R is a function of the coefficients which is called the "resultant" or "eliminant" of the k equations, and the process by which it is obtained is termed "elimination." We cannot combine the equations so as to eliminate the variables unless on the supposition that the equations are simultaneous, i.e. each of them satisfied by a common system of values; hence the equation R＝0 is derived on this supposition, and the vanishing of R expresses the condition that the equations can be satisfied by a common system of values assigned to the variables.

Consider two binary equations of orders m and n respectively expressed in non-homogeneous form, viz.

ƒ(x) ＝ƒ＝a₀x^m – a₁x^m–1 + a₂x^m–2 – ...＝0,
ƒ(φ)＝φ＝b₀xⁿ – b₁x^n–1 + b₂x^n–2 – ...＝0,
If α₁, α₂, ...α_m be the roots of ƒ＝0, β₁, β₂, ...β_n the roots of φ＝0, the condition that some root of φ＝0 may cause ƒ to vanish is clearly
R_ƒ,φ＝ƒ (β₁)ƒ(β₂)...ƒ(β₂)＝0;
so that R_ƒ,φ is the resultant of ƒ and φ, and expressed as a function of the roots, it is of degree m in each root β, and of degree n in each root α, and also a symmetric function alike of the roots α and of the roots β; hence, expressed in terms of the coefficients, it is homogeneous and of degree n in the coefficients of ƒ, and homogeneous and of degree m in the coefficients of φ
Ex. gr.
ƒ＝a₀x² − a₁x+a₂＝0, φ＝b₀x² − b₁x+b₂.
We have to multiply a₀β_₂
₁ − a₁β₁+a₂ by a₀β_₂
₂ − a₁β₂+a₂ and we obtain
a_₂
₀β_₂
₁β_₂
₂ − a₀a₁(β_₂
₁β₂ + β₁β_₂
₂) + a₀a₂(β_₂
₁β_₂
₁ + β₁β_₂
₂) + a_₂
₁0β₁β₂ − a₁a₂(β₁ + β₂) + a_₂
₂,
where
β₁ + β₂＝b₁/b₀,β₁ β₂＝b₂/b₀, β₁ β₂＝b_₂
¹ – 2b₀b₂/b_₂
⁰,
and clearing of fractions
R_ƒ,φ＝(a₀b₂ – a₂b₀)² + (a₁b₀ – a₀b₁)(a₁b₂ – a₂b₁).

We may equally express the result as
φ(α(₁)φ(α₂)...φ(α_m)＝0,
or as
II
^s,t(α_s – β_t＝0.

This expression of R shows that, as will afterwards appear, the resultant is a simultaneous invariant of the two forms.

The resultant being a product of mn root differences, is of degree mn in the roots, and hence is of weight mn in the coefficients of the forms; i.e. the sum of the suffixes in each term of the resultant is equal to mn.

Resultant Expressible as a Determinant.—From the theory of linear equations it can be gathered that the condition that p linear equations in p variables (homogeneous and independent) may be simultaneously satisfied is expressible as a determinant, viz. if
a₁₁x₁ + a₁₂x₂ +...+ a_1px_p＝0,
a₂₁x₁ + a₂₂x₂ +...+ a_2px_p＝0,
......
a_p1x₁ + a_p2x₂ +...+ a_ppx_p＝0,

be the system the condition is, in determinant form

(a₁₁a₂₂...a_pp)＝0;

in fact the determinant is the resultant of the equations.

Now, suppose ƒ and φ to have a common factor x – γ,

ƒ(x)＝ƒ₁(x)(x – γ); φ(x)＝φ₁(x)(x – γ),

ƒ₁ and φ₁ being of degrees m – 1 and n – 1 respectively; we have the identity φ₁ƒ(x)＝ƒ₁(x)φ(x) of degree m + n – 1.

Assuming then φ₁ to have the coefficients B₁, B₂,...B_n
and ƒ₁the coefficients A₁, A₂,...A_m,

we may equate coefficients of like powers of x in the identity, and obtain m + n homogeneous linear equations satisfied by the m + n quantities B₁, B₂,...B_n, A₁, A₂,...A_m. Forming the resultant of these equations we evidently obtain the resultant of ƒ and φ.

Thus to obtain the resultant of

ƒ＝a₀x³ + a₁x² + a₂x+ a₃, , φ＝b₀x² + b₁x+ b₂

we assume the identity

(B₀x + B₁)(a₀x³ + a₁x² + a₂x+ a₃)＝(A₀x² + A₁x+ A₂)(b₀x² + b₁x+ b₂),

and derive the linear equations

B₀a₀		−A₀b₀			＝0,
B₀a₁	+B₁a₀	−A₀b₁	−A₁b₀		＝0,
B₀a₂	+B₁a₁	−A₀b₂	−A₁b₁	−A₂b₀	＝0,
B₀a₃	+B₁a₂		−A₁b₂	−A₂b₁	＝0,
	B₁a₃			−A₂b₂	＝0,

and by elimination we obtain the resultant

${\begin{vmatrix}a_{0}&0&b_{0}&0&0\\a_{1}&a_{0}&b_{1}&b_{0}&0\\a_{2}&a_{1}&b_{2}&b_{1}&b_{0}\\a_{3}&a_{2}&0&b_{2}&b_{1}\\0&a_{3}&0&0&b_{2}\\\end{vmatrix}}~{\begin{matrix}{\text{a numerical factor}}\\{\text{being disregarded.}}\end{matrix}}$

This is Euler’s method. Sylvester’s leads to the same expression, but in a simpler manner.

He forms n equations from ƒ by separate multiplication by x^n–1, x^n–2,...x, 1, in succession, and similarly treats φ with m multipliers x^{m –1}, x^{m –2},...x, 1. From these m + n equations he eliminates the m + n powers x^{m +n –1}, x^m +n–2, x,.. 1, treating them as independent unknowns. Taking the same example as before the process leads to the system of equations

a₀x⁴+	a₁x³+	a₂x²+	a₃x		＝0,
	a₀x³+	a₁x²+	a₂x+	a₃	＝0,
b₀x⁴+	b₁x³+	b₂x²			＝0,
	b₀x³+	b₁x²+	b₂x		＝0,
		b₀x²+	b₁x+	b₂	＝0,

whence by elimination the resultant

a₀	a₁	a₂	a₃	0
0	a₀	a₁	a₂	a₃
b₀	b₁	b₂	0	0
0	b₀	b₁	b₂	0
0	0	b₉	b₁	b₂

which reads by columns as the former determinant reads by rows, and is therefore identical with the former. E. Bézout’s method gives the resultant in the form of a determinant of order m or n, according as m is ≷ n. As modified by Cayley it takes a very simple form. He forms the equation
ƒ(x)φ(x ′) − ƒ(x ′)φ(x)＝0,
which can be satisfied when ƒ and φ possess a common factor. He first divides by the factor x − x ′, reducing it to the degree m − 1 in both x and x ′ where m > n; he then forms m equations by equating to zero the coefficients of the various powers of x ′; these equations involve the m powers x⁰, x, x²,... x^m−1 of x, and regarding these as the unknowns of a system of linear equations the resultant is reached in the form of a determinant of order m. Ex. gr. Put
(a₀x³+a₁x²+a₂x +a₃) (b₀x ′²+b₁x ′+b₂) − (a₀x ′³+a₁x ′²+a₂x ′ +a₃) (b₀x²+b₁x+b₂)＝0;
after division by x − x ′ the three equations are formed

a₀b₀x²+a₀b₁x+a₀b₂	＝0,
a₀b₁x²+(a₀b₂+a₁b₁−a₀b₂)x+a₁b₂−a₃b₀	＝0,
a₀b₂x²+(a₁b₂−a₃b₀)x+a₂b₂−a₃b₁	＝0

and thence the resultant

a₀b₀	a₀b₁	a₀b₂
a₀b₁	a₀b₂+a₁b₁−a₀b₂	a₁b₂−a₃b₀
a₀b₂	a₁b₂−a₃b₀	a₂b₂−a₃b₁

which is a symmetrical determinant.

Case of Three Variables.—In the next place we consider the resultants of three homogeneous polynomials in three variables. We can prove that if the three equations be satisfied by a system of values of the variable, the same system will also satisfy the Jacobian or functional determinant. For if u, v, w be the polynomials of orders m, n, p respectively, the Jacobian is (u₁ v₂ w₃), and by Euler’s theorem of homogeneous functions
xu₁ + yu₂ + zu₃＝mu
xv₁ + yv₂ + zv₃＝nv
xw₁ + yw₂ + zw₃＝pw;
denoting now the reciprocal determinant by (U₁ V₂ W₃) we obtain Jx＝muU₁ + nvV₁ + pwW₁; Jy=..., Jz=..., and it appears that the vanishing of u, v, and w implies the vanishing of J. Further, if m＝n＝ p, we obtain by differentiation
J + x∂J/∂x =m (u∂U₁/∂x. + v∂V₁/∂x + u∂W₁/∂x + u₁U₁ v₁V₁ w₁W₁).
or
x∂J/∂x =m – 1)J + m (u∂U₁/∂x. + v∂V₁/∂x + u∂W₁/∂x).

Hence the system of values also causes ∂J/∂x to vanish in this case; and by symmetry ∂J/∂y and ∂J/∂z also vanish.

The proof being of general application we may state that a system of values which causes the vanishing of k polynomials in k variables causes also the vanishing of the Jacobian, and in particular, when the forms are of the same degree, the vanishing also of the differential coefficients of the Jacobian in regard to each of the variables.

There is no difficulty in expressing the resultant by the method of symmetric functions. Taking two of the equations
ax^m + (by + cz) x^m–1 +... =0,
a′xⁿ + (b′y + c′z) x^n–1 +... =0,
we find that, eliminating x, the resultant is a homogeneous function of y and z of degree mn; equating this to zero and solving for the ratio of y to z we obtain mn solutions; if values of y and z, given by any solution, be substituted in each of the two equations, they will possess a common factor which gives a value of x which, combined with the chosen values of y and z, yields a system of values which satisfies both equations. Hence in all there are mn such systems. If, therefore, we have a third equation, and we substitute each system of values in it successively and form the product of the mn expressions thus formed, we obtain a function which vanishes if any one system of values, common to the first two equations, also satisfies the third. Hence this product is the required resultant of the three equations.

Now by the theory of symmetric functions, any symmetric functions of the mn values which satisfy the two equations, can be expressed in terms of the coefficient of those equations. Hence, finally, the resultant is expressed in terms of the coefficients of the three equations, and since it is at once seen to be of degree mn in the coefficient of the third equation, by symmetry it must be of degrees np and pm in the coefficients of the first and second equations respectively. Its weight will be mnp (see Salmon’s Higher Algebra, 4th ed. § 77). The general theory of the resultant of k homogeneous equations in k variables presents no further difficulties when viewed in this manner.

The expression in form of a determinant presents in general considerable difficulties. If three equations, each of the second degree, in three variables be given, we have merely to eliminate the six products x², y², z², yz, zx, xy from the six equations
u＝v＝w＝∂J/∂x＝∂J/∂y＝∂J/∂z＝0; if we apply the same process to these equations each of degree three, we obtain similarly a determinant of order 21, but thereafter the process fails. Cayley, however, has shown that, whatever be the degrees of the three equations, it is possible to represent the resultant as the quotient of two determinants (Salmon, l.c. p. 89).

Discriminants.—The discriminant of a homogeneous polynomial in k variables is the resultant of the k polynomials formed by differentiations in regard to each of the variables.

It is the resultant of k polynomials each of degree m–1, and thus contains the coefficients of each form to the degree (m–1)^k–1; hence the total degrees in the coefficients of the k forms is, by addition, k(m–1)^k–1; it may further be shown that the weight of each term of the resultant is constant and equal to m(m–1)^k–1 (Salmon, l.c. p. 100).

A binary form which has a square factor has its discriminant equal to zero. This can be seen at once because the factor in question being once repeated in both differentials, the resultant of the latter must vanish.

Similarly, if a form in k variables be expressible as a quadratic function of k – 1, linear functions X₁, X₂, ... X_{k – 1}, the coefficients being any polynomials, it is clear that the k differentials have, in common, the system of roots derived from X₁＝X₂＝...＝X_{k – 1}＝0, and have in consequence a vanishing resultant. This implies the vanishing of the discriminant of the original form.

Expression in Terms of Roots.—Since x∂ƒ/∂x+∂ƒ/∂y＝mƒ, if we take any root x₁, y₁, of ∂ƒ/∂x, and substitute in mf we must obtain, y₁(∂ƒ/∂y)
x–x₁
y–y₁; hence the resultant of ∂ƒ/∂x and ƒ is, disregarding numerical factors, y₁y₂...y_n–1 × discriminant of ƒ＝a₀ × disct. of ƒ.

Now
ƒ＝(xy₁ – x₁y)(xy₂ – x₂y) ... (xy_m – x_my),
∂ƒ/∂x =Σ₁ y₁(xy_m – x_my),
and substituting in the latter any root of ƒ and forming the product, we find the resultant of ƒ and ∂ƒ/∂x, viz.

y₁y₂...y_m(x₁y₂ – x₂y₁)²(x₁y₃ – x₃y₁)²...(x_ry_s – x_sy_r)²...

and, dividing by y₁y₂...y_m, the discriminant of ƒ is seen to be equal to the product of the squares of all the differences of any two roots of the equation. The discriminant of the product of two forms is equal to the product of their discriminants multiplied by the square of their resultant. This follows at once from the fact that the discriminant is
II(α_r – α_s)²II(β_r – β_s)²{II(α_r – β_s}².

References for the Theory of Determinants.—T. Muir’s “List of Writings on Determinants,” Quarterly Journal of Mathematics. vol. xviii. pp. 110-149, October 1881, is the most important bibliographical article on the subject in any language; it contains 589 entries, arranged in chronological order, the first date being 1693 and the last 1880. The bibliography has been continued, and published at various dates (vol. xxi. pp. 299-320; vol. xxxvi. pp. 171-267) in the same periodical. These lists contain 1740 entries. T. Muir, History of the Theory of Determinants (2nd ed., London, 1906). School treatises are those of Thomson, Mansion, Bartl, Mollame, in English, French, German and Italian respectively.—Advanced treatises are those of William Spottiswoode (1851), Francesco Brioschi (1854), Richard Baltzer (1857), George Salmon (1859), N. Trudi (1862), Giovanni Garbieri (1874), Siegmund Gunther (1875), Georges J. Dostor (1877), Baraniecki (the most extensive of all) (1879), R. F. Scott (2nd ed., 1904), T. Muir (1881).

II. The Theory Of Symmetric Functions

Consider $n$ quantities $a_{1},a_{2},a_{3},\dots a_{n}$ .

Every rational integral function of these quantities, which does not alter its value however the $n$ suffixes $1,2,3,\dots n$ be permuted, is a rational integral symmetric function of the quantities. If we write $(1+a_{1}x)(1+a_{2}x)\dots (1+a_{n}x)=1+a_{1}x+a_{2}x^{2}+\dots +a_{n}x^{n}$ , $a_{1},a_{2},\dots a_{n}$ are called the elementary symmetric functions.

$a_{1}=a_{1}+a_{2}+\dots +a_{n}=\Sigma a_{1}$
$a_{2}=a_{1}a_{2}+a_{2}a_{3}+a_{2}a_{3}+\dots =\Sigma a_{1}a_{2}$
⋅⋅⋅⋅⋅
$a_{n}=a_{1}a_{2}a_{3}\dots a_{n}$

The general monomial symmetric function is

$\Sigma a_{1}^{p_{1}}a_{2}^{p_{2}}a_{3}^{p_{3}}\dots a_{n}^{p_{n}}$ ,

the summation being for all permutations of the indices which result in different terms. The function is written

$(p_{1}p_{2}p_{3}\dots p_{n})$

for brevity, and repetitions of numbers in the bracket are indicated by exponents, so that $(p_{1}p_{1}p_{2})$ is written $(p_{1}^{2}p_{2}).$ The weight of the function is the sum of the numbers in the bracket, and the degree the highest of those numbers.

Ex. gr. The elementary functions are denoted by

$(1),(1^{2}),(1^{3}),\dots (1^{n})$ ,

are all of the first degree, and are of weights $1,2,3,\dots n$ respectively.

Remark.—In this notation $(0)=\Sigma a_{1}^{0}={\tbinom {n}{1}}$ ; $(0^{2})=\Sigma a_{1}^{0}a_{2}^{0}={\tbinom {n}{2}}$ ; ... $(0^{s})={\tbinom {n}{s}}$ , &c. The binomial coefficients appear, in fact, as symmetric functions, and this is frequently of importance.

The order of the numbers in the bracket $(p_{1}p_{2}\dots p_{n})$ is immaterial; we may therefore always place them, as is most convenient, in descending order of magnitude; the numbers then constitute an ordered partition of the weight $w$ , and the leading number denotes the degree.

The sum of the monomial functions of a given weight is called the homogeneous-product-sum or complete symmetric function of that weight; it is denoted by $h_{w}$ ; it is connected with the elementary functions by the formula

${\frac {1}{1-a_{1}x+a_{2}x^{2}-a_{3}x^{3}+\dots }}=1+h_{1}x+h_{2}x^{2}+h_{3}x^{3}+\dots$ ,

which remains true when the symbols $a$ and $h$ are interchanged, as is at once evident by writing $-x$ for $x$ . This proves, also, that in any formula connecting $a_{1},a_{2},a_{3},\dots$ with $h_{1},h_{2},h_{3},\dots$ the symbols $a$ and $h$ may be interchanged.

Ex. gr, from $h_{2}=a_{1}^{2}-a_{2}$ we derive $a_{2}=h_{1}^{2}-h_{2}$ .

The function $\Sigma a_{1}^{p_{1}}a_{2}^{p_{2}}\dots a_{n}^{p_{n}}$ being as above denoted by a partition of the weight, viz. $(p_{1}p_{2}\dots p_{n})$ , it is necessary to bring under view other functions associated with the same series of numbers: such, for example, as

$\Sigma a_{1}^{p_{1}}a_{2}^{p_{3}}\Sigma a_{1}^{p_{2}}a_{2}^{p_{4}}\dots a_{n-2}^{p_{n-2}}=(p_{1}p_{3})(p_{2}p_{4}\dots p_{n-2})$ .

The expression just written is in fact a partition of a partition, and to avoid confusion of language will be termed a separation of a partition. A partition is separated into separates so as to produce a separation of the partition by writing down a set of partitions, each separate partition in its own brackets, so that when all the parts of these partitions are reassembled in a single bracket the partition which is separated is reproduced. It is convenient to write the distinct partitions or separates in descending order as regards weight. If the successive weights of the separates $w_{1},w_{2},w_{3},\dots$ be enclosed in a bracket we obtain a partition of the weight $w$ which appertains to the separated partition. This partition is termed the specification of the separation. The degree of the separation is the sum of the degrees of the component separates. A separation is the symbolic representation of a product of monomial symmetric functions. A partition, $(p_{1}p_{1}p_{1}p_{2}p_{2}p_{3})=(p_{1}^{3}p_{2}^{2}p_{3})$ , can be separated in the manner $(p_{1}p_{2})(p_{1}p_{2})(p_{1}p_{3})=(p_{1}p_{2})^{2}(p_{1}p_{3})$ , and we may take the general form of a partition to be $(p_{1}^{\pi _{1}}p_{2}^{\pi _{2}}p_{3}^{\pi _{3}}\dots )$ and that of a separation $({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots$ when ${\text{J}}_{1},{\text{J}}_{2},{\text{J}}_{3}\dots$ denote the distinct separates involved.

Theorem.— The function symbolized by $(n)$ , viz. the sum of the n^th powers of the quantities, is expressible in terms of functions which are symbolized by separations of any partition $(n_{1}^{\nu _{1}}n_{2}^{\nu _{2}}n_{3}^{\nu _{3}}\ldots )$ of the number $n$ . The expression is—

$(-)^{\nu _{1}+\nu _{2}+\nu _{3}+\ldots }{\frac {(\nu _{1}+\nu _{2}+\nu _{3}+\dots -1){\text{!}}}{\nu _{1}{\text{!}}\nu _{2}{\text{!}}\nu _{3}{\text{!}}\ldots }}(n)$
$=\sum (-)^{j_{1}+j_{2}+j_{3}+\ldots }{\frac {(j_{1}+j_{2}+j_{3}+\dots -1){\text{!}}}{j_{1}{\text{!}}j_{2}{\text{!}}j_{3}{\text{!}}\ldots }}({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots$ ,

$({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots$ being a separation of $(n_{1}^{\nu _{1}}n_{2}^{\nu _{2}}n_{3}^{\nu _{3}}\ldots )$ and the summation being in regard to all such separations. For the particular case $(n_{1}^{\nu _{1}}n_{2}^{\nu _{2}}n_{3}^{\nu _{3}}\ldots )=(1^{n})$

$(-)^{n}{\frac {1}{n}}(n)=\sum (-)^{j_{1}+j_{2}+j_{3}+\ldots }{\frac {(j_{1}+j_{2}+j_{3}+\dots -1){\text{!}}}{j_{1}{\text{!}}j_{2}{\text{!}}j_{3}{\text{!}}\ldots }}(1_{1})^{j_{1}}(1_{2})^{j_{2}}(1_{3})^{j_{3}}\dots$

To establish this write—

$1+\mu {\text{X}}_{1}+\mu ^{2}{\text{X}}_{2}+\mu ^{3}{\text{X}}_{3}+\dots ={\underset {a}{\text{II}}}(1+\mu a_{1}x_{1}+\mu ^{2}a_{1}^{2}x_{2}+\mu ^{3}a_{1}^{3}x_{3}+\dots )$ ,

the product on the right involving a factor for each of the quantities $a_{1},a_{2},a_{3}\dots$ , and $\mu$ being arbitrary.

Multiplying out the right-hand side and comparing coefficients

${\text{X}}_{1}=(1)x_{1}$ ,
${\text{X}}_{2}=(2)x_{2}+(1^{2})x_{1}^{2}$ ,
${\text{X}}_{3}=(3)x_{3}+(21)x_{2}x_{1}+(1^{3})x_{1}^{3}$ ,
${\text{X}}_{4}=(4)x_{4}+(31)x_{3}x_{1}+(2^{2})x_{2}^{2}+(21^{2})x_{2}x_{1}^{2}+(1^{4})x_{1}^{4}$ ,
⋅⋅⋅⋅⋅⋅⋅
${\text{X}}_{m}=\Sigma (m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\dots )x_{m_{1}}^{\mu _{1}}x_{m_{2}}^{\mu _{2}}x_{m_{3}}^{\mu _{3}}\dots$ ,

the summation being for all partitions of $m$ .

Auxiliary Theorem.—The coefficient of $x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\dots$ in the product ${\frac {{\text{X}}_{m_{1}}^{\mu _{1}}{\text{X}}_{m_{2}}^{\mu _{2}}{\text{X}}_{m_{3}}^{\mu _{3}}\dots }{\mu _{1}{\text{!}}\mu _{2}{\text{!}}\mu _{3}{\text{!}}\dots }}$ is $\sum {\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots }{j_{1}{\text{!}}j_{2}{\text{!}}j_{3}{\text{!}}\dots }}$ where $({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots$ is a separation of $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\dots )$ of specification $(m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\dots )$ , and the sum is for all such separations.

To establish this observe the result.

${\frac {1}{p{\text{!}}}}{\text{X}}_{3}^{p}=\sum {\frac {(3)^{\pi _{1}}(21)^{\pi _{2}}(1^{3})^{\pi _{3}}}{\pi _{1}{\text{!}}\pi _{2}{\text{!}}\pi _{3}{\text{!}}}}x_{3}^{\pi _{1}}x_{2}^{\pi _{2}}x_{1}^{\pi _{2}+3\pi _{3}}$

and remark that $(3)^{\pi _{1}}(21)^{\pi _{2}}(1^{3})^{\pi _{3}}$ is a separation of $(3^{\pi _{1}}2^{\pi _{2}}1^{\pi _{2}+3\pi _{3}})$ of specification $(3^{p})$ . A similar remark may be made in respect of

${\frac {1}{\mu _{1}{\text{!}}}}{\text{X}}_{m_{1}}^{\mu _{1}},{\frac {2}{\mu _{2}{\text{!}}}}{\text{X}}_{m_{2}}^{\mu _{2}},{\frac {3}{\mu _{3}{\text{!}}}}{\text{X}}_{m_{3}}^{\mu _{3}},\dots$ ,

and therefore of the product of those expressions. Hence the theorem.

Now

$\log(1+\mu {\text{X}}_{1}+\mu ^{2}{\text{X}}_{2}+\mu ^{3}{\text{X}}_{3}+\dots )$
$={\underset {a}{\Sigma }}\log(1+\mu a_{1}x_{1}+\mu ^{2}a_{1}^{2}x_{2}+\mu ^{3}a_{1}^{3}x_{3}+\dots )$

whence, expanding by the exponential and multinomial theorems, a comparison of the coefficients of $\mu ^{n}$ gives

$(n)\sum (-)^{\nu _{1}+\nu _{2}+\nu _{3}+\ldots -1}{\frac {(\nu _{1}+\nu _{2}+\nu _{3}+\ldots -1){\text{!}}}{\nu _{1}{\text{!}}\nu _{2}{\text{!}}\nu _{3}{\text{!}}\ldots }}x_{n_{1}}^{\nu _{1}}x_{n_{2}}^{\nu _{2}}x_{n_{3}}^{\nu _{3}}\dots$
$=\sum (-)^{\nu _{1}+\nu _{2}+\nu _{3}+\ldots -1}{\frac {(\nu _{1}+\nu _{2}+\nu _{3}+\ldots -1){\text{!}}}{\nu _{1}{\text{!}}\nu _{2}{\text{!}}\nu _{3}{\text{!}}\ldots }}{\text{X}}_{n_{1}}^{\nu _{1}}{\text{X}}_{n_{2}}^{\nu _{2}}{\text{X}}_{n_{3}}^{\nu _{3}}\dots$

and, by the auxiliary theorem, any term ${\text{X}}_{m_{1}}^{\mu _{1}}{\text{X}}_{m_{2}}^{\mu _{2}}{\text{X}}_{m_{3}}^{\mu _{3}}\dots$ on the right-hand side is such that the coefficient of $x_{n_{1}}^{\nu _{1}}x_{n_{2}}^{\nu _{2}}x_{n_{3}}^{\nu _{3}}\dots$ in ${\frac {1}{\mu _{1}{\text{!}}\mu _{2}{\text{!}}\mu _{3}{\text{!}}\ldots }}{\text{X}}_{m_{1}}^{\mu _{1}}{\text{X}}_{m_{2}}^{\mu _{2}}{\text{X}}_{m_{3}}^{\mu _{3}}\dots$ is

$\sum {\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots }{j_{1}{\text{!}}j_{2}{\text{!}}j_{3}{\text{!}}\ldots }}$ ,

where since $(m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\dots )$ is the specification of $({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots$ , $\mu _{1}+\mu _{2}+\mu _{3}+\dots =j_{1}+j_{2}+j_{3}+\dots$ . Comparison of the coefficients of $x_{n_{1}}^{\nu _{1}}x_{n_{2}}^{\nu _{2}}x_{n_{3}}^{\nu _{3}}\dots$ therefore yields the result

$(-)^{\nu _{1}+\nu _{2}+\nu _{3}+\ldots }{\frac {(\nu _{1}+\nu _{2}+\nu _{3}+\ldots -1){\text{!}}}{\nu _{1}{\text{!}}\nu _{2}{\text{!}}\nu _{3}{\text{!}}\ldots }}(n)$
$=\sum (-)^{j_{1}+j_{2}+j_{3}+\ldots }{\frac {(j_{1}+j_{2}+j_{3}+\ldots -1){\text{!}}}{j_{1}{\text{!}}j_{2}{\text{!}}j_{3}{\text{!}}\ldots }}({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots$ ,

for the expression of $\Sigma a^{n}$ in terms of products of symmetric functions symbolized by separations of $(n_{1}^{\nu _{1}}n_{2}^{\nu _{2}}n_{3}^{\nu _{3}}\dots )$ .

Let $(n)_{a},(n)_{x},(n)_{\text{x}}$ denote the sums of the n^th powers of quantities whose elementary symmetric functions are $a_{1},a_{2},a_{3},\dots$ ; $x_{1},x_{2},x_{3},\dots$ ; ${\text{X}}_{1},{\text{X}}_{2},{\text{X}}_{3},\dots$ respectively: then the result arrived at above from the logarithmic expansion may be written

$(n)_{a}(n)_{x}=(n){\text{x}}$ ,

exhibiting $(n)_{\text{x}}$ as an invariant of the transformation given by the expressions of ${\text{X}}_{1},{\text{X}}_{2},{\text{X}}_{3},\dots$ in terms of $x_{1},x_{2},x_{3},\dots$ .

The inverse question is the expression of any monomial symmetric function by means of the power functions $(r)=s_{r}$ .

Theorem of Reciprocity.—If

${\text{X}}_{m_{1}}^{\mu _{1}}{\text{X}}_{m_{2}}^{\mu _{2}}{\text{X}}_{m_{3}}^{\mu _{3}}\dots =\dots +\theta (s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}s_{3}^{\sigma _{3}}\dots )x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\dots +\dots$ ,

where $\theta$ is a numerical coefficient, then also

${\text{X}}_{s_{1}}^{\sigma _{1}}{\text{X}}_{s_{2}}^{\sigma _{2}}{\text{X}}_{s_{3}}^{\sigma _{3}}\dots =\dots +\theta (m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\dots )x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\dots +\dots$ .

We have found above that the coefficient of $(x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\dots )$ in the product ${\text{X}}_{m_{1}}^{\mu _{1}}{\text{X}}_{m_{2}}^{\mu _{2}}{\text{X}}_{m_{3}}^{\mu _{3}}\dots$ is

${\mu _{1}{\text{!}}\mu _{2}{\text{!}}\mu _{3}{\text{!}}\ldots }\sum {\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\dots }{j_{1}{\text{!}}j_{2}{\text{!}}j_{3}{\text{!}}\ldots }}$ ,

the sum being for all separations of $l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l^{\lambda _{3}}\dots )$ which have the specification $(m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m^{\mu _{3}}\dots )$ . We can multiply out this expression so as to obtain a series of monomials of the form $\theta (s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}s_{3}^{\sigma _{3}}\dots )$ . It can be shown that the number $\theta$ enumerates distributions of a certain nature defined by the partitions $(m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}\dots )$ , $(s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}\dots )$ , $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}\ldots )$ and it is seen intuitively that the number $\theta$ remains unaltered when the first two of these partitions are interchanged (see Combinatorial Analysis). Hence the theorem is established.

Putting $x_{1}=1$ and $x_{2}=x_{3}=x_{4}=\ldots =0,$ we find a particular law of reciprocity given by Cayley and Betti,

${\begin{aligned}(1^{m_{1}})^{\mu _{1}}(1^{m_{2}})^{\mu _{2}}(1^{m_{3}})^{\mu _{3}}\ldots &=\ldots +\theta (s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}s_{3}^{\sigma _{3}}\ldots )+\ldots ,\\(1^{s_{1}})^{\sigma _{1}}(1^{s_{2}})^{\sigma _{2}}(1^{s_{3}})^{\sigma _{3}}\ldots &=\ldots +\theta (m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\ldots )+\ldots ;\end{aligned}}$

and another by putting $x_{1}=x_{2}=x_{3}=\ldots =1$ , for then $\mathrm {X} _{m}$ becomes $h_{m}$ , and we have

$h_{m_{1}}^{\mu _{1}}h_{m_{2}}^{\mu _{2}}h_{m_{3}}^{\mu _{3}}\ldots =\ldots +\theta ^{\prime }(s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}s_{3}^{\sigma _{3}}\ldots )+\ldots ,$ $h_{s_{1}}^{\sigma _{1}}h_{s_{2}}^{\sigma _{2}}h_{s_{3}}^{\sigma _{3}}\ldots =\ldots +\theta ^{\prime }(m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\ldots )+\ldots ,$

Theorem of Expressibility.—“If a symmetric function be symboilized by $(\lambda \mu \nu \ldots )$ and $(\lambda _{1}\lambda _{2}\lambda _{3}\ldots ),$ $(\mu _{1}\mu _{2}\mu _{3}\ldots ),$ $(\nu _{1}\nu _{2}\nu _{3}\ldots )\ldots$ be any partitions of $\lambda ,\mu ,\nu \ldots$ respectively, the function $(\lambda \mu \nu \ldots )$ is expressible by means of functions symbolized by separation of

$(\lambda _{1}\lambda _{2}\lambda _{3}\ldots \mu _{1}\mu _{2}\mu _{3}\ldots \nu _{1}\nu _{2}\nu _{3}\ldots ).{\text{”}}$

For, writing as before,

$\mathrm {X} _{m_{1}}^{\mu _{1}}\mathrm {X} _{m_{2}}^{\mu _{2}}\mathrm {X} _{m_{3}}^{\mu _{3}}\ldots =\Sigma \Sigma \theta (s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}s_{3}^{\sigma _{3}}\ldots )x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\ldots ,$ $=\Sigma {\text{P}}x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\ldots ,$

${\text{P}}$ is a linear function of separations of $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ of specification $(m_{1}^{\mu _{1}}m_{2}^{\mu _{2}}m_{3}^{\mu _{3}}\ldots ),$ and if $\mathrm {X} _{s_{1}}^{\sigma _{1}}\mathrm {X} _{s_{2}}^{\sigma _{2}}\mathrm {X} _{s_{3}}^{\sigma _{3}}\ldots =\Sigma {\text{P}}^{\prime }x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\ldots ,{\text{P}}^{\prime }$ is a linear function of separations of $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ of specification $(s_{1}^{\sigma _{1}}s_{2}^{\sigma _{2}}s_{3}^{\sigma _{3}}\ldots ).$ Suppose the separations of $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ to involve $k$ different specifications and form the $k$ identities

$\mathrm {X} _{m_{1s}}^{\mu _{1s}}\mathrm {X} _{m_{2s}}^{\mu _{2s}}\mathrm {X} _{m_{3s}}^{\mu _{3s}}\ldots =\Sigma {\text{P}}^{(s)}x_{l_{1}}^{\lambda _{1}}x_{l_{2}}^{\lambda _{2}}x_{l_{3}}^{\lambda _{3}}\ldots (s=1,2,\ldots k),$

where $m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots )$ is one of the $k$ specifications.

The law of reciprocity shows that

${\text{P}}^{(s)}={\overset {t=k}{\underset {t=1}{\Sigma \theta _{st}}}}(m_{1t}^{\mu _{1t}}m_{2t}^{\mu _{2t}}m_{3t}^{\mu _{3t}}\ldots ),$

viz.: a linear function of symmetric functions symbolized by the $k$ specifications; and that $\theta _{st}=\theta _{ts}.$ A table may be formed expressing the $k$ expressions ${\text{P}}^{(1)},{\text{P}}^{(2)},\ldots {\text{P}}^{(k)}$ as linear functions of the $k$ expressions $(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots )$ , $s=1,2,\ldots k$ , and the numbers $\theta _{st}$ occurring therein possess row and column symmetry. By solving $k$ linear equations we similarly express the latter functions as linear functions of the former, and this table will also be symmetrical.

Theorem.—“The symmetric function $(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots )$ whose partition is a specification of a separation of the function symbolized by $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ is expressible as a linear function of symmetric functions symbolized by separations of $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ and a symmetrical table may be thus formed.” It is now to be remarked that the partition $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ can be derived from $(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots )$ by substituting for the numbers $m_{1s},m_{2s},m_{3s},\ldots$ certain partitions of those numbers (vide the definition of the specification of a separation).

Hence the theorem of expressibility enunciated above. A new statement of the law of reciprocity can be arrived at as follows:—Since.

${\text{P}}^{(s)}=\mu _{1s}!\mu _{2s}!\mu _{3s}!\ldots \sum {\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\ldots }{j_{1}!j_{2}!j_{3}!\ldots }},$

where $({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\ldots$ is a separation of $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots )$ of specification $(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots ),$ placing $s$ under the summation sign to denote the specification involved,

${\begin{aligned}\mu _{1s}!\mu _{2s}!\mu _{3s}\ldots &\sum _{s}{\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\ldots }{j_{1}!j_{2}!j_{3}!\ldots }}=\sum _{t=1}^{t=k}\theta _{st}(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots ),\\\mu _{15}!\mu _{2t}!\mu _{3t}!&\ldots \sum _{t}{\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\ldots }{j_{1}!j_{2}!j_{3}\ldots }}=\sum _{s=1}^{s=k}\theta _{ts}(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots ),\end{aligned}}$

where $\theta _{st}=\theta _{ts}$ .

Theorem of Symmetry.—If we form the separation function

$\sum _{s}{\frac {({\text{J}}_{1})^{j_{1}}({\text{J}}_{2})^{j_{2}}({\text{J}}_{3})^{j_{3}}\ldots }{j_{1}!j_{2}!j_{3}!\ldots }}$

appertaining to the function $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}l_{3}^{\lambda _{3}}\ldots ),$ each separation having a specification $(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots )$ , multiply by $\mu _{1s}!\mu _{2s}!\mu _{3s}!\ldots ,$ and take therein the coefficient of the function $(m_{1t}^{\mu _{1t}}m_{2t}^{\mu _{2t}}m_{3t}^{\mu _{3t}}\ldots ),$ we obtain the same result as if we formed the separation function in regard to the specification $(m_{1t}^{\mu _{1t}}m_{2t}^{\mu _{2t}}m_{3t}^{\mu _{3t}}\ldots ),$ multiplied by $\mu _{1t}!\mu _{2t}!\mu _{3t}!\ldots$ and took therein the coefficient of the function $(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}m_{3s}^{\mu _{3s}}\ldots ).$

Ex. gr., take $(l_{1}^{\lambda _{1}}l_{2}^{\lambda _{2}}\ldots )=(21^{4});(m_{1s}^{\mu _{1s}}m_{2s}^{\mu _{2s}}\ldots )=(321);(m_{1t}^{\mu _{1t}}m_{2t}^{\mu _{2t}}\ldots )=(31_{3});$ we find

${\begin{aligned}(21)(1^{2})(1)+(1^{3})(2)(1)&=\ldots +13(31^{3})+\ldots ,\\(21)(1)^{3}&=\ldots +13(321)+\ldots \end{aligned}}$

The Differential Operators.—Starting with the relation

$(1+\alpha _{1}x)(1+\alpha _{2}x)\ldots (1+a_{n}x)=1+a_{1}x+a_{2}x^{2}+\ldots +a_{n}x^{n}$

multiply each side by $1+\mu x,$ thus introducing a new quantity $\mu ;$ we obtain

$(1+a_{1}x)(1+a_{2}x)\ldots (1+a_{n}x)(1+\mu x)=1+(a_{1}+\mu )x+(a_{2}+\mu a_{1})x^{2}+\ldots$

so that $f(a_{1},a_{2},a_{3},\ldots a_{n})=f,$ a rational integral function of the elementary functions, is converted into

$f(a_{1}+\mu ,a_{2}+\mu a_{1},\ldots a_{n}+\mu a_{n-1})=f+\mu d_{1}f+{\frac {\mu ^{2}}{2!}}{\overline {d_{1}^{2}}}f+{\frac {\mu ^{3}}{3!}}{\overline {d_{1}^{3}}}f+\ldots$

where

$d_{1}={\frac {\delta }{\delta a_{1}}}+a_{1}{\frac {\delta }{\delta a_{2}}}+a_{2}{\frac {\delta }{\delta a_{3}}}+\ldots +a_{n-1}{\frac {\delta }{\delta a_{n}}}$

and ${\overline {d_{1}^{s}}}$ denotes, not $s$ successive operations of $d_{1},$ but the operator of order $s$ obtained by raising $d_{1}$ to the $s^{th}$ power symbolically as in Taylor’s theorem in the Differential Calculus.

Write also ${\frac {1}{s!}}{\overline {d_{1}^{s}}}={\text{D}},$ so that

$f(a_{1}+\mu ,a_{2}+\mu a_{1},\ldots a_{n}+\mu a_{n-1})=f+\mu {\text{D}}_{1}f+\mu ^{2}{\text{D}}_{2}f+\mu ^{3}{\text{D}}_{3}f+\ldots .$

The introduction of the quantity $\mu$ converts the symmetric function $(\lambda _{1}\lambda _{2}\lambda _{3}\ldots )$ into

$(\lambda _{1}\lambda _{2}\lambda _{3}+\ldots )+\mu ^{\lambda _{1}}(\lambda _{2}\lambda _{3}\ldots )+\mu ^{\lambda _{2}}(\lambda _{1}\lambda _{3}\ldots )+\mu ^{\lambda _{3}}(\lambda _{1}\lambda _{2}\ldots )+\ldots .$

Hence, if $f(a_{1},a_{2},\ldots a_{n})=(\lambda _{1}\lambda _{2}\lambda _{3}\ldots ),$

$(\lambda _{1}\lambda _{2}\lambda _{3}\ldots )+\mu ^{\lambda _{1}}(\lambda _{2}\lambda _{3}\ldots )+\mu ^{\lambda _{2}}(\lambda _{1}\lambda _{3}\ldots )+\mu ^{\lambda _{3}}(\lambda _{1}\lambda _{2}\ldots )+\ldots$ $=(1+\mu {\text{D}}_{1}+\mu ^{2}{\text{D}}_{2}+\mu ^{3}{\text{D}}_{3}+\ldots )(\lambda _{1}\lambda _{2}\lambda _{3}\ldots ).$

Comparing coefficients of like powers of $\mu$ we obtain

${\text{D}}\lambda _{1}(\lambda _{1}\lambda _{2}\lambda _{3}\ldots )=(\lambda _{2}\lambda _{3}\ldots ),$

while ${\text{D}}_{s}(\lambda _{1}\lambda _{2}\lambda _{3}\ldots )=0$ unless the partition $(\lambda _{1}\lambda _{2}\lambda _{3}\ldots )$ contains a part $s.$ Further, if ${\text{D}}_{\lambda _{1}}{\text{D}}_{\lambda _{2}}$ denote successive operations of ${\text{D}}_{\lambda _{1}}$ and ${\text{D}}_{\lambda _{2}},$

${\text{D}}\lambda _{1}{\text{D}}\lambda _{2}(\lambda _{1}\lambda _{2}\lambda _{2}\ldots )=(\lambda _{3}\ldots ),$

and the operations are evidently commutative.

Also ${\text{D}}_{p_{1}}^{\pi _{1}}{\text{D}}_{p_{2}}^{\pi _{2}}{\text{D}}_{p_{3}}^{\pi _{3}}\ldots (p_{1}^{\pi _{1}}p_{2}^{\pi _{2}}p_{3}^{\pi _{3}}\ldots )=1,$ and the law of operation of the operators ${\text{D}}$ upon a monomial symmetric function is clear.

We have obtained the equivalent operations

$1+\mu {\text{D}}_{1}+\mu ^{2}{\text{D}}_{2}+\mu ^{3}{\text{D}}_{3}+\ldots ={\overline {exp}}\mu d_{1}$

where ${\overline {exp}}$ denotes (by the rule over $exp$ ) that the multiplication of operators is symbolic as in Taylor’s theorem. $d_{1}^{s}$ denotes, in fact, an operator of order $s,$ but we may transform the right-hand side so that we are only concerned with the successive performance of linear operations. For this purpose write

$a_{s}=\partial _{a_{s}}+a_{1}\partial _{a_{s+1}}+a_{2}\partial _{a_{s+2}}+\ldots .$

It has been shown (vide ”Memoir on Symmetric Functions of the Roots of Systems of Equations,” Phil. Trans. 1890, p. 490) that

${\overline {exp}}(m_{1}d_{1}+m_{2}d_{2}+m_{3}d_{3}+\ldots )=exp(M_{1}d_{1}+M_{2}d_{2}+M_{3}d_{3}+\ldots ),$

where now the multiplications on the dexter denote successive operations, provided that

$exp({\text{M}}_{1}\xi +{\text{M}}_{2}\xi ^{2}+{\text{M}}_{3}\xi ^{3}+\ldots )=1+m_{1}\xi +m_{2}\xi ^{2}+m_{3}\xi ^{3}+\ldots ,$

$\xi$ being an undetermined algebraic quantity.

Hence we derive the particular cases

${\overline {exp}}d_{1}=exp(d_{1}-{\frac {1}{2}}d_{2}+{\frac {1}{3}}d_{3}-\ldots );$ ${\overline {exp}}\mu d_{1}=exp(\mu d_{1}-{\frac {1}{2}}\mu ^{2}d_{2}+{\frac {1}{3}}\mu ^{3}d_{3}-\ldots ),$

and we can express ${\text{D}}_{s}$ in terms of $d_{1},d_{2},d_{3},\ldots ,$ products denoting successive operations, by the same law which expresses the elementary function $a_{s}$ in terms of the sums of powers $s_{1},s_{2},s_{3},\ldots$ Further, we can express $d_{s}$ in terms of ${\text{D}}_{1},{\text{D}}_{2},{\text{D}}_{3},\ldots$ by the same law which expresses the power function $s,$ in terms of the elementary functions $a_{1},a_{2},a_{3},\ldots$

Operation of ${\text{D}}_{s}$ upon a Product of Symmetric Functions.—Suppose $f$ to be a product of symmetric functions $f_{1}f_{2}\ldots f_{m}.$ If in the identity $f=f_{1}f_{2}\ldots f_{m}$ we introduce a new root $\mu$ we change $a_{s}$ into $a_{s}+\mu a_{s-1},$ and we obtain

${\begin{aligned}(1&+\mu {\text{D}}_{1}+\mu ^{2}{\text{D}}_{2}+\ldots +\mu ^{s}{\text{D}}_{s}+\ldots )f\\=(1&+\mu {\text{D}}_{1}+\mu ^{2}{\text{D}}_{2}+\ldots +\mu ^{s}{\text{D}}_{s}+\ldots )f_{1}\\\times \,(1&+\mu {\text{D}}_{1}+\mu ^{2}{\text{D}}_{2}+\ldots +\mu ^{s}{\text{D}}_{s}+\ldots )f_{2}\\\times \,\quad &\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \\\times \,(1&+\mu {\text{D}}_{1}+\mu ^{2}{\text{D}}_{2}+\ldots +\mu ^{s}{\text{D}}_{s}+\ldots )f_{m}\end{aligned}}$

and now expanding and equating coefficients of like powers of $\mu$

${\begin{aligned}&{\text{D}}_{1}f=\Sigma ({\text{D}}_{1}f_{1})f_{2}f_{3}\ldots f_{m},\\&{\text{D}}_{2}f=\Sigma ({\text{D}}_{2}f_{1})f_{2}f_{3}\ldots f_{m}+\Sigma ({\text{D}}_{1}f_{1})({\text{D}}_{1}f_{2})f_{3}\ldots f_{m},\\&{\text{D}}_{3}f=\Sigma ({\text{D}}_{3}f_{1})f_{2}f_{3}\ldots f_{m}+\Sigma ({\text{D}}_{2}f_{1})({\text{D}}_{1}f_{2})f_{3}\ldots f_{m}+\Sigma ({\text{D}}_{3}f_{1})f_{2}f_{3}\ldots f_{m},\\&\cdot \qquad \quad \;\;\cdot \qquad \quad \;\cdot \qquad \quad \;\;\cdot \qquad \quad \;\;\cdot \qquad \quad \;\;\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \qquad \quad \;\;\cdot \end{aligned}}$

the summation in a term covering every distribution of the operators of the type presenting itself in the term.

Writing these results

${\begin{aligned}&{\text{D}}_{1}f={\text{D}}_{(1)}f,\\&{\text{D}}_{2}f={\text{D}}_{(2)}f+{\text{D}}_{(1^{2})}f,\\&{\text{D}}_{3}f={\text{D}}_{(3)}f+{\text{D}}_{(2^{1})}f+{\text{D}}_{(1^{2})}f,\end{aligned}}$ we may write in general

${\text{D}}_{s}f=\Sigma {\text{D}}(p_{1}p_{2}p_{3}\ldots )f,$

the summation being for every partition $(p_{1}p_{2}p_{3}\ldots )$ of $s,$ and ${\text{D}}(p_{1}p_{2}p_{3}\ldots )f$ being $=\Sigma ({\text{D}}p_{1}f_{1})({\text{D}}p_{2}f_{2})({\text{D}}p_{3}f_{3})f_{4}\ldots f_{m}.$

Ex. gr. To operate with ${\text{D}}_{2}$ upon $(21^{3})(21^{4})(1^{5}),$ we have

${\begin{aligned}{\text{D}}_{(2)}f&=(1^{3})(21^{4})(1^{5})+(21^{3})(1^{4})(1^{5})\\{\text{D}}_{(1^{2})}f&=(12^{2})(21^{3})(1^{5})+(21^{3})(21^{3})(1^{4})+(21^{2})(21^{4})(1^{4}),\end{aligned}}$

and hence

${\begin{aligned}{\text{D}}_{2}f=(21^{4})(1^{5})(1^{3})+(21^{3})(1^{5})(1^{4})+(&21^{3})(21^{2})(1^{5})+(21^{3})^{2}(1^{4})\\&+(21^{4})(21^{2})(1^{4}),\end{aligned}}$

Application to Symmetric Function Multiplication.—An example will explain this. Suppose we wish to find the coefficient of $(52^{4}1^{3})$ in the product $(21^{3})(21^{4})(1^{5})$ .

Write

$(21^{3})(21^{4})(1^{5})=\ldots +{\text{A}}(52^{4})(1^{3})+\ldots ;$

then

${\text{D}}_{5}{\text{D}}_{2}^{4}{\text{D}}_{1}^{3}(21^{3})(21^{4})(1^{5})={\text{A}};$

every other term disappearing by the fundamental property of ${\text{D}}_{s}.$ Since

${\text{D}}_{5}(21^{3})(21^{4})(1^{5})=(1^{3})(1^{4})(1^{4}),$

we have:—

${\begin{aligned}&{\text{D}}_{2}^{4}{\text{D}}_{1}^{3}(1^{4})(1^{4})(1^{3})={\text{A}}\\&{\text{D}}_{2}^{3}{\text{D}}_{1}^{3}\{(1^{3})(1^{3})(1^{3})+2(1^{4})(1^{3})(1^{2})\}={\text{A}}\\&{\text{D}}_{2}^{2}{\text{D}}_{1}^{3}\{5(1^{3})(1^{2})(1^{2})+2(1^{4})(1^{2})(1)+2(1^{3})(1^{3})(1)\}={\text{A}}\\&{\text{D}}_{2}{\text{D}}_{1}^{3}\{12(1^{2})(1^{2})(1)+7(1^{3})(1)(1)+2(1^{4})(1)+6(1^{3})(1^{2})\}={\text{A}}\\&{\phantom {{\text{D}}\,}}{\text{D}}_{1}^{3}12(1)^{3}={\text{A}},\end{aligned}}$

where ultimately disappearing terms have been struck out. Finally ${\text{A}}=6\cdot 12=72.$

The operator $d_{1}=a_{0}\delta a_{1}+a_{1}\delta a_{2}+a_{2}\delta a_{3}+\ldots$ which is satisfied by every symmetric fraction whose partition contains no unit (called by Cayley non-unitary symmetric functions), is of particular importance in algebraic theories. This arises from the circumstance that the general operator

$\lambda _{0\iota }a_{0}\delta a_{1}+\lambda _{1}a_{1}\delta a_{2}+\lambda _{2}a_{2}\delta a_{3}+\ldots$

is transformed into the operator $d_{1}$ by the substitution

$(a_{0},a_{1},a_{2},\ldots a_{s},\ldots )=(a_{0},\lambda _{0}a_{1},\lambda _{0}\lambda _{1}a_{2},\ldots ,\lambda _{0}\lambda _{1}\ldots \lambda _{s-1}a_{s},\ldots ),$

so that the theory of the general operator is coincident with that of the particular operator $d_{1}.$ For example, the theory of invariants may be regarded as depending upon the consideration of the symmetric functions of the differences of the roots of the equation

$a_{0}x^{n}-1!{\tbinom {n}{1}}a_{1}x^{n-1}+{\tbinom {n}{2}}a_{2}x^{n-2}-\ldots =0;$

and such functions satisfy the differential equation

$a_{0}\delta a_{1}+2a_{1}\delta a_{2}+3a_{2}\delta a_{3}+\ldots +na_{n-1}\delta a_{n}=0.$

For such functions remain unaltered when each root receives the same infinitesimal increment $h;$ but writing $x-h$ for $x$ causes $a_{0},a_{1},a_{2},a_{3},\ldots$ to become respectively $a_{0},a_{1}+ha_{0},a_{2}+2ha_{1},a_{3}+3ha_{2},\ldots$ and $f(a_{0},a_{1},a_{2},a_{3},\ldots )$ becomes

$f+h(a_{0}\delta _{a1}+2a_{1}\delta _{a2}+3a_{2}\delta _{a3}+\ldots )f,$

and hence the functions satisfy the differential equation. The important result is that the theory of invariants is from a certain point of view coincident with the theory of non-unitary symmetric functions of the roots of $a_{0}x^{n}-a_{1}x^{n-1}+a_{2}x^{n-2}-\ldots =0,$ are symmetric functions of differences of the roots of

$a_{0}x^{n}-1!{\tbinom {n}{1}}a_{1}x^{n-1}+2!{\tbinom {n}{2}}a_{2}x^{n-2}-\ldots =0;$

and on the other hand that symmetric functions of the differences of the roots of

$a_{0}x^{n}-{\tbinom {n}{1}}a_{1}x^{n-1}+{\tbinom {n}{2}}a_{2}x^{n-2}-\ldots =0,$

are non-unitary symmetric functions of the roots of

$a_{0}x^{n}-{\frac {a_{1}}{1!}}x^{n-1}+{\frac {a_{2}}{2!}}x^{n-2}-\ldots =0.$

An important notion in the theory of linear operators in general is that of MacMahon’s multilinear operator (“Theory of a Multilinear partial Differential Operator with Applications to the Theories of Invariants and Reciprocants,” Proc. Lond. Math. Soc. t. xviii. (1886), pp. 61-88). It is defined as having four elements, and is written

$(\mu ,\nu ;m,n)$ ${\begin{aligned}={\frac {1}{m}}{\bigg [}&\mu a_{0}^{m}\delta _{an}+(\mu +\nu ){\frac {m!}{(m-1)!1!}}a_{0}^{m-1}a_{1}\delta _{an+1}\\&+(\mu +2\nu ){\bigg \{}{\frac {m!}{(m-1)!1!}}a_{0}^{m-1}a_{2}+{\frac {m!}{(m-2)!2!}}a_{0}^{m-2}a_{1}^{2}{\bigg \}}\delta _{an+2}\\&+(\mu +3\nu ){\bigg \{}{\frac {m!}{(m-1)!1!}}a_{0}^{m-1}a_{3}+{\frac {m!}{(m-2)!1!1!}}a_{0}^{m-2}a_{1}a_{2}\\&\qquad \qquad \qquad \qquad \qquad \qquad \quad \;\;+{\frac {m!}{(m-3)!3!}}a_{0}^{m-3}a_{0}^{3}{\bigg \}}\delta _{an+3}\\&+\ldots {\bigg ]},\end{aligned}}$

the coefficient of $a_{0}^{k_{0}}a_{1}^{k_{1}}a_{1}^{k_{1}}\ldots$ being ${\frac {m!}{k_{0}!k_{1}!k_{2}!\ldots }}.$ The operators $a_{0}\delta _{a1}+a_{1}\delta _{a2}+\ldots ,a_{0}\delta _{a1}+2a_{1}\delta _{a2}+\ldots$ are seen to be $(1,0;1,1)$ and $(1,1;1,1)$ respectively. Also the operator of the Theory of Pure Reciprocents (see Sylvester Lectures of the New Theory of Reciprocants, Oxford, 1888) is

$(4,1;2,1)={\frac {1}{2}}{\bigg \{}4a_{0}^{2}\delta _{a1}+10a_{0}a_{1}\delta _{a2}+6(2a_{0}a_{2}+a_{1}^{2})\delta _{a3}+\ldots {\bigg \}}.$

It will be noticed that

$(\mu ,\nu ;m,n)=\mu (1,0;m,n)+\nu (0,1;m,n).$

The importance of the operator consists in the fact that taking any two operators of the system

$(\mu ,\nu ;m,n);(\mu ^{1},\nu ^{1};m^{1},n^{1}),$

the operator equivalent to

$(\mu ,\nu ;m,n)(\mu ^{1},\nu ^{1};m^{1},n^{1})-(\mu ^{1},\nu ^{1};m^{1},n^{1})(\mu ,\nu ;m,n),$

where

${\begin{aligned}\mu _{1}&=(m^{1}+m-1){\bigg \{}{\frac {\mu ^{1}}{m^{1}}}(\mu +n^{1}\nu )-{\frac {\mu }{m}}(\mu ^{1}+n\nu ^{1}){\bigg \}},\\\nu _{1}&=(n^{1}-n)\nu ^{1}\nu +{\frac {m-1}{m^{1}}}\mu ^{1}\nu -{\frac {m^{1}-1}{m}}\mu \nu ^{1},\\m_{1}&=m^{1}+m-1,\\n_{1}&=n^{1}+n,\end{aligned}}$

and we conclude that quâ “alternation” the operators of the system form a “group.” It is thus possible to study simultaneously all the theories which depend upon operations of the group.

Symbolic Representation of Symmetric Functions.—Denote the elementary symmetric function $a_{s}$ by ${\tfrac {a_{1}^{s}}{s!}},{\tfrac {a_{2}^{s}}{s!}},{\tfrac {a_{3}^{s}}{s!}},\ldots$ at pleasure; then, taking $n$ equal to $\infty ,$ we may write

$1+a_{1}x+a_{2}x^{2}+\ldots =(1+\rho _{1}x)(1+\rho _{2}x)\ldots =e^{a_{1^{x}}}=e^{a_{2^{x}}}=e^{a_{3^{x}}}=\ldots$

where

$a_{s}=\sum \rho _{1}\rho _{2}\ldots \rho _{3}={\tfrac {a_{1}^{s}}{s!}},{\tfrac {a_{2}^{s}}{s!}},{\tfrac {a_{3}^{s}}{s!}},\ldots .$

Further, let

$1+b_{1}x+b_{2}x^{2}+\ldots +b_{m}x^{m}=(1+\sigma _{1}x)(1+\sigma _{2}x)\ldots (1+\sigma _{m}x);$

so that

${\begin{aligned}1+a_{1}\sigma _{1}+a_{2}\sigma _{1}^{2}+\ldots =(1+\rho _{1}\sigma _{1})(1+\rho _{2}\sigma _{1})\ldots &=e^{\sigma _{1}a_{1}},\\1+a_{1}\sigma _{2}+a_{2}\sigma _{2}^{2}+\ldots =(1+\rho _{1}\sigma _{2})(1+\rho _{2}\sigma _{2})\ldots &=e^{\sigma _{2}a_{2}},\\\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \qquad \quad \;\cdot \quad &\qquad \;\cdot \\1+a_{1}\sigma _{m}+a_{2}\sigma _{m}^{2}+\ldots =(1+\rho _{1}\sigma _{m})(1+\rho _{2}\sigma _{m})\ldots &=e^{\sigma _{m}a_{m}};\end{aligned}}$

and, by multiplication,

$\mathop {\Pi } _{\sigma }(1+a_{1}\sigma +a_{2}\sigma ^{2}+\ldots )=\mathop {\Pi } _{\rho }(1+b_{1}\rho +b_{2}\rho ^{2}+\ldots +b_{m}\rho ^{m}),$ $=e^{\sigma _{1}a_{1}+\sigma _{2}a_{2}+..+\sigma _{m}a_{m}}.$

Denote by brackets $(\;)$ and $[\;]$ symmetric functions of the quantities $\rho$ and $\sigma$ respectively. Then

$1+a_{1}[1]+a_{1}^{2}[1^{2}]+a_{2}[2]+a_{1}^{3}[1^{3}]+a_{1}a_{2}[21]+a_{3}[3]+\ldots$ $+a_{p_{1}}a_{p_{2}}a_{p_{3}}\ldots a_{p_{m}}{\big [}p_{1}p_{2}p_{3}\ldots p_{m}{\big ]}+\ldots$ ${\begin{aligned}&=1+b_{1}(1)+b_{1}^{2}(1^{2})+b_{2}(2)+b_{1}^{3}(1^{3})+b_{1}b_{2}(21)+b_{3}(3)+\ldots \\&\qquad \quad \;+b_{1}^{q_{1}}b_{2}^{q_{2}}b_{3}^{q_{3}}\ldots b_{m}^{q_{m}}(m^{q_{m}}m-1^{q_{m-1}}\ldots 2^{q_{2}}1^{q_{1}})+\ldots \\&=e^{\sigma _{1}a_{1}+\sigma _{2}a_{2}..+\sigma _{m}a_{m}}.\end{aligned}}$

Expanding the right-hand side by the exponential theorem, and then expressing the symmetric functions of $\sigma _{1},\sigma _{2},\sigma _{3},\ldots \sigma _{m},$ which arise, in terms of $b_{1},b_{2},\ldots b_{m},$ we obtain by comparison with the middle series the symbolical representation of all symmetric functions in brackets $(\;)$ appertaining to the quantities $\rho _{1},\rho _{2},\rho _{3},\ldots$ To obtain particular theorems the quantities $\sigma _{1},\sigma _{2},\sigma _{3},\ldots \sigma _{m}$ are auxiliaries which are at our entire disposal. Thus to obtain Stroh’s theory of seminvariants put

$b_{1}=\sigma _{1}+\sigma _{2}+\ldots +\sigma _{m}=[1]=0;$

we then obtain the expression of non-unitary symmetric functions of the quantities $\rho$ as functions of differences of the symbols $a_{1},a_{2},a_{3},\ldots$

Ex. gr. $b_{2}^{2}(2^{2})$ with $m=2$ must be a term in

$e^{\sigma _{1}a_{1}+\sigma _{2}a_{2}}=e^{\sigma _{1}(a_{1}-a_{2})}=\ldots +{\frac {1}{4!}}\sigma _{1}^{4}(a_{1}-a_{2})^{4}+\ldots ,$

and since $b_{2}^{2}=\sigma _{1}^{4}$ we must have

${\begin{aligned}(2^{2})&={\frac {1}{24}}(a_{1}-a_{2})^{4}={\frac {1}{24}}(a_{1}^{4}+a_{2}^{4})-{\frac {1}{6}}(a_{1}^{3}a_{2}+a_{1}a_{2}^{3})+{\frac {1}{4}}a_{1}^{2}a_{2}^{2}\\&=2a_{4}-2a_{1}a_{3}+a_{2}^{2}\end{aligned}}$

as is well known.

Again, if $\sigma _{1},\sigma _{2},\sigma _{3}\ldots \sigma _{m}$ be the $m,m^{th}$ roots of $-1,b_{1}=b_{2}=\ldots =b_{m-1}=0$ and $b_{m}=1,$ leading to

$1+(m)+(m^{2})+(m^{3})+\ldots =e^{\sigma _{1}a_{1}+\sigma _{2}a_{2}+..+\sigma _{m}a_{m}}$

and

$\therefore (m^{s})={\frac {1}{ms!}}(\sigma _{1}a_{1}+\sigma _{2}a_{2}+\ldots +\sigma _{m}a_{m})^{sm},$ and we see further that $(\sigma _{1}a_{1}+\sigma _{2}a_{2}+\ldots +\sigma _{m}a_{m})^{k}$ vanishes identically unless $k=0({\text{mod }}m)$ . If $m$ be infinite and

$1+b_{1}x+b_{2}x^{2}+\ldots =(1+\sigma _{1}x)(1+\sigma _{2}x)\ldots =e^{\beta _{1^{x}}}=e^{\beta _{2^{x}}}=\ldots ,$

we have the symbolic identity

$e^{\sigma _{1}a_{1}+\sigma _{2}a_{2}+\sigma _{3}a_{3}+\ldots }=e^{\rho _{1}\beta _{1}+\rho _{2}\beta _{2}+\rho _{3}\beta _{3}+\ldots ,}$

and

$(\sigma _{1}a_{1}+\sigma _{2}a_{2}+\sigma _{3}a_{3}+\ldots )^{p}=(\rho _{1}\beta _{1}+\rho _{2}\beta _{2}+\rho _{3}\beta _{3}+\ldots )^{p}.$

Instead of the above symbols we may use equivalent differential operators. Thus let

$\delta _{a}=a_{1}\delta _{a_{0}}+2a_{2}\delta _{a_{1}}+3a_{3}\delta _{a_{2}}+\ldots$

and let $a,b,c,\ldots$ be equivalent quantities. Any function of differences of $\delta _{a},\delta _{b},\delta _{c},\ldots$ being formed, the expansion being carried out, an operand $a_{0}$ or $b_{0}$ or $c_{0}\ldots$ being taken and $b,c,\ldots$ being subsequently put equal to $a$ , a non-unitary symmetric function will be produced.

Ex. gr. ${\begin{aligned}(&\delta _{a}-\delta _{b})^{2}(\delta _{a}-\delta _{c})=(\delta _{a}^{2}-2\delta _{a}\delta _{b}+\delta _{b}^{2})(\delta _{a}-\delta _{c})\\&=\delta _{a}^{3}-2\delta _{a}^{2}\delta _{b}+\delta _{a}\delta _{b}^{2}-\delta _{a}^{2}\delta _{c}+2\delta _{a}\delta _{b}\delta _{c}-\delta _{b}^{2}\delta _{c}\\&=6a_{3}-4a_{2}b_{1}+2a_{1}b_{2}-2a_{2}c_{1}+2a_{1}b_{1}c_{1}-2b_{2}c_{1}\\&=2(a_{1}^{3}-3a_{1}a_{2}+3a_{3})=2(3).\end{aligned}}$

The whole theory of these forms is consequently contained implicitly in the operation $\delta .$

Symmetric Functions of Several Systems of Quantities.—It will suffice to consider two systems of quantities as the corresponding theory for three or more systems is obtainable by an obvious enlargement of the nomenclature and notation.

Taking the systems of quantities to be

$a_{1},a_{2},a_{3},\ldots$ $\beta _{1},\beta _{2},\beta _{3},\ldots$

we start with the fundamental relation

${\begin{aligned}&(1+a_{1}x+\beta _{1}y)(1+a_{2}x+\beta _{2}y)(1+a_{3}x+\beta _{3}y)\ldots \\=\,&1+a_{10}x+a_{01}y+a_{20}x^{2}+a_{11}xy+a_{02}y^{2}+\ldots +a_{pq}x^{p}y^{q}+\ldots \end{aligned}}$

As shown by L. Schläfli^[2] this equation may be directly formed and exhibited as the resultant of two given equations, and an arbitrary linear non-homogeneous equation in two variables. The right-hand side may be also written

$1+\Sigma a_{1}x+\Sigma \beta _{1}y+\Sigma a_{1}a_{2}x^{2}+\Sigma a_{1}\beta _{2}xy+\Sigma \beta _{1}\beta _{2}y^{2}+\ldots$

The most general symmetric function to be considered is

$\Sigma a_{1}^{p_{1}}\beta _{1}^{q_{1}}a_{2}^{p_{2}}\beta _{2}^{q_{2}}a_{3}^{p_{3}}\beta _{3}^{q_{3}}\ldots$

conveniently written in the symbolic form

$({\overline {p_{1}q_{1}}}{\overline {p_{2}q_{2}}}{\overline {p_{3}q_{3}}}\ldots ).$

Observe that the summation is in regard to the expressions obtained by permuting the $n$ suffixes $1,2,3,\ldots n.$ The weight of the function is bipartite and consists of the two numbers $\Sigma p$ and $\Sigma q;$ the symbolic expression of the symmetric function is a partition into biparts (multiparts) of the bipartite (multipartite) number ${\overline {\Sigma p,\Sigma q}}.$ Each part of the partition is a bipartite number, and in representing the partition it is convenient to indicate repetitions of parts by power symbols. In this notation the fundamental relation is written

${\begin{aligned}(&1+a_{1}x+\beta _{1}y)(1+a_{2}x+\beta _{2}y)(1+a_{3}x+\beta _{3}y)\ldots \\=1&+({\overline {10}})x+({\overline {01}})y+({\overline {10}}^{2})x^{2}+({\overline {10}}\,{\overline {01}})xy+({\overline {01}}^{2})y^{2}\\&+({\overline {10}}^{3})x^{3}+({\overline {10}}^{2}{\overline {01}})x^{2}y+({\overline {10}}\,{\overline {01}}^{2})xy^{2}+({\overline {01}}^{3})y^{3}+\ldots \end{aligned}}$

where in general $a_{pq}=({\overline {10}}^{p}{\overline {01}}^{q}).$

All symmetric functions are expressible in terms of the quantities $a_{pq}$ in a rational integral form; from this property they are termed elementary functions; further they are said to be single-unitary since each part of the partition denoting $a_{pq}$ involves but a single unit.

The number of partitions of a biweight ${\overline {pq}}$ into exactly $\mu$ biparts is given (after Euler) by the coefficient of $a\mu x^{p}y^{q}$ in the expansion of the generating function

${\frac {1}{1-ax.\,1-ay.\,1-ax^{2}.\,1-axy.\,1-ay^{2}.\,1-ax^{3}.\,1-ax^{2}y.\,1-axy^{2}.\,1-ay^{3}\ldots }}$

The partitions with one bipart correspond to the sums of powers in the single system or unipartite theory; they are readily expressed in terms of the elementary functions. For write $({\overline {pq}})=s_{pq}$ and take logarithms of both sides of the fundamental relation; we obtain

$s_{10}x+s_{01}y=\Sigma (a_{1}x+\beta _{1}y)$ $s_{20}x^{2}+2s_{11}xy+s_{02}y^{2}=\Sigma (a_{1}x+\beta _{1}y)^{2},\,\&{\text{c}}.,$

and

$s_{10}x+s_{01}y-{\frac {1}{2}}(s_{20}x^{2}+2s_{11}xy+s_{02}y^{2})+\ldots$ $=\log {(1+a_{10}x+a_{01}y+\ldots +a_{pq}x^{p}y^{q}+\ldots )}$

From this formula we obtain by elementary algebra

$(-)^{p+q-1}{\frac {(p+q-1)!}{p!\,q!}}s_{pq}=\sum _{\pi }(-)^{\Sigma \pi -1}{\frac {(\Sigma \pi -1)!}{\pi _{1}!\,\pi _{2}!\ldots }}a_{p_{1}q_{1}}^{\pi _{1}}a_{p_{2}q_{2}}^{\pi _{2}}\ldots$

corresponding to Thomas Waring’s formula for the single system. The analogoous formula appertaining to $n$ systems of quantities which express $s_{pqr\ldots }$ in terms of elementary functions can be at once written down.

Ex. gr. We can verify the relations

${\begin{aligned}&s_{30}=a_{1\,0}^{3}-3a_{20}a_{10}+3a_{30},\\&s_{21}=a_{1\,0}^{2}a_{01}-a_{20}a_{01}-a_{11}a_{10}+a_{21}.\end{aligned}}$

The formula actually gives the expression of $({\overline {pq}})$ by means of separations of

$({\overline {10^{p}}}\,{\overline {01^{q})}},$

which is one of the partitions of $({\overline {pq}}).$ This is the true standpoint from which the theorem should be regarded. It is but a particular case of a general theory of expressibility.

To invert the formula we may write

$1+a_{10}x+a_{01}y+\ldots +a_{pq}x^{p}y^{q}+\ldots$ $=exp\;\{(s_{10}x+s_{01}y)-{\frac {1}{2}}s_{20}x^{2}+2s_{11}xy+s_{02}y^{2})+\ldots \},$

and thence derive the formula—

$(-)^{p+q-1}a_{pq}$ $=\sum \;{\bigg \{}{\frac {(p_{1}+q_{1}-1)!}{p_{1}!q_{1}!}}{\bigg \}}^{\pi _{1}}{\bigg \{}{\frac {(p_{2}+q_{2}-1)!}{p_{2}!q_{2}!}}{\bigg \}}^{\pi _{2}}\ldots {\frac {(-)^{\Sigma \pi -1}}{\pi _{1}!\,\pi _{2}!\ldots }}s_{p_{1}q_{1}}^{\pi _{1}}s_{p_{2}q_{2}}^{\pi _{2}}\ldots ,$

which expresses the elementary function in terms of the single bipart functions. The similar theorem for $n$ systems of quantities can be at once written down.

It will be shown later that every rational integral symmetric function is similarly expressible.

The Function $h_{pq}$ .—As the definition of $h_{pq}$ we take

$1+n_{10}x+n_{01}y+\ldots +n_{pq}x^{p}y^{q}+\ldots$ $={\frac {1}{(1-a_{1}x-\beta _{1}y)(1-a_{2}x-\beta _{2}y)\ldots }};$

and now expanding the right-hand side

$h_{pq}=\sum {\bigg (}{\frac {p_{1}+q_{1}}{p_{1}}}{\bigg )}{\bigg (}{\frac {p_{2}+q_{2}}{p_{2}}}{\bigg )}\ldots ({\overline {p_{1}q_{1}}}\,{\overline {p_{2}q_{2}}}\ldots ),$

the summation being for all partitions of the biweight. Further writing

$1+h_{10}x+h_{01}y+\ldots +h_{pq}x^{p}y^{q}+\ldots$ $={\frac {1}{1-a_{10}x-a_{01}y+\ldots +(-)^{p+q}a_{pq}x^{p}y^{q}+\ldots }},$

we find that the effect of changing the signs of both $x$ and $y$ is merely to interchange the symbols $a$ and $h;$ hence in any relation connecting the quantities $h_{pq}$ with the quantities $a_{pq}$ we are at liberty to interchange the symbols $a$ and $h.$ By the exponential and multinomial theorems we obtain the results—

$(-)^{p+q-1}h_{pq}=\sum _{\pi }(-)^{\Sigma \pi -1}{\frac {(\Sigma \pi )!}{\pi _{1}!\,\pi _{2}!\ldots }}h_{p_{1}q_{1}}^{\pi _{1}}h_{p_{2}q_{2}}^{\pi _{2}}\ldots ;$ $h_{pq}=\sum {\bigg \{}{\frac {(p_{1}+q_{1}-1)!}{p_{1}!\,q_{1}!}}{\bigg \}}^{\pi _{1}}{\bigg \{}{\frac {(p_{2}+q_{2}-1)!}{p_{2}!\,q_{2}!\ldots }}{\bigg \}}^{\pi _{2}}\ldots {\frac {1}{\pi _{1}!\,\pi _{2}!\ldots }}s_{\underline {p_{1}q_{1}}}^{\pi _{1}}s_{p_{2}q_{2}}^{\pi _{2}}\ldots .$

Differential Operations.—If, in the identity

${\begin{aligned}&(1+a_{1}x+\beta _{1}y)(1+a_{2}x+\beta _{2}y)\ldots (1+a_{n}x+\beta _{n}y)\\=\,&1+a_{10}x+a_{01}y+a_{20}x^{2}+a_{11}xy+a_{02}y^{2}+\ldots ,\end{aligned}}$

we multiply each side by $(1+\mu x+\nu y),$ the right-hand side becomes

$1+(a_{10}+\mu )x+(a_{01}+\nu )y+\ldots +(a_{pq}+\mu a_{p-1,q}+\nu a_{p,q-1})x^{p}y^{q}+\ldots ;$

hence any rational integral function of the coefficients $a_{10},a_{01},\ldots a_{pq},\ldots$ say $f(a_{10},a_{01},\ldots )\equiv f$ is converted into

${\overline {exp}}(\mu d_{10}+\nu d_{01})f$ ${\text{where }}d_{10}=\sum a_{p-1,q}{\frac {d}{da_{pq}}},d_{01}=\sum a_{p,q-1}{\frac {d}{da_{pq}}}.$

The rule over $exp$ will serve to denote that $\mu d_{10}+\nu d_{01}$ is to be raised to the various powers symbolically as in Taylor’s theorem.

Writing ${\text{D}}_{pq}={\frac {1}{p!\,q!}}d_{10}^{p}d_{01}^{q},$ ${\overline {exp}}(\mu d_{10}+\nu d_{01})=(1+\mu {\text{D}}_{10}+\nu {\text{D}}_{01}+\ldots +\mu ^{p}\nu ^{q}{\text{D}}_{pq}+\ldots )f;$

now, since the introduction of the new quantities $\mu ,\nu$ results in the addition to the function $({\overline {p_{1}q_{1}}}\,{\overline {p_{2}q_{2}}}\,{\overline {p_{3}q_{3}}}\ldots )$ of the new terms

$\mu ^{p_{1}}\nu ^{q_{1}}({\overline {p_{2}q_{2}}}\,{\overline {p_{3}q_{3}}}\ldots )+\mu ^{p_{2}}\nu ^{q_{2}}({\overline {p_{1}q_{1}}}\,{\overline {p_{3}q_{3}}}\ldots )+\mu ^{p_{3}}\nu ^{q_{3}}({\overline {p_{1}q_{1}}}\,{\overline {p_{2}q_{2}}}\ldots )+\ldots ,$

we find

${\text{D}}_{p_{1}q_{1}}({\overline {p_{1}q_{1}}}\,{\overline {p_{2}q_{2}}}\,{\overline {p_{3}q_{3}}}\ldots )=({\overline {p_{2}q_{2}}}\,{\overline {p_{3}q_{3}}}\ldots );$

and thence

${\text{D}}_{p_{1}q_{1}}{\text{D}}_{p_{2}q_{2}}{\text{D}}_{p_{3}q_{3}}\ldots ({\overline {p_{1}q_{1}}}\,{\overline {p_{2}q_{2}}}\,{\overline {p_{3}q_{3}}}\ldots )=1;$

while ${\text{D}}_{rs}f=0$ unless the part ${\overline {rs}}$ is involved in $f.$ We may then state that ${\text{D}}_{pq}$ is an operation which obliterates one part ${\overline {pq}}$ when such part is present, but in the contrary case causes the function to vanish. From the above D_pq is an operator of order pq, but it is convenient for some purposes to obtain its expression in the form of a number of terms, each of which denotes pq successive linear operations: to accomplish this write

d_pq ＝ $\sum$ a_rsd/a_p+r,q+s

and note the general result^[3]
exp (m₁₀d₁₀ + m₀₁d₀₁ + … + m_pqd_pq + …)
= exp (M₁₀d₁₀ + M₀₁d₀₁ + … + M_pqd_pq + …);
where the multiplications on the left- and right-hand sides of the equation are symbolic and unsymbolic respectively, provided that m_pq, M_pq are quantities which satisfy the relation
exp (M₁₀ξ + M₀₁η + … + M_pqξ ^pη^p + …)
= 1 + m₁₀ξ + m₀₁η + … + m_pqξ ^pη^q + …;
where ξ, η are undetermined algebraic quantities. In the present particular case putting m₁₀ = μ, m₀₁ = ν and m_pq = 0 otherwise
M₁₀ξ + M₀₁η + … + M_pqξ ^pη^q + … = log (1 + μξ + νη)
or
M_pq = ( − )^p+q−1(p + q − 1)!/p! q!μ^pν^q;
and the result is thus
exp (μd₁₀ + νd₀₁)
= exp { μd₁₀ + νd₀₁ − 1/2 (μ²d₂₀ + 2μνd₁₁ + ν²d₀₂) + … }
= 1 + μD₁₀ + νD₀₁ + … + μ^pν^qD_pq + …;
and thence
μd₁₀ + νd₀₁ − 1/2 (μ²d₂₀ + 2μνd₁₁ + ν²d₀₂) + …
=log (1 + μD₁₀ + νD₀₁ + … + μ^pν^qD_pq + …).
From these formulae we derive two important relations, viz.
( − )^p+q−1(p + q − 1)!/p!q!d_pq = $\sum _{\pi }$ ( − )^Σπ−1 (Σπ − 1)!/π₁! π₂! …Dπ₁
p₁q₁Dπ₂
p₂q₂…,
( − )^p+q−1D_pq = $\sum _{\pi }$ $\scriptstyle {\left\{{\begin{matrix}\ \\\ \end{matrix}}\right.}$ (p₁ + q₁ − 1)!/p₁!q₁! $\scriptstyle {\left.{\begin{matrix}\ \\\ \end{matrix}}\right\}\,}$ ^π₁ $\scriptstyle {\left\{{\begin{matrix}\ \\\ \end{matrix}}\right.}$ (p₂ + q₂ − 1)!/p₂!q₂! $\scriptstyle {\left.{\begin{matrix}\ \\\ \end{matrix}}\right\}\,}$ ^π₂ …
… ( − )^{Σπ − 1)!}/π₁! π₂! …dπ₁
p₁q₁dπ₂
p₂q₂…,
the last written relation having, in regard to each term on the right-hand side, to do with Σπ successive linear operations. Recalling the formulae above which connect s_pq and a_pq, we see that d_pq and D_pq are in co-relation with these quantities respectively, and may be said to be operations which correspond to the partitions (pq), (10^p 01^q) respectively. We might conjecture from this observation that every partition is in correspondence with some operation; this is found to be the case, and it has been shown (loc. cit. p. 493) that the operation
1/π₁!1/π₂!…dπ₁
p₁q₁dπ₂
p₂q₂… (multiplication symbolic)
corresponds to the partition (/p₁q₁^π₁ /p₂q₂^π₂…). The partitions being taken as denoting symmetric functions we have complete correspondence between the algebras of quantity and operation, and from any algebraic formula we can at once write down an operation formula. This fact is of extreme importance in the theory of algebraic forms, and is easily representable whatever be the number of the systems of quantities.

We may remark the particular result
( − )^p+q−1(p + q − 1)!/p!q!d_pqs_pq = D_pq(pq) = 1;
d_pq causes every other single part function to vanish, and must cause any monomial function to vanish which does not comprise one of the partitions of the biweight pq amongst its parts.

Since
d_pq = ( − )^p+q−1(p + q − 1)!/p!q!d/ds_pq
the solutions of the partial differential equation d_pq = 0 are the single bipart forms, omitting s_pq, and we have seen that the solutions of D_pq = 0 are those monomial functions in which the part pq is absent.

One more relation is easily obtained, viz.
d/da_pq = d_pq − h₁₀d_p+1,q − h₀₁d_p,q+1 + … + ( − )^r+sh_rsd_p+r,q+s + … .

References for Symmetric Functions.—Albert Girard, Invention nouvelle en l’algèbre (Amsterdam, 1629); Thomas Waring, Meditationes Algebraicae (London, 1782); Lagrange, Mém. de l’acad. de Berlin (1768); Meyer-Hirsch, Sammlung von Aufgaben aus der Theorie der algebraischen Gleichungen (Berlin, 1809); Serret, Cours d'algèbre supérieure, t. iii. (Paris, 1885); Unferdinger, Sitzungsber. d. Acad. d. Wissensch. i. Wien, Bd. lx. (Vienna, 1869); L. Schläfli, “Ueber die Resultante eines Systemes mehrerer algebraischen Gleichungen,” Vienna Transactions, t. iv. 1852; MacMahon, “Memoirs on a New Theory of Symmetric Functions,” American Journal of Mathematics, Baltimore, Md. 1888–1890; “Memoir on Symmetric Functions of Roots of Systems of Equations,” Phil. Trans. 1890.

III. The Theory of Binary Forms

A binary form of order n is a homogeneous polynomial of the nth degree in two variables. It may be written in the form
axn
1 + bxn−1
1x₂ + cn−2
1x2
2 + …;
or in the form
axn
1 + (n
1)bxn−1
1x₂ + (n
2)cn−2
1x2
2 + …;
which Cayley denotes by
(a, b, c, …) (x₁, x₂)ⁿ
(n
1), (n
2)… being a notation for the successive binomial coefficients n, 1/2n (n − 1), …. Other forms are
axn
1 + nbn−1
1x₂ + n(n − 1)cxn−2
1x2
2 + …,
the binomial coefficients (n
s) being replaced by s!(n
s), and
axn
1 + 1/1!bn−1
1x₂ + 1/2!cxn−2
1x2
2 + …,
the special convenience of which will appear later. For present purposes the form will be written
/a₀xn
1 + (n
1)/a₁xn−1
1x₂ + (n
1)/a₁xn−1
1x2
2 + … + /a_nxn
2,
the notation adopted by German writers; the literal coefficients have a rule placed over them to distinguish them from umbral coefficients which are introduced almost at once. The coefficients /a₀, /a₁, /a₂, … /a_n, n + 1 in number are arbitrary. If the form, sometimes termed a quantic, be equated to zero the n + 1 coefficients are equivalent to but n, since one can be made unity by division and the equation is to be regarded as one for the determination of the ratio of the variables.

If the variables of the quantic 𝑓(x₁, x₂) be subjected to the linear transformation
x₁ = α₁₁ξ₁ + α₁₂ξ₂,
x₂ = α₂₁ξ₁ + α₂₂ξ₂,
ξ₁, ξ₂ being new variables replacing x₁, x₂ and the coefficients α₁₁, α₁₂, α₂₁, α₂₂, termed the coefficients of substitution (or of transformation), being constants, we arrive at a transformed quantic
𝑓 (ξ₁, ξ₂) = a′
0ξn
1 + (n
1)a′
1ξn−1
1ξ₂ + (n
2)a′
2ξn−2
1ξ₂ + … + a′
nξn
2
in the new variables which is of the same order as the original quantic; the new coefficients a′/0, a′/1, a′/2 . . . a′/n are linear functions of the original coefficients, and also linear functions of products, of the coefficients of substitution, of the nth degree.

By solving the equations of transformation we obtain
rξ₁ = α₂₂x₁ − α₁₂x₂,
rξ₁ = − α₂₁x₁ − α₁₁x₂,
where r = |α₁₁α₁₂
α₂₁α₂₂ | = α₁₁α₂₂ − α₁₂α₂₁;
r is termed the determinant of substitution or modulus of transformation; we assure x₁, x₂ to be independents, so that r must differ from zero.

In the theory of forms we seek functions of the coefficients and variables of the original quantic which, save as to a power of the modulus of transformation, are equal to the like functions of the coefficients and variables of the transformed quantic. We may have such a function which does not involve the variables, viz.
F(a′
0, a′
1, a′
2, … a′
n) = r ^λ F(/a₀, /a₁, /a₂, … /a_n),
the function F(/a₀, /a₁, /a₂, … /a_n) is then said to be an invariant of the quantic quâ linear transformation. If, however, F involve as well the variables, viz.
F(a′
0, a′
1, a′
2, … ; ξ₁, ξ₂) = r ^λ F(/a₀, /a₁, /a₂, … ; x₁, x₂),
the function F(/a₀, /a₁, /a₂, … ; x₁, x₂) is said to be a covariant of the quantic. The expression “invariantive forms” includes both invariants and covariants, and frequently also other analogous forms which will be met with. Occasionally the word “invariants” includes covariants; when this is so it will be implied by the text. Invariantive forms will be found to be homogeneous functions alike of the coefficients and of the variables. Instead of a single quantic we may have several
𝑓 (/a₀, /a₁, /a₂, … ; x₁, x₂), φ(/b₀, /b₁, /b₂, … ; x₁, x₂), …
which have different coefficients, the same variables, and are of the same or different degrees in the variables; we may transform them all by the same substitution, so that they become
𝑓 (a′
0, a′
1, a′
2, … ; ξ₁, ξ₂), φ(b′
0, b′
1, b′
2, … ; ξ₁, ξ₂), …
If then we find
F(a′
0, a′
1, a′
2, … b′
0, b′
1, b′
2, …, …; ξ₁, ξ₂),
= r ^λ F(/a₀, /a₁, /a₂, … /b₀, /b₁, /b₂, …, …; x₁, x₂),
the function F, on the right which multiplies r, is said to be a simultaneous invariant or covariant of the system of quantics. This notion is fundamental in the present theory because we will find that one of the most valuable artifices for finding invariants of a single quantic is first to find simultaneous invariants of several different quantics, and subsequently to make all the quantics identical. Moreover, instead of having one pair of variables x₁, x₂ we may have several pairs y₁, y₂; z₁, z₂;… in addition, and transform each pair to a new pair by substitutions, having the same coefficients α₁₁, α₁₂, α₂₁, α₂₂ and arrive at functions of the original coefficients and variables (of one or more quantics) which possess the above definied invariant property. A particular quantic of the system may be of the same or different degrees in the pairs of variables which it involves, and these degrees may vary from quantic to quantic of the system. Such quantics have been termed by Cayley multipartite.

Symbolic Form.—Restricting consideration, for the present, to binary forms in a single pair of variables, we must introduce the symbolic form of Aronhold, Clebsch and Gordan; they write the form
(a₁x₁ + a₂x₂)ⁿ = an
1xn
1 + (n
1)an−1
1a₂xn−1
1 x₂ + ... + an
2xn
3 = an
x
wherein a₁, a₂ are umbrae, such that
an
1, an−1
1a₂, ... a₁an−1
2. an
2
are symbolical representations of the real coefficients a₀, a₁, ... a_n−1, a_n, and in general an−k
1ak
2 is the symbol for a_k. If we restrict ourselves to this set of symbols we can uniquely pass from a product of real coefficients to the symbolic representations of such product, but we cannot, uniquely, from the symbols recover the real form, This is clear because we can write
a₁a₂ = an−1
1a₂. an−2
1a2
2 = a2n−3
1a3
2
while the same product of umbrae arises from
a₀a₃ = an
1.an−3
1a3
2 = a2n−3
1a3
2
Hence it becomes necessary to have more than one set of umbrae, so that we may have more than one symbolical representation of the same real coefficients. We consider the quantic to have any number of equivalent representations an
x ≡ bn
x ≡ cn
x ≡ …. So that an−k
1ak
2 ≡ bn−k
1bk
2 ≡ cn−k
1ck
2 ≡ … = a_k; and if we wish to denote, by umbrae, a product of coefficients of degree s we employ s sets of umbrae.

Ex. gr. We write a₁a₂ = an−1
1a₂.bn−2
1b2
2
/a2
2 = an−3
1a3
2.bn−3
1b3
2.cn−3
1c3
2,
and so on whenever we require to represent a product of real coefficients symbolically; we then have a one-to-one correspondence between the products of real coefficients and their symbolic forms. If we have a function of degree s in the coefficients, we may select any s sets of umbrae for use, and having made a selection we may when only one quantic is under consideration at any time permute the sets of umbrae in any manner without altering the real significance of the symbolism.Ex. gr. To express the function a₀a₂−a2
1, which is the discriminant of the binary quadratic a₀x2
1 + 2 a₁x₁x₂ + a₂x2
2 = a2
x = b2
x, in a symbolic form we have
2(a₀a₂ − a2
1) = a₀a₂ + a₁a₂ − 2a₁ . a₁ = a2
1b2
1 + a2
2 b2
1 − 2a₁a₂b₁b₂
= (a₁b₂ − a₂b₁)².

Such an expression as a₁b₂−a₂b₁ which is
∂a_x/∂x₁∂b_x/∂x₂ − ∂a_x/∂x₂∂b_x/∂x₁,
is usually written (ab) for brevity; in the same notation the determinant, whose rows are a_l, a₂, a₃; b₁, b₂, b₃; c₁, c₂, c₃ respectively, is written (abc) and so on. It should be noticed that the real function denoted by (ab)² is not the square of a real function denoted by (ab). For a single quantic of the first order (ab) is the symbol of a function of the coefficients which vanishes identically; thus
(ab) = a₁b₂ − a₂b₁ = a₀a₁ − a₁a₀ = 0
and, indeed, from a remark made above we see that (ab) remains unchanged by interchange of a and b; but (ab), = −(ba), and these two facts necessitate (ab) = 0.

To find the effect of linear transformation on the symbolic form of quantic we will disuse the coefficients a₁₁, a₁₂, a₂₁, a₂₂, and employ λ₁, μ₁, λ₂, μ₂. For the substitution
x₁ = λ₁ξ₁ + μ₁ξ₂, x₂ = λ₂ξ₁ + μ₂ξ₂,
of modulus |λ₁
λ₂μ₁
μ₂| = (λ₁μ₂ − λ₂μ₁) = (λμ),
the quadratic form a₀x2
1 + 2ax₁x₂ + a₂x2
2 = 2
x = ƒ(x),
becomes
A₀ξ2
1 + 2A₁ξ₁ξ₂ + A₂ξ2
2 = A2
ξ = φ(ξ),
where
A₀ = a₀λ2
1 + 2a₁λ₁λ₂ + a₂λ2
2,
A₁ = a₀λ₁μ₁ + a₁(λ₁μ₂ + λ₂μ₁) + a₂λ₂μ₂,
A₂ = a₀μc+ 2a₁μ₁μ₂ + a₂μ2
2.

We pass to the symbolic forms
a2
x = (a₁x₁ + a₂x₂)²,A2
ξ = (A₁ξ₁ + A₂ξ₂)²,
by writing for
a₀, a₁, a₂ the symbols a2
1, a₁a₂, a2
2
A₀, A₁, A₂ the symbols„ A2
1, A₁A₂, A2
2
and then
A₀ = a2
1λ2
1 + 2a₁a₂λ₁λ₂ + a2
2λ2
2 = (a₁λ₁ + a₂λ₂)² = a2
λ,
A₁ = (a₁λ₁ + a₂λ₂) (a₁μ₁ + a₂μ₂) = a_λa_μ,
A₂ = (a₁μ₁ + a₂μ₂)² = a2
μ;
so that
A2
ξ = a2
λξ2
1 + 2a_λa_μξ₁ξ₂ + a2
μξ2
2 = (a_λξ₁ + a_μξ₂)²;
whence A₁, A₂ become a_λ, 'a_μ respectively and
φ(ξ) = (a_λξ₁ + a_μξ₂)².
The practical result of the transformation is to change the umbrae a_l, a₂ into the umbrae
a_λ = a₁λ₁ + a₂λ₁,a_μ = a₁μ₁ + a₂μ₂
respectively.

By similarly transforming the binary n^ic form an
x we find
A₀ = (a₁λ₁ + a₂λ₂)ⁿ = an
λ + An
1,
A₁ = (a₁λ₁ + a₂λ₂)ⁿ⁻¹ (a₁μ₁ + a₂μ₂) = an−1
λa_μ = An−1
1A₂,
········ A_k = (a₁λ₁ + a₂λ₂)^n−k (a₁μ₁ + a₂μ₂)^k = an−k
λak
μ = An−k
1An−k
2,
so that the umbrae A₁, A₂ are a_λ, a_μ respectively.

Theorem.-When the binary form
an
x = (a₁x₁ + a₂x₂)ⁿ
is transformed to
An
ξ = (A₁ξ₁ + A₂ξ₂)ⁿ
by the substitutions
x₁ = λ₁ξ₁ + μ₁ξ₂, x₂ = λ₂ξ₁ + μ₂ξ₂,
the umbrae A₁, A₂ are expressed in terms of the umbrae a₁, a₂ by the formulae
A₁ = λ₁a₁ + λ₂a₂, A₂ = μ₁a₁ + μ₂a₂,
We gather that A₁, A₂ are transformed to a₁, a₂ in such wise that the determinant of transformation reads by rows as the original determinant reads by columns, and that the modulus of the transformation is, as before, (λμ). For this reason the umbrae A₁, A₂ are said to be contragredient to x₁, x₂. If we solve the equations connecting the original and transformed unbrae we find
(λμ)(−a₂) = λ₁(−A₂) + μ₁A₁,
(λμ)a₁ = λ₂(−A₂) + μ₂A₁,
and we find that, except for the factor (λμ), −a₂ and +a₁ are transformed to −A₂ and +A₁ by the same substitutions as x₁ and x₂ are transformed to ξ₁ and ξ₂. For this reason the umbrae −a₂, a₁ are said to be cogredient to x₁ and x₂. We frequently meet with cogredient and contragedient quantities, and we have in general the following definitions:-(1) "If two equally numerous sets of quantities x, y, z, ... x′, y′, z′, ... are such that whenever one set x, y, z,... is expressed in terms of new quantities X, Y, Z, ... the second set x′, y′, z′, ... is expressed in terms of other new quantities X′, Y′, Z′, .... by the same scheme of linear substitution the two sets are said to be cogredient quantities." (2) "Two sets of quantities x, y, z, ...; ξ, ηζ, ... are said to be contragredient when the linear substitutions for the first set are
x = λ₁X + μ₁Y + ν₁Z +
y = λ₂X + μ₂Y + ν₂Z +
z = λ₃X + μ₃Y + ν₃Z +
····· and these are associated with the following formulae appertaining to the second set,
Ξ = λ₁ξ + λ₂η + λ₃ζ + ,
Η = μ₁ξ + μ₂η + μ₃ζ + ,
Ζ = ν₁ξ + ν₂η + ν₃ζ + ,
···· wherein it should be noticed that new quantities are expressed in terms of the old, as regards the latter set, and not vice versa."

Ex. gr. The symbols d/dx, d/dy, d/dz, ... are contragredient with the variables x, y, z, ... for when
(x, y, z, ...) = (λ₁, μ₁, ν₁, ...)(X, Y, Z, ...)
( x , z, ï¿½ï¿½ï¿½) = (A l, ï¿½i, VI I ï¿½ï¿½ï¿½)

(X, Y, Z, ï¿½ï¿½ï¿½), I A 2, / 2 2, Y2, ... I I A S, 1 2 3, Y 3, .... 1

(Tr (T d d d d d d ,.. rd Y' ' ...) = 01, A2, A 3, ...)

(d ' ' z / 2 1, /22, / 1 3, ... Pl, P2, P3, ... we find
(d/dX, d/dY, d/dZ, …) = (λ₁, λ₂, λ₃, …) (d/dx, d/dy, d/dz, …)

μ₁,	μ₂,	μ₃,	…
ν₁,	ν₂,	ν₃,	…
.	.	.	.

Observe the notation, which is that introduced by Cayley into the theory of matrices which he himself created.

Just as cogrediency leads to a theory of covariants, so contragrediency leads to a theory of contravariants. If u, a quantic in x, y, z, …, be expressed in terms of new variables X, Y, Z …; and if, ξ, η, ζ, …, be quantities contragredient to x, y, z, …; there are found to exist functions of ξ, η, ζ …, and of the coefficients in u, which need, at most, be multiplied by powers of the modulus to be made equal to the same functions of Ξ, Η, Ζ, … of the transformed coefficients of u; such functions are called contravariants of u. There also exist functions, which involve both sets of variables as well as the coefficients of u, possessing a like property; such have been termed mixed concomitants, and they, like contravariants, may appertain as well to a system of forms as to a single form.

As between the original and transformed quantic we have the umbral relations
A₁ = λ₁a₁ + λ₂a₂, A₂ = μ₁a₁ + μ₂a₂,
and for a second form
B₁ = λ₁b₁ + λ₂b₂, B₂ = μ₁b₁ + μ₂b₂.
The original forms are an
x, bn
x, and we may regard them either as different forms or as equivalent representations of the same form. In other words, B, b may be regarded as different or alternative symbols to A, a. In either case
(AB) = A₁B₂ − A₂B₁ = (λμ)(ab);
and, from the definition, (ab) possesses the invariant property. We cannot, however, say that it is an invariant unless it is expressible in terms of the real coefficients. Since (ab) = a₁b₂ − a₂b₁, that this may be the case each form must be linear; and if the forms be different (ab) is an invariant (simultaneous) of the two forms, its real expression being a₀b₁ − a₁b₀. This will be recognized as the resultant of the two linear forms. If the two linear forms be identical, the umbral sets a₁, a₂; b₁, b₂ are alternative, are ultimately put equal to one another and (ab) vanishes. A single linear form has, in fact, no invariant. When either of the forms is of an order higher than the first (ab), as not being expressible in terms of the actual coefficients of the forms, is not an invariant and has no significance. Introducing now other sets of symbols C, D, …; c, d, … we may write
(AB)ⁱ(AC)^j(BC)^k… = (λμ)^i+j+k+…(ab)ⁱ(ac)^j(bc)^k…,
so that the symbolic product
(ab)ⁱ(ac)^j(bc)^k…,
possesses the invariant property. If the forms be all linear and different, the function is an invariant, viz. the i^th power of that appertaining to a_x and b_x multiplied by the j^th power of that appertaining to a_x and c_x multiplied by &c. If any two of the linear forms, say p_x, q_x, be supposed identical, any symbolic expression involving the factor (pq) is zero. Notice, therefore, that the symbolic product (ab)ⁱ(ac)^j(bc)^k… may be always viewed as a simultaneous invariant of a number of different linear forms a_x, b_x, c_x, …. In order that (ab)ⁱ(ac)^j(bc)^k… may be a simultaneous invariant of a number of different forms an₁
x, bn₂
x, cn₃
x,…, where n₁, n₂, n₃, … may be the same or different, it is necessary that every product of umbrae which arises in the expansion of the symbolic product be of degree n₁ in a₁, a₂; in the case of b₁, b₂ of degree n₂; in the case of c ₁, c₂ of degree n₃; and so on. For these only will the symbolic product be replaceable by a linear function of products of real coefficients. Hence the condition is
i + j + … = n₁,
i + k + … = n₂,
j + k + … = n₃,
....
If the forms an
x, bn
x, cn
x, … be identical the symbols are alternative, and provided that the form does not vanish it denotes an invariant of the single form an
x.

There may be a number of forms an
x, bn
x, cn
x, … and we may suppose such identities between the symbols that on the whole only two, three, or more of the sets of umbrae are not equivalent; we will then obtain invariants of two, three, or more sets of binary forms. The symbolic expression of a covariant is equally simple, because we see at once that since Aξ, Bξ, Cξ, … are equal to a_x, b_x, c_x, … respectively, the linear forms a_x, b_x, c_x, … possess the invariant property, and we may write
(AB)ⁱ(AC)^j(BC)^k…Aρ
ξBσ
ξCτ
ξ…
= (λμ)^i+j+k+…(ab)ⁱ(ac)^j(bc)^k…aρ
xbσ
xcτ
x…,
and assert that the symbolic product
(ab)ⁱ(ac)^j(bc)^k…aρ
xbσ
xcτ
x…,
possesses the invariant property. It is always an invariant or covariant appertaining to a number of different linear forms, and as before it may vanish if two such linear forms be identical. In general it will be simultaneous covariant of the different forms an₁
x, bn₂
x, cn₃
x, … if
i + j + … + ρ = n₁,
i + j + … + σ = n₂,
i + j + … + τ = n₃,
. . ..

It will also be a covariant if the symbolic product be factorizable into portions each of which satisfies these conditions. If the forms be identical the sets of symbols are ultimately equated, and the form, provided it does not vanish, is a covariant of the form an
x.

The expression (ab)⁴ properly appertains to a quartic; for a quadratic it may also be written (ab)² (cd)², and would denote the square of the discriminant to a factor près. For the quartic
(ab)⁴ = (a₁b₂ − a₂b₁)⁴ = a4
1b4
2 − 4a4
1a₂b₁b3
2 + 6a2
1a2
2b2
1b2
2
− 4a₁a3
2}b3
1b₂ + a4
2b4
1 = a₀a₄ − 4a₁a₃ + 6a2
2 − 4a₁a₃ + a₀a₄
= 2(a₀a₄ − 4a₁a₃ + 3a2
2),
one of the well-known invariants of the quartic.

For the cubic (ab)²a_xb_x is a covariant because each symbol a, b occurs three times; we can first of all find its real expression as a simultaneous covariant of two cubics, and then, by supposing the two cubics to merge into identity, find the expression of the quadratic covariant, of the single cubic, commonly known as the Hessian.

By simple multiplication
(a3
1b₁b2
2 − 2a2
1a₂b2
1b₂ + a₁a2
2b3
1)x2
1
+(a3
1b3
2 - a₁a2
2b2
1b₂ - a2
1a₂b₁b2
2 + a3
2b3
1)x₁x₂
+ (a2
1a₂b3
2 - 2a₁a2
2b₁b3
2 + a3
2b2
1b₂)x2
2;
and transforming to the real form,
(a₀b₂ − 2a₁b₁ + a₂b₀)x2
1 (a₀b₃ − a₁b₂ − a₂b₁ + a₃b₀)x₁x₂
+ (a₁b₃ − 2a₂b₂ + a₃b₁)x2
2,
the simultaneous covariant; and now, putting b = a, we obtain twice. the Hessian
(a₀a₂ − a2
1)x2
2 + (a₀a₃ − a₁a₂)x₁x₂ + (a₁a₃ − a2
2)x2
2.

It will be shown later that all invariants, single or simultaneous, are expressible in terms of symbolic products. The degree of the covariant in the coefficients is equal to the number of different symbols a, b, c, … that occur in the symbolic expression; the degree in the variables (i.e. the order of the covariant) is ρ + σ + τ … and the weight^[4] of the coefficient of the leading term xρ + σ + τ …
1 is equal to i + j + k + …. It will be apparent that there are four numbers associated with a covariant, viz. the orders of the quantic and covariant, and the degree and weight of the leading coefficient; calling these n, ε, θ, w respectively we can see that they are not independent integers, but that they are invariably connected by a certain relation nθ − 2w = ε. For, if φ(a₀,…x₁, x₂) be a covariant of order ε appertaining to a quantic of order n,
φ(A₀,…ξ₁ξ₂) = (λμ)^w φ(a₀,…λ₁ξ₁ + μ₁ξ₂, λ₂ξ₁ + μ₂ξ₂)
we find that the left- and right-hand sides are of degrees nθ and 2w + ε respectively in λ₁, μ₁, λ₂, μ₂, and thence nθ = 2w + ε.

Symbolic Identities.— For the purpose of manipulating symbolic expressions it is necessary to be in possession of certain simple identities which connect certain symbolic products. From the three equations
a_x = a₁x₁ + a₂x₂, b_x = b₁x₁ + b₂x₂, c_x = c₁x₁ + c₂x₂,
we find by eliminating x₁, and x₂ the relation
a_x(bc) + b_x(ca) + c_x(ab) = 0...(I.)
Introduce now new umbrae d₁, d₂ and recall that +d₂ −d₁ are cogredient with x₁, and x₂. We may in any relation substitute for any pair of quantities any other cogredient pair so that writing +d₂, −d₁ for x₁ and x₂, and noting that g_x then becomes (gd), the above-written identity becomes
(ad)(bc) + (bd)(ca) + (cd)(ab) = 0... (II.)
Similarly in (I.), writing for c₁, c₂ the cogredient pair -y2, +y1, we obtain
a_xb_y − a_yb_x = (ab)(xy). ... (III.)
Again in (I.) transposing a_x(bc) to the other side and squaring, we obtain
2(ac)(bc)a_xb_x = (bc)²a2
x + (ac)²b_x − (ab)²c2
x. (IV.)
and herein writing d₂, −d₁ for x₁, x₂,
2(ac)(bc)(ad)(bd) − (bc)²(ad)² + (ac)²(bd)² − (ab)²(cd)². (V.)

As an illustration multiply (IV.) throughout by an−2
x bn−2
x cn−2
x so that each term may denote a covariant of an n^ic.
2(ac)(bc)an−1
xbn−1
xcn−1
x
= (bc)²an
xbn−2
xcn−2
x + (ac)²an−2
xbn
xcn−2
x − (ab)²an−2
xbn−2
xcn
x,

Each term on the right-hand side may be shown by permutation of a, b, c to be the symbolical representation of the same covariant; they are equivalent symbolic products, and we may accordingly write
2(ac) (bc)ai -1 bi -1 cx 2 =(ab)2a:-2b:-2c:,
a relation which shows that the form on the left is the product of the two covariants
n (ab) ay 2 by 2 and cZ.

The identities are, in particular, of service in reducing symbolic products to standard forms. A symbolical expression may be always so transformed that the power of any determinant factor (ab) is even. For we may in any product interchange a and b without altering its signification; therefore
(ab) 2m+1 4) 1 = - (ab) 2 " 4)2,
where 4,1 becomes by the interchange, and hence
(ab)2m+14)1= Z (ab) 2m+1 (4) 1 - 02);
and identity (I.) will always result in transforming 01-02 so as to make it divisible by (ab).

Ex. gr.
(ab)(ac)bxcx = - (ab)(bc)axcx = 2(ab)c x {(ac)bx-(bc)axi = 1(ab)2ci;
so that the covariant of the quadratic on the left is half the product of the quadratic itself and its only invariant. To obtain the corresponding theorem concerning the general form of even order we multiply throughout by (ab)2' 2c272 and obtain (ab)2m-1(ac)bxc2:^1=(ab)2mc2

Paying attention merely to the determinant factors there is no form with one factor since (ab) vanishes identically. For two factors the standard form is (ab) 2; for three factors (ab) 2 (ac); for four factors (ab) 4 and (ab) 2 (cd) 2; for five factors (ab) 4 (ac) and (ab) 2 (ac)(de) 2; for six factors (ab) 6, (ab) 2 (bc) 2 (ca) 2 , and (ab) 2 (cd) 2 (ef) 2 . It will be a useful exercise for the reader to interpret the corresponding covariants of the general quantic, to show that some of them are simple powers or products of other covariants of lower degrees and order.

The Polar Process.—The ï¿½th polar of ax with regard to y is
n-ï¿½ a aye i.e. of the symbolic factors of the form are replaced by IA others in which new variables y1, y2 replace the old variables x1, x 2 . The operation of taking the polar results in a symbolic product, and the repetition of the process in regard to new cogredient sets of variables results in symbolic forms. It is therefore an invariant process. All the forms obtained are invariants in regard to linear transformations, in accordance with the same scheme of substitutions, of the several sets of variables.

An important associated operation is a ? 32 ax l ay 2 ax2ay1' which, operating upon any polar, causes it to vanish. Moreover, its operation upon any invariant form produces an invariant form. Every symbolic product, involving several sets of cogredient variables, can be exhibited as a sum of terms, each of which is a polar multiplied by a product of powers of the determinant factors ( xy), (xz), (yz),... Transvection. - We have seen that (ab) is a simultaneous invariant of the two different linear forms a x, bx, and we observe that (ab) is equivalent to where f =a x, 4)=b. If f =ay, 4 = b' be any two binary forms, we generalize by forming the function (m-k)! (n-k)! of a4) of a 4) k m! l ax 2 2 ax i l This is called the kth transvectant of f over 4); it may be conveniently denoted by (f, (15)k. (a m b n) k (ab) kamkbn-k x, x - x it is clear that the k th transvectant is a simultaneous covariant of the two forms.

It has been shown by Gordan that every symbolic product is expressible as a sum of transvectants.

If m > n there are n +1 transvectants corresponding to the values o, t, 2,... n of k; if k = o we have the product of the two forms, and for all values of k>n the transvectants vanish. In general we may have any two forms 01/1X1+ 'II ï¿½ Yy + 02x2) p Y'x =, / / being the umbrae, as usual, and for the kth transvectant we have (4)1,,, 4)Q) k = (4)) k 4)2 -krk, a simultaneous covariant of the two forms. We may suppose of, 4 ,2 to be any two covariants appertaining to a system, and the process of transvection supplies a means of proceeding from them to other covariants.

The two forms ax, bx, or of, 0, may be identical; we then have the kth transvectant of a form over itself which may, or may not, vanish identically; and, in the latter case, is a covariant of the single form. It is obvious that, when k is uneven, the kth transvectant of a form over itself does vanish. We have seen that transvection is equivalent to the performance of partial differential operations upon the two forms, but, practically, we may regard the process as merely substituting (ab) k, (OW for azbx, 4x t ' respectively in the symbolic product subjected to transvection. It is essentially an operation performed upon the product of ï¿½two forms. If, then, we require the transvectants of the two forms f+Xf', 0+14', we take their product fc5+xf'95+,-ifct'+atif'cb', and the kth transvectant is simply obtained by operating upon each term separately, viz.

(f, 4)) k +(f, 4)) k +ï¿½(f, 4/) k +aï¿½(1, 4)')k; and, moreover, if we require to find the kth transvectant of one linear system of forms over another we have merely to multiply the two systems, and take the k th transvectant of the separate products.

The process of transvection is connected with the operations 12; for ?k (a m b n) = (ab)kam-kbn-k, (x y x y or S 2 k (a x by) x = 4))k; so also is the polar process, for since f k m-k k k n - k k y = a x by, 4)y = bx by, if we take the k th transvectant of f i x; over 4 k, regarding y,, y 2 as the variables, (f k, 4)y) k (ab) ka x -kb k (f, 15)k; or the k th transvectant of the k th polars, in regard to y, is equal to the kth transvectant of the forms. Moreover, the kth transvectant (ab) k a m-k b: -k is derivable from the kth polar of ax, viz. ai by substituting for y 1, y 2 the cogredient quantities b2,-b1, and multiplying by by-k.

First and Second Transvectants.- A few words must be said about the first two transvectants as they are of exceptional Interest. Since, If F = An, 4) = By, 1 = I

(Df A4) Of A?) Ab A"'^1Bz 1=, (F, Mn Ax I Ax 2 Axe Ax1) J

The First Transvectant Differs But By A Numerical Factor From The Jacobian Or Functional Determinant, Of The Two Forms. We Can Find An Expression For The First Transvectant Of (F, ï¿½) 1 Over Another Form Cp. For (M N)(F,4)), =Nf.4Y Mfy.4), And F,4, F 5.4)= (Axby A Y B X) A X B X 1= (Xy)(F,4))1; (F,Ct)1=F5.D' 7,(Xy)(F4)1. Put M 1 For M, N I For N, And Multiply Through By (Ab); Then { (F ,C6) } = (Ab) A X 2A Y B X 1 M N I 2 (Xy) ,?) 2, = (A B)Ax 1B X 2B Y L I Multiply By Cp 1 And For Y L, Y2 Write C 2, C1;

Then The Right Hand Side Becomes

(Ab)(Bc)Am Lbn 2Cp 1 M I C P (F?) 2 M { N2 X, Of Which The First Term, Writing C P = ,,T, Is Mn 2 A B (Ab)(Bc)Axcx 1 M 2 N 2 P 2 2222 2 2 _2 A X B X C (Bc) A C Bx M N 2 2 2 M2Â°N 2 N 2 M 2 2 A X (Bc) B C P C P (Ab) A B B(Ac) Ax Cp 2 = 2 (04) 2 1 (F,0) 2.4 (F,Y') 2 ï¿½?;

And, If

(F,4)) 1 = Km " 2, (F??) 1 1 M N S X X X Af A _Af A Ax, Ax Ax Ax1 and this, on writing c₂, − c₁ For y₁, y₂, becomes

(kc)K X 'T 3C X 1＝ (ƒ,0 1 ', G 1; ï¿½

∴1{F,O}¹ M 1＝1 M 2 0`,4)) (ƒ,φ²).ψ+ (0,0 2 .F '

and thence it appears that the first transvectant of (ƒ, (φ)¹ over ψ) is always expressible by means of forms of lower degree in the coefficients wherever each of the forms F, 0, 4, is of higher degree than the first in x₁, x₂.

The second transvectant of a form over itself is called the Hessian of the form. It is

(ƒ,ƒ′)² = (ab)² a n-2 r7 2 =Hx - =H;

unsymbolically it is a numerical multiple of the determinant ∂²ƒ a2f (32 f) It is also the first transvectant of the differxi ax axa x 2 ential coefficients of the form with regard to the variables, viz. (L, _f_)'. For the quadratic it is the discriminant (ab) 2 and for ax2 the cubic the quadratic covariant (ab) 2 axbx.

In general for a form in n variables the Hessian is 3 2 f 3 2 f a2f ax i ax n ax 2 ax " ï¿½ï¿½ ' axn and there is a remarkable theorem which states that if H =o and n=2, 3, or 4 the original form can be exhibited as a form in 1, 2, 3 variables respectively.

The Form ƒ+λφ. - An important method for the formation of covariants is connected with the form ƒ+λφ, where ƒ and φ are of the same order in the variables and X is an arbitrary constant. If the invariants and covariants of this composite quantic be formed we obtain functions of X such that the coefficients of the various powers of X are simultaneous invariants of f and 4). In particular, when 4) is a covariant of f, we obtain in this manner covariants of f. The Partial Differential Equations.--It will be shown later that covariants may be studied by restricting attention to the leading coefficient, viz. that affecting xi where e is the order of the covariant.

An important fact, discovered by Cayley, is that these coefficients, and also the complete covariants, satisfy certain partial differential equations which suffice to determine them, and to ascertain many of their properties. These equations can be arrived at in many ways; the method here given is due to Gordan. X1, X 2, u1, /22 being as usual the coefficients of substitution, let x1a ? + X 2 - = D, X 1 -' j +X 2 =D 2 AA' ?2 / 2 1 3 - 5 -, =112 87,2 = ?1a a + ?2a a =Dï¿½ï¿½, 1 be linear operators. Then if j, J be the original and transformed forms of an invariant J= (a1)wj, w being the weight of the invariant.

Operation upon J results as follows D AA J = wJ; D A J=0; D ï¿½A J =0;D ï¿½ï¿½ J = wJ.

The first and fourth of these indicate that (a 2) w is a homogeneous function of X i, X2, and of /u1, ï¿½ 2 separately, and the second and third arise from the fact that (X / 1) is caused to vanish by both Da ï¿½ and Dï¿½A. Since J= F(A0,A11...Ak,ï¿½..), where A k= we find that the results are equivalent to. aJ - ., _ A aJ ï¿½. k (DwAk) Ak 0; (D (ï¿½ A k) Ak =wJ.

k k According to the well-known law for the changes of independent variables. Now D A xA k = (n - k) A k; Aï¿½ A k = k A?1; D ï¿½A A k = (n - k) A k+1;D mï¿½ A k = kA k; (n - k)A ka - w Ak - 1 aA k = O; a _ J (n - k) A k +l A k = O; kA k Ak = wJ; equations which are valid when X 1, X 2, ï¿½ 1, ï¿½2 have arbitrary values, and therefore when the values are such that J =j, A k =akï¿½ Hence Â°a-do +(n -1)71 (a2aa-+... =wj, - aj aj - aj a Â°aa1 +2a 1aa2 +3a 2aa3 +... =0, - aj aj aj nal aao +(n-1)a2 at i -} (n - 2)a 3aa2+... =0, a 1 a ? +2a 2 a? +3a 3 a +... = wj, aa 1 aa 2 a a 3 the complete system of equations satisfied by an invariant. The fourth shows that every term of the invariant is of the same weight. Moreover, if we add the first to the fourth we obtain aj 2w ak = 7 1=6, j, =0j, where 0 is the degree of the invariant; this shows, as we have before observed, that for an invariant w= - n0. The second and third are those upon the solution of which the theory of the invariant may be said to depend. An instantaneous deduction from the relation w= 2 n0 is that forms of uneven orders possess only invariants of even degree in the coefficients. The two operators - a a - a = a Â°aa 1 +2 a 1aa2 +... +na" -laan -a a O = na laao + (n 1)a 2aa1 +ï¿½.. +a"aa"-1 have been much studied by Sylvester, Hammond, Hilbert and Elliott (Elliott, Algebra of Quantics, ch. vi.). An important reference is “The Differential Equations satisfied by Concomitants of Quantics,” by A. R. Forsyth, Proc. Lond. Math. Soc. vol. xix.

The Evectant Process.—If we have a symbolic product, which contains the symbol a only in determinant factors such as (ab), we may write x 2 ,-x 1 for a 1, a 2 , and thus obtain a product in which (ab) is replaced by b x, (ac) by c x and so on. In particular, when the product denotes an invariant we may transform each of the symbols a, b,...to x in succession, and take the sum of the resultant products; we thus obtain a covariant which is called the first evectant of the original invariant. The second evectant is obtained by similarly operating upon all the symbols remaining which only occur in determinant factors, and so on for the higher evectants.

Ex. gr. From (ac) 2 (bd) 2 (ad)(bc) we obtain (bd) 2 (bc) cyd x +(ac) 2 (ad) c xdx - (bd) 2 (ad)axb x - (ac)2(bc)axbx =4(bd) 2 (bc)c 2. d x the first evectant; and thence 4cxdi the second evectant; in fact the two evectants are to numerical factors pres, the cubic covariant Q, and the square of the original cubic.

If θ be the degree of an invariant j

aj aj a; oj =a Â° a a o +al aa l +... +anaan naj n.-1 aj naj =a l aa Â° +a 1 a2c3a1...+a2aan

and, herein transforming from a to x, we obtain the first evectant

(-) k, x1x2 aak k

Combinants. - An important class of invariants, of several binary forms of the same order, was discovered by Sylvester. The invariants in question are invariants quâ linear transformation of the forms themselves as well as quâ linear transformation of the variables. If the forms be ax, b2, cy,... The Aronhold process, given by the operation a as between any two of the forms, causes such an invariant to vanish. Thus it has annihilators of the forms

a0 db - 0 +al d 1+a2d 22+... Â°c - iao l a12da2+'..

and Gordan, in fact, takes the satisfaction of these conditions as defining those invariants which Sylvester termed " combinants." The existence of such forms seems to have been brought to Sylvester's notice by observation of the fact that the resultant of of and b must be a factor of the resultant of Xax+ 12 by and X'a +tA2 for a common factor of the first pair must be also a common factor so we obtain P: = of the second pair; so that the condition for the existence of such common factor must be the same in the two cases. A leading proposition states that, if an invariant of Xax and i ubi be considered as a form in the variables X and ,u, and an invariant of the latter be taken, the result will be a combinant of cif and b1'. The idea_can be generalized so as to have regard to ternary and higher forms each of the same order and of the same number of variables.

For further information see Gordan, Vorlesungen Tiber Invariantentheorie, Bd. ii. ï¿½ 6 (Leipzig, 1887); E. B. Elliott, Algebra of Quantics, Art. 264 (Oxford, 1895).

Associated Forms.-A system of forms, such that every form appertaining to the binary form is expressible as a rational and integral function of the members of the system, is difficult to obtain. If, however, we specify that all forms are to be rational, but not necessarily integral functions, a new system of forms arises which is easily obtainable. A binary form of order n contains n independent constants, three of which by linear transformation can be given determinate values; the remaining n-3 coefficients, together with the determinant of transformation, give us n -2 parameters, and in consequence one relation must exist between any n - I invariants of the form, and fixing upon n-2 invariants every other invariant is a rational function of its members. Similarly regarding 1 x 2 as additional parameters, we see that every covariant is expressible as a rational function of n fixed covariants. We can so determine these n covariants that every other covariant is expressed in terms of them by a fraction whose denominator is a power of the binary form.

First observe that with f x =a: = b z = ï¿½ï¿½ï¿½,f1 = a l a z ', f 2 = a 2 az-', f x =f,x i +f 2 x i, we find (ab) - (a f) bx - (b f) ax. fx ? and that thence every symbolic product is equal to a rational function of covariants in the form of a fraction whose denominator is a power of f x. Making the substitution in any symbolic product the only determinant factors that present themselves in the numerator are of the form (af), (bf), (cf),...and every symbol a finally appears in the form.

% -k Y k = (af) k a n x. 'hc has f as a factor, and may be written f. uk; for observing that 1,to =f. =f. uo; 4, 1=0=f.; where u 0 =1, u1=o, assume that tfik = (af) k ay -k = f. u k =ï¿½y. ukx(n-2) ï¿½ Taking the first polar with regard to y (n - k) (a f) xa x -k-l ay+ k (af) k-l ay -k (ab) (n -1) b12by n kn-2k-1 n-1 k(n-2) =k(n- 2)a u x u5+nax ayux and, writing f 2 and -f l for y1 and 3,21 (n-k)(a f) k+ta i k-1 + k (n - 1)(ab)(a f) k-1 (b f)4 1 k by-2 = (uf)u xn-2k-1? Moreover the second term on the left contains ( a f)' c -2b z 2 = 2 (a f) k-2b x 2 - (b) /0-2a 2 ï¿½ if k be uneven, and (af)?'bx (i f) of) '-la if k be even; in either case the factor (af) bx - (bf) ax = (ab) f, and therefore (n-k),bk+1 +Mï¿½f = k(n-2)f.(uf)uxn-2k-1; and 4 ' +1 is seen to be of the form f .14+1. We may write therefore 1 These forms, n in number, are called " associated forms " of f (" Schwesterformen," " formes associbes ").

Every covariant is rationally expressible by means of the forms f, u 2, u3,... u n since, as we have seen uo =I, u 1 =o. It is easy to find the relations u2 =2(f u3 = ((f ,f')2,f") 114=2(f,f') 4 ï¿½f 2 41(1,f')212, and so on.

To exhibit any covariant as a function of uo, ul, a n = (aiy1+a2y2) n and transform it by the substitution fi y 1+f2 y where f l = aay 1 ,f2 = a2ay -1, x y - x y = X x thence f . y1 = x 15+f2n; fï¿½ y2 =x2-f?n, f .a b = ax+ (a f) n, l; n u 2 " 2 22 2 +` n) u3 n-3n3+...+U 2jnï¿½ 3 n Now a covariant of ax =f is obtained from the similar covariant of ab by writing therein x i, x 2, for yl, y2, and, since y?, Y2 have been linearly transformed to and n, it is merely necessary to form the covariants in respect of the form (u1E+u2n) n, and then division, by the proper power of f, gives the covariant in question as a function of f, u0 = I, u2, u3,...un.

Summary of Results.-We will now give a short account of the results to which the foregoing processes lead. Of any form az there exists a finite number of invariants and covariants, in terms of which all other covariants are rational and integral functions (cf. Gordan,, Bd. ii. ï¿½ 21). This finite number of forms is said to constitute the complete system. Of two or more binary forms there are also complete systems containing a finite number of forms. There are also algebraic systems, as above mentioned, involving fewer covariants which are such that all other covariants are rationally expressible in terms of them; but these smaller systems do not possess the same mathematical interest as those first mentioned.

The Binary Quadratic.-The complete system consists of the form itself, ax, and the discriminant, which is the second transvectant of the form upon itself, viz.: (f, f') 2 = (ab) 2; or, in real coefficients, 2(a 0 a 2 a 2 1). The first transvectant, (f,f') 1 = (ab) a x b x ,vanishes identically. Calling the discriminate D, the solution of the quadratic as =o is given by the formula a: = o ( a0+a12_x2 (a0x+aix2 If the form a 2 be written as the product of its linear factors p.a., the discriminant takes the form -2(pq) 2. The vanishing of this invariant is the condition for equal roots. The simultaneous system of two quadratic forms ai, ay, say f and 0, consists of six forms, viz.

the two quadratic forms f, 4); the two discriminants (f, f')2,(0,4')2, and the first and second transvectants of f upon 4, (f, ,>) 1 and (f, 402, which may be written (aa)a x a x and (aa) 2 . These fundamental or ground forms are connected by the relation - 2 1 (f,4) 1) 2 = -2f4,(f ,4,)2+ 02(f,f')2.

If the covariant (f,4) 1 vanishes f and 4 are clearly proportional, and if the second transvectant of (f, 4 5) 1 upon itself vanishes, f and 4) possess a common linear factor; and the condition is both necessary and sufficient. In this case (f, ï¿½) 1 is a perfect square, since its discriminant vanishes. If (f,4) 1 be not a perfect square, and rx, s x be its linear factors, it is possible to express f and 4, in the canonical forms Xi(rx)2+X2(sx)2, 111(rx)2+1.2 (sx) 2 respectively. In fact, if f and 4, have these forms, it is easy to verify that (f, 4,)i= (A j z) (rs)r x s x . The fundamental system connected with n quadratic forms consists of (i.) the n forms themselves f i, f2,ï¿½ï¿½ fn, (ii.) the (2) functional determinants (f i ,f k) 1 , (iii.) the (n 2 1) in variants (f l, fk) 2, (iv.) the (3) forms (f i, (f k, f ni)) 2 , each such form remaining unaltered for any permutations of i, k, m. Between these forms various relations exist (cf. Gordan, ï¿½ 134).

The Binary Cubic.-The complete system consists of f=aa,(f,f')'=(ab)2a b =0 2 ,(f 0)= (ab) 2 (ca)b c=Q3, x x x x x x and (0,0')2 (ab) 2 (cd) 2 (ad) (bc) = R.

To prove that this system is complete we have to consider (f, o) 2, 04') 1, (f,Q) 1, (f,Q) 2, (f,Q) 3, 0,Q) 1, (o,Q)2, and each of these can be shown either to be zero or to be a rational integral function of f, 0 Q and R. These forms are connected by the relation 2Q2+ 3+Rf2=0.

The discriminant of f is equal to the discriminant of 0, and is therefore (0, 0') 2 = R; if it vanishes both f and 0 have two roots equal, 0 is a rational factor of f and Q is a perfect cube; the cube root being equal, to a numerical factor pres, to the square root of A. The Hessian 0 =A 2 is such that (f, 2 and if f is expressible in the form X(p x) 3 +,i(g x) 3 , that is as the sum of two perfect cubes,. we find that Di must be equal to p x g x for then t x (p x) 3 +, u (g x) 3, Hence, if px, qx be the linear factors of the Hessian 64, the cubic can be put into the form A(p x) 3 +ï¿½(g x) 3 and immediately solved. This method of solution fails when the discriminant R vanishes, for then the Hessian has equal roots, as also the cubic f. The Hessian in that case is a factor of f, and Q is the third power of u2,... linear factor which occurs to the second power in $f$ . If, moreover, $\Delta$ vanishes identically $f$ is a perfect cube.

The Binary Quartic.—The fundamental system consists of five forms $a_{x}^{4}=f$ ; $(f,f')^{2}=(ab)^{2}a_{x}^{2}b_{x}^{2}=\Delta _{x}^{4}$ ; $(f,f')^{4}=(ab)^{4}=i$ ; $(f,\Delta )^{1}=(a\Delta )a_{x}^{3}\Delta _{x}^{3}=(ab)^{2}(cb)a_{x}^{2}b_{x}c_{x}^{3}=t$ ; $(f,\Delta )^{4}=(a\Delta )^{4}=(ab)^{2}(bc)^{2}(ca)^{2}=j$ , viz. two invariants, two quartics and a sextic. They are connected by the relation

$2t^{2}={\frac {1}{2}}if^{2}\Delta -\Delta ^{3}-{\frac {1}{3}}jf^{3}$ .

The discriminant, whose vanishing is the condition that f may possess two equal roots, has the expression j 2 - 6 i 3; it is nine times the discriminant of the cubic resolvent k 3 - 2 ik- 3j , and has also the expression 4(1, t') 6 . The quartic has four equal roots, that is to say, is a perfect fourth power, when the Hessian vanishes identically; and conversely. This can be verified by equating to zero the five coefficients of the Hessian (ab) 2 axb2. Gordan has also shown that the vanishing of the Hessian of the binary n ic is the necessary and sufficient condition to ensure the form being a perfect n th power. The vanishing of the invariants i and j is the necessary and sufficient condition to ensure the quartic having three equal roots. On the one hand, assuming the quartic to have the form 4xix 2, we find i=j=o, and on the other hand, assuming i=j=o, we find that the quartic must have the form a o xi+4a 1 xix 2 which proves the proposition. The quartic will have two pairs of equal roots, that is, will be a perfect square, if it and its Hessian merely differ by a numerical factor. For it is easy to establish] the formula (yx) 2 0 4 = 2f.4-2(f y 1 ) 2 connecting the Hessian with the quartic and its first and second polars; now a, a root of f, is also a root of Ox, and con se uentl the first polar 1 of of q y p f? =y la xl -i-y2a x2 must also vanish for the root a, and thence ax, and a must also vanish for the same root; which proves that a is a double root of f, and f therefore a perfect square. When f = 6xix2 it will be found that 0 = -f. The simplest form to which the quartic is in general reducible is +6mxix2+x2, involving one parameter m; then Ox = 2m (xi +x2) +2 (1-3m2) x2 ix2; i = 2 (t +3m2) ;j= '6m (1 - m) 2; t= (1 - 9m 2) (xi - x2) (x21 + x2) x i x 2. The .sextic covariant t is seen to be factorizable into three quadratic factors 4 = x 1 x 2, =x 2 1 - 1 - 2 2, 4) - x, which are such that the three mutual second transvectants vanish identically; they are for this reason termed conjugate quadratic factors. It is on a consideration of these factors of t that Cayley bases his solution of the quartic equation. For, since -2t 2 =0 3 -21f 2 ,6,-3j(-f) 3, he compares the right-hand side with cubic resolvent k 3 -21X 2 k - j 2. of f=0, :and notices that they become identical on substituting 0 for k, and -f for X; hence, if k1, k2, k 3 be the roots of the resolvent -21 2 = (o + k if) (A + k 2f)(o + k 3f); and now, if all the roots of f be different, so also are those of the resolvent, since the latter, and f, have practically the same discriminant; consequently each of the three factors, of -21 2, must be perfect squares and taking the square root 1 t = -' (1)ï¿½x4; and it can be shown that 0, x, 1P are the three conjugate quadratic factors of t above mentioned. We have A +k 1 f =0 2, O+k 2 f = x2, O+k3f =4) 2 , and Cayley shows that a root of the quartic can be xpressed in the determinant form 1, k, 0.1y the remaining roots being obtained by varying 1, k, x the signs which occur in the radicals 2 u The transformation to the normal form reduces 1, k 3 ,? the quartic to a quadratic. The new variables y1= 0 are the linear factors of 0. If 4) = rx.sx, the Y2 =1 normal form of a:, can be shown to be given by (rs) 4 .a x 4 = (ar) 4s: 6 (ar) 2 (as) 2rxsy -I- (as) 4rx; 4) is any one of the conjugate quadratic factors of t, so that, in determining rx, sx from J z+k 1 f =o, k 1 is any root of the resolvent. The transformation to the normal form, by the solution of a cubic and a quadratic, therefore, supplies a solution of the quartic. If (Xï¿½) is the modulus of the transformation by which a2 is reduced to 3 the normal form, i becomes (X /2) 4 i, and j, (Ap) 3 j; hence ? 3 is absolutely unaltered by transformation, and is termed the absolute invariant. Since therefore ? 2 - 9 m 2 (1 3 m 2)) 2 we have a cubic equation for determining m 2 as a function of the absolute invariant.

Remark.—Hermite has shown (Crelle, Bd. lii.) that the substitution, $z={\frac {i}{j}}{\frac {\Delta }{f}}$ , reduces ${\frac {x_{2}\partial x_{1}-x_{1}\partial x_{2}}{\sqrt {j}}}$ to the form

${\frac {1}{2i}}{\sqrt {-{\frac {j}{2}}}}{\frac {\partial z}{\sqrt {{\frac {1}{3}}-{\frac {1}{2}}z+{\frac {j^{2}}{i^{3}}}z^{3}}}}$ .

The Binary Quintic.—The complete system consists of 23 forms, of which the simplest are f =a:; the Hessian H = (f, f') 2 = (ab) 2axbz; the quadratic covariant i= (f, f) 4 = (ab) 4axbx; and the nonic co variant T = (f, (f', f") 2) 1 = (f, H) 1 = (aH) azHi = (ab) 2 (ca) axbycy; the remaining 19 are expressible as transvectants of compounds of these four.

There are four invariants (i, i')2; (13, H)6; (f2, 151c.; (f t, 17)14 four linear forms (f, i 2) 4; (f, i 3) 5; (i 4, T) 8; ( 2 5 , T)9 three quadratic forms i; (H, i 2)4; (H, 23)5 three cubic forms (f, i)2; (f, i 2) 3; (13, T)6 two quartic forms (H, i) 2; (H, 12)3. three quintic forms f; (f, i) 1; (i 2, T)4 two sextic forms H; (H, 1)1 one septic form (i, T)2 one nonic form T.

We will write the cubic covariant (f, i) 2 =j, and then remark that the result, (f,j) 3 = o, can be readily established. The form j is completely defined by the relation (f,j) 3 =o as no other covariant possesses this property.

Certain convariants of the quintic involve the same determinant factors as appeared in the system of the quartic; these are f, H, i, T and j, and are of special importance. Further, it is convenient to have before us two other quadratic covariants, viz. T = (j, j) 2 jxjx; 0 = (iT)i x r x; four other linear covariants, viz. a = - (ji) 2 jx; s = (ia)ix; Y = (ra)r x: (3= (T0)T x . Further, in the case of invariants, we write A= (1, i') 2 and take three new forms B = (i, T) 2; C = (r, r`) 2; R = (/y). Hermite expresses the quintic in a forme-type in which the constants are invariants and the variables linear covariants. If a, a be the linear forms, above defined, he raises the identity ax(0) =ax(aJ3) - (3x(aa) to the fifth power (and in general to the power n) obtaining (aa) 5 f = (a13) 5 az - 5 (a0) 4 (aa) ax?3 -F... - (aa) and then expresses the coefficients, on the right, in terms of the fundamental invariants. On this principle the covariant j is expressible in the form R 2 j =5 3 + BS 2 a+4ACSa 2 + C(3AB -4C)a3 when S, a are the above defined linear forms.

Hence, solving the cubic, R 2 j = (S -m i a) (S - m 2 a) (S - m3a) wherein m 1 m2, m 3 are invariants.

Sylvester showed that the quintic might, in general, be expressed as the sum of three fifth powers, viz. in the canonical form

f=k1(px)5 +k2(gx) 5 +k3(rx) 5 .

Now, evidently, the third transvectant of f, expressed in this form, with the cubic pxgxrx is zero, and hence from a property of the covariant j we must have j = pxgxrx; showing that the linear forms involved are the linear factors of j. We may therefore write I. / f = k1.(S-mia)5+k2(S-m2a)5+k3(6-m3a)5; and we have merely to determine the constants k1, k2, k3. To determine them notice that R = (a6) and then (f, a 5) 5 = - R 5 (k1 +k2+k3) (f, a 4 5) 5 = - 5R5 ( m 1 k 1+ m 2 k 2+ m 3 k 3), (f, a352) 5 = -10R5 (m21ke +m2k2+m3k3) three equations for determining k 1, k2, k3. This canonical form depends upon j having three unequal linear factors. When C vanishes j has the form j = pxg x , and (f,j) 3 = (ap) 2 (aq)ax = o. Hence, from the identity ax (pq) = px (aq) -qx (ap), we obtain (pet' = (aq) 5px - 5 (ap) (aq) 4 pxg x - (ap) 5 gi, the required canonical form. Now, when C = o, clearly (see ante) R 2 j = 6 2 p where p = S +2 B a; and Gordan then proves the relation 6R 4 .f = B65ï¿½5B64p - 4A2p5, which is Bring's form of quintic at which we can always arrive, by linear transformation, whenever the invariant C vanishes. Remark.-The invariant C is a numerical multiple of the resultant of the covariants i and j, and if C = o, p is the common factor of i and j. The discriminant is the resultant of ax and ax and of degree 8 in the coefficients; since it is a rational and integral function of the fundamental invariants it is expressible as a linear function of A 2 and B; it is independent of C, and is therefore unaltered when C vanishes; we may therefore take f in the canonical form

6R 4 f = BS5+5BS4p-4A2p5. The two equations

${\frac {\partial f}{\partial \delta }}=5{\text{B}}\delta ^{4}+4{\text{B}}\delta ^{3}\rho )=0$ ,
${\frac {\partial f}{\partial \rho }}=5({\text{B}}\delta ^{4}-4{\text{A}}^{2}\rho ^{4})=0$ ,

yield by elimination of $\delta$ and $\rho$ the discriminant

${\text{D}}=64{\text{B}}-{\text{A}}^{2}$ .

The general equation of degree 5 cannot be solved algebraically, but the roots can be expressed by means of elliptic modular functions. For an algebraic solution the invariants must fulfil certain conditions. When $R=0$ , and neither of the expressions ${\text{A}}{\text{C}}-{\text{B}}^{2}$ , $2{\text{A}}{\text{B}}-3{\text{C}}$ vanishes, the covariant $\alpha _{x}$ is a linear factor of $f$ ; but, when ${\text{R}}={\text{A}}{\text{C}}-{\text{B}}^{2}=2{\text{A}}{\text{B}}-3{\text{C}}=0$ , $\alpha _{x}$ also vanishes, and then $f$ is a product of the form $j_{x}^{3}$ and of the Hessian of $j_{x}^{3}$ . When $\alpha _{x}$ and the invariants ${\text{B}}$ and ${\text{C}}$ all vanish, either A or $j$ must vanish; in the former case $j$ is a perfect cube, its Hessian vanishing, and further $f$ contains $j$ as a factor; in the latter case, if $\rho _{x}$ , $\sigma _{x}$ be the linear factors of $i$ , $f$ can be expressed as $(\rho \sigma )^{5}f=c_{1}\rho _{x}^{5}+c_{2}\sigma _{x}^{5}$ ; if both ${\text{A}}$ and $j$ vanish $i$ also vanishes identically, and so also does $f$ . If, however, the condition be the vanishing of $i$ , $f$ contains a linear factor to the fourth power.

The Binary Sextic.—The complete system consists of 26 forms, of which the simplest are $f=a_{x}^{6}$ ; the Hessian ${\text{H}}=(ab)^{2}a_{x}^{4}b_{x}^{4}$ ; the quartic $i=(ab)^{4}a_{x}^{2}b_{x}^{2}$ ; the covariants $l=(ai)^{4}a_{x}^{2}$ ; ${\text{T}}=(ab)^{2}(cb)a_{x}^{4}b_{x}^{3}c_{x}^{5}$ ; and the invariants ${\text{A}}=(ab)^{6}$ ; ${\text{B}}=(ii')^{4}$ . There are

5 invariants: $(a,b)^{6},~(i,i')^{4},~(l,l')^{2},~(f,l^{3})^{6},~((f,i),l^{4})^{8}$ ;
6 of order 2: $l,~(i,l)^{2},~(f,l^{2})^{4},~(i,l^{2})^{3},~(f,l^{3})^{5},~((f,i),l^{3})^{6}$ ;
5 of order 4: $i,~(f,l)^{2},~(i,l),~(f,l^{2})^{3},~((f,i),l^{2})^{4}$ ;
5 of order 6: $f,~p=(ai)^{2}a_{x}^{4}i_{x}^{2},~(f,l),~((f,i),l)^{2},~(p,l)$ ;
3 of order 8: ${\text{H}},~(f,i),~({\text{H}},l)$ ;
1 of order 10: $({\text{H}},i)$ ;
1 of order 12: ${\text{T}}$ .

For a further discussion of the binary sextic see Gordan, loc. cit., Clebsch, loc. cit. The complete systems of the quintic and sextic were first obtained by Gordan in 1868 (Journ. f. Math. lxix. 323-354). August von Gall in 1880 obtained the complete system of the binary octavic (Math. Ann. xvii. 31-52, 139-152, 456); and, in 1888, that of the binary septimic, which proved to be much more complicated (Math. Ann. xxxi. 318-336). Single binary forms of higher and finite order have not been studied with complete success, but the system of the binary form of infinite order has been completely determined by Sylvester, Cayley, MacMahon and Stroh, each of whom contributed to the theory.

As regards simultaneous binary forms, the system of two quadratics, and of any number of quadratics, is alluded to above and has long been known. The system of the quadratic and cubic, consisting of 15 forms, and that of two cubics, consisting of 26 forms, were obtained by Salmon and Clebsch; that of the cubic and quartic we owe to Sigmund Gundelfinger (Programm Stuttgart, 1869, 1-43); that of the quadratic and quintic to Winter (Programm Darmstadt, 1880); that of the quadratic and sextic to von Gall (Programm Lemgo, 1873); that of two quartics to Gordan (Math. Ann. ii. 227-281, 1870); and to Eugenio Bertini (Batt. Giorn. xiv. 1-14, 1876; also Math. Ann. xi. 30-41, 1877). The system of four forms, of which two are linear and two quadratic, has been investigated by Perrin (S. M. F. Bull. xv. 45-61, 1887).

Ternary and Higher Forms.—The ternary form of order $n$ is represented symbolically by

$(a_{1}x_{1}+a_{2}x_{2}+a_{3}x_{3})^{n}=a_{x}^{n}$ ;

and, as usual, $b,c,d,\dots$ are alternative symbols, so that

$a_{x}^{n}=b_{x}^{n}=c_{x}^{n}=d_{x}^{n}=\dots$ .

To form an invariant or covariant we have merely to form a product of factors of two kinds, viz. determinant factors $(abc)$ , $(abd)$ , $(bce)$ , etc.…, and other factors $a_{x},~b_{x},~c_{x},\dots$ in such manner, that each of the symbols $a,~b,~c,\dots$ occurs $n$ times. Such a symbolic product, if its does not vanish identically, denotes an invariant or a covariant, according as factors $a_{x},~b_{x},~c_{x},\dots$ do not or do appear. To obtain the real form we multiply out, and, in the result, substitute for the products of symbols the real coefficients which they denote.

For example, take the ternary quadratic

$(a_{1}x_{1}+a_{2}x_{2}+a_{3}x_{3})^{2}=a_{x}^{2}$ ,

or in real form $ax_{1}^{3}+bx_{2}^{3}+cx_{3}^{3}+2fx_{2}x_{3}+2gx_{3}x_{1}+2hx_{1}x_{2}$ . We can see that $(abc)a_{x}b_{x}c_{x}$ is not a covariant, because it vanishes identically, the interchange of $a$ and $b$ changing its sign instead of leaving it unchanged; but $(abc)^{2}$ is an invariant. If $a_{x}^{2}$ , $b_{x}^{2}$ , $c_{x}^{2}$ be different forms we obtain, after development of the squared determinant and conversion to the real form (employing single and double dashes to distinguish the real coefficients of $b_{x}^{2}$ and $c_{x}^{2}$ ),

$a(b'c''+b''c'-2f'f'')+b(c'a''+c''a'-2g'g'')$
$+c(a'b''+a''b'-2h'h'')+2f(g'h''+g''h'-a'f''-a''f')$
$+2g(h'f''+h''f'-b'g''-b''g')+2h(f'g''+f''g'-c'h''-c''h')$ ;

a simultaneous invariant of the three forms, and now suppressing the dashes we obtain

$6(abc+2fgh-af^{2}-bg^{2}-ch^{2})$ ,

the expression in brackets being the well-known invariant of $a_{x}^{2}$ , the vanishing of which expresses the condition that the form may break up into two linear factors, or, geometrically, that the conic may represent two right lines. The complete system consists of the form itself and this invariant.

The ternary cubic has been investigated by Cayley, Aronhold, Hermite, Brioschi and Gordan. The principal reference is to Gordan (Math. Ann. i. 90-128, 1869, and vi. 436-512, 1873). The complete covariant and contravariant system includes no fewer than 34 forms; from its complexity it is desirable to consider the cubic in a simple canonical form; that chosen by Cayley was $ax^{3}+by^{3}+cz^{3}+6dxyz$ (Amer. J. Math. iv. 1-16, 1881). Another form, associated with the theory of elliptic functions, has been considered by Dingeldey (Math. Ann. xxxi. 157-176, 1888), viz. $xy^{2}-4z^{3}+g_{2}x^{2}y+g_{3}x^{3}$ , and also the special form $axz^{2}-4by^{3}$ of the cuspidal cubic. An investigation, by non-symbolic methods, is due to F. C. J. Mertens (Wien. Ber. xcv. 942-991, 1887). Hesse showed independently that the general ternary cubic can be reduced, by linear transformation, to the form

$x^{3}+y^{3}+z^{3}+6mxyz$ ,

a form which involves 9 independent constants, as should be the case; it must, however, be remarked that the counting of constants is not a sure guide to the existence of a conjectured canonical form. Thus the ternary quartic is not, in general, expressible as a sum of five 4th powers as the counting of constants might have led one to expect, a theorem due to Sylvester. Hesse’s canonical form shows at once that there cannot be more than two independent invariants; for if there were three we could, by elimination of the modulus of transformation, obtain two functions of the coefficients equal to functions of $m$ , and thus, by elimination of $m$ , obtain a relation between the coefficients, showing them not to be independent, which is contrary to the hypothesis.

The simplest invariant is ${\text{S}}=(abc)(abd)(acd)(bcd)$ cf degree 4, which for the canonical form of Hesse is $m(1-m^{3})$ ; its vanishing indicates that the form is expressible as a sum of three cubes. The Hessian is symbolically $(abc)^{2}a_{x}b_{x}c_{x}={\text{H}}^{3}x$ , and for the canonical form $(1+2m^{3})xyz-m^{2}(x^{3}+y^{3}+z^{3})$ . By the process of Aronhold we can form the invariant ${\text{S}}$ for the cubic $a_{x}^{3}+\lambda {\text{H}}_{x}^{3}$ , and then the coefficient of $\lambda$ is the second invariant ${\text{T}}$ . Its symbolic expression, to a numerical factor près, is

$({\text{H}}bc)({\text{H}}bd)({\text{H}}cd)(bcd)$ ,

and it is clearly of degree 6.

One more covariant is requisite to make an algebraically complete set. This is of degree 8 in the coefficients, and degree 6 in the variables, and, for the canonical form, has the expression

$-9m^{6}(x^{3}+y^{3}+z^{3})^{2}-(2m+5m^{4}+20m^{7})(x^{3}+y^{3}+z^{3})xyz$
$-(15m^{2}+78m^{5}-12m^{8})x^{2}y^{2}z^{2}+(1+8m^{3})^{2}(y^{3}z^{3}+z^{3}x^{3}+x^{3}y^{3})$ .

Passing on to the ternary quartic we find that the number of ground forms is apparently very great. Gordan (Math. Ann. xvii. 217-233), limiting himself to a particular case of the form, has determined 54 ground forms, and G. Maisano (Batt. G. xix. 198-237, 1881) has determined all up to and including the 5th degree in the coefficients.

The system of two ternary quadratics consists of 20 forms; it has been investigated by Gordan (Clebsch-Lindemann’s Vorlesungen i. 288, also Math. Ann. xix. 529-552); Perrin (S. M. F. Bull. xviii. 1-80, 1890); Rosanes (Math. Ann. vi. 264); and Gerbaldi (Annali (2), xvii. 161-196).

Ciamberlini has found a system of 127 forms appertaining to three ternary quadratics (Batt. G. xxiv. 141-157).

A. R. Forsyth has discussed the algebraically complete sets of ground forms of ternary and quaternary forms (see Amer. J. xii. 1-60, 115-160, and Camb. Phil. Trans. xiv. 409-466, 1889). He proves, by means of the six linear partial differential equations satisfied by the concomitants, that, if any concomitant be expanded in powers of $x_{1}$ , $x_{2}$ , $x_{3}$ , the point variables—and of $u_{1}$ , $u_{2}$ , $u_{3}$ , the contragredient line variables—it is completely determinate if its leading coefficient be known. For the unipartite ternary quantic of order $n$ he finds that the fundamental system contains ${\tfrac {1}{2}}(n+4)(n-1)$ individuals. He successfully considers the systems of two and three simultaneous ternary quadratics. In Part III. of the Memoir he discusses bi-ternary quantics, and in particular those which are lineo-linear, quadrato-linear, cubo-linear, quadrato-quadratic, cubo-cubic, and the system of two lineo-linear quantics. He shows that the system of the bi-ternary $n^{o}m^{ic}$ comprises

${\frac {1}{4}}(n+1)(n+2)(m+1)(m+2)-3$ individuals.

Bibliographical references to ternary forms are given by Forsyth (Amer. J. xii. p. 16) and by Cayley (Amer. J. iv., 1881). Clebsch, in 1872, in papers in Abh. d. K. Akad. d. U. zu Göttingen, t. xvii. and Math. Ann. t. v., established the important result that in the case of a form in $n$ variables, the concomitants of the form, or of a system of such forms, involve in the aggregate $n-l$ classes of variables. For instance, those of a ternary form involve two classes which may be geometrically interpreted as point and line co-ordinates in a plane; those of a quaternary form involve three classes which may be geometrically interpreted as point, line and plane coordinates in space.

IV. Enumerating Generating Functions

Professor Michael Roberts (Quart. Math. J. iv.) was the first to remark that the study of covariants may be reduced to the study of their leading coefficients, and that from any relations connecting the latter are immediately derivable the relations connecting the former. It has been shown above that a covariant, in general, satisfies four partial differential equations. Two of these show that the leading coefficient of any covariant is an isobaric and homogeneous function of the coefficients of the form; the remaining two may be regarded as operators which cause the vanishing of the covariant. These may be written, for the binary $n^{ic}$ ,

$\Sigma ka_{k-1}{\frac {d}{da_{k}}}-x_{2}{\frac {d}{dx_{1}}}=0$ ;
$\Sigma (n-k)a_{k+1}{\frac {d}{da_{k}}}-x_{1}{\frac {d}{dx_{2}}}=0$ ;

or in the form

$\Omega -x_{2}{\frac {d}{dx_{1}}}=0$ , $\mathrm {O} -x_{1}{\frac {d}{dx_{2}}}=0$ ;

where

$\Omega =a_{0}{\frac {d}{da_{1}}}+2a_{1}{\frac {d}{da_{2}}}+\ldots +na_{n-1}{\frac {d}{da_{n}}}$ ,
$\mathrm {O} =na_{1}{\frac {d}{da_{0}}}+(n-1)a_{2}{\frac {d}{da_{1}}}+\ldots +a_{n}{\frac {d}{da_{n-1}}}$ .

Let a covariant of degree $\epsilon$ in the variables, and of degree $\theta$ in the coefficients (the weight of the leading coefficient being $w$ and $n\theta -2w=\epsilon$ ), be

${\text{C}}_{0}x_{1}^{\epsilon }+\epsilon c_{1}x_{1}^{\epsilon -1}x_{2}+\ldots$ .

Operating with $\Omega -x_{2}{\frac {d}{dx_{1}}}$ we find $\Omega {\text{C}}_{0}=0$ ; that is to say, ${\text{C}}_{0}$ satisfies one of the two partial differential equations satisfied by an invariant. It is for this reason called a seminvariant, and every seminvariant is the leading coefficient of a covariant. The whole theory of invariants of a binary form depends upon the solutions of the equation $\Omega =0$ . Before discussing these it is best to transform the binary form by substituting $1{\text{!}}a_{1},~2{\text{!}}a_{2},~3{\text{!}}a_{3},\ldots n{\text{!}}a_{n}$ , for $a_{1},~a_{2},~a_{3}\ldots a_{n}$ respectively; it then becomes

$a_{0}x_{1}^{n}+na_{1}x_{1}^{n-1}x_{2}+n(n-1)a_{2}x_{1}^{n-2}x_{2}^{2}+\ldots +n{\text{!}}a_{n}x_{2}^{n}$ ,

and $\Omega$ takes the simpler form

$a_{0}{\frac {d}{da_{1}}}+a_{1}{\frac {d}{da_{2}}}+a_{2}{\frac {d}{da_{3}}}+\ldots +a_{n-1}{\frac {d}{da_{n}}}$ .

One advantage we have obtained is that, if we now write $a_{0}=0$ , and substitute $a_{s-1}$ for $a_{s}$ , when $s$ > 0, we obtain

$a_{0}{\frac {d}{da_{1}}}+a_{1}{\frac {d}{da_{2}}}+a_{2}{\frac {d}{da_{3}}}+\ldots +a_{n-2}{\frac {d}{da_{n-1}}}$

which is the form of $\Omega$ for a binary $(n-1)^{ic}$ .

Hence by merely diminishing each suffix in a seminvariant by unity, we obtain another seminvariant of the same degree, and of weight $w-\theta$ , appertaining to the $(n-1)^{ic}$ . Also, if we increase each suffix in a seminvariant, we obtain terms, free from $a_{0}$ , of some seminvariant of degree $\theta$ and weight $w+\theta$ . Ex. gr. from the invariant $a_{2}^{2}-2a_{1}a_{3}+2a_{0}a_{4}$ of the quartic the diminishing process yields $a_{1}^{2}-2a_{0}a_{2}$ , the leading coefficient of the Hessian of the cubic, and the increasing process leads to $a_{3}^{2}-2a_{2}a_{4}+2a_{1}a_{5}$ which only requires the additional term $-2a_{0}a_{6}$ to become a seminvariant of the sextic. A more important advantage, springing from the new form of $\Omega$ , arises from the fact that if

$x^{n}-a_{1}x^{n-1}+a_{2}x^{n-2}-\ldots (-)^{n}a_{n}=(x-\alpha _{1})(x-\alpha _{2})\ldots (x-\alpha _{n})$ ,

the sums of powers $\Sigma \alpha ^{2},~\Sigma \alpha ^{3},~\Sigma \alpha ^{4},~\ldots \Sigma \alpha ^{n}$ all satisfy the equation $\Omega =0$ . Hence, excluding $a_{0}$ , we may, in partition notation, write down the fundamental solutions of the equation, viz.—

$(2),~(3),~(4),\dots (n)$ ,

and say that with $a_{0}$ , we have an algebraically complete system. Every symmetric function denoted by partitions, not involving the figure unity (say a non-unitary symmetric function), which remains unchanged by any increase of $n$ , is also a seminvariant, and we may take if we please another fundamental system, viz.—

$a_{0},(2),~(3),~(22),~(32),\dots (2^{{\frac {1}{2}}n})$ or $(32^{{\frac {1}{2}}(n-3)})$ .

Observe that, if we subject any symmetric function $(p_{1}p_{2}p_{3}\dots )$ to the diminishing process, it becomes $a_{0}^{p_{1}-p_{2}}(p_{2}p_{3}\dots )$ .

Next consider the solutions of $\Omega =0$ which are of degree $\theta$ and weight $w$ . The general term in a solution involves the product $a_{0}^{\pi _{0}}a_{1}^{\pi _{1}}a_{2}^{\pi _{2}}\dots a_{n}^{\pi _{n}}$ wherein $\Sigma \pi =\theta$ , $\Sigma s\pi _{s}=w$ ; the number of such products that may appear depends upon the number of partitions of $w$ into $\theta$ or fewer parts limited not to exceed $n$ in magnitude. Let this number be denoted by $(w;~\theta ,~n)$ . In order to obtain the seminvariants we would write down the $(w;~\theta ,~n)$ terms each associated with a literal coefficient; if we now operate with $\Omega$ we obtain a linear function of $(w-1;~\theta ,~n)$ products, for the vanishing of which the literal coefficients must satisfy $(w-1;~\theta ,~n)$ linear equations; hence $(w;~\theta ,~n)-(w-1;~\theta ,~n)$ of these coefficients may be assumed arbitrarily, and the number of linearly independent solutions of $\Omega =0$ , of the given degree and weight, is precisely $(w;~\theta ,~n)-(w-1;~\theta ,~n)$ . This theory is due to Cayley; its validity depends upon showing that the $(w-1;~\theta ,~n)$ linear equations satisfied by the literal coefficients are independent; this has only recently been established by E. B. Elliott. These seminvariants are said to form an asyzygetic system. It is shown in the article on Combinatorial Analysis that $(w;~\theta ,~n)$ is the coefficient of $a^{\theta }z^{w}$ in the ascending expansion of the fraction

${\frac {1}{1-a.~1-az.~1-az^{2}.~\ldots 1-az^{n}}}$ .

Hence $(w;~\theta ,~n)-(w-1;~\theta ,~n)$ is given by the coefficient of $a^{\theta }z^{w}$ in the fraction

${\frac {1-z}{1-a.~1-az.~1-az^{2}.~\ldots 1-az^{n}.}}$ ,

the enumerating generating function of asyzygetic seminvariants. We may, by a well-known theorem, write the result as a coefficient of $z^{w}$ in the expansion of

${\frac {1-z^{n+1}.~1-z^{n+2}.~\dots 1-z^{n+\theta }}{1-z^{2}.~1-z^{3}.~\dots 1-z^{\theta }}}$ ;

and since this expression is unaltered by the interchange of $n$ and $\theta$ we prove Hermite’s Law of Reciprocity, which states that the asyzygetic forms of degree $\theta$ for the $n^{ic}$ are equinumerous with those of degree $n$ for the $\theta ^{ic}$ .

The degree of the covariant in the variables is $\epsilon =n\theta -2w$ ; consequently we are only concerned with positive terms in the developments and $(w;~\theta ,~n)-(w-1;~\theta ,~n)$ will be negative unless $n\theta -2w\geq 0$ . It is convenient to enumerate the seminvariants of degree $\theta$ and order $\epsilon =n\theta -2w$ by a generating function; so, in the first written generating function for seminvariants, write ${\tfrac {1}{z^{2}}}$ for $z$ and $az^{n}$ for $a$ ; we obtain

${\frac {1-z^{-2}}{1-az^{n}.~1-az^{n-2}.~1-az^{n-4}.\dots 1-az^{-n+4}.~1-az^{-n+2}.~1-az^{-n}}}$

in which we have to take the coefficient of $a^{\theta }z^{n\theta -2w}$ , the expansion being in ascending powers of $a$ . As we have to do only with that part of the expansion which involves positive powers of $z$ , we must try to isolate that portion, say $A_{n}(z)$ . For $n=2$ we can prove that the complete function may be written

${\text{A}}_{2}(z)-{\frac {1}{z^{2}}}{\text{A}}_{2}\left({\frac {1}{z}}\right)$ ,

where

${\text{A}}_{2}(z)={\frac {1}{1-az^{2}.~1-a^{2}}}$ ;

and this is the reduced generating function which tells us, by its denominator factors, that the complete system of the quadratic is composed of the form itself of degree order 1, 2 shown by $az^{2}$ , and of the Hessian of degree order 2, 0 shown by $a^{2}$ .

Again, for the cubic, we can find

${\text{A}}_{3}(z)={\frac {1-a^{6}z^{6}}{1-az^{3}.~1-a^{2}z^{2}.~1-a^{3}z^{3}.~1-a^{4}}}$ ,

where the ground forms are indicated by the denominator factors, viz.: these are the cubic itself of degree order 1, 3; the Hessian of degree order 2, 2; the cubi-covariant G of degree order 3, 3, and the quartic invariant of degree order 4, 0. Further, the numerator factor establishes that these are not all algebraically independent, but are connected by a syzygy of degree order 6, 6.

Similarly for the quartic

${\text{A}}_{4}(z)={\frac {1-a^{6}z^{12}}{1-az^{4}.~1-a^{2}.~1-a^{2}z^{4}.~1-a^{3}.~1-a^{3}z^{6}}}$ ,

establishing the 5 ground forms and the syzygy which connects them.

The process is not applicable with complete success to quintic and higher ordered binary forms. This arises from the circumstance that the simple syzygies between the ground forms are not all independent, but are connected by second syzygies, and these again by third syzygies, and so on; this introduces new difficulties which have not been completely overcome. As regards invariants a little further progress has been made by Cayley, who established the two generating functions for the quintic

${\frac {1-a^{36}}{1-a^{4}.~1-a^{8}.~1-a^{12}.~1-a^{18}}}$ ,

and for the sextic

${\frac {1-a^{30}}{1-a^{2}.~1-a^{4}.~1-a^{6}.~1-a^{10}.~1-a^{15}}}$ .

Accounts of further attempts in this direction will be found in Cayley’s Memoirs on Quantics (Collected Papers), in the papers of Sylvester and Franklin (Amer. J. i.-iv.), and in Elliott’s Algebra of Quantics, chap. viii.

Perpetuants.—Many difficulties, connected with binary forms of finite order, disappear altogether when we come to consider the form of infinite order. In this case the ground forms, called also perpetuants, have been enumerated and actual representative seminvariant forms established. Putting $n$ equal to ∞, in a generating function obtained above, we find that the function, which enumerates the asyzygetic seminvariants of degree $\theta$ , is

${\frac {1}{1-z^{2}.~1-z^{3}.~1-z^{4}.\ldots 1-z^{\theta }}}$

that is to say, of the weight $w$ , we have one form corresponding to each non-unitary partition of $w$ into the parts 2, 3, 4,... $\theta$ . The extraordinary advantage of the transformation of $\Omega$ to association with non-unitary symmetric functions is now apparent; for we may take, as representative forms, the symmetric functions which are symbolically denoted by the partitions referred to. Ex. gr., of degree 3 weight 8, we have the two forms $(3^{2}2)$ , $a(2^{4})$ . If we wish merely to enumerate those whose partitions contain the figure $\theta$ , and do not therefore contain any power of $a$ as a factor, we have the generator

${\frac {z^{\theta }}{1-z^{2}.~1-z^{3}.~1-z^{4}.~\ldots 1-z^{\theta }}}$ .

If $\theta =2$ , every form is obviously a ground form or perpetuant, and the series of forms is denoted by $(2),~(2^{2}),~(2^{3}),\dots (2^{\kappa +1})\dots$ . Similarly, if $\theta =3$ , every form $(3^{\kappa +1}2^{\lambda })$ is a perpetuant. For these two cases the perpetuants are enumerated by

${\frac {z^{2}}{1-z^{2}}}$ , and ${\frac {z^{3}}{1-z^{2}.~1-z^{3}}}$

respectively.

When $\theta =4$ it is clear that no form, whose partition contains a part 3, can be reduced; but every form, whose partition is composed of the parts 4 and 2, is by elementary algebra reducible by means of perpetuants of degree 2. These latter forms are enumerated by ${\frac {z^{4}}{1-z^{2}.~1-z^{4}}}$ ; hence the generator of quartic perpetuants must be

${\frac {z^{4}}{1-z^{2}.~1-z^{3}.~1-z^{4}}}-{\frac {z^{4}}{1-z^{2}.~1-z^{4}}}={\frac {z^{7}}{1-z^{2}.~1-z^{3}.~1-z^{4}}}$ ;

and the general form of perpetuants is $(4^{\kappa +1}~3^{\lambda +1}~2^{\mu })$ .

When $\theta \geq 5$ , the reducible forms are connected by syzygies which there is some difficulty in enumerating. Sylvester, Cayley and MacMahon succeeded, by a laborious process, in establishing the generators for $\theta =5$ , and $\theta =6$ , viz.:

${\frac {z^{15}}{1-z^{2}.~1-z^{3}.~1-z^{4}.~1-z^{5}}}$ , ${\frac {z^{31}}{1-z^{2}.~1-z^{3}.~1-z^{4}.~1-z^{5}.~1-z^{6}}}$ ;

but the true method of procedure is that of Stroh which we are about to explain.

Method of Stroh.—In the section on “Algebraic Forms” , it was noted that Stroh considers

$(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\ldots +\sigma _{\theta }\alpha _{\theta })^{w}$ ,

where $\sigma _{1}+\sigma _{2}+\ldots +\sigma _{\theta }=0$ and ${\frac {\alpha _{1}^{s}}{s{\text{!}}}}={\frac {\alpha _{2}^{s}}{s{\text{!}}}}=\ldots ={\frac {\alpha _{\theta }^{s}}{s{\text{!}}}}=a_{s}$ symbolically, to be the fundamental form of seminvariant of degree $\theta$ and weight $w$ ; he observes that every form of this degree and weight is a linear function of such symbolic expressions. We may write

$(1+\sigma _{1}\xi )(1+\sigma _{2}\xi )\ldots (1+\sigma _{\theta }\xi )=1+{\text{A}}_{2}\xi ^{2}+{\text{A}}_{3}\xi ^{3}+\ldots +{\text{A}}_{\theta }\xi ^{\theta }$ .

If we expand the symbolic expression by the multinomial theorem, and remember that any symbolic product $\alpha _{1}^{\pi _{1}}\alpha _{2}^{\pi _{2}}\alpha _{3}^{\pi _{3}}\dots$ retains the same value, however the suffixes be permuted, we shall obtain a sum of terms, such as $w{\text{!}}~{\frac {\alpha _{1}^{\pi _{1}}}{\pi _{1}{\text{!}}}}{\frac {\alpha _{2}^{\pi _{2}}}{\pi _{2}{\text{!}}}}{\frac {\alpha _{3}^{\pi _{3}}}{\pi _{3}{\text{!}}}}\ldots \Sigma \sigma _{1}^{\pi _{1}}\sigma _{2}^{\pi _{2}}\sigma _{3}^{\pi _{3}}\ldots$ , which in real form is $w{\text{!}}~a_{\pi _{1}}a_{\pi _{2}}a_{\pi _{3}}\ldots \Sigma \sigma _{1}^{\pi _{1}}\sigma _{2}^{\pi _{2}}\sigma _{3}^{\pi _{3}}\ldots$ ; and, if we express $\Sigma \sigma _{1}^{\pi _{1}}\sigma _{2}^{\pi _{2}}\sigma _{3}^{\pi _{3}}\ldots$ in terms of ${\text{A}}_{2},~{\text{A}}_{3},\dots$ , and arrange the whole as a linear function of products of ${\text{A}}_{2},~{\text{A}}_{3},\dots$ , each coefficient will be a seminvariant, and the aggregate of the coefficients will give us the complete asyzygetic system of the given degree and weight.

When the proper degree $\theta$ is < $w$ a factor $a_{0}^{w-\theta }$ must be of course understood.

Ex. gr.

${\frac {1}{2{\text{!}}}}(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\sigma _{3}\alpha _{3}+\sigma _{4}\alpha _{4})^{2}={\frac {\alpha _{1}^{2}}{2{\text{!}}}}\Sigma \sigma _{1}^{2}+\alpha _{1}\alpha _{2}\Sigma \sigma _{1}\sigma _{2}$
$=a_{2}(-2{\text{A}}_{2})+a_{1}^{2}{\text{A}}_{2}=(a_{1}^{2}-2a_{2}){\text{A}}_{2}=(2){\text{A}}_{2}\equiv a_{0}^{2}(2){\text{A}}_{2}$ .

In general the coefficient, of any product ${\text{A}}_{\pi _{1}}{\text{A}}_{\pi _{2}}{\text{A}}_{\pi _{3}}\ldots$ , will have, as coefficient, a seminvariant which, when expressed by partitions, will have as leading partition (preceding in dictionary order all others) the partition $(\pi _{1}\pi _{2}\pi _{3}\dots )$ . Now the symbolic expression of the seminvariant can be expanded by the binomial theorem so as to be exhibited as a sum of products of seminvariants, of lower degrees if $\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\ldots +\sigma _{\theta }\alpha _{\theta }$ can be broken up into any two portions

$(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\ldots +\sigma _{3}\alpha _{3})+(\sigma _{s+1}\alpha _{s+1}+\sigma _{s+2}\alpha _{s+2}+\ldots +\sigma _{\theta }\alpha _{\theta })$

such that $\sigma _{1}+\sigma _{2}+\ldots +\sigma _{s}=0$ , for then

$\sigma _{s+1}+\sigma _{s+2}+\ldots +\sigma _{\theta }=0$

and each portion raised to any power denotes a seminvariant. Stroh assumes that every reducible seminvariant can in this way be reduced. The existence of such a relation, as $\sigma _{1}+\sigma _{2}+\ldots +\sigma _{\theta }=0$ , necessitates the vanishing of a certain function of the coefficients ${\text{A}}_{2},~{\text{A}}_{3},\ldots {\text{A}}_{\theta }$ , and as a consequence one product of these coefficients can be eliminated from the expanding form and no seminvariant, which appears as a coefficient to such a product (which may be the whole or only a part of the complete product, with which the seminvariant is associated), will be capable of reduction.

Ex. gr. for $\theta =2$ , $(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2})^{w}$ ; either $\sigma _{1}$ or $\sigma _{2}$ will vanish if $\sigma _{1}\sigma _{2}={\text{A}}_{2}=0$ ; but every term, in the development, is of the form $(222\ldots ){\text{A}}_{2}^{{\frac {1}{2}}w}$ and therefore vanishes; so that none are left to undergo reduction. Therefore every form of degree 2, except of course that one whose weight is zero, is a perpetuant. The generating function is ${\frac {z^{2}}{1-z^{2}}}$ .

For $\theta =3$ , $(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\sigma _{3}\alpha _{3})^{w}$ ; the condition is clearly $\sigma _{1}\sigma _{2}\sigma _{3}={\text{A}}_{3}=0$ , and since every seminvariant, of proper degree 3, is associated, as coefficient, with a product containing ${\text{A}}_{3}$ , all such are perpetuants. The general form is $(3^{\kappa }2^{\lambda }$ and the generating function ${\frac {z^{3}}{1-z^{2}.~1-z^{3}}}$ .

For $\theta =4$ , $(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\sigma _{3}\alpha _{3}+\sigma _{4}\alpha _{4})^{w}$ ; the condition is

$\sigma _{1}\sigma _{2}\sigma _{3}\sigma _{4}(\sigma _{1}+\sigma _{2})(\sigma _{1}+\sigma _{3})(\sigma _{1}+\sigma _{4})={\text{A}}_{4}{\text{A}}_{3}=0$ .

Hence every product of ${\text{A}}_{1}$ , ${\text{A}}_{2}$ , ${\text{A}}_{3}$ , ${\text{A}}_{4}$ , which contains the product ${\text{A}}_{4}{\text{A}}_{3}$ disappears before reduction; this means that every seminvariant, whose partition contains the parts 4, 3, is a perpetuant. The general form of perpetuant is $(4^{\kappa }3^{\lambda }2^{\mu })$ and the generating function

${\frac {z^{7}}{1-z^{2}.~1-z^{3}.~1-z^{4}}}$ .

In general when $\theta$ is even and $=2\phi$ , the condition is

$\sigma _{1}\sigma _{2}\ldots \sigma _{2\phi }{\text{II}}(\sigma _{1}+\sigma _{2}){\text{II}}(\sigma _{1}+\sigma _{2}+\sigma _{3})\ldots {\text{II}}(\sigma _{1}+\sigma _{2}+\ldots +\sigma _{\phi })=0$ ;

and we can determine the lowest weight of a perpetuant; the degree in the quantities $\sigma$ is

$2\phi +{\tbinom {2\phi }{2}}+{\tbinom {2\phi }{3}}+\ldots +{\frac {1}{2}}{\tbinom {2\phi }{\phi }}=2^{2\phi -1}-1=2^{\theta -1}-1$ .

Again, if $\theta$ is uneven $=2\phi +1$ , the condition is

$\sigma _{1}\sigma _{2}\ldots \sigma _{2\phi +1}{\text{II}}(\sigma _{1}+\sigma _{2}){\text{II}}(\sigma _{1}+\sigma _{2}+\sigma _{3})\ldots {\text{II}}(\sigma _{1}+\sigma _{2}+\ldots +\sigma _{\phi })=0$ ;

and the degree, in the quantities $\sigma$ , is

$2\phi +1+{\tbinom {2\phi +1}{2}}+{\tbinom {2\phi +1}{3}}+\ldots +{\tbinom {2\phi +1}{\phi }}$
$=2^{2\phi }-1=2^{\theta -1}-1$ .

Hence the lowest weight of a perpetuant is $2^{\theta -1}-1$ , when $\theta$ is >2. The generating function is thus

${\frac {z^{2^{\theta -1}}-1}{(1-z^{2})(1-z^{3})(1-z^{4})\ldots (1-z^{\theta })}}$ .

The actual form of a perpetuant of degree $\theta$ has been shown by MacMahon to be

$(\theta ^{\kappa _{\theta }+1}{\overline {,\theta -1}}^{\kappa _{\theta -1}+1}{\overline {,\theta -2}}^{\kappa _{\theta -2}+2}{\overline {,\theta -3}}^{\kappa _{\theta -3}+4},\ldots 3^{\kappa _{3}+2^{\theta -4}},2^{\kappa _{2}})$ ,

$\kappa _{\theta },\kappa _{\theta -1},\ldots \kappa _{2}$ being given any zero or positive integer values.

Simultaneous Seminvariants of two Binary Forms.—Taking the two forms to be

$a_{0}x_{1}^{p}+pa_{1}x_{1}^{p-1}x_{2}+p(p-1)a_{2}x_{1}^{p-2}x_{2}^{2}+\ldots +a_{p}x_{2}^{p}$ ,
$b_{0}x_{1}^{q}+qb_{1}x_{1}^{q-1}x_{2}+q(q-1)b_{2}x_{1}^{q-2}x_{2}^{2}+\ldots +b_{q}x_{2}^{q}$ ,

every leading coefficient of a simultaneous covariant vanishes by the operation of

$\Omega _{a}+\Omega _{b}=a_{0}{\frac {d}{da_{1}}}+a_{1}{\frac {d}{da_{2}}}+\ldots +a_{p-1}{\frac {d}{da_{p}}}+b_{0}{\frac {d}{db_{1}}}+b_{1}{\frac {d}{db_{2}}}+\ldots +b_{q-1}{\frac {d}{db_{q}}}$ .

Observe that we may employ the principle of suffix diminution to obtain from any seminvariant one appertaining to a $(p-1)^{ic}$ and a $q-1^{ic}$ , and that suffix augmentation produces a portion of a higher seminvariant, the degree in each case remaining unaltered. Remark, too, that we are in association with non-unitary symmetric functions of two systems of quantities which will be denoted by partitions in brackets $(~~)_{a}$ , $(~~)_{b}$ respectively. Solving the equation

$(\Omega _{a}+\Omega _{b})u=0$ ,

by the ordinary theory of linear partial differential equations, we obtain $p+q+1$ independent solutions, of which $p$ appertain to $\Omega _{a}u=0$ , $\Omega _{b}u=0$ ; the remaining one is ${\text{J}}_{ab}=a_{0}b_{1}-a_{1}b_{0}$ , the leading coefficient of the Jacobian of the two forms. This constitutes an algebraically complete system, and, in terms of its members, all seminvariants can be rationally expressed. A similar theorem holds in the case of any number of binary forms, the mixed seminvariants being derived from the Jacobians of the several pairs of forms. If the seminvariant be of degree $\theta ,~\theta '$ in the coefficients, the forms of orders $p,~q$ respectively, and the weight $w$ , the degree of the covariant in the variables will be $p\theta +q\theta '-2w=\epsilon$ , an easy generalization of the theorem connected with a single form. The general term of a seminvariant of degree $\theta ,\theta '$ and weight $w$ will be

$a_{0}^{\rho _{0}}a_{1}^{\rho _{1}}a_{2}^{\rho _{2}}\dots a_{p}^{\rho _{p}}b_{0}^{\sigma _{0}}b_{1}^{\sigma _{1}}b_{2}^{\sigma _{2}}\dots b_{q}^{\sigma _{q}}$

where ${\overset {p}{\underset {1}{\Sigma }}}\rho _{s}=\theta$ , ${\overset {q}{\underset {1}{\Sigma }}}\sigma _{s}=\theta '$ and ${\overset {p}{\underset {1}{\Sigma }}}s\rho _{s}+{\overset {q}{\underset {1}{\Sigma }}}s\sigma _{s}=w$ .

The number of such terms is the number of partitions of $w$ into $\theta +\theta '$ parts, the part magnitudes, in the two portions, being limited not to exceed $p$ and $q$ respectively. Denote this number by $(w;\theta ,p;\theta '.q)$ . The number of linearly independent seminvariants of the given type will then be denoted by

$(w;\theta ,p;\theta ',q)-(w-1;\theta ,p;\theta ',q)$ ;

and will be given by the coefficient of $a^{\theta }b^{\theta '}z^{w}$ in

${\frac {1}{1-a.~1-az.1-az^{2}.~\dots ~1-az^{p}.~1-b.~1-bz.~1-bz^{2}\dots ~1-bz^{q}}}$ ;

that is, by the coefficient of $z^{w}$ in

${\frac {1-z^{p+1}.~1-z^{p+2}.~\dots ~1-z^{p+\theta }.~1-z^{q+1}.~1-z^{q+2}.~\dots ~1-z^{q+\theta '}}{1-z.~1-z^{2}.~1-z^{3}.~\dots ~1-z^{\theta }.~1-z^{2}.~1-z^{3}.~\dots ~1-z^{\theta '}}}$ ;

which preserves its expression when $\theta$ and $p$ and $\theta '$ and $q$ are separately or simultaneously interchanged.

Taking the first generating function, and writing $az^{p}$ , $bz^{q}$ and ${\frac {1}{z^{2}}}$ for $a$ , $b$ and $z$ respectively, we obtain the coefficient of $a^{\theta }b^{\theta '}z^{p\theta +q\theta '-2w}$ , that is of $a^{\theta }b\theta 'z^{\epsilon }$ , in

${\frac {1-z^{-2}}{1-az^{p}.~1-az^{p-2}~\dots 1-az^{-p+2}.~1-az^{-p}.~1-bz^{q}.~1-bz^{q-2}.~\dots 1-bz^{-q+2}.~1-bz^{-q}}}$ ;

the unreduced generating function which enumerates the covariants of degrees $\theta ,\theta '$ in the coefficients and order $\epsilon$ in the variables. Thus, for two linear forms, $p=q=1$ , we find

${\frac {1-z^{-2}}{1-az.~1-az^{-1}.~1-bz.~1-bz^{-1}}}$ ,

the positive part of which is

${\frac {1}{1-az.~1-bz.~1-ab}}$ ;

establishing the ground forms of degrees-order (1, 0; 1), (0, 1; 1), (1, 1; 0), viz:—the linear forms themselves and their Jacobian ${\text{J}}_{ab}$ . Similarly, for a linear and a quadratic, $p=1$ , $q=2$ , and the reduced form is found to be

${\frac {1-a^{2}b^{2}z^{2}}{1-az.~1-bz^{2}.~1-abz.~1-b^{2}.~1-a^{2}b}}$ ,

where the denominator factors indicate the forms themselves, their Jacobian, the invariant of the quadratic and their resultant; connected, as shown by the numerator, by a syzygy of degrees-order (2, 2; 2).

The complete theory of the perpetuants appertaining to two or more forms of infinite order has not yet been established. For two forms the seminvariants of degrees 1, 1 are enumerated by ${\frac {1}{1-z}}$ , and the only one which is reducible is $a_{0}b_{0}$ of weight zero; hence the perpetuants of degrees 1, 1 are enumerated by

${\frac {1}{1-z}}-1={\frac {z}{1-z}}$ ;

and the series is evidently

$a_{0}b_{1}-a_{1}b_{0}$ ,
$a_{0}b_{2}-a_{1}b_{1}+a_{2}b_{0}$ ,
$a_{0}b_{3}-a_{1}b_{2}+a_{2}b_{1}-a_{3}b_{0}$ ,

one for each of the weights 1, 2, 3,...ad infin.

For the degrees 1, 2, the asyzygetic forms are enumerated by ${\frac {1}{1-z.~1-z^{2}}}$ , and the actual forms for the first three weights are

$a_{0}b_{0}^{2}$ ,
$(a_{0}b_{1}-a_{1}b_{0})b_{0}$ ,
$(a_{0}b_{2}-a_{1}b_{1}+a_{2}b_{0})b_{0}$ ,
$a_{0}(b_{1}^{2}-2b_{0}b_{2})$ ,
$(a_{0}b_{3}-a_{1}b_{2}+a_{2}b_{1}-a_{3}b_{0})b_{0}$ ,
$a_{0}(b_{1}b_{2}-3b_{0}b_{3})-a_{1}(b_{1}^{2}-2b_{0}b_{2})$ ;

amongst these forms are included all the asyzygetic forms of degrees 1, 1, multiplied by $b_{0}$ , and also all the perpetuants of the second binary form multiplied by $a_{0}$ ; hence we have to subtract from the generating function ${\frac {1}{1-z}}$ and ${\frac {z^{2}}{1-z^{2}}}$ , and obtain the generating function of perpetuants of degrees 1, 2.

${\frac {1}{1-z.~1-z^{2}}}-{\frac {1}{1-z}}-{\frac {z^{2}}{1-z^{2}}}={\frac {z^{3}}{1-z.~1-z^{2}}}$ .

The first perpetuant is the last seminvariant written, viz.:—

$a_{0}(b_{0}b_{2}-3b_{0}b_{3})-a_{1}(b_{1}^{2}-2b_{0}b_{2})$ ,

or, in partition notation,

$a_{0}(21)_{b}-(1)_{a}(2)_{b}$ ;

and, in this form, it is at once seen to satisfy the partial differential equation. It is important to notice that the expression

$(\theta )_{a}(\theta '1^{s})_{b}-(\theta 1)_{a}(\theta '1^{s-1})_{b}+(\theta 1^{2})_{a}(\theta '1^{s-2})_{b}-\ldots \pm (\theta 1^{s})_{a}(\theta ')_{b}$

denotes a seminvariant, if $\theta ,\theta ',$ be neither of them unity, for, after operation, the terms destroy one another in pairs: when $\theta =0$ , $(\theta )^{a}$ must be taken to denote $a_{0}$ and so for $\theta '$ . In general it is a seminvariant of degrees $\theta ,\theta '$ , and weight $\theta +\theta '+s$ ; for this there is an exception, viz., when $\theta =0$ , or when $\theta '=0$ , the corresponding partial degrees are 1 and 1. When $\theta =\theta '=0$ , we have the general perpetuant of degrees 1, 1. There is a still more general form of the seminvariant; we may have instead of $\theta ,\theta '$ any collections of non-unitary integers not exceeding $\theta ,\theta '$ in magnitude respectively, Ex. gr.

$(2^{\lambda _{2}}3^{\lambda _{3}}\ldots \theta ^{\lambda _{\theta }})_{a}(1^{s}2^{\mu _{2}}3^{\mu _{3}}\ldots \theta '^{\mu _{\theta '}})_{b}$
$~-(12^{\lambda _{2}}3^{\lambda _{3}}\ldots \theta ^{\lambda _{\theta }})_{a}(1^{s-1}2^{\mu _{2}}3^{\mu _{3}}\ldots \theta '^{\mu _{\theta '}})_{b}$
$~+(1^{2}2^{\lambda _{2}}3^{\lambda _{3}}\ldots \theta ^{\lambda _{\theta }})_{a}(1^{s-2}2^{\mu _{2}}3^{\mu _{3}}\ldots \theta '^{\mu _{\theta '}})_{b}$
$~~-.~.~.~.$
$(-)^{s}(1^{s}2^{\lambda _{2}}3^{\lambda _{3}}\ldots \theta ^{\lambda _{\theta }})_{a}(2^{\mu _{2}}3^{\mu _{3}}\ldots \theta '^{\mu _{\theta '}})_{b}$ ,

is a seminvariant; and since these terms are clearly enumerated by

${\frac {1}{1-z.~1-z^{2}.~\dots ~1-z^{\theta }.~1-z^{2}.~1-z^{3}.~\dots ~1-z^{\theta '}}}$ ,

an expression which also enumerates the asyzygetic seminvariants, we may regard the form, written, as denoting the general form of asyzygetic seminvariant; a very important conclusion. For the case in hand, from the simplest perpetuant of degrees 1, 2, we derive the perpetuants of weight $w$ ,

$a_{0}(21^{w-2})_{b}-a_{1}(21^{w-3})_{b}+a_{2}(21^{w-4})_{b}-\ldots \pm a_{w-2}(2)_{b}$ ,
$a_{0}(2^{2}1^{w-4}(_{b}-a_{1}(2^{2}1^{w-5})_{b}+a_{2}(2^{2}1^{w-6})_{b}-\ldots \pm a_{w-4}(2^{2})_{b}$ ,
$a_{0}(2^{3}1^{w-6})_{b}-a_{1}(2^{3}1^{w-7})_{b}+a_{2}(2^{3}1^{w-8})_{b}-\ldots \pm a_{w-6}(2^{3})_{b}$ ,

a series of ${\frac {1}{2}}(w-2)$ or of ${\frac {1}{2}}(w-1)$ forms according as $w$ is even or uneven. Their number for any weight $w$ is the number of ways of composing $w-3$ with the parts 1, 2, and thus the generating function is verified. We cannot, by this method, easily discuss the perpetuants of degrees 2, 2, because a syzygy presents itself as early as weight 2. It is better now to proceed by the method of Stroh.

We have the symbolic expression of a seminvariant.

${\frac {1}{w{\text{!}}}}(\sigma _{1}\alpha _{1}+\sigma _{2}\alpha _{2}+\ldots +\sigma _{\theta }\alpha _{\theta }+\tau _{1}\beta _{1}+\tau _{2}\beta _{2}+\ldots +\tau _{\theta '}\beta _{\theta '})^{w}$

where

${\frac {\alpha _{1}^{s}}{s{\text{!}}}}={\frac {\alpha _{2}^{s}}{s{\text{!}}}}=\ldots =a_{s}$ ; ${\frac {\beta _{1}^{s}}{s{\text{!}}}}={\frac {\beta _{2}^{s}}{s{\text{!}}}}=\ldots =b_{s}$ ;

and $\sigma _{1}+\sigma _{2}+\ldots +\sigma _{\theta }+\tau _{1}+\tau _{2}+\ldots +\tau _{\theta }=0$ .

Proceeding as we did in the case of the single binary form we find that for a given total degree $\theta +\theta '$ , the condition which expresses reducibility is of total degree $2^{\theta +\theta '-1}-1$ in the coefficients $\sigma$ and $\tau$ ; combining this with the knowledge of the generating function of asyzygetic forms of degrees $\theta$ , $\theta '$ , we find that the perpetuants of these degrees are enumerated by

${\frac {z^{2^{\theta +\theta '-1}}-1}{1-z.~1-z^{2}.~1-z^{3}.~\dots ~1-z^{\theta }.~1-z^{2}.~1-z^{3}.~\dots ~1-z^{\theta '}}}$ ,

and this is true for $\theta +\theta '=2$ as well as for other values of $\theta +\theta '$ (compare the case of the single binary form).

Observe that, if there be more than two binary forms, the weight of the simplest perpetuant of degrees $\theta ,\theta ',\theta '',\dots$ is $2^{\theta +\theta '+\theta ''+\ldots -1}-1$ , as can be seen by reasoning of a similar kind.

To obtain information concerning the actual forms of the perpetuants, write

$(1+\sigma _{1}x)(1+\sigma _{2}x)\ldots (1+\sigma _{\theta }x)=1+{\text{A}}_{1}x+{\text{A}}_{2}x^{2}+\ldots +{\text{A}}_{\theta }x^{\theta }$
$(1+\tau _{1}x)(1+\tau _{2}x)\ldots (1+\tau _{\theta '}x)=1+{\text{B}}_{1}x+{\text{B}}_{2}x^{2}+\ldots +{\text{B}}_{\theta '}x^{\theta '}$

where ${\text{A}}_{1}{\text{B}}_{1}=0$ .

For the case $\theta =1$ , $\theta '=1$ , the condition is

$\sigma _{1}\tau _{1}={\text{A}}_{1}{\text{B}}_{1}=0$ ,

which since ${\text{A}}_{1}+{\text{B}}_{1}=0$ , is really a condition of weight unity. For $w=1$ the form is ${\text{A}}_{1}a_{1}+{\text{B}}_{1}b_{1}$ , which we may write $a_{0}b_{1}-a_{1}b_{0}=a_{0}(1)_{b}-(1)_{a}b_{0}$ ; the remaining perpetuants, enumerated by ${\frac {z}{1-z}}$ , have been set forth above.

For the case $\theta =1$ , $\theta '=2$ , the condition is $\sigma _{1}\tau _{1}\tau _{2}={\text{A}}_{1}{\text{B}}_{2}=0$ ; and the simplest perpetuant, derived directly from the product ${\text{A}}_{1}{\text{B}}_{2}$ , is $(1)_{a}(2)_{b}-(21)_{b}$ ; the remainder of those enumerated by ${\frac {z^{3}}{1-z.~1-z^{2}}}$ may be represented by the form

$(1^{\lambda _{1}+1})_{a}(2^{\mu _{2}+1})_{b}-(1^{\lambda _{1}})_{a}(2^{\mu _{2}+1}1)_{b}+\ldots \pm (2^{\mu _{2}+1}1^{\lambda _{1}+1})_{b}$ ;

$\lambda _{1}$ and $\mu _{2}$ each assuming all integer (including zero) values. For the case $\theta =\theta '=2$ , the condition is

$\sigma _{1}\sigma _{2}\tau _{1}\tau _{2}(\sigma _{1}+\sigma _{2})(\sigma _{1}+\tau _{1})(\sigma _{1}+\tau _{2})={\text{A}}_{2}^{2}{\text{B}}_{1}{\text{B}}_{2}-{\text{A}}_{1}{\text{A}}_{2}{\text{B}}_{2}^{2}=0$ .

To represent the simplest perpetuant, of weight 7, we may take as base either ${\text{A}}_{2}^{2}{\text{B}}_{1}{\text{B}}_{2}$ or ${\text{A}}_{1}{\text{A}}_{2}{\text{B}}_{2}^{2}$ , and since ${\text{A}}_{1}+{\text{B}}_{1}=0$ the former is equivalent to ${\text{A}}_{1}{\text{A}}_{2}^{2}{\text{B}}_{2}$ and the latter to ${\text{A}}_{2}{\text{B}}_{1}{\text{B}}_{2}^{2}$ ; so that we have, apparently, a choice of four products. ${\text{A}}_{2}^{2}{\text{B}}_{1}{\text{B}}_{2}$ gives $(2^{2})_{a}(21)_{b}-(2^{2}1)_{a}(2)_{b}$ and ${\text{A}}_{1}{\text{A}}_{2}^{2}{\text{B}}_{2}$ , $(2^{2}1)_{a}(2)_{b}-(2^{2})_{a}(21)_{b}$ ; these two merely differ in sign; and similarly ${\text{A}}_{2}{\text{B}}_{1}{\text{B}}_{2}^{2}$ yields $(2)_{a}(2^{2}1)_{b}-(21)_{a}(2^{2})_{b}$ , and that due to ${\text{A}}_{1}{\text{A}}_{2}{\text{B}}_{2}^{2}$ merely differs from it in sign. We will choose from the forms in such manner that the product of letters ${\text{A}}$ is either a power of ${\text{A}}_{1}$ , or does not contain ${\text{A}}_{1}$ ; this rule leaves us with ${\text{A}}_{2}^{2}{\text{B}}_{1}{\text{B}}_{2}$ and ${\text{A}}_{2}{\text{B}}_{1}{\text{B}}_{2}^{2}$ ; of these forms we will choose that one which in letters ${\text{B}}$ is earliest in ascending dictionary order; this is ${\text{A}}_{2}^{2}{\text{B}}_{1}{\text{B}}_{2},$ and our earliest perpetuant is

(2^{2})_{a}(21)_{b}-(2^{2}1)_{a}(2)_{b},

and thence the general form enumerated by the generating function ${\frac {z^{7}}{(1-z)(1-z^{2})^{2}}}$ is

(2^{\lambda _{2}+2})_{a}(2^{\mu _{2}+1}1^{\mu _{1}+1})_{b}-(2^{\lambda _{2}+2}1)_{a}(2^{\mu _{2}+1}1^{\mu _{1}})_{b}+...

=(2^{\lambda _{2}+2}1^{\mu _{1}+1})_{a}(2^{\mu _{2}+1})_{b}.

For the case $\theta =1$ , $\theta '=3$ , the condition is

\sigma _{1}\tau _{1}\tau _{2}\tau _{3}(\sigma _{1}+\tau _{1})(\sigma _{1}+\tau _{2})(\sigma _{1}+\tau _{3})={\text{A}}_{1}{\text{B}}_{3}^{2}+{\text{A}}_{1}^{2}{\text{B}}_{2}{\text{B}}_{3}=0.

By the rules adopted we take ${\text{A}}_{1}^{2}{\text{B}}_{2}{\text{B}}_{3}$ , which gives

(1^{2})_{a}(32)_{b}-(1)_{a}(321)_{b}+a_{0}(321^{2})_{b},

the simplest perpetuant of weight 7; and thence the general form enumerated by the generating function

{\frac {z^{7}}{1-z.~1-z^{2}.~1-z^{3}}},

viz:— $(1^{\lambda _{1}+2})_{a}(3^{\mu _{3}+1}2^{\mu _{2}+1})_{b}-...\pm a_{0}(3^{\mu _{3}+1}2^{\mu _{2}+1}1^{\lambda _{1}+2})_{b},$

For the case $\theta =2$ , $\theta '=3$ , the condition is

\sigma _{1}\sigma _{2}\tau _{1}\tau _{2}\tau _{3}(\sigma _{1}+\sigma _{2})(\sigma _{1}+\tau _{1})(\sigma _{1}+\tau _{2})(\sigma _{1}+\tau _{3})(\sigma _{2}+\tau _{1})(\sigma _{2}+\tau _{2})(\sigma _{2}+\tau _{3})\times (\tau _{1}+\tau _{2})(\tau _{1}+\tau _{3})(\tau _{2}+\tau _{3})=0.

The calculation results in

-{\text{A}}_{2}^{4}{\text{B}}_{3}{\text{B}}_{2}{\text{B}}_{1}^{2}+2{\text{A}}_{2}^{3}{\text{B}}_{3}{\text{B}}_{2}^{2}{\text{B}}_{1}^{2}-{\text{A}}_{2}^{2}{\text{B}}_{3}{\text{B}}_{2}^{3}{\text{B}}_{1}^{2}+{\text{A}}_{2}^{4}{\text{B}}_{3}^{2}{\text{B}}_{1}-2{\text{A}}_{2}^{3}{\text{B}}_{3}^{2}{\text{B}}_{2}{\text{B}}_{1}-{\text{A}}_{2}^{2}{\text{B}}_{3}^{2}{\text{B}}_{2}{\text{B}}_{1}^{3}+{\text{A}}_{2}^{2}{\text{B}}_{3}^{2}{\text{B}}_{2}^{2}{\text{B}}_{1}+{\text{A}}_{2}{\text{B}}_{3}^{2}{\text{B}}_{2}^{2}{\text{B}}_{1}^{3}+{\text{A}}_{2}^{2}{\text{B}}_{3}^{3}{\text{B}}_{1}^{2}-2{\text{A}}_{2}{\text{B}}_{3}^{3}{\text{B}}_{2}{\text{B}}_{1}^{2}+{\text{A}}_{2}{\text{B}}_{3}^{4}{\text{B}}_{1}=0.

By the rules adopted we take ${\text{A}}_{2}^{4}{\text{B}}_{3}{\text{B}}_{2}{\text{B}}_{1}^{2}$ , giving the simplest perpetuant of weight 15, viz:—

(2^{4})_{a}(321^{2})_{b}-(2^{4}1)_{a}(321)_{b}+(2^{4}1^{2})_{a}(32)_{b};

and thence the general form

(2^{\lambda _{2}+4})_{a}(3^{\mu _{3}+1}2^{\mu _{2}+1}1^{\mu _{1}+2})_{b}-...\pm (2^{\lambda _{2}+4}1^{\mu _{1}+2})_{a}(3^{\mu _{3}+1}2^{\mu _{2}+1})_{b},

due to the generating function

{\frac {z^{15}}{(1-z)(1-z^{2})^{2}(1-z^{3})}}

For the case $\theta =1$ , $\theta '=4$ , the condition is

\sigma _{1}\tau _{1}\tau _{2}\tau _{3}\tau _{4}(\sigma _{1}+\tau _{1})(\sigma _{1}+\tau _{2})(\sigma _{1}+\tau _{3})(\sigma _{1}+\tau _{4}){\text{II}}(\sigma _{s}+\tau _{t})=0;

the calculation gives

{\text{A}}_{1}{\text{B}}_{4}({\text{A}}_{1}^{2}{\text{B}}_{2}+{\text{A}}_{1}{\text{B}}_{3}+{\text{B}}_{4})(-{\text{B}}_{3}^{2}-{\text{A}}_{1}{\text{B}}_{2}{\text{B}}_{3}-{\text{A}}_{1}^{2}{\text{B}}_{4})=0.

Selecting the product ${\text{A}}_{1}^{4}{\text{B}}_{4}{\text{B}}_{3}{\text{B}}_{2}^{2}$ , we find the simplest perpetuant

(1^{4})_{a}(432^{2})_{b}-(1^{3})_{a}(432^{2}1)_{b}+(1^{2})_{a}(432^{2}1^{2})_{b}-(1)_{a}(432^{2}1^{3})_{b}+a_{0}(432^{2}1^{4})_{b},

and thence the general form

(1^{\lambda _{1}+4})_{a}(4^{\mu _{4}+1}3^{\mu _{3}+1}2^{\mu _{2}+2})_{b}-...\pm a_{0}(4^{\mu _{4}+1}3^{\mu _{3}+1}2^{\mu _{2}+2}1^{\lambda _{1}+4})_{b},

due to the generating function

{\frac {z^{15}}{1-z.{\text{ }}1-z^{2}.{\text{ }}1-z^{3}.{\text{ }}1-z^{4}}}.

The series may be continued, but the calculations soon become very laborious.

V. Restricted Substitutions

We may regard the factors of a binary $n^{ic}$ equated to zero as denoting $n$ straight lines through the origin, the co-ordinates being Cartesian and the axes inclined at any angle. Taking the variables to be $x,y$ and effecting the linear transformation

x=\lambda _{1}{\text{X}}+\mu _{1}{\text{Y}}

y=\lambda _{2}{\text{X}}+\mu _{2}{\text{Y}}

so that

{\frac {y}{x}}={\frac {\lambda _{2}+\mu _{2}{\frac {\text{Y}}{\text{X}}}}{\lambda _{1}+\mu _{1}{\frac {\text{Y}}{\text{X}}}}},{\frac {\text{Y}}{\text{X}}}={\frac {\lambda _{1}{\frac {y}{x}}-\lambda _{2}}{\mu _{2}-\mu _{1}{\frac {y}{x}}}};

it is seen that the two lines, on which lie $(x,y)$ , $({\text{X}},{\text{Y}})$ , have a definite projective correspondence. The linear transformation replaces points on lines through the origin by corresponding points on projectively corresponding lines through the origin; it therefore replaces a pencil of lines by another pencil, which corresponds projectively, and harmonic and other properties of pencils which are unaltered by linear transformation we may expect to find indicated in the invariant system. Or, instead of looking upon a linear substitution as replacing a pencil of lines by a projectively corresponding pencil retaining the same axes of co-ordinates, we may look upon the substitution as changing the axes of co-ordinates retaining the same pencil. Then a binary $n^{ic}$ , equated to zero, represents $n$ straight lines through the origin, and the $x,y$ of any line through the origin are given constant multiples of the sines of the angles which that line makes with two fixed lines, the axes of co-ordinates. As new axes of co-ordinates we may take any other pair of lines through the origin, and for the ${\text{X}},{\text{Y}}$ corresponding to $x,y$ any new constant multiples of the sines of the angles which the line makes with the new axes. The substitution for $x,y$ in terms of ${\text{X}},{\text{Y}}$ is the most general linear substitution in virtue of the four degrees of arbitrariness introduced, viz. two by the choice of axes, two by the choice of multiples. If now the $n^{ic}$ denote a given pencil of lines, an invariant is the criterion of the pencil possessing some particular property which is independent alike of the axes and of the multiples, and a covariant expresses that the pencil of lines which it denotes is a fixed pencil whatever be the axes or the multiples.

Besides the invariants and covariants, hitherto studied, there are others which appertain to particular cases of the general linear substitution. Thus what have been called seminvariants are not all of them invariants for the general substitution, but are invariants for the particular substitution

x_{1}=\lambda _{1}\xi _{1}+\mu _{1}\xi _{2},

x_{2}=

\mu _{2}\xi _{2}.

Again, in plane geometry, the most general equations of substitution which change from old axes inclined at $\omega$ to new axes inclined at $\omega '=\beta -\alpha$ , and inclined at angles $\alpha ,\beta$ to the old axis of $x$ , without change of origin, are

x={\frac {\sin {(\omega -\alpha )}}{\sin {\omega }}}{\text{X}}+{\frac {\sin {(\omega -\beta )}}{\sin {\omega }}}{\text{Y}},

y={\frac {\sin {\alpha }}{\sin {\omega }}}{\text{X}}+{\frac {\sin {\beta }}{\sin {\omega }}}{\text{Y}},

a transformation of modulus

{\frac {\sin {\omega '}}{\sin {\omega }}}.

The theory of invariants originated in the discussion, by George Boole, of this system so important in geometry. Of the quadratic

ax^{2}+2bxy+cy^{2},

he discovered the two invariants

ac-b^{2},a-2b\cos \omega +c,

and it may be verified that, if the transformed of the quadratic be

{\text{AX}}^{2}\div 2{\text{BXY}}+{\text{CY}}^{2},

${\text{AC}}-{\text{B}}^{2}=\left({\frac {\sin {\omega '}}{\sin {\omega }}}\right)^{2}(ac-b^{2}),$

{\text{A}}-{\text{2B}}\cos {\omega '}+{\text{C}}=\left({\frac {\sin {\omega '}}{\sin {\omega }}}\right)^{2}(a-2b\cos {\omega }+c).

The fundamental fact that he discovered was the invariance of $x^{2}+2\cos {\omega }~xy+y^{2}$ , viz.—

x^{2}+2\cos {\omega }~xy+y^{2}={\text{X}}^{2}+2\cos {\omega '}~{\text{XY}}+{\text{Y}}^{2}.

from which it appears that the Boolian invariants of $ax^{2}+2bxy+y^{2}$ are nothing more than the full invariants of the simultaneous quadratics

ax^{2}+2bxy+y^{2},x^{2}+2\cos {\omega }~xy+y^{2},

the word invariant including here covariant. In general the Boolian system, of the general $n^{ic}$ , is coincident with the simultaneous system of the $n^{ic}$ and the quadratic $x^{2}+2\cos {\omega }~xy+y^{2}$ .

Orthogonal System.—In particular, if we consider the transformation from one pair of rectangular axes to another pair of rectangular axes we obtain an orthogonal system which we will now briefly inquire into. We have $\cos {\omega '}=\cos {\omega }=0$ and the substitution

x_{1}=\cos {\theta }{\text{X}}_{1}-\sin {\theta }{\text{X}}_{2}

x_{2}=\sin {\theta }{\text{X}}_{1}+\cos {\theta }{\text{X}}_{2},

with modulus unity. This is called the direct orthogonal substitution, because the sense of rotation from the axis of ${\text{X}}_{1}$ to the axis of ${\text{X}}_{2}$ is the same as that from that of $x_{1}$ to that of $x_{2}$ . If the senses of rotation be opposite we have the skew orthogonal substitution

x_{1}=\cos {\theta }{\text{X}}_{1}+\sin {\theta }{\text{X}}_{2},

x_{2}=\sin {\theta }{\text{X}}_{1}-\cos {\theta }{\text{X}}_{2},

of modulus $-1$ . In both cases ${\frac {d}{dx_{1}}}$ and ${\frac {d}{dx_{2}}}$ are cogredient with $x_{1}$ and $x_{2}$ ; for, in the case of direct substitution,

{\frac {d}{dx_{1}}}=\cos {\theta }{\frac {d}{d{\text{X}}_{1}}}-\sin {\theta }{\frac {d}{d{\text{X}}_{2}}},

{\frac {d}{dx_{2}}}=\sin {\theta }{\frac {d}{d{\text{X}}_{1}}}+\cos {\theta }{\frac {d}{d{\text{X}}_{2}}};

and for skew substitution

{\frac {d}{dx_{1}}}=\cos {\theta }{\frac {d}{d{\text{X}}_{1}}}+\sin {\theta }{\frac {d}{d{\text{X}}_{2}}},

{\frac {d}{dx_{2}}}=\sin {\theta }{\frac {d}{d{\text{X}}_{1}}}-\cos {\theta }{\frac {d}{d{\text{X}}_{2}}}.

Hence, in both cases, contragrediency and cogrediency are identical, and contravariants are included in covariants. Consider the binary $n^{ie}$ , $(a_{1}x_{1}+a_{2}x_{2})^{n}=a_{x}^{n},$ and the direct substitution

x_{1}=\lambda {\text{X}}_{1}-\mu {\text{X}}_{2}

x_{2}=\mu {\text{X}}_{1}+\lambda {\text{X}}_{2}

where $\lambda ^{2}+\mu ^{2}=1$ ; $\lambda ,\mu$ replacing $\sin \theta ,\cos \theta$ respectively. In the notation

a_{x}=a_{1}x_{1}+a_{2}x_{2}

,

observe that

a_{a}=a_{1}^{2}+a_{2}^{2}

,

a_{b}=a_{1}b_{1}+a_{2}b_{2}

.

Suppose that

a_{x}=b_{x}=c_{x}=...

is transformed into

{\text{A}}_{\text{X}}={\text{B}}_{\text{X}}={\text{C}}_{\text{X}}=...

then of course $({\text{AB}})=(ab)$ the fundamental fact which appertains to the theory of the general linear substitution; now here we have additional and equally fundamental facts; for since

{\text{A}}_{1}=\lambda a_{1}+\mu a_{2},{\text{A}}_{2}=-\mu a_{1}+\lambda a_{2},

${\text{A}}_{\text{A}}={\text{A}}_{1}^{2}+{\text{A}}_{2}^{2}=(\lambda ^{2}+\mu ^{2})(a_{1}^{2}+a_{2}^{2})=a_{a};$
${\text{A}}_{\text{B}}={\text{A}}_{1}{\text{B}}_{1}+{\text{A}}_{2}{\text{B}}_{2}=(\lambda ^{2}+\mu ^{2})(a_{1}b_{1}+a_{2}b_{2})=a_{b};$

({\text{XA}})={\text{X}}_{1}{\text{A}}_{2}-{\text{X}}_{2}{\text{A}}_{1}=(\lambda x_{1}+\mu x_{2})(-\mu a_{1}+\lambda a_{2})

-(-\mu x_{1}+\lambda x_{2})(\lambda a_{1}+\mu a_{2})=(\lambda ^{2}+\mu ^{2})(x_{1}a_{2}-x_{2}a_{1})=(xa);

showing that, in the present theory, $a_{a},a_{b},$ and $(xa)$ possess the invariant property. Since $+xZ=xx$ we have six types of symbolic factors which may be used to form invariants and covariants, viz.—

(ab),a_{a},a_{b},(xa),a_{x},x_{x}.

The general form of covariant is therefore

(ab)^{h_{1}}(ac)^{h_{2}}(bc)^{h_{3}}...a_{a}^{i_{1}}b_{b}^{i_{2}}c_{c}^{i_{3}}...a_{b}^{i_{1}}a_{c}^{i_{2}}b_{c}^{i_{3}}...

$\times (xa)^{k_{1}}(xb)^{k_{2}}(xc)^{k_{3}}...a_{x}^{l_{1}}b_{x}^{l_{2}}c_{x}^{l_{3}}...x_{x}^{m}$ $=({\text{A}}{\text{B}})^{h_{1}}({\text{A}}{\text{C}})^{h_{2}}({\text{B}}{\text{C}})^{h_{3}}...{\text{A}}_{\text{A}}^{i_{1}}{\text{B}}_{\text{B}}^{i_{2}}{\text{C}}_{\text{C}}^{i_{3}}...{\text{A}}_{\text{B}}^{i_{1}}{\text{A}}_{\text{C}}^{i_{2}}{\text{B}}_{\text{C}}^{i_{3}}...$

\times ({\text{X}}{\text{A}})^{k_{1}}({\text{X}}{\text{B}})^{k_{2}}({\text{X}}{\text{C}})^{k_{3}}...{\text{A}}_{\text{X}}^{l_{1}}{\text{B}}_{\text{X}}^{l_{2}}{\text{C}}_{\text{X}}^{l_{3}}...{\text{X}}_{\text{X}}^{m}.

If this be of order $\epsilon$ and appertain to an $n^{ic}$

\Sigma k+\Sigma l+2m=\epsilon ,

$h_{1}+h_{2}+...+2i_{1}+j_{1}+j_{2}+...+k_{1}+l_{1}=n,$
$h_{1}+h_{3}+...+2i_{2}+j_{1}+j_{3}+...+k_{2}+l_{2}=n,$

h_{2}+h_{3}+...+2i_{3}+j_{2}+j_{3}+...+k_{3}+l_{3}=n;

viz., the symbols a, b, c,... must each occur n times. It may denote a simultaneous orthogonal invariant of forms of orders $n_{1},n_{2},n_{3},...$ ; the symbols must then present themselves $n_{1},n_{2},n_{3}...$ times respectively. The number of different symbols $a,b,c,...$ denotes the degree $\theta$ of the covariant in the coefficients. The coefficients of the covariants are homogeneous, but not in general isobaric functions, of the coefficients of the original form or forms. Of the above general form of covariant there are important transformations due to the symbolic identities:—

(ab)^{2}=a_{a}b_{b}-a_{b}^{2};(xa)^{2}=a_{a}x_{x}-a_{x}^{2};

as a consequence any even power of a determinant factor may be expressed in terms of the other symbolic factors, and any uneven power may be expressed as the product of its first power and a function of the other symbolic factors. Hence in the above general form of covariant we may suppose the exponents

h_{1},h_{2},h_{3},...k_{1},k_{2},k_{3},...

if the determinant factors to be, each of them, either zero or unity. Or, if we please, we may leave the determinant factors untouched and consider the exponents $j_{1},j_{2},j_{3},...l_{1},l_{2},l_{3},...$ to be, each of them, either zero or unity. Or, lastly, we may leave the exponents h, k, j, l, untouched and consider the product

a_{a}^{i_{1}}b_{b}^{i_{2}}c_{c}^{i_{3}}...x_{x}^{m},

to be reduced either to the form $g_{g}^{i}$ where g is a symbol of the series $a,b,c,...$ or to a power of $x_{x}.$ To assist us in handling the symbolic products we have not only the identity

(ab)c_{x}+(bc)a_{x}+(ca)b_{x}=0,

but also

(ab)x_{x}+(bx)a_{x}+(xa)b_{x}=0,

(ab)a_{c}+(bc)a_{a}+(ca)a_{b}=0,

and many others which may be derived from these in the manner which will be familiar to students of the works of Aronhold, Clebsch and Gordan. Previous to continuing the general discussion it is useful to have before us the orthogonal invariants and covariants of the binary linear and quadratic forms.

For the linear forms ${\bar {a}}_{0}x_{1}+{\bar {a}}_{1}x_{2}=a_{x}=b_{x}$ there are four fundamental forms

(i.)	$a_{x}={\bar {a}}_{0}x_{1}+{\bar {a}}_{1}x_{2}$ of degree-order		(1, 1),
(ii.)	$x_{x}=x_{1}^{2}+x_{2}^{2}$	”	(0, 2),
(iii.)	$(xa)={\bar {a}}_{1}x_{1}-{\bar {a}}_{0}x_{2}$	”	(1, 1),
(iv.)	$a_{b}=a_{0}^{2}+a_{1}^{2}$	”	(2, 0),

(iii.) and (iv.) being the linear covariant and the quadrinvariant respectively. Every other concomitant is a rational integral function of these four forms. The linear covariant, obviously the Jacobian of $a_{x}$ and $x_{x}$ is the line perpendicular to $a_{x},$ and the vanishing of the quadrinvariant $a_{b}$ is the condition that $a_{x}$ passes through one of the circular points at infinity. In general any pencil of lines, connected with the line $a_{x}$ by descriptive or metrical properties, has for its equation a rational integral function of the four forms equated to zero.

For the quadratic ${\bar {a}}_{0}x_{1}^{2}+2{\bar {a}}_{1}x_{1}x_{2}+{\bar {a}}_{2}x_{2}^{2},$ we have

(i.)	$a_{x}^{2}={\bar {a}}_{1}x_{1}^{2}+2{\bar {a}}_{1}x_{1}x_{2}+{\bar {a}}_{2}x_{2}^{2},$
(ii.)	$x_{x}=x_{1}^{2}+x_{2}^{2},$
(iii.)	$(ab)^{2}=2({\bar {a}}_{0}{\bar {a}}_{2}-{\bar {a}}_{1}^{2}),$
(iv.)	$a_{a}={\bar {a}}_{0}+{\bar {a}}_{2},$
(v.)	$(xa)a_{x}={\bar {a}}_{1}x_{1}^{2}+({\bar {a}}_{2}-{\bar {a}}_{0})x_{1}x_{2}-{\bar {a}}_{1}x_{2}^{2}.$

This is the fundamental system; we may, if we choose, replace $(ab)^{2}$ by $a_{b}^{2}={\bar {a}}_{0}^{2}+2{\bar {a}}_{1}^{2}+{\bar {a}}_{2}^{2}$ since the identity $a_{a}b_{b}-a_{b}^{2}=(ab)^{2}$ shows the syzygetic relation

({\bar {a}}_{0}+{\bar {a}}_{2})^{2}-({\bar {a}}_{0}^{2}+2{\bar {a}}_{1}^{2}+{\bar {a}}_{2}^{2})=2({\bar {a}}_{0}{\bar {a}}_{2}-{\bar {a}}_{1}^{2}).

There is no linear covariant, since it is impossible to form a symbolic product which will contain $x$ once and at the same time appertain to a quadratic. (v.) is the Jacobian; geometrically it denotes the bisectors of the angles between the lines $a_{x}^{2},$ or, as we may say, the common harmonic conjugates of the lines $a_{x}^{2},$ and the lines $x_{x}.$ The linear invariant $a_{a}$ is such that, when equated to zero, it determines the lines $a_{x}^{2}$ as harmonically conjugate to the lines $x_{x};$ or, in other words, it is the condition that $a_{x}^{2}$ may denote lines at right angles.

References.—Cayley, “Memoirs on Quantics,” in the Collected Mathematical Papers (Cambridge, 1898); Salmon, Lessons Introductory to the Modern Higher Algebra (Dublin, 1885); E. B. Elliott, Algebra of Quantics (Oxford, 1895); F. Brioschi, Teoria dei Covarianti (Rome, 1861); W. Fiedler, Die Elemente der neueren Geometrie and der Algebra der binären Formen (Leipzig, 1862); A. Clebsch, Theorie der binären Algebraischen Formen (Leipzig, 1872); Vorlesungen über Geometrie (Leipzig, 1875); Faà de Bruno, Théorie des formes binaires (Turin, 1876); P. Gordan, Vorlesungen über Invariantentheorie, Bd. i. “Determinanten” (Leipzig, 1885); Bd. ii. “Binäre Formen” (Leipzig, 1887); G. Rubini, Teoria delle forme in generale, e specialmente delle binarie (Leue, 1886); E. Study, Methoden zur Theorie der Ternären Formen (Leipzig, 1889); Lie, Theorie der Transformationsgruppen (Leipzig, 1888–1890); Franz Meyer, Bericht über den gegenwärtigen Stand der Invariantentheorie; Jahresbericht der Deutschen Mathematiker-Vereinigung, Bd. i. (Berlin, 1892); Encyklopädie der mathematischen Wissenschaften, Bd. i., Heft 3, 4, by Heinrich Burkhardt and Franz Meyer (Leipzig, 1899); J. H. Grace and A. Young, The Algebra of Invariants (Cambridge, 1903). (P. A. M.)

↑ ^1.0 ^1.1 The elementary theory is given in the article Determinant.
↑ Vienna Transactions, t. iv. 1852.
↑ Phil. Trans., 1890, p. 490.
↑ The weight of a term ak₀
0ak₁
1…ak_n
n is defined as being k₁ + 2k₂ + … + nk_n.

[AF1-1] 1.0 ^1.1 The elementary theory is given in the article Determinant.

[2] Vienna Transactions, t. iv. 1852.

[3] Phil. Trans., 1890, p. 490.

[4] The weight of a term ak₀
0ak₁
1…ak_n
n is defined as being k₁ + 2k₂ + … + nk_n.

[1]

[2]

[3]

[4]