Special Functions – Orthogonal polynomials – General theory

Three-term recurrence relation

Theorem: A sequence of orthogonal polynomials \(\{p_n(x)\}_{n=0}^{\infty}\) satisfies

\[p_{n+1}(x)=(A_nx+B_n)p_n(x)+C_np_{n-1}(x),\quad n=1,2,3,\ldots,\]

where

\[A_n=\frac{k_{n+1}}{k_n},\quad n=0,1,2,\ldots\quad\text{and}\quad C_n=-\frac{A_n}{A_{n-1}}\cdot\frac{h_n}{h_{n-1}},\quad n=1,2,3,\ldots.\]

Proof: Since \(\text{degree}\left[p_n(x)\right]=n\) for each \(n\in\{0,1,2,\ldots\}\) the sequence of orthogonal polynomials \(\left\{p_n(x)\right\}_{n=0}^{\infty}\) is linearly independent. Let \(A_n=k_{n+1}/k_n\). Then \(p_{n+1}(x)-A_nxp_n(x)\) is a polynomial of degree \(\leq n\). Hence

\[p_{n+1}(x)-A_nxp_n(x)=\sum_{k=0}^nc_kp_k(x).\]

The orthogonality property now gives

\[\langle p_{n+1}(x)-A_nxp_n(x),\,p_k(x)\rangle =\sum_{m=0}^nc_m\langle p_m(x),\,p_k(x)\rangle=c_k\langle p_k(x),\,p_k(x)\rangle=c_k\,h_k.\]

This implies

\[h_k\,c_k=\langle p_{n+1}(x)-A_nxp_n(x),\,p_k(x)\rangle=\langle p_{n+1}(x),\,p_k(x)\rangle-A_n\langle xp_n(x),\,p_k(x)\rangle =-A_n\langle p_n(x),\,xp_k(x)\rangle.\]

For \(k < n-1\) we have \(\text{degree}\left[xp_k(x)\right] < n\) which implies that \(\langle p_n(x),\,xp_k(x)\rangle=0\). Hence: \(c_k=0\) for \(k < n-1\). This proves that the polynomials satisfy the three-term recurrence relation

\[p_{n+1}(x)-A_nxp_n(x)=c_np_n(x)+c_{n-1}p_{n-1}(x),\quad n=1,2,3,\ldots.\]

Further we have

\[h_{n-1}\,c_{n-1}=-A_n\langle p_n(x),\,xp_{n-1}(x)\rangle=-A_n\,\frac{k_{n-1}}{k_n}\,h_n \quad\Longrightarrow\quad c_{n-1}=-\frac{A_n}{A_{n-1}}\cdot\frac{h_n}{h_{n-1}}.\]

This proves the theorem.

Note that the three-term recurrence relation for a sequence of monic (\(k_n=1\)) orthogonal polynomials \(\{p_n(x)\}_{n=0}^{\infty}\) has the form

\[p_{n+1}(x)=xp_n(x)+B_np_n(x)+C_np_{n-1}(x)\quad\text{with}\quad C_n=-\frac{h_n}{h_{n-1}},\quad n=1,2,3,\ldots.\]

Christoffel-Darboux formula

Theorem: A sequence of orthogonal polynomials \(\left\{p_n(x)\right\}_{n=0}^{\infty}\) satisfies

\[\sum_{k=0}^n\frac{p_k(x)p_k(y)}{h_k}=\frac{k_n}{h_n\,k_{n+1}}\cdot \frac{p_{n+1}(x)p_n(y)-p_{n+1}(y)p_n(x)}{x-y},\quad n=0,1,2,\ldots\]

and

\[\sum_{k=0}^n\frac{\left\{p_k(x)\right\}^2}{h_k}=\frac{k_n}{h_n\,k_{n+1}}\cdot \left(p_{n+1}'(x)p_n(x)-p_{n+1}(x)p_n'(x)\right),\quad n=0,1,2,\ldots.\]

The first formula is called the Christoffel-Darboux formula and the second one its confluent form.

Proof: The three-term recurrence relation implies that

\[p_{n+1}(x)p_n(y)=(A_nx+B_n)p_n(x)p_n(y)+C_np_{n-1}(x)p_n(y)\]

and

\[p_{n+1}(y)p_n(x)=(A_ny+B_n)p_n(y)p_n(x)+C_np_{n-1}(y)p_n(x).\]

Subtraction gives

\[p_{n+1}(x)p_n(y)-p_{n+1}(y)p_n(x)=A_n(x-y)p_n(x)p_n(y)+C_n\left[p_{n-1}(x)p_n(y)-p_{n-1}(y)p_n(x)\right].\]

This leads to the telescoping sum

\begin{align*} (x-y)\sum_{k=1}^n\frac{p_k(x)p_k(y)}{h_k}&=\sum_{k=1}^n\frac{p_{k+1}(x)p_k(y)-p_{k+1}(y)p_k(x)}{A_k\,h_k} -\sum_{k=1}^n\frac{p_k(x)p_{k-1}(y)-p_k(y)p_{k-1}(x)}{A_{k-1}\,h_{k-1}}\\[2.5mm] &=\frac{p_{n+1}(x)p_n(y)-p_{n+1}(y)p_n(x)}{A_n\,h_n}-\frac{k_0^2(x-y)}{h_0}. \end{align*}

This implies that

\[(x-y)\sum_{k=0}^n\frac{p_k(x)p_k(y)}{h_k}=\frac{p_{n+1}(x)p_n(y)-p_{n+1}(y)p_n(x)}{A_n\,h_n} =\frac{k_n}{h_n\,k_{n+1}}\left(p_{n+1}(x)p_n(y)-p_{n+1}(y)p_n(x)\right),\]

which proves the Christoffel-Darboux formula. The confluent form then follows by taking the limit \(y\rightarrow x\):

\begin{align*} \lim\limits_{y\rightarrow x}\frac{p_{n+1}(x)p_n(y)-p_{n+1}(y)p_n(x)}{x-y}&=\lim\limits_{y\rightarrow x} \frac{p_n(x)\left(p_{n+1}(x)-p_{n+1}(y)\right)-p_{n+1}(x)\left(p_n(x)-p_n(y)\right)}{x-y}\\[2.5mm] &=p_n(x)p_{n+1}'(x)-p_{n+1}(x)p_n'(x). \end{align*}

Zeros

Theorem: If \(\{p_n(x)\}_{n=0}^{\infty}\) is a sequence of orthogonal polynomials on the interval \((a,b)\) with respect to the weight function \(w(x)\), then the polynomial \(p_n(x)\) has exactly \(n\) real simple zeros in the interval \((a,b)\).

Proof: Since \(\text{degree}[p_n(x)]=n\) the polynomial has at most \(n\) real zeros. Suppose that \(p_n(x)\) has \(m\leq n\) distinct real zeros \(x_1,x_2,\ldots,x_m\) in \((a,b)\) of odd multiplicity. Then the polynomial

\[p_n(x)(x-x_1)(x-x_2)\cdots(x-x_m)\]

does not change sign on the interval \((a,b)\). This implies that

\[\int_a^bw(x)p_n(x)(x-x_1)(x-x_2)\cdots(x-x_m)\,dx\neq 0.\]

By orthogonality this integral equals zero if \(m < n\). Hence: \(m=n\), which implies that \(p_n(x)\) has \(n\) distinct real zeros of odd multiplicity in \((a,b)\). This proves that all \(n\) zeros are distinct and simple (have multiplicity equal to one).

Theorem: If \(\{p_n(x)\}_{n=0}^{\infty}\) is a sequence of orthogonal polynomials on the interval \((a,b)\) with respect to the weight function \(w(x)\), then the zeros of \(p_n(x)\) and \(p_{n+1}(x)\) separate each other.

Proof: This follows from the confluent form of the Christoffel-Darboux formula. Note that

\[h_n=\int_a^bw(x)\left\{p_n(x)\right\}^2\,dx>0,\quad n=0,1,2,\ldots.\]

This implies that

\[\frac{k_n}{h_n\,k_{n+1}}\cdot\left(p_{n+1}'(x)p_n(x)-p_{n+1}(x)p_n'(x)\right)=\sum_{k=0}^n\frac{\left\{p_k(x)\right\}^2}{h_k}>0.\]

Hence

\[\frac{k_n}{k_{n+1}}\cdot\left(p_{n+1}'(x)p_n(x)-p_{n+1}(x)p_n'(x)\right)>0.\]

Now suppose that \(x_{n,k}\) and \(x_{n,k+1}\) are two consecutive zeros of \(p_n(x)\) with \(x_{n,k} < x_{n,k+1}\). Since all \(n\) zeros of \(p_n(x)\) are real and simple \(p_n'(x_{n,k})\) and \(p_n'(x_{n,k+1})\) should have opposite signs. Hence we have

\[p_n(x_{n,k})=0=p_n(x_{n,k+1})\quad\text{and}\quad p_n'(x_{n,k})\cdot p_n'(x_{n,k+1})<0.\]

This implies that \(p_{n+1}(x_{n,k})\cdot p_{n+1}(x_{n,k+1})\) should be negative too. Then the continuity of \(p_{n+1}(x)\) implies that there should be at least one zero of \(p_{n+1}(x)\) between \(x_{n,k}\) and \(x_{n,k+1}\). However, this holds for each two consecutive zeros of \(p_n(x)\). This proves the theorem.

Moreover, if \(\{x_{n,k}\}_{k=1}^n\) and \(\{x_{n+1,k}\}_{k=1}^{n+1}\) denote the consecutive zeros of \(p_n(x)\) and \(p_{n+1}(x)\) respectively, then we have

\[a < x_{n+1,1} < x_{n,1} < x_{n+1,2} < x_{n,2} < \cdots < x_{n+1,n} < x_{n,n} < x_{n+1,n+1} < b.\]

Gauss quadrature

If \(f\) is a continuous function on \((a,b)\) and \(x_1 < x_2 < \cdots < x_n\) are \(n\) distinct points in \((a,b)\), then there exists exactly one polynomial \(P\) with degree \(\leq n-1\) such that \(P(x_i)=f(x_i)\) for all \(i=1,2,\ldots,n\). This polynomial \(P\) can easily be found by using Lagrange interpolation as follows. Define

\[p(x)=(x-x_1)(x-x_2)\cdots(x-x_n)\]

and consider

\[P(x)=\sum_{i=1}^nf(x_i)\frac{p(x)}{(x-x_i)p'(x_i)} =\sum_{i=1}^nf(x_i)\frac{(x-x_1)\cdots(x-x_{i-1})(x-x_{i+1})\cdots(x-x_n)}{(x_i-x_1)\cdots(x_i-x_{i-1})(x_i-x_{i+1})\cdots(x_i-x_n)}.\]

Let \(\{p_n(x)\}_{n=0}^{\infty}\) be a sequence of orthogonal polynomials on the interval \((a,b)\) with respect to the weight function \(w(x)\). Then for \(x_1 < x_2 < \cdots < x_n\) we take the \(n\) distinct real zeros of the polynomial \(p_n(x)\). If \(f\) is a polynomial of degree \(\leq 2n-1\), then \(f(x)-P(x)\) is a polynomial of degree \(\leq 2n-1\) with at least the zeros \(x_1 \[f(x)=P(x)+r(x)p_n(x),\]

where \(r(x)\) is a polynomial of degree \(\leq n-1\). This can also be written as

\[f(x)=\sum_{i=1}^nf(x_i)\frac{p_n(x)}{(x-x_i)p_n'(x_i)}+r(x)p_n(x).\]

This implies that

\[\int_a^bw(x)f(x)\,dx=\sum_{i=1}^nf(x_i)\int_a^b\frac{w(x)p_n(x)}{(x-x_i)p_n'(x_i)}\,dx+\int_a^bw(x)r(x)p_n(x)\,dx.\]

Since \(\text{degree}[r(x)]\leq n-1\) the orthogonality property implies that the latter integral equals zero. This implies that

\[\int_a^bw(x)f(x)\,dx=\sum_{i=1}^n\lambda_{n,i}f(x_i)\quad\text{with}\quad \lambda_{n,i}:=\int_a^b\frac{w(x)p_n(x)}{(x-x_i)p_n'(x_i)}\,dx,\quad i=1,2,\ldots,n.\]

This is the Gauss quadrature formula. This gives the value of the integral in the case that \(f\) is a polynomial of degree \(\leq 2n-1\) if the value of \(f(x_i)\) is known for the \(n\) zeros \(x_1 < x_2 < \cdots < x_n\) of the polynomial \(p_n(x)\).

If \(f\) is not a polynomial of degree \(\leq 2n-1\) this leads to an approximation of the integral:

\[\int_a^bw(x)f(x)\,dx\approx\sum_{i=1}^n\lambda_{n,i}f(x_i)\quad\text{with}\quad \lambda_{n,i}:=\int_a^b\frac{w(x)p_n(x)}{(x-x_i)p_n'(x_i)}\,dx,\quad i=1,2,\ldots,n.\]

The coefficients \(\{\lambda_{n,i}\}_{i=1}^n\) are called Christoffel numbers. Note that these numbers do not depend on the function \(f\). These Christoffel numbers are all positive. This can be shown as follows. We have

\[\lambda_{n,i}=\int_a^bw(x)\ell_{n,i}(x)\,dx\quad\text{with}\quad\ell_{n,i}(x):=\frac{p_n(x)}{(x-x_i)p_n'(x_i)},\quad i=1,2,\ldots,n.\]

Then \(\ell_{n,i}^2(x)-\ell_{n,i}(x)\) is a polynomial of degree \(\leq 2n-2\) which vanishes at the zeros \(\{x_{n,k}\}_{k=1}^n\) of \(p_n(x)\). Hence

\[\ell_{n,i}^2(x)-\ell_{n,i}(x)=p_n(x)q(x)\quad\text{for some polynomial \(q\) of degree \(\leq n-2\).}\]

This implies that

\[\int_a^bw(x)\left(\ell_{n,i}^2(x)-\ell_{n,i}(x)\right)\,dx=\int_a^bw(x)p_n(x)q(x)\,dx=0\]

by orthogonality. Hence we have

\[\lambda_{n,i}=\int_a^bw(x)\ell_{n,i}(x)\,dx=\int_a^bw(x)\left\{\ell_{n,i}(x)\right\}^2\,dx>0.\]

Now we can also prove

Theorem: Let \(\{p_n(x)\}_{n=0}^{\infty}\) be a sequence of orthogonal polynomials on the interval \((a,b)\) with respect to the weight function \(w(x)\) and let \(m < n\). Then we have: between any two zeros of \(p_m(x)\) there is at least one zero of \(p_n(x)\).

Proof: Suppose that \(x_{m,k}\) and \(x_{m,k+1}\) are two consecutive zeros of \(p_m(x)\) and that there is no zero of \(p_n(x)\) in \((x_{m,k},x_{m,k+1})\). Then consider the polynomial

\[g(x)=\frac{p_m(x)}{(x-x_{m,k})(x-x_{m,k+1})}.\]

Then we have

\[g(x)p_m(x)\geq 0\quad\text{for}\quad x\notin(x_{m,k},x_{m,k+1}).\]

Now the Gauss quadrature formula gives

\[\int_a^bw(x)g(x)p_m(x)\,dx=\sum_{i=1}^n\lambda_{n,i}g(x_{n,i})p_m(x_{n,i}),\]

where \(\{x_{n,i}\}_{i=1}^n\) are the zeros of \(p_n(x)\). Since there are no zeros of \(p_n(x)\) in \((x_{m,k},x_{m,k+1})\) we conclude that \(g(x_{n,i})p_m(x_{n,i})\geq 0\) for all \(i=1,2,\ldots,n\). Further we have \(\lambda_{n,i}>0\) for all \(i=1,2,\ldots,n\) which implies that the sum at the right-hand side cannot vanish. However, the integral at the left-hand side is zero by orthogonality. This contradiction proves that there should be at least one zero of \(p_n(x)\) between the two consecutive zeros of \(p_m(x)\).


Last modified on May 22, 2021
© Roelof Koekoek

Metamenu