Convergence of Solvers and the Importance of Well-Posedness

Date: December, 2025

1. Introduction

Black-box solvers are systems that are best viewed as versatile solvers that deal with systems of equations without getting into too much mathetmatical detail. They are mostly closed-source solvers that take in equations or scenarios as inputs and give out the solution as an output. For example, you may have a finite element solver that solves the steady-state heat equation depending on the physical parameters (like diffusivity) and the boundary conditions that you input, along with a mesh resolution that you also provide; see Fig. 1. for a diagram showing this process. But to ensure that the solution is accurate on whatever scenario you input, it is important do convergence studies with known manufactured solutions, or at least fine-grid solutions, and most importantly, a test of robustness of the scheme.

Figure 1. An illustration of a black box solver that solves the steady state heat equation.

The topic of this blog post is to exemplify the importance of verification, validation, and robustness testing when using such black-box solvers, or any solver for that matter. I have personally come across an inordinate amount of reserach articles that present a new solver or numerical scheme, but do not verify or validate their scheme with known solutions, do a thorough analysis of the system, or test for robustness of the system. In such cases, even though you may have convergence of your solver, it is hard to tell if your solution has converged to the ‘right solution’ (if there is a ‘right solution’ in the first place!).

We will build our own solver to solve the elliptic Poisson equation in $1D$

\begin{equation} \label{eq:elliptic_eq} -\frac{d^2 u}{dx^2} = f \text{ in } (0, 1), \; \frac{du}{dx}\bigg\rvert_{x = 0} = \frac{du}{dx}\bigg\rvert_{x=1} = 0, \end{equation}

using the conjugate gradient method and we will show how convergence of our solver does not implicitly imply a physically meaningful solution (although an experienced player may have already guessed the issue at hand!).

2. Computational Solver

We now provide the details of our computational solver. We consider the finite element method to discretize \ref{eq:elliptic_eq} in space and then use the conjugate gradient (CG) method to solve the resulting system of equations.

2.1. Numerical Discretization

We begin by discretizing \ref{eq:elliptic_eq} using $P^1$ Lagrange finite elements (standard Galerkin method). Let $\Omega = (0, 1)$ be divided into $M$ cells $(x_{j}, x_{j+1})$, $0 \leq j \leq M-1$, of uniform width denoted by $h = \frac{1}{M}$. We seek an approximation $u_h \approx u$, with $u_h \in V_h \subset H^1(0, 1)$ such that¹ ²

\begin{equation} \label{eq:variational_form} \int_{0}^{1} \frac{du_h}{dx} \frac{d\phi}{dx} = \int_{0}^{1} f \phi, \; \forall \phi \in V_h, \end{equation}

where $V_h$ is the subspace of piecewise-linear functions with basis functions given by

\[\phi_{j}(x) = \begin{cases} (x - x_{j-1})h^{-1}; & x \in (x_{j-1}, x_j) \\ (x_{j+1} - x)h^{-1}; & x \in (x_j, x_{j+1}) \end{cases}, 1 \leq j \leq M-1, \\ \phi_0(x) = 1 - x h^{-1}, \; \forall x \in (0, x_1), \; \phi_M(x) = (x - x_{M-1})h^{-1}, \; \forall x \in (x_{M-1}, x_M).\]

By expanding $u_h = \sum_{j=0}^{M} U_j \phi_j$ and choosing $\phi = \phi_j$ in \ref{eq:variational_form} we obtain the discretized system

\begin{equation} \label{eq:linear_system} AU = F, \end{equation}

where $U = [U_0 \; U_1 \; \dots \; U_{M}]^T \in \mathbb{R}^{M+1}$, and $A \in \mathbb{R}^{(M+1)\times (M+1)}$, $F \in \mathbb{R}^{M+1}$ are given by

\[\label{eq:discretized_system} A = \frac{1}{h}\begin{bmatrix} 1 & -1 & 0 & \dots & 0 & 0 \\ -1 & 2 & -1 & \dots & 0 & 0 \\ 0 & -1 & 2 & \dots & 0 & 0 \\ \vdots & \vdots & \vdots & \ddots & \vdots & \vdots \\ 0 & 0 & 0 & \dots & 2 & -1 \\ 0 & 0 & 0 & \dots & -1 & 1 \end{bmatrix}, \; F = \begin{bmatrix} \frac{h}{2}f(x_0) \\ hf(x_1) \\ hf(x_2) \\ \vdots \\ hf(x_{M-1}) \\ \frac{h}{2}f(x_{M}) \end{bmatrix},\]

where we have used the trapezoidal rule to approximate $\int_{0}^{1} f(x)\phi_j(x)dx$ to obtain $F$. Note that the matrix $A$ is symmetric positive semi-definite.

2.2. Linear Solver

The system \ref{eq:linear_system} is linear, symmetric, and can be solved using the CG method ³ ¹: given $U^{(0)} \in \mathbb{R}^{M+1}$, we set $r^{(0)} = F - A U^{(0)}, \; p^{(0)} = r^{(0)}$, and iterate as follows

\[\begin{cases} \alpha^{(m-1)} = \frac{ {r^{(m-1)}}^T r^{(m-1)} }{ {p^{(m-1)}}^T A p^{(m-1)} }, \\ U^{(m)} = U^{(m-1)} + \alpha^{(m-1)} r^{(m-1)}, \\ r^{(m)} = F - A U^{(m)}, \\ \beta^{(m-1)} = \frac{ {r^{(m)}}^T r^{(m)} }{ {r^{(m-1)}}^T r^{(m-1)} }, \\ p^{(m)} = r^{(m)} + \beta^{(m-1)} p^{(m)}. \end{cases}\]

For symmetric positive definite matrices, the convergence is guaranteed in $M+1$ iterations (ignoring the round-off error). We iterate the CG algorithm till a prescirbed tolerance $\epsilon$, i.e., we terminate the iteration when

\begin{equation} \lVert (A U^{(m)} - F)^T (A U ^{(m)} - F) \rVert_2 \leq \epsilon. \nonumber \end{equation}

3. Results

We now use our solver on a given source function $f$. The source $f$ is chosen as to represent a pulse function and is shown in Fig. 2.

Figure 2. Plot showing the source function $f$ used in the numerical example. The function takes value $0$ everywhere with $f(0.44) = f(0.52) = -1$ and $f(0.48) = 2$.

We use $M = 25$ cells, an initial guess of ${u_h}^{(0)} = 0$, and a prescribed tolerance of $\epsilon = 10^{-8}$. The results are shown in Fig. 3.

Figure 3. Results showing the profile of $u_h$ (left) and the residual (right).

It can be observed that the solution $u_h$ is $0$ everywhere except at $x = 0.48$, where the source function takes the value $2$. Also, the total number of iterations taken by the CG method is $25$.

Robustness testing: changing the initial guess. The results in Fig. 2. show the convergence of the CG method, and in fact, the solver does not struggle to converge. However, to test the robustness of our solver, we provide a different initial guess. We choose ${u_h}^{(0)} = 1$. If our computational method is indeed robust, then we should still hope for convergence to the same solution profile as shown in Fig. 2. regardless of the initial guess (reasonable intial guess!). The results with this new initial guess are shown in Fig. 4.

Figure 4. Results showing the profile of $u_h$ (left) and the residual (right) with the initial guess ${u_h}^{(0)} = 1$.

The results in Fig. 3. show a similar profile for $u_h$ as in Fig. 2., but the values differ by $1$! That is, even though we have convergence in both scenarios, the values of the solution are not the same and differ almost by a value of $1$.

The reader may have guessed the reason for this behaviour, and now we make it clear. Even though we have convergence of our CG solver, the problem itself is not well-posed in the first place! That is, the solution to \ref{eq:elliptic_eq} with the given boundary conditions is not unique. It is easy to verify that if $u_h$ solves \ref{eq:variational_form}, then so does $u_h + c$, for any constant $c \in \mathbb{R}$. Thus, for different initial guesses the solver converges to different solutions, which rightfully differ by a constant ($1$ in this case). If one probes further (or already did when setting up the system), it can be verified that the matrix $A$ is singular (and hence not positive definite, but positive semi-definite). Finally, if instead of Neumann boundary equation we would have homogeneous Dirichlet boundary conditions $u(0) = u(1) = 0$, then we would not have run into this issue since the problem would have been well-posed.

This little example highlights the importance of due diligence when it comes to mathematical equations. One may have a convergent solution, but the solution may not make physical sense if prior information regarding the well-posedness of the system is not known. Moreover, in this case, a grid convergence study would have not helped either since for an initial guess the solution would converge to a solution with order $O(h)$.

Note on the choice of quadrature. Note that when computing $\int_{0}^{1} f \phi_j $ in \ref{eq:discretized_system} we make use of the trapezoidal rule. In the example, the function $f$ is non-zero only on a set of measure $0$ and hence the Lebesgue integral $\int_{0}^{1} f \phi_j = 0$. Thus the trapezoidal rule is a poor approximation of this integral. However, we could have obtained the same results as above by using a piecewise-constant $f$ which takes the values $2$ and $-1$ on itervals rather than at points, and for which the trapezoidal rule would have been a better approximation, and we would have obtained the same results as above.

Naren Vohra