Satisfiability in Rationals and Linear Programming

Introduction

The study of satisfiability sits at the intersection of mathematical logic, computer science, and optimization theory. At its core, satisfiability asks a fundamental question: given a set of constraints, does there exist an assignment to the variables that makes all constraints true? This seemingly simple question leads to remarkable theoretical depth when we examine different constraint languages and their computational properties.

When we restrict ourselves to Boolean variables with logical connectives, we encounter the famous SAT problem—the first problem proven to be NP-complete. However, when we enrich our constraint language with arithmetic over the rational numbers, something fascinating happens: the decision problem becomes solvable in polynomial time. This contrast between the intractability of Boolean SAT and the tractability of linear arithmetic satisfiability reveals deep structure in the landscape of computational problems.

This article explores the theoretical connection between satisfiability problems over rational arithmetic and linear programming. We examine how linear programming provides both a decision procedure for rational satisfiability and an optimization framework, while also serving as a foundation for understanding why adding integer constraints dramatically changes computational complexity.

Foundations: Rational Satisfiability and Linear Constraints

What Is Rational Satisfiability?

Let us begin with a precise definition. A system of linear constraints over the rationals (LRA) consists of a finite set of variables $x_1, x_2, \ldots, x_n$ taking values in the rational numbers $\mathbb{Q}$, together with constraints of the form:

$$a_1 x_1 + a_2 x_2 + \cdots + a_n x_n \leq b$$

where $a_1, a_2, \ldots, a_n \in \mathbb{Q}$ and $b \in \mathbb{Q}$. We may also include constraints of the form $c \leq a_1 x_1 + \cdots + a_n x_n$ (which is equivalent to $-a_1 x_1 - \cdots - a_n x_n \leq -c$) and equality constraints $a_1 x_1 + \cdots + a_n x_n = b$ (representable as two inequalities).

Definition (Satisfiability). A system of linear constraints over $\mathbb{Q}$ is satisfiable if there exists an assignment $\nu: \{x_1, \ldots, x_n\} \to \mathbb{Q}$ such that all constraints are simultaneously satisfied. Such an assignment is called a model or solution.

Example 1. Consider the system:

$$\begin{aligned} x + y &\leq 5 \\ 2x - y &\geq 1 \\ x - y &= 0 \end{aligned}$$

The assignment $\nu(x) = 2, \nu(y) = 2$ satisfies all three constraints, so this system is satisfiable. Geometrically, this corresponds to a point in $\mathbb{R}^2$ lying in the intersection of the half-planes defined by these inequalities.

Connection to Classical SAT

The relationship between rational linear arithmetic satisfiability and classical Boolean SAT becomes clear when we consider how each problem generalizes the other. Classical SAT asks whether a Boolean formula in conjunctive normal form can be made true by some assignment to its variables. Each clause is a disjunction of literals, and we require all clauses to be true simultaneously.

Linear arithmetic over rationals provides a much richer constraint language. Indeed, we can encode Boolean variables using rational constraints. Let $x$ be a Boolean variable. We can represent $x = \text{true}$ by the constraint $x \geq 1$ and $x = \text{false}$ by $x \leq 0$, with the implicit constraint that $x \in \{0, 1\}$. However, the pure rational theory without integerness cannot enforce $x \in \{0, 1\}$—it only enforces $x \in \mathbb{Q}$.

This observation leads to an important distinction: LRA (Linear Rational Arithmetic) allows variables to take any rational value, while Boolean SAT restricts variables to $\{0, 1\}$. Despite this apparent restriction on SAT, the computational properties differ dramatically: LRA satisfiability is in P (specifically, polynomial time), while SAT is NP-complete.

Linear Programming as Decision and Optimization Framework

The Geometry of Feasible Regions

Linear programming studies the optimization of a linear objective function subject to linear constraints. The feasible region—the set of all points satisfying all constraints—is a convex polytope (in $\mathbb{R}^n$, a bounded polyhedron). For rational inputs, this polytope has vertices with rational coordinates.

Definition (Linear Program). A linear program (LP) is a problem of the form:

$$\begin{aligned} \max \quad & c^T x \\ \text{s.t.} \quad & Ax \leq b \\ & x \in \mathbb{Q}^n \end{aligned}$$

where $A \in \mathbb{Q}^{m \times n}$, $b \in \mathbb{Q}^m$, and $c \in \mathbb{Q}^n$. The decision version of LP asks whether there exists $x \in \mathbb{Q}^n$ satisfying $Ax \leq b$; this is precisely the satisfiability problem for linear constraints.

The geometry of the feasible region provides intuition. Consider two variables $x_1$ and $x_2$ with constraints:

$$\begin{aligned} x_1 + x_2 &\leq 10 \\ x_1 - x_2 &\leq 3 \\ -x_1 + x_2 &\leq 2 \\ x_1, x_2 &\geq 0 \end{aligned}$$

The feasible region is a convex polygon in $\mathbb{R}^2$. The simplex algorithm moves along edges of this polygon from vertex to vertex, seeking an optimal corner. Interior-point methods move through the interior of the polytope.

Why Linear Programming Works Efficiently

The remarkable fact that linear programming is solvable in polynomial time contrasts sharply with integer linear programming. Several factors contribute to this tractability:

Convexity. The feasible region of a linear program is convex. This means that if we have two feasible points, the entire line segment between them is also feasible. Convexity enables efficient algorithms because local optima are global optima.

Polyhedral Structure. The feasible region is a polyhedron (possibly unbounded). The fundamental theorem of linear programming states that if a feasible optimal solution exists, there is an optimal solution at a vertex (extreme point) of the polyhedron. Since a polyhedron in $n$ dimensions with $m$ constraints has at most $\binom{m}{n}$ vertices, we have a finite (though potentially exponential) search space.

Duality Theory. Every linear program has a dual program. Weak duality states that any feasible solution to the dual provides an upper bound on the primal objective, while strong duality states that optimal primal and dual values are equal (when both are feasible). This theoretical framework enables powerful algorithms and provides optimality certificates.

Key Theoretical Results

Farkas’ Lemma and Rational Satisfiability

One of the most fundamental results connecting linear inequalities and feasibility is Farkas’ Lemma, which provides a certificate of infeasibility.

Theorem (Farkas’ Lemma). Exactly one of the following systems has a solution:

$Ax = b, x \geq 0$ for some $x \in \mathbb{R}^n$
$y^T A \leq 0, y^T b > 0$ for some $y \in \mathbb{R}^m$

In practical terms, Farkas’ Lemma tells us that either there exists a rational solution satisfying all constraints, or there exists a certificate (the vector $y$) proving that no such solution exists. This “either-or” characteristic is fundamental to linear programming—either we find a solution, or we can prove none exists.

Theorem (Rational Satisfiability is in P). The decision problem “Is a system of linear inequalities over $\mathbb{Q}$ satisfiable?” can be solved in polynomial time.

This follows from the existence of polynomial-time algorithms for linear programming, such as the ellipsoid method (Khachiyan, 1979) with complexity $O(n^6 L^2)$ where $L$ is the bit-length of the input, or interior-point methods with complexity $O(n^3 L)$.

LP Relaxations and Integer Satisfiability

One of the most practically important theoretical connections involves LP relaxations. Given an integer linear program (ILP):

$$\begin{aligned} \max \quad & c^T x \\ \text{s.t.} \quad & Ax \leq b \\ & x \in \mathbb{Z}^n \end{aligned}$$

The LP relaxation removes the integrality constraint:

$$\begin{aligned} \max \quad & c^T x \\ \text{s.t.} \quad & Ax \leq b \\ & x \in \mathbb{Q}^n \end{aligned}$$

The relaxed problem is easier to solve (polynomial time), and its solution provides bounds on the integer problem. The integrality gap—the difference between the optimal LP value and the optimal ILP value—measures how much rounding hurts.

Example 2. Consider the ILP:

$$\begin{aligned} \max \quad & x_1 + x_2 \\ \text{s.t.} \quad & x_1 + 1.1 x_2 \leq 5 \\ & x_1, x_2 \in \mathbb{Z}_{\geq 0} \end{aligned}$$

The LP relaxation has optimal solution at $x_1 = 5, x_2 = 0$ with value 5. However, the integer optimal is $x_1 = 4, x_2 = 0$ (or $x_1 = 3, x_2 = 1$) with value 4. The integrality gap is 1.

This gap motivates cutting plane methods and branch-and-bound algorithms for ILP, which iteratively refine the LP relaxation by adding valid inequalities until an integer solution is found.

Why Rationals and Integers Behave Differently

The fundamental difference in computational complexity between rational and integer satisfiability stems from a key topological property: the rational numbers are dense in the real numbers, while the integers are discrete.

Theorem (Fundamental Difference). Linear programming over $\mathbb{Q}$ (or $\mathbb{R}$) is in P. Integer linear programming is NP-complete.

This result, established by Karp in his famous 1972 paper, marks one of the original NP-complete problems. The proof reduces from SAT: given a Boolean formula in CNF, we can construct an ILP that has an integer solution if and only if the formula is satisfiable.

The key insight is that the “rounding” operation needed to go from rational to integer solutions is inherently combinatorial—it requires searching through exponentially many possibilities in the worst case.

Algorithmic Implications

How LP Solvers Exploit Rational Satisfiability

Modern LP solvers employ sophisticated algorithms that exploit the mathematical structure we have discussed:

The Simplex Method. Invented by Dantzig in 1947, the simplex method moves along edges of the feasible polytope from vertex to vertex. While exponential worst-case complexity was initially suspected, smoothed analysis explains why it performs well in practice. The simplex method does not directly decide satisfiability but finds optimal solutions; however, if no bounded optimum exists (unbounded or infeasible), this is detected during execution.

Interior-Point Methods. Following Karmarkar’s 1984 polynomial-time algorithm, interior-point methods traverse the interior of the polytope using Newton’s method with a barrier function. These methods achieve polynomial time complexity and handle large-scale problems efficiently.

The Ellipsoid Method. Though not practical for implementation, Khachiyan’s ellipsoid method provided the first proof that LP is in P. It uses a sequence of ellipsoids that contain the feasible region, shrinking exponentially until either a feasible point is found or the region is proven empty.

Complexity Classification

Theorem (Complexity of LP). Linear programming over rationals (or reals) belongs to the complexity class P.

More precisely, the decision version of LP is P-complete under log-space reductions. The optimization version is in FP (functiona polynomial time), meaning we can find the optimal value in polynomial time.

This places LP squarely in the class of efficiently solvable problems, alongside matrix multiplication and shortest path computation. The existence of polynomial-time algorithms for LP is one of the landmark results in optimization and theoretical computer science.

Integer Linear Programming and NP-Completeness

The Jump from Rationals to Integers

As noted, the transition from rational to integer constraints dramatically increases computational difficulty. Let us examine why this happens:

Theorem (ILP is NP-Complete). The decision problem “Is there an integer solution to $Ax \leq b$?” is NP-complete.

The proof involves reducing from arbitrary SAT problems. Given a CNF formula with variables $x_1, \ldots, x_n$ and clauses $C_1, \ldots, C_m$, construct an ILP as follows. For each Boolean variable $x_i$, create an integer variable $y_i \in \{0, 1\}$. For each clause $C_j = (\ell_{j1} \vee \ell_{j2} \vee \ell_{j3})$, add the constraint:

$$\sum_{k: x_k \in \ell_{jk}} y_k + \sum_{k: \neg x_k \in \ell_{jk}} (1 - y_k) \geq 1$$

This constraint is satisfied if and only if at least one literal in the clause is true. The ILP is satisfiable if and only if the original formula is.

Practical Implications

Despite theoretical intractability, modern ILP solvers handle many practical instances through:

Branch-and-bound: Solve LP relaxations, branch on variables taking fractional values in the relaxation
Cutting planes: Add valid inequalities that separate fractional solutions from the integer hull
Preprocessing: Simplify instances through probing, coefficient reduction, and variable fixing
Heuristics: Use greedy, local search, or evolutionary methods to find good solutions quickly

The empirical success of ILP solvers on practical problems is one of the triumphs of applied algorithmics, even though worst-case complexity remains exponential.

Connections to SMT Solvers

SMT and Linear Arithmetic

SMT (Satisfiability Modulo Theories) generalizes SAT by combining Boolean reasoning with theories such as linear arithmetic, arrays, uninterpreted functions, and bit-vectors. The LRA (linear rational arithmetic) theory is one of the most important and well-developed SMT theories.

Modern SMT solvers like Z3 (Microsoft Research), CVC5, and Boolector implement sophisticated decision procedures for LRA. The core insight is that we can combine the efficiency of SAT solvers (propagation, conflict analysis, learning) with specialized theory solvers for linear arithmetic.

Definition (SMT with LRA). An LRA formula is a Boolean combination of linear constraints over rational variables. The SMT(LRA) decision problem asks whether such a formula is satisfiable in the theory of dense ordered fields with characteristic zero.

Algorithms for SMT(LRA)

SMT solvers employ several key techniques:

Simplex-Based Solving. The core theory solver for LRA is typically a variant of the Simplex algorithm adapted for incremental SAT solving. Variables can be added or removed as the SAT solver explores different truth assignments, and the solver must efficiently handle theory conflicts.

Conflict Analysis. When the theory solver detects unsatisfiability (e.g., a set of constraints that together imply a contradiction), it generates a conflict clause that the SAT solver learns from. This integration of theory reasoning with Boolean learning is crucial for efficiency.

Model Construction. When a satisfying assignment is found, the theory solver provides actual rational values for all variables, not just a Boolean assignment. This is essential for applications in verification and synthesis.

Practical Applications

SMT solvers with LRA support are foundational in:

Software verification: Checking array bounds, numerical assertions, and loop invariants
Synthesis: Automatically generating programs satisfying specifications
Automated theorem proving: Proving mathematical theorems involving linear arithmetic
Optimization: Solving optimization problems with both discrete and continuous components

Summary and Outlook

The study of satisfiability in rationals and linear programming reveals a beautiful theoretical landscape. We have seen that:

Linear programming over rationals is in P—the first problem in NP to have a polynomial-time algorithm discovered
Integer linear programming is NP-complete—the jump from continuous to discrete dramatically increases complexity
LP relaxations provide both upper bounds for ILP and the foundation for practical ILP solvers
SMT solvers combine the best of SAT and linear programming, enabling reasoning about complex systems

These results have profound implications. The tractability of rational satisfiability enables efficient decision procedures for verification, synthesis, and optimization. The intractability of integer problems explains why approximate methods, heuristics, and specialized algorithms remain essential in practice.

The frontier of research continues to advance. Sparse LP algorithms, second-order cone programming, and semi-definite programming extend the linear framework. Meanwhile, SMT solvers push the boundaries of what can be decided automatically, bringing theoretical results into practical tools that engineers and mathematicians use daily.

Resources

Linear Programming: Foundations and Extensions by Robert J. Vanderbei
Karmarkar’s Algorithm for Linear Programming - The original interior-point method paper
Z3 SMT Solver - Microsoft’s SMT solver with excellent LRA support
Farkas’ Lemma and Linear Programming - Mathematical background on the fundamental lemma
SAT/SMT by Example - Practical introduction to SAT and SMT solving
Complexity of Integer Linear Programming - Overview of theoretical results