General number field sieve

In number theory, the general number field sieve (GNFS) is the most efficient classical algorithm known for factoring integers larger than $10100$ . Heuristically, its complexity for factoring an integer $n$ (consisting of $⌊log 2 n ⌋ + 1$ bits) is of the form

in O and L-notations.^[1] It is a generalization of the special number field sieve: while the latter can only factor numbers of a certain special form, the general number field sieve can factor any number apart from prime powers (which are trivial to factor by taking roots).

The principle of the number field sieve (both special and general) can be understood as an improvement to the simpler rational sieve or quadratic sieve. When using such algorithms to factor a large number $n$ , it is necessary to search for smooth numbers (i.e. numbers with small prime factors) of order $n 1/2$ . The size of these values is exponential in the size of $n$ (see below). The general number field sieve, on the other hand, manages to search for smooth numbers that are subexponential in the size of $n$ . Since these numbers are smaller, they are more likely to be smooth than the numbers inspected in previous algorithms. This is the key to the efficiency of the number field sieve. In order to achieve this speed-up, the number field sieve has to perform computations and factorizations in number fields. This results in many rather complicated aspects of the algorithm, as compared to the simpler rational sieve.

The size of the input to the algorithm is $log 2 n$ or the number of bits in the binary representation of $n$ . Any element of the order $n c$ for a constant $c$ is exponential in $log n$ . The running time of the number field sieve is super-polynomial but sub-exponential in the size of the input.

Improving polynomial choice[edit]

The choice of polynomial can dramatically affect the time to complete the remainder of the algorithm. The method of choosing polynomials based on the expansion of $n$ in base $m$ shown above is suboptimal in many practical situations, leading to the development of better methods.

One such method was suggested by Murphy and Brent;^[3] they introduce a two-part score for polynomials, based on the presence of roots modulo small primes and on the average value that the polynomial takes over the sieving area.

The best reported results^[4] were achieved by the method of Thorsten Kleinjung,^[5] which allows $g (x) = ax + b$ , and searches over $a$ composed of small prime factors congruent to 1 modulo 2 $d$ and over leading coefficients of $f$ which are divisible by 60.

NFS@Home

GGNFS

factor by gnfs

CADO-NFS

(which contains final-processing code, a polynomial selection optimized for smaller numbers and an implementation of the line sieve)

msieve

kmGNFS

Some implementations focus on a certain smaller class of numbers. These are known as special number field sieve techniques, such as used in the Cunningham project. A project called NFSNET ran from 2002^[6] through at least 2007. It used volunteer distributed computing on the Internet.^[7] Paul Leyland of the United Kingdom and Richard Wackerbarth of Texas were involved.^[8]

Until 2007, the gold-standard implementation was a suite of software developed and distributed by CWI in the Netherlands, which was available only under a relatively restrictive license. In 2007, Jason Papadopoulos developed a faster implementation of final processing as part of msieve, which is in the public domain. Both implementations feature the ability to be distributed among several nodes in a cluster with a sufficiently fast interconnect.

Polynomial selection is normally performed by GPL software written by Kleinjung, or by msieve, and lattice sieving by GPL software written by Franke and Kleinjung; these are distributed in GGNFS.

Special number field sieve

Matthew E. Briggs: An Introduction to the General Number Field Sieve, 1998