Processing math: 100%

10 June 2017

Cap Sets

Why it's difficult to avoid arithmetic progressions.

The following result was established in 2016:

Let A be a subset of Fn3 containing no three-term arithmetic progression. Then, |A|=o(2.8n).

This has been described as a big breakthrough. Yet, the proof is simple enough to fit in 2 pages.

The elements of Fn3 are vectors with n components, each component being an integer modulo 3. Vectors a1,a2,a3 are said to form a three-term arithmetic progression when a2a1=a3a2. Because we are working modulo 3, this is equivalent to a1+a2+a3=0. Thus, we can rephrase the result as follows.

Let A be a subset of Fn3 such that a1+a2+a30 for all distinct a1,a2,a3A. Then, |A|=o(2.8n).

If we were to not require a1,a2,a3 to be distinct, then the hypothesis would essentially be just ‘false’, because any nonempty set contains a trivial three-term arithmetic progression if we are allowed to pick the same element three times. An equivalent formulation is to say ‘not all equal’ instead of ‘distinct’, because a+a+b0 if ab.

Let A be a subset of Fn3 such that a1+a2+a30 for a1,a2,a3A not all equal. Then, |A|=o(2.8n).

The overall idea is to find lower and upper bounds for the dimension of some vector space of polynomials. We consider the set Mdn of monomials over n variables that have total degree d and all powers from {0,1,2}. The restriction on powers is not serious because x3=x by Fermat's little theorem. (One may think that x2=1, but that is true only for nonzero x.) Let us write Sdn for the vector space of polynomials written with monomials from Mdn. We have that dimSdn=|Mdn|; let us denote this quantity by md. Observe that m=3n; indeed, if we allow any total degree, then polynomials can represent any function Fn3F3, by combining indicator polynomials Ia(x)def=nk=1(1(xkak)2).

We define two subsets of Fn3: Xdef={a1+a2a1,a2A distinct}Ydef={a3a3A} These sets are disjoint, if A satisfies our hypothesis. We will consider the subspace V of the polynomials that vanish outside Y. We will derive a lower bound on dimV, by simply counting points outside Y; and we will derive an upper bound on dimV, using a lemma which says, roughly, that ‘any polynomial that is zero on all of X is zero on much of Y’. The reasoning will work for an arbitrary d; when comparing the upper with the lower bound, we shall pick for d a convenient value.

The bounds we aim to prove are the following: md3n+|A|dimV2md/2

Let us first see how we use these bounds, and we will soon return to how to prove them. By the correspondence xkx2k, we see that monomials of total degree d are in a one-to-one correspondence with monomials of total degree 2nd. Thus, md equals 3nm2nd1. It follows that |A|2md/2+m2nd1. If we pick d such that d/2=2nd1, we get |A|3m(2n1)/33m2n/3. Finally, one can show that m2n/3=o(2.8n).

Why is m2n/3=o(2.8n)? [toggle answer]

We can use a Chernoff bound. Several useful inequalities involving probabilities have the form Pr(XS)Ef(X) where X is a random variable, and f is a function such that f(x)[xS]. The notation [ϕ] stands for 1 when ϕ holds, and 0 otherwise. It is very easy to prove this: Pr(XS)=x[xS]Pr(X=x)xf(x)Pr(X=x)=Ef(X) We pick Sdef={xx0} and f(x)def=eαx. This tells us that Pr(X0)EeαX for any α0. Enough background – let us move back to m2n/3.

The number m2n/3 says in how many ways we can choose n numbers from the set {0,1,2} such that their sum is 2n/3. In other words, m2n/3=3nPr(Y1++Yn2n/3), where Y1,,Yn are i.i.d. random variables taking values (uniformly) in {0,1,2}. If we define Xkdef=Yk2/3 and we use the inequality from above, we can calculate Pr(ni=1Yi2n3)=Pr(ni=1Xi0)E(eαni=1Xi)=E(ni=1eαXi)=ni=1EeαXi=(e43α+e13α+e23α3)n Now we optimize for α0, and we get that m2n/3<2.75510462n.

Now let us get back to proving the bounds on dimV.

The lower bound is easy. The definition of V is {PSdnP(a)=0 for aFn3Y}. The dimension of Sdn is md, and each of the |Fn3Y| constraints reduces the dimension by at most 1. Done.

Why does adding a constraint P(a)=0 reduce the dimension by at most 1? [toggle answer]

Consider a basis P1,,Pk. Without loss of generality, we can assume that Pi(a){0,1} for all i. We partition P1,,Pk based on whether Pi(a) is 0 or 1: for Q1,,Qk1, we have Qi(a)=0 for all i; for R1,,Rk2, we have Ri(a)=1 for all i; and k=k1+k2. If k2=0, then the dimension is not reduced at all, as witnessed by the initial basis P1,,Pk. If k2>0, then the dimension is reduced by 1, as witnessed by the basis formed by Q1,,Qk1 together with R2R1,R3R1,,Rk2R1. [This is an instance of the rank-nullity theorem.]

The upper bound isn't quite so easy. My understanding is that the upper bound is the tear that led to the breakthrough. We'll do it in two steps, corresponding to these inequalities: dimV|Σ|2md/2 where Σ is a maximal support of a polynomial in V.

For the first inequality, we will show the contrapositive: if a polynomial PV has support Σ with |Σ|<dimV, then Σ is not maximal. We do this by finding another polynomial QV whose support is nonempty and disjoint from Σ. The support of P+Q will be a strict superset of Σ.

Why does Q exist? [toggle answer]

We reuse the argument from the previous gray box. We start with space V, and each additional constraint Q(a)=0 reduces the dimension by at most 1. Thus, the space {QVQ(a)=0 for aΣ} has positive dimension.

For the second inequality, we show that any PV has a support of size 2md/2. I'll start with an example. Take P(x)=x1x2x3+x21. This is an element of S33 because we use 3 variables and the total degree of each monomial is 3. The polynomial P(x+y) will have monomials that use both x-variables and y-variables, but still have total degree 3: P(x+y)={x1x2x3+x2x3y1+x1x3y2+x3y1y2+x1x2y3+x2y1y3+x1y2y3+y1y2y3+x21+2x1y1+y21

For each monomial, in addition to the total degree (the sum of all powers), we can define an x-degree (the sum of all powers on x-variables), and a y-degree (the sum of all powers on y-variables). Since the total degree is 3, it follows that the x-degree is 3/2 or the y-degree is 3/2 (or both). Let's put on the first line all those monomials with x-degree 3/2, and on the second line the rest: P(x+y)={x3y1y2+x2y1y3+x1y2y3+y1y2y3+2x1y1+y21+x1x2x3+x2x3y1+x1x3y2+x1x2y3+x21

Now we group monomials on the first line by x, and we group monomials on the second line by y. P(x+y)={x3y1y2+x2y1y3+x1(y2y3+2y1)+(y1y2y3+y21)+(x1x2x3+x21)+x2x3y1+x1x3y2+x1x2y3

Introducing some new notation, we can write the above as P(x+y)={x3Fx3(y)+x2Fx2(y)+x1Fx1(y)+F1(y)+G1(x)+y1Gy1(x)+y2Gy2(x)+y3Gy3(x)

Finally, we instantiate this equality for all (x,y)A2. For an example, let's say A={100,020}, where 100 is a compact notation for the point (1,0,0)F33. Then, [P(100+100)P(100+020)P(020+100)P(020+020)]=[00]x3[Fx3(100)Fx3(020)]+[02]x2[Fx2(100)Fx2(020)]+

The matrix corresponding to the term x3Fx3(y) was factored into a column vector corresponding to x3, and a row vector corresponding to Fx3(y).

Of course, we can do the same manipulations for any polynomial PSdn, obtaining P(x+y)=mMd/2nm(x)Fm(y)+mMd/2nm(y)Gm(x)

and we can instantiate this equation for all (x,y)A2 to obtain a matrix identity. If PV, which implies that P vanishes on X, then its matrix is diagonal. On the other hand, the matrix of each term m(x)Fm(y) has rank 1, and similarly the matrix of each term m(y)Gm(x). Because rank is subadditive, the matrix of a PV has rank 2md/2, which means that it has 2md/2 nonzero elements on the diagonal. In other words, P(a)=P(a+a)0 for 2md/2 points aA.

This concludes the proof.

This post follows [Ellenberg, Gijswijt, On large subsets of Fnq with no three-term arithmetic progression], which presents a slightly more general result, for Fnq rather than Fn3.

[Stefan Kiefer provided feedback on several drafts of this post. Thanks!]

No comments:

Post a Comment

Note: (1) You need to have third-party cookies enabled in order to comment on Blogger. (2) Better to copy your comment before hitting publish/preview. Blogger sometimes eats comments on the first try, but the second works. Crazy Blogger.