Toolbox

C1: The Augmented Indexing Problem

Reminder: This post contains 2106 words · 7 min read · by Xianbin

The augmented indexing problem is an invariant of the indexing problem, which has applications of lower bounds. This post is base on [1].

Definition

Alice has a string SS while Bob has an index jj, S<iS_{<i} and a check bit c{0,1}c\in \{0,1\}. The goal is to let Bob output SjS_j​. Namely, Bob needs to know the jj-th (index starts from 1) elements in SS without giving Alice the index (otherwise, it is trivial.)

Comparing to the indexing problem, now, Bob has a set of elements before the jj-th elements, i.e., S<jS_{<j}.

We use AInd to denote the augmented indexing problem. The goal is to output AInd(S,j,c):=xjc\text{AInd}(S,j, c):=x_j\oplus c.

Tools

Information Cost

Here, we need to consider how much information Bob learns from Alice and Alice learns from Bob. It is called internal information cost.

IC(Π)=I(Π:AB)+I(Π:BA)IC(\Pi) = I(\Pi: A \mid B) + I(\Pi:B\mid A)

On the distribution μ{0,1}n×[n]×{0,1}\mu \in \{0, 1\}^n \times [n] \times \{0, 1\}, we use ICA(Π)IC^A(\Pi) to denote the cost of Alice, i.e., I(Π:SS<J,J,C,R)I(\Pi: S \mid S_{<J}, J, C, R) and use ICB(Π)IC^B(\Pi) to denote the cost of Bob, i.e., I(Π:K,CS,R)I(\Pi:K, C\mid S, R) where RR is the public coin.

Average Argument

Lemma 1\textbf{Lemma 1}. Consider a function family f1,,fL:DR+f_1, \ldots, f_L: D\to R^+, and parameter c1,,cLc_1,\ldots,c_L,where L>0L>0 is an integer and DD is a finite domain. Let ZZ be a random variable over DD. Then, we have the following: i[L],E[fi(Z)]ci\forall i\in [L], \mathbb{E}[f_i(Z)] \leq c_i leads to there exists zDz\in D such that

i[L],fi(Z)Lci\forall i\in[L], f_i (Z) \leq L c_i

What does this lemma mean? It means that if a set of functions in a domain have a small average value, then there exists an element in the domain where all functions have a small value.

It is not hard to prove this tiny lemma. Let g(z):=fi(z)/big(z):=\sum f_i(z)/b_i, and we know that E[g(z)]L\mathbb{E}[g(z)]\leq L. Then, there must be a zz such that g(z)Lg(z) \leq L.

Quite simple but useful. We will see why we will need this lemma soon.

Theorem 1

Theorem 1\textbf{Theorem 1}. If Π\Pi is a randomized protocol for Aind\text{Aind} with two-sided error at most 1/log2n1/\log^2 n, either ICA(Π)=Ω(n)IC^A(\Pi) = \Omega(n), or ICB(Π)=Ω(1)IC^B(\Pi) = \Omega(1).

It means that either Alice reveals Ω(n)\Omega(n) bits of her input, or Bob reveals Ω(1)\Omega(1) bits of his input.

To be continued…

Reference

[1]. Chakrabarti, A., Cormode, G., Kondapally, R. and McGregor, A., 2013. Information cost tradeoffs for augmented index and streaming language recognition. SIAM Journal on Computing, 42(1), pp.61-83.