paper-BagRelationalPDBsAreHard/poly-form.tex

%root: main.tex
%!TEX root = ./main.tex
%\onecolumn
\subsection{Reduced Polynomials and Equivalences}

We now introduce some terminology 
and develop a reduced form of lineage polynomials for a \abbrBIDB or \abbrTIDB.
Note that a polynomial over $\vct{X}=(X_1,\dots,X_n)$ with individual degree $B <\infty$ 
is formally defined as (where $c_{\vct{d}}\in \semN$): 
\begin{equation}
  \label{eq:sop-form}
\poly\inparen{X_1,\dots,X_n}=\sum_{\vct{d}\in\{0,\ldots,B\}^n} c_{\vct{d}}\cdot \prod_{i=1}^n X_i^{d_i}.
\end{equation}
%where $c_{\vct{d}}\in \semN$.

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Definition}[Standard Monomial Basis]\label{def:smb}
The term $\prod_{i=1}^n X_i^{d_i}$ in \Cref{eq:sop-form} is a {\em monomial}. A polynomial $\poly\inparen{\vct{X}}$ is in standard monomial basis (\abbrSMB) when we keep only the terms with $c_{\vct{d}}\ne 0$ from \Cref{eq:sop-form}.
\end{Definition}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Unless othewise noted, we consider all polynomials to be in \abbrSMB representation. 
When it is unclear, we use $\smbOf{\poly}$ to denote the \abbrSMB form of a polynomial $\poly$.

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Definition}[Degree]\label{def:degree}
The degree of polynomial $\poly(\vct{X})$ is the largest $\sum_{i=1}^n d_i$ such that $c_{(d_1,\dots,d_n)}\ne 0$. % maximum sum of exponents, over all monomials in $\smbOf{\poly(\vct{X})}$.
\end{Definition}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
As an example, the degree of the polynomial $X^2+2XY^2+Y^2$ is $3$.
Product terms in lineage arise only from join operations (\Cref{fig:nxDBSemantics}), so intuitively, the degree of a lineage polynomial is analogous to the largest number of joins needed to produce a result tuple.
%in any clause of the $\raPlus$ query that created it.
We call a polynomial $\poly\inparen{\vct{X}}$ a \emph{\bi-lineage polynomial} (resp., \emph{\ti-lineage polynomial}, or simply lineage polynomial), if there exists a $\raPlus$ query $\query$, \bi (\ti) $\pdb$, and result tuple $\tup$ such that $\poly\inparen{\vct{X}} = \apolyqdt\inparen{\vct{X}}.$
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

\begin{Definition}[Reduced \bi Polynomials]\label{def:reduced-bi-poly}
  Let $\poly(\vct{X})$ be a \bi-lineage polynomial.
  The reduced form $\rpoly(\vct{X})$ of $\poly(\vct{X})$ is the same as \Cref{def:reduced-poly} with the added constraint that all monomials with variables $X_{\block, i}, X_{\block, j}, i\neq j$ from the same block $\block$ are omitted.
\end{Definition}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%

Consider a $\abbrBIDB$ polynomial $\poly\inparen{\vct{X}} = X_{1, 1}X_{1, 2} + X_{1, 2}X_{2, 1}^2$.  Then by \Cref{def:reduced-bi-poly}, we have that $\rpoly\inparen{\vct{X}} = X_{1, 2}X_{2, 1}$.  Next, we show why the reduced form is useful for our purposes.
%%Removing this example to save space
\iffalse
\begin{Example}\label{example:qtilde}
Consider $\poly(X, Y) = (X + Y)(X + Y)$ where $X$ and $Y$ are from different blocks.  The expanded derivation for $\rpoly(X, Y)$ is
\begin{align*}
(&X^2 + 2XY + Y^2 \mod X^2 - X) \mod Y^2 - Y\\
= ~&X + 2XY + Y^2 \mod Y^2 - Y\\
= ~& X + 2XY + Y
\end{align*}
\end{Example}
\fi
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Lemma}\label{lem:exp-poly-rpoly}
Let $\pdb$ be a \abbrBIDB over $\numvar$ input tuples such that the probability distribution $\pdassign$ over $\{0,1\}^\numvar$ (the all worlds set) is induced by the probability vector $\probAllTup = (\prob_1, \ldots, \prob_\numvar)$.  As in \Cref{lem:tidb-reduce-poly} for \abbrTIDB, any \abbrBIDB-lineage polynomial $\poly(\vct{X})$ based on $\pdb$ and query $\query$ we have:
  % The expectation over possible worlds in $\poly(\vct{X})$ is equal to $\rpoly(\prob_1,\ldots, \prob_\numvar)$.
\begin{equation*}
\expct_{\vct{W}\sim \pdassign}\pbox{\poly(\vct{W})}  = \rpoly(\probAllTup).
\end{equation*}
\end{Lemma}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
Let $\abs{\poly}$ be the number of operators in $\poly$.

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Corollary}\label{cor:expct-sop}
If $\poly$ is a \bi-lineage polynomial already in \abbrSMB, then the expectation of $\poly$, i.e., $\expct\pbox{\poly} = \rpoly\left(\prob_1,\ldots, \prob_\numvar\right)$ can be computed in $\bigO{\abs{\poly}}$ time.
\end{Corollary}


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%% Local Variables:
%%% mode: latex
%%% TeX-master: "main"
%%% End:
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00			`%root: main.tex`
Oliver's notes 2020-06-26 17:27:52 -04:00			`%!TEX root = ./main.tex`
Changed to one column. 2020-07-14 11:45:57 -04:00			`%\onecolumn`
poly 2020-12-14 23:34:12 -05:00			`\subsection{Reduced Polynomials and Equivalences}`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00
Moved commented out material into the appendix. 2021-09-17 18:10:41 -04:00			`We now introduce some terminology`
More changes S 2 2021-09-07 11:32:06 -04:00			`and develop a reduced form of lineage polynomials for a \abbrBIDB or \abbrTIDB.`
			`Note that a polynomial over $\vct{X}=(X_1,\dots,X_n)$ with individual degree $B <\infty$`
Done with pass on S2 2021-09-18 00:55:37 -04:00			`is formally defined as (where $c_{\vct{d}}\in \semN$):`
shorten 2021-04-08 22:30:03 -04:00			`\begin{equation}`
			`\label{eq:sop-form}`
Done with S2 pass 2021-09-20 18:04:04 -04:00			`\poly\inparen{X_1,\dots,X_n}=\sum_{\vct{d}\in\{0,\ldots,B\}^n} c_{\vct{d}}\cdot \prod_{i=1}^n X_i^{d_i}.`
shorten 2021-04-08 22:30:03 -04:00			`\end{equation}`
Done with pass on S2 2021-09-18 00:55:37 -04:00			`%where $c_{\vct{d}}\in \semN$.`
Incorporated all of Oliver's 113020 suggestions. 2020-12-03 10:32:09 -05:00
shorten 2021-04-08 22:30:03 -04:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Finished restructuring mult p and single p arguments. 2020-12-07 15:12:39 -05:00			`\begin{Definition}[Standard Monomial Basis]\label{def:smb}`
Done with pass on S2 2021-09-18 00:55:37 -04:00			`The term $\prod_{i=1}^n X_i^{d_i}$ in \Cref{eq:sop-form} is a {\em monomial}. A polynomial $\poly\inparen{\vct{X}}$ is in standard monomial basis (\abbrSMB) when we keep only the terms with $c_{\vct{d}}\ne 0$ from \Cref{eq:sop-form}.`
Incorporated all of Oliver's 113020 suggestions. 2020-12-03 10:32:09 -05:00			`\end{Definition}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
More changes to notation, etc. 2021-06-11 11:22:58 -04:00			`Unless othewise noted, we consider all polynomials to be in \abbrSMB representation.`
			`When it is unclear, we use $\smbOf{\poly}$ to denote the \abbrSMB form of a polynomial $\poly$.`
Done with pass on S2 2021-04-07 23:27:51 -04:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
poly 2020-12-14 23:34:12 -05:00			`\begin{Definition}[Degree]\label{def:degree}`
Small changes to poly, smb, degree, etc. defs 2021-04-10 14:39:54 -04:00			`The degree of polynomial $\poly(\vct{X})$ is the largest $\sum_{i=1}^n d_i$ such that $c_{(d_1,\dots,d_n)}\ne 0$. % maximum sum of exponents, over all monomials in $\smbOf{\poly(\vct{X})}$.`
poly 2020-12-14 23:34:12 -05:00			`\end{Definition}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
More changes S 2 2021-09-07 11:32:06 -04:00			`As an example, the degree of the polynomial $X^2+2XY^2+Y^2$ is $3$.`
Done with pass on S2 2021-09-18 00:55:37 -04:00			`Product terms in lineage arise only from join operations (\Cref{fig:nxDBSemantics}), so intuitively, the degree of a lineage polynomial is analogous to the largest number of joins needed to produce a result tuple.`
More changes S 2 2021-09-07 11:32:06 -04:00			`%in any clause of the $\raPlus$ query that created it.`
Done with pass on S2 2021-09-18 00:55:37 -04:00			`We call a polynomial $\poly\inparen{\vct{X}}$ a \emph{\bi-lineage polynomial} (resp., \emph{\ti-lineage polynomial}, or simply lineage polynomial), if there exists a $\raPlus$ query $\query$, \bi (\ti) $\pdb$, and result tuple $\tup$ such that $\poly\inparen{\vct{X}} = \apolyqdt\inparen{\vct{X}}.$`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Moved commented out material into the appendix. 2021-09-17 18:10:41 -04:00
poly 2020-12-14 23:34:12 -05:00			`\begin{Definition}[Reduced \bi Polynomials]\label{def:reduced-bi-poly}`
			`Let $\poly(\vct{X})$ be a \bi-lineage polynomial.`
More changes S 2 2021-09-07 11:32:06 -04:00			`The reduced form $\rpoly(\vct{X})$ of $\poly(\vct{X})$ is the same as \Cref{def:reduced-poly} with the added constraint that all monomials with variables $X_{\block, i}, X_{\block, j}, i\neq j$ from the same block $\block$ are omitted.`
Moved definitions, lemmas, etc. to background/notation section. 2020-12-11 20:19:45 -05:00			`\end{Definition}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
Fixing merge conflict 2020-12-20 18:29:52 -05:00
More changes S 2 2021-09-07 11:32:06 -04:00			`Consider a $\abbrBIDB$ polynomial $\poly\inparen{\vct{X}} = X_{1, 1}X_{1, 2} + X_{1, 2}X_{2, 1}^2$. Then by \Cref{def:reduced-bi-poly}, we have that $\rpoly\inparen{\vct{X}} = X_{1, 2}X_{2, 1}$. Next, we show why the reduced form is useful for our purposes.`
Done with pass on S2 2021-04-07 23:27:51 -04:00			`%%Removing this example to save space`
			`\iffalse`
In the middle of Oliver's 091420 suggestions 2020-09-16 16:27:50 -04:00			`\begin{Example}\label{example:qtilde}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`Consider $\poly(X, Y) = (X + Y)(X + Y)$ where $X$ and $Y$ are from different blocks. The expanded derivation for $\rpoly(X, Y)$ is`
Started incorporating Oliver's 081420 suggestions 2020-08-20 14:01:56 -04:00			`\begin{align*}`
Made a pass on S2. 2020-12-16 12:38:21 -05:00			`(&X^2 + 2XY + Y^2 \mod X^2 - X) \mod Y^2 - Y\\`
			`= ~&X + 2XY + Y^2 \mod Y^2 - Y\\`
			`= ~& X + 2XY + Y`
Started incorporating Oliver's 081420 suggestions 2020-08-20 14:01:56 -04:00			`\end{align*}`
In the middle of Oliver's 091420 suggestions 2020-09-16 16:27:50 -04:00			`\end{Example}`
Done with pass on S2 2021-04-07 23:27:51 -04:00			`\fi`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Restructuring S.2. 2021-09-02 12:06:47 -04:00

poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
RA to poly translation; corrections 062320 2020-06-23 19:33:28 -04:00			`\begin{Lemma}\label{lem:exp-poly-rpoly}`
Done with pass on S2 2021-09-18 00:55:37 -04:00			`Let $\pdb$ be a \abbrBIDB over $\numvar$ input tuples such that the probability distribution $\pdassign$ over $\{0,1\}^\numvar$ (the all worlds set) is induced by the probability vector $\probAllTup = (\prob_1, \ldots, \prob_\numvar)$. As in \Cref{lem:tidb-reduce-poly} for \abbrTIDB, any \abbrBIDB-lineage polynomial $\poly(\vct{X})$ based on $\pdb$ and query $\query$ we have:`
poly 2020-12-14 23:34:12 -05:00			`% The expectation over possible worlds in $\poly(\vct{X})$ is equal to $\rpoly(\prob_1,\ldots, \prob_\numvar)$.`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00			`\begin{equation*}`
Rehashing the Intro for arxiv upload. 2022-01-11 11:35:45 -05:00			`\expct_{\vct{W}\sim \pdassign}\pbox{\poly(\vct{W})} = \rpoly(\probAllTup).`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00			`\end{equation*}`
RA to poly translation; corrections 062320 2020-06-23 19:33:28 -04:00			`\end{Lemma}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Rehashing the Intro for arxiv upload. 2022-01-11 11:35:45 -05:00			`Let $\abs{\poly}$ be the number of operators in $\poly$.`
Proof for \tilde{Q}(p,...p) 2020-06-15 18:38:10 -04:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
In the middle of Oliver's 091420 suggestions 2020-09-16 16:27:50 -04:00			`\begin{Corollary}\label{cor:expct-sop}`
Finished with pass on Appendix A. 2021-09-18 13:48:00 -04:00			`If $\poly$ is a \bi-lineage polynomial already in \abbrSMB, then the expectation of $\poly$, i.e., $\expct\pbox{\poly} = \rpoly\left(\prob_1,\ldots, \prob_\numvar\right)$ can be computed in $\bigO{\abs{\poly}}$ time.`
More poly-formulation. 2020-06-17 10:58:02 -04:00			`\end{Corollary}`
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`

			`%%% Local Variables:`
			`%%% mode: latex`
			`%%% TeX-master: "main"`
			`%%% End:`