paper-BagRelationalPDBsAreHard/poly-form.tex

%root: main.tex
%!TEX root = ./main.tex
%\onecolumn
\subsection{Reduced Polynomials and Equivalences}

We now introduce some terminology for polynomials and develop a reduced form for polynomials --- a closed form of the polynomial's expectation over probability distributions derived from a \bi or \ti.
We will use $(X + Y)^2$ as a running example.

\begin{Definition}[Standard Monomial Basis]\label{def:smb}
A monomial is a product of variable terms, each raised to a non-negative integer power.
  A polynomial in \termSMB (\abbrSMB) has the form: $\sum_{i=1}^n c_i \cdot m_i$, where each $c_i \neq 0$ is an integer and each $m_i$ is a monomial and $m_i \neq m_j$ for $i \neq j$. The \abbrSMB of a polynomial $\poly$ is $\smbOf{\poly}$.
%  fully expanded out such that no product of sums exist and where each unique monomial appears exactly once.
\end{Definition}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

The \abbrSMB for the running example is $X^2 +2XY + Y^2$.  While $X^2 + XY + XY + Y^2$ is an expanded form of the expression, it is not the standard monomial basis since $XY$ appears more than once.

% \BG{Maybe inline degree?}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Definition}[Degree]\label{def:degree}
The degree of polynomial $\poly(\vct{X})$ is the maximum sum of exponents, over all monomials in $\smbOf{\poly(\vct{X})}$.
\end{Definition}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

The degree of the running example polynomial is $2$. 
Note that product terms can only arise as a consequence of join operations, so intuitively, the degree of a lineage polynomial is analogous to the largest number of joins in one clause of the UCQ query that created it.
In this paper we consider only finite degree polynomials.
%
% Throughout this paper, we also make the following \textit{assumption}.
%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% \begin{Assumption}\label{assump:poly-smb}
% All polynomials considered are in standard monomial basis, i.e., $\poly(\vct{X}) = \sum\limits_{\vct{d} \in \mathbb{N}^\numvar}q_d \cdot \prod\limits_{i = 1, d_i \geq 1}^{\numvar}X_i^{d_i}$, where $q_d$ is the coefficient for the monomial encoded in $\vct{d}$ and $d_i$ is the $i^{th}$ element of $\vct{d}$.
% \end{Assumption}
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%
We call a polynomial $\query(\vct{X})$ a \emph{\bi-lineage polynomial} (resp., \emph{\ti-lineage polynomial}, or simply lineage polynomial), if
%\AH{Why is it required for the tuple to be n-ary?  I think this slightly confuses me since we have n tuples.} 
% OK: agreed w/ AH, this can be treated as implicit
there exists a $\raPlus$ query $\query$, \bi $\pxdb$ (\ti $\pxdb$, or $\semNX$-PDB $\pxdb$), and tuple $\tup$ such that $\query(\vct{X}) = \query(\pxdb)(\tup)$. % Before proceeding, note that the following is assume that polynomials are  \bis (which subsume \tis as a special case).
As they are a special case of \bis, the following applies to \tis as well.
Recall that in a \bi $\pxdb$ with tuples $t_1, \ldots, t_n$, each input tuple $t_i$ is annotated with a unique variable $X_i$. 
Tuples of $\pxdb$ are partitioned into $\ell$ blocks $\block_1, \ldots, \block_\ell$ where tuple $t_i$ is associated with a probability $\prob_{\tup_i} = \pd[X_i = 1]$.
\footnote{
  Although it is customary to define a single independent, $[\abs{\block_i}+1]$-valued variable per block, we decompose it into $\abs{\block_i}$ correlated $\{0,1\}$-valued variables per block that can be used directly in polynomials (without an indicator function).  For $t_j \in b_i$, the event $(X_j = 1)$ corresponds to the event $(X_i = j)$ in the customary annotation scheme.
} 
Because blocks are independent and tuples from the same block are disjoint, $\prob$ and the blocks induce the probability distribution $\pd$ of $\pxdb$.
We will write a \bi-lineage polynomial $\poly(\vct{X})$ for a \bi with $\ell$ blocks as
$\poly(\vct{X})$ = $\poly(X_{\block_1, 1},\ldots, X_{\block_1, \abs{\block_1}},$ $\ldots, X_{\block_\ell, \abs{\block_\ell}})$, where $\abs{\block_i}$ denotes the size of $\block_i$, and $X_{i, j}$ denotes the annotation of tuple $j$ residing in block $i$ for $j$ in $[\abs{\block_i}]$.\footnote{Later on in the paper, especially in~\Cref{sec:algo}, we will overload notation and rename the variables as $X_1,\dots,X_n$, where $n=\sum_{i=1}^\ell \abs{b_i}$.}
%\SF{Where is $\block_{i, j}$ used? Is it $X_{\block_{1, 1}}$ or $X_{\block_1, 1}$ ?}
% and the probability distribution of $\pxdb$ is  uniquely determined based on a probability vector $\vct{p}$ that associates each tuple a probability
% variables are independent of each other (or disjoint if they are from the same block) and each variable $X$ is associated with a probability $\vct{p}(X) = \pd[X = 1]$. Thus, we are dealing with polynomials $\poly(\vct{X})$ that are annotations of a tuple in the result of a query $\query$ over a BIDB $\pxdb$ where $\vct{X}$ is the set of variables that occur in annotations of tuples of $\pxdb$.

% While the definition of polynomial $\poly(\vct{X})$ over a $\bi$ input doesn't change, we introduce an alternative notation which will come in handy.  Given $\ell$ blocks, we write $\poly(\vct{X})$ = $\poly(X_{\block_1, 1},\ldots, X_{\block_1, \abs{\block_1}},$ $\ldots, X_{\block_\ell, \abs{\block_\ell}})$, where $\abs{\block_i}$ denotes the size of $\block_i$, and $\block_{i, j}$ denotes tuple $j$ residing in block $i$ for $j$ in $[\abs{\block_i}]$.
% The number of tuples in the $\bi$ instance can be (trivially) computed as $\numvar = \sum\limits_{i = 1}^{\ell}\abs{\block_i}$ .


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Definition}[Modding with a set]\label{def:mod-set}
Let $S$ be a {\em set} of polynomials over $\vct{X}$. Then $\poly(\vct{X})\mod{S}$ is the polynomial obtained by taking the mod of $\poly(\vct{X})$ over {\em all} polynomials in $S$ (order does not matter).
\end{Definition}
For example when $S_0=\inset{X^2-X, Y^2-Y}$, taking the polynomial $2X^2 + 3XY - 2Y^2\mod S_0$ gives $2X+3XY-2Y$.
%
\begin{Definition}\label{def:mod-set-polys}
Given the set of BIDB variables $\inset{X_{b,i}}$, define

\setlength\parindent{0pt}
\vspace*{-3mm}
{\small
\begin{tabular}{@{}l l}
	\begin{minipage}[b]{0.45\linewidth}
		\centering
		\begin{equation*}
		\mathcal{B}=\comprehension{X_{b,i}\cdot X_{b,j}}{\text{ for every block } b \text{ and } i\ne j \in [~\abs{\block}~]},
		\end{equation*}
	\end{minipage}%
	\hspace{13mm}
	&
	\begin{minipage}[b]{0.45\linewidth}
		\centering
		\begin{equation*}
		\mathcal{T}=\comprehension{X_{b,i}^2-X_{b,i}}{\text{ for every block } b \text{ and } i \in [~\abs{\block}~]}
		\end{equation*}
	\end{minipage}
	\\
\end{tabular}
}
\end{Definition}
%
\begin{Definition}[Reduced \bi Polynomials]\label{def:reduced-bi-poly}
  Let $\poly(\vct{X})$ be a \bi-lineage polynomial.
  The reduced form $\rpoly(\vct{X})$ of $\poly(\vct{X})$ is:
\begin{equation*}
\rpoly(\vct{X}) = \smbOf{\poly(\vct{X})} \mod \inparen{\mathcal{T} \cup \mathcal{B}}%X_i^2 - X_i \mod X_{\block_s, t}X_{\block_s, u}
\end{equation*}
%for all $i$ in $[\numvar]$ and for all $s$ in $\ell$, such that for all $t, u$ in $[\abs{\block_s}]$, $t \neq u$.
\end{Definition}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%

Intuitively, in the reduced form, all exponents $e > 1$ are reduced to $e = 1$.  This is performed by $\text{mod } \mathcal T$.  To see why this is the case, consider the concrete example $7^2 \text{mod } (7^2 - 7) = 42 \text{mod } 42 = 7$ as desired.  To filter disallowed $\bi$ cross-terms, all monomials with multiple variables from the same block $\block$ are dropped by $\text{mod } \mathcal B$ (i.e., any monomial containing more than one tuple from a block has $0$ probability and can be ignored). 

For the special case of \tis, the second step is not necessary since every block contains a single tuple.
%Alternatively, one can think of $\rpoly$ as the \abbrSMB of $\poly(\vct{X})$ when the product operator is idempotent.
%
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% \begin{Definition}[$\rpoly(\vct{X})$] \label{def:qtilde}
% Define $\rpoly(X_1,\ldots, X_\numvar)$ as the reduced version of $\poly(X_1,\ldots, X_\numvar)$, of the form
% $\rpoly(X_1,\ldots, X_\numvar) = $

% \[\poly(X_1,\ldots, X_\numvar) \mod X_1^2-X_1\cdots\mod X_\numvar^2 - X_\numvar.\]
% \end{Definition}
% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Example}\label{example:qtilde}
Consider $\poly(X, Y) = (X + Y)(X + Y)$ where $X$ and $Y$ are from different blocks.  The expanded derivation for $\rpoly(X, Y)$ is
\begin{align*}
(&X^2 + 2XY + Y^2 \mod X^2 - X) \mod Y^2 - Y\\
= ~&X + 2XY + Y^2 \mod Y^2 - Y\\
= ~& X + 2XY + Y
\end{align*}
\end{Example}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%
% Intuitively, $\rpoly(\textbf{X})$ is the \abbrSMB form of $\poly(\textbf{X})$ such that if any $X_j$ term  has an exponent $e > 1$, it is reduced to $1$, i.e. $X_j^e\mapsto X_j$ for any $e > 1$.
%
%When considering $\bi$ input, it becomes necessary to redefine $\rpoly(\vct{X})$.
%
%\noindent The usefulness of this will reduction become clear in \Cref{lem:exp-poly-rpoly}.
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Definition}[Valid Worlds]
For probability distribution $\probDist$ and its corresponding probability mass function $\probOf$, the set of valid worlds $\eta$ consists of all the worlds with probability value greater than $0$; i.e., for variable vector $\vct{W}$
\[
\eta = \{\vct{w}\suchthat \probOf[\vct{W} = \vct{w}] > 0\}
\]
\end{Definition}

%We state additional equivalences between $\poly(\vct{X})$ and $\rpoly(\vct{X})$ in~\Cref{app:subsec-pre-poly-rpoly} and~\Cref{app:subsec-prop-q-qtilde}.
Next, we show why the reduced form is useful for our purposes:
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%


%Define all variables $X_i$ in $\poly$ to be independent.

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Lemma}\label{lem:exp-poly-rpoly}
Let $\pxdb$ be a \bi over variables $\vct{X} = \{X_1, \ldots, X_\numvar\}$ and with probability distribution $\probDist$ produced by the tuple probability vector $\probAllTup = (\prob_1, \ldots, \prob_\numvar)$ over all $\vct{w}$ in $\eta$. For any \bi-lineage polynomial $\poly(\vct{X})$ based on $\pxdb$ and query $\query$ we have:
  % The expectation over possible worlds in $\poly(\vct{X})$ is equal to $\rpoly(\prob_1,\ldots, \prob_\numvar)$.
\begin{equation*}
\expct_{\vct{W}\sim \probDist}\pbox{\poly(\vct{W})}  = \rpoly(\probAllTup).
\end{equation*}
\end{Lemma}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

Note that in the preceding lemma, we have assigned $\vct{p}$ 
%(introduced in \Cref{subsec:def-data}) 
to the variables $\vct{X}$. Intuitively, \Cref{lem:exp-poly-rpoly} states that when we replace each variable $X_i$ with its probability $\prob_i$ in the reduced form of a \bi-lineage polynomial and evaluate the resulting expression in $\mathbb{R}$, then the result is the expectation of the polynomial.


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\begin{Corollary}\label{cor:expct-sop}
If $\poly$ is a \bi-lineage polynomial, then the expectation of $\poly$, i.e., $\expct\pbox{\poly} = \rpoly\left(\prob_1,\ldots, \prob_\numvar\right)$ can be computed in $O(\size\inparen{\smbOf{\poly}})$, where $\size\inparen{\poly}$ denotes the total number of multiplication/addition operators in $\poly$.
\end{Corollary}
%\AH{What if $\poly$ is not in \abbrSMB form?}


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%% Local Variables:
%%% mode: latex
%%% TeX-master: "main"
%%% End:
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00			`%root: main.tex`
Oliver's notes 2020-06-26 17:27:52 -04:00			`%!TEX root = ./main.tex`
Changed to one column. 2020-07-14 11:45:57 -04:00			`%\onecolumn`
poly 2020-12-14 23:34:12 -05:00			`\subsection{Reduced Polynomials and Equivalences}`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`We now introduce some terminology for polynomials and develop a reduced form for polynomials --- a closed form of the polynomial's expectation over probability distributions derived from a \bi or \ti.`
Misc clarifications 2020-12-20 17:13:52 -05:00			`We will use $(X + Y)^2$ as a running example.`
Incorporated all of Oliver's 113020 suggestions. 2020-12-03 10:32:09 -05:00
Finished restructuring mult p and single p arguments. 2020-12-07 15:12:39 -05:00			`\begin{Definition}[Standard Monomial Basis]\label{def:smb}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`A monomial is a product of variable terms, each raised to a non-negative integer power.`
Finished my first past implementing Reviewer Suggestions. 2021-03-10 13:28:04 -05:00			`A polynomial in \termSMB (\abbrSMB) has the form: $\sum_{i=1}^n c_i \cdot m_i$, where each $c_i \neq 0$ is an integer and each $m_i$ is a monomial and $m_i \neq m_j$ for $i \neq j$. The \abbrSMB of a polynomial $\poly$ is $\smbOf{\poly}$.`
poly 2020-12-14 13:58:56 -05:00			`% fully expanded out such that no product of sums exist and where each unique monomial appears exactly once.`
Incorporated all of Oliver's 113020 suggestions. 2020-12-03 10:32:09 -05:00			`\end{Definition}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Incorporated all of Oliver's 113020 suggestions. 2020-12-03 10:32:09 -05:00
Made a pass on S2. 2020-12-16 12:38:21 -05:00			`The \abbrSMB for the running example is $X^2 +2XY + Y^2$. While $X^2 + XY + XY + Y^2$ is an expanded form of the expression, it is not the standard monomial basis since $XY$ appears more than once.`
poly 2020-12-14 13:58:56 -05:00
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`% \BG{Maybe inline degree?}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
poly 2020-12-14 23:34:12 -05:00			`\begin{Definition}[Degree]\label{def:degree}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`The degree of polynomial $\poly(\vct{X})$ is the maximum sum of exponents, over all monomials in $\smbOf{\poly(\vct{X})}$.`
poly 2020-12-14 23:34:12 -05:00			`\end{Definition}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Finished up to page 4 on 1st pass Atri 090320 pass. 2020-09-07 12:30:07 -04:00
Misc clarifications 2020-12-20 17:13:52 -05:00			`The degree of the running example polynomial is $2$.`
			`Note that product terms can only arise as a consequence of join operations, so intuitively, the degree of a lineage polynomial is analogous to the largest number of joins in one clause of the UCQ query that created it.`
			`In this paper we consider only finite degree polynomials.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 23:34:12 -05:00			`% Throughout this paper, we also make the following \textit{assumption}.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 23:34:12 -05:00			`% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
			`% \begin{Assumption}\label{assump:poly-smb}`
			`% All polynomials considered are in standard monomial basis, i.e., $\poly(\vct{X}) = \sum\limits_{\vct{d} \in \mathbb{N}^\numvar}q_d \cdot \prod\limits_{i = 1, d_i \geq 1}^{\numvar}X_i^{d_i}$, where $q_d$ is the coefficient for the monomial encoded in $\vct{d}$ and $d_i$ is the $i^{th}$ element of $\vct{d}$.`
			`% \end{Assumption}`
			`% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
			`We call a polynomial $\query(\vct{X})$ a \emph{\bi-lineage polynomial} (resp., \emph{\ti-lineage polynomial}, or simply lineage polynomial), if`
			`%\AH{Why is it required for the tuple to be n-ary? I think this slightly confuses me since we have n tuples.}`
			`% OK: agreed w/ AH, this can be treated as implicit`
			`there exists a $\raPlus$ query $\query$, \bi $\pxdb$ (\ti $\pxdb$, or $\semNX$-PDB $\pxdb$), and tuple $\tup$ such that $\query(\vct{X}) = \query(\pxdb)(\tup)$. % Before proceeding, note that the following is assume that polynomials are \bis (which subsume \tis as a special case).`
			`As they are a special case of \bis, the following applies to \tis as well.`
			`Recall that in a \bi $\pxdb$ with tuples $t_1, \ldots, t_n$, each input tuple $t_i$ is annotated with a unique variable $X_i$.`
Merge remote-tracking branch 'origin/master' 2020-12-19 23:22:49 -05:00			`Tuples of $\pxdb$ are partitioned into $\ell$ blocks $\block_1, \ldots, \block_\ell$ where tuple $t_i$ is associated with a probability $\prob_{\tup_i} = \pd[X_i = 1]$.`
			`\footnote{`
Misc clarifications 2020-12-20 17:13:52 -05:00			`Although it is customary to define a single independent, $[\abs{\block_i}+1]$-valued variable per block, we decompose it into $\abs{\block_i}$ correlated $\{0,1\}$-valued variables per block that can be used directly in polynomials (without an indicator function). For $t_j \in b_i$, the event $(X_j = 1)$ corresponds to the event $(X_i = j)$ in the customary annotation scheme.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`}`
			`Because blocks are independent and tuples from the same block are disjoint, $\prob$ and the blocks induce the probability distribution $\pd$ of $\pxdb$.`
poly 2020-12-14 23:34:12 -05:00			`We will write a \bi-lineage polynomial $\poly(\vct{X})$ for a \bi with $\ell$ blocks as`
Finished my first past implementing Reviewer Suggestions. 2021-03-10 13:28:04 -05:00			`$\poly(\vct{X})$ = $\poly(X_{\block_1, 1},\ldots, X_{\block_1, \abs{\block_1}},$ $\ldots, X_{\block_\ell, \abs{\block_\ell}})$, where $\abs{\block_i}$ denotes the size of $\block_i$, and $X_{i, j}$ denotes the annotation of tuple $j$ residing in block $i$ for $j$ in $[\abs{\block_i}]$.\footnote{Later on in the paper, especially in~\Cref{sec:algo}, we will overload notation and rename the variables as $X_1,\dots,X_n$, where $n=\sum_{i=1}^\ell \abs{b_i}$.}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%\SF{Where is $\block_{i, j}$ used? Is it $X_{\block_{1, 1}}$ or $X_{\block_1, 1}$ ?}`
poly 2020-12-15 12:02:22 -05:00			`% and the probability distribution of $\pxdb$ is uniquely determined based on a probability vector $\vct{p}$ that associates each tuple a probability`
poly 2020-12-14 23:34:12 -05:00			`% variables are independent of each other (or disjoint if they are from the same block) and each variable $X$ is associated with a probability $\vct{p}(X) = \pd[X = 1]$. Thus, we are dealing with polynomials $\poly(\vct{X})$ that are annotations of a tuple in the result of a query $\query$ over a BIDB $\pxdb$ where $\vct{X}$ is the set of variables that occur in annotations of tuples of $\pxdb$.`

			`% While the definition of polynomial $\poly(\vct{X})$ over a $\bi$ input doesn't change, we introduce an alternative notation which will come in handy. Given $\ell$ blocks, we write $\poly(\vct{X})$ = $\poly(X_{\block_1, 1},\ldots, X_{\block_1, \abs{\block_1}},$ $\ldots, X_{\block_\ell, \abs{\block_\ell}})$, where $\abs{\block_i}$ denotes the size of $\block_i$, and $\block_{i, j}$ denotes tuple $j$ residing in block $i$ for $j$ in $[\abs{\block_i}]$.`
			`% The number of tuples in the $\bi$ instance can be (trivially) computed as $\numvar = \sum\limits_{i = 1}^{\ell}\abs{\block_i}$ .`




Finished up to page 4 on 1st pass Atri 090320 pass. 2020-09-07 12:30:07 -04:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Changed to ICDT format. 2021-03-06 20:34:18 -05:00			`\begin{Definition}[Modding with a set]\label{def:mod-set}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`Let $S$ be a {\em set} of polynomials over $\vct{X}$. Then $\poly(\vct{X})\mod{S}$ is the polynomial obtained by taking the mod of $\poly(\vct{X})$ over {\em all} polynomials in $S$ (order does not matter).`
Changed to ICDT format. 2021-03-06 20:34:18 -05:00			`\end{Definition}`
a few more tweaks 2020-12-19 00:57:52 -05:00			`For example when $S_0=\inset{X^2-X, Y^2-Y}$, taking the polynomial $2X^2 + 3XY - 2Y^2\mod S_0$ gives $2X+3XY-2Y$.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00			`\begin{Definition}\label{def:mod-set-polys}`
			`Given the set of BIDB variables $\inset{X_{b,i}}$, define`
More adjustments to save space; currently ~8.5 pages over. 2021-03-09 11:43:38 -05:00
			`\setlength\parindent{0pt}`
			`\vspace*{-3mm}`
			`{\small`
			`\begin{tabular}{@{}l l}`
			`\begin{minipage}[b]{0.45\linewidth}`
			`\centering`
			`\begin{equation*}`
			`\mathcal{B}=\comprehension{X_{b,i}\cdot X_{b,j}}{\text{ for every block } b \text{ and } i\ne j \in [~\abs{\block}~]},`
			`\end{equation*}`
			`\end{minipage}%`
			`\hspace{13mm}`
			`&`
			`\begin{minipage}[b]{0.45\linewidth}`
			`\centering`
			`\begin{equation*}`
			`\mathcal{T}=\comprehension{X_{b,i}^2-X_{b,i}}{\text{ for every block } b \text{ and } i \in [~\abs{\block}~]}`
			`\end{equation*}`
			`\end{minipage}`
			`\\`
			`\end{tabular}`
			`}`
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00			`\end{Definition}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 23:34:12 -05:00			`\begin{Definition}[Reduced \bi Polynomials]\label{def:reduced-bi-poly}`
			`Let $\poly(\vct{X})$ be a \bi-lineage polynomial.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`The reduced form $\rpoly(\vct{X})$ of $\poly(\vct{X})$ is:`
poly 2020-12-14 23:34:12 -05:00			`\begin{equation*}`
Done with pass on S2 2020-12-20 00:13:58 -05:00			`\rpoly(\vct{X}) = \smbOf{\poly(\vct{X})} \mod \inparen{\mathcal{T} \cup \mathcal{B}}%X_i^2 - X_i \mod X_{\block_s, t}X_{\block_s, u}`
poly 2020-12-14 23:34:12 -05:00			`\end{equation*}`
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00			`%for all $i$ in $[\numvar]$ and for all $s$ in $\ell$, such that for all $t, u$ in $[\abs{\block_s}]$, $t \neq u$.`
Moved definitions, lemmas, etc. to background/notation section. 2020-12-11 20:19:45 -05:00			`\end{Definition}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
Fixing merge conflict 2020-12-20 18:29:52 -05:00
Finished my first past implementing Reviewer Suggestions. 2021-03-10 13:28:04 -05:00			Intuitively, in the reduced form, all exponents $e > 1$ are reduced to $e = 1$. This is performed by $\text{mod } \mathcal T$. To see why this is the case, consider the concrete example $7^2 \text{mod } (7^2 - 7) = 42 \text{mod } 42 = 7$ as desired. To filter disallowed $\bi$ cross-terms, all monomials with multiple variables from the same block $\block$ are dropped by $\text{mod } \mathcal B$ (i.e., any monomial containing more than one tuple from a block has $0$ probability and can be ignored).
Fixing merge conflict 2020-12-20 18:29:52 -05:00
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`For the special case of \tis, the second step is not necessary since every block contains a single tuple.`
Done with pass on S2 2020-12-20 00:13:58 -05:00			`%Alternatively, one can think of $\rpoly$ as the \abbrSMB of $\poly(\vct{X})$ when the product operator is idempotent.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 23:34:12 -05:00			`% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
			`% \begin{Definition}[$\rpoly(\vct{X})$] \label{def:qtilde}`
			`% Define $\rpoly(X_1,\ldots, X_\numvar)$ as the reduced version of $\poly(X_1,\ldots, X_\numvar)$, of the form`
			`% $\rpoly(X_1,\ldots, X_\numvar) = $`
Some small changes. 2020-07-08 16:48:37 -04:00
poly 2020-12-14 23:34:12 -05:00			`% \[\poly(X_1,\ldots, X_\numvar) \mod X_1^2-X_1\cdots\mod X_\numvar^2 - X_\numvar.\]`
			`% \end{Definition}`
			`% %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
In the middle of Oliver's 091420 suggestions 2020-09-16 16:27:50 -04:00			`\begin{Example}\label{example:qtilde}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`Consider $\poly(X, Y) = (X + Y)(X + Y)$ where $X$ and $Y$ are from different blocks. The expanded derivation for $\rpoly(X, Y)$ is`
Started incorporating Oliver's 081420 suggestions 2020-08-20 14:01:56 -04:00			`\begin{align*}`
Made a pass on S2. 2020-12-16 12:38:21 -05:00			`(&X^2 + 2XY + Y^2 \mod X^2 - X) \mod Y^2 - Y\\`
			`= ~&X + 2XY + Y^2 \mod Y^2 - Y\\`
			`= ~& X + 2XY + Y`
Started incorporating Oliver's 081420 suggestions 2020-08-20 14:01:56 -04:00			`\end{align*}`
In the middle of Oliver's 091420 suggestions 2020-09-16 16:27:50 -04:00			`\end{Example}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 23:34:12 -05:00			`% Intuitively, $\rpoly(\textbf{X})$ is the \abbrSMB form of $\poly(\textbf{X})$ such that if any $X_j$ term has an exponent $e > 1$, it is reduced to $1$, i.e. $X_j^e\mapsto X_j$ for any $e > 1$.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 23:34:12 -05:00			`%When considering $\bi$ input, it becomes necessary to redefine $\rpoly(\vct{X})$.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
Done with pass on S2 2020-12-20 00:13:58 -05:00			`%\noindent The usefulness of this will reduction become clear in \Cref{lem:exp-poly-rpoly}.`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Pulled more S2 material into the appendix 2020-12-19 15:04:14 -05:00			`\begin{Definition}[Valid Worlds]`
Read through fixes. 2020-12-20 18:28:30 -05:00			`For probability distribution $\probDist$ and its corresponding probability mass function $\probOf$, the set of valid worlds $\eta$ consists of all the worlds with probability value greater than $0$; i.e., for variable vector $\vct{W}$`
Pulled more S2 material into the appendix 2020-12-19 15:04:14 -05:00			`\[`
Changed to ICDT format. 2021-03-06 20:34:18 -05:00			`\eta = \{\vct{w}\suchthat \probOf[\vct{W} = \vct{w}] > 0\}`
Pulled more S2 material into the appendix 2020-12-19 15:04:14 -05:00			`\]`
			`\end{Definition}`
Some small changes. 2020-07-08 16:48:37 -04:00
Done with pass on S2 2020-12-20 00:13:58 -05:00			`%We state additional equivalences between $\poly(\vct{X})$ and $\rpoly(\vct{X})$ in~\Cref{app:subsec-pre-poly-rpoly} and~\Cref{app:subsec-prop-q-qtilde}.`
			`Next, we show why the reduced form is useful for our purposes:`
Pulled more S2 material into the appendix 2020-12-19 15:04:14 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
poly 2020-12-14 13:58:56 -05:00
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00
Some small changes. 2020-07-08 16:48:37 -04:00
poly 2020-12-14 23:34:12 -05:00			`%Define all variables $X_i$ in $\poly$ to be independent.`
poly 2020-12-14 13:58:56 -05:00
			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
RA to poly translation; corrections 062320 2020-06-23 19:33:28 -04:00			`\begin{Lemma}\label{lem:exp-poly-rpoly}`
Conformed S2 to notation convention for probabilities. 2020-12-19 23:19:02 -05:00			`Let $\pxdb$ be a \bi over variables $\vct{X} = \{X_1, \ldots, X_\numvar\}$ and with probability distribution $\probDist$ produced by the tuple probability vector $\probAllTup = (\prob_1, \ldots, \prob_\numvar)$ over all $\vct{w}$ in $\eta$. For any \bi-lineage polynomial $\poly(\vct{X})$ based on $\pxdb$ and query $\query$ we have:`
poly 2020-12-14 23:34:12 -05:00			`% The expectation over possible worlds in $\poly(\vct{X})$ is equal to $\rpoly(\prob_1,\ldots, \prob_\numvar)$.`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00			`\begin{equation*}`
Conformed S2 to notation convention for probabilities. 2020-12-19 23:19:02 -05:00			`\expct_{\vct{W}\sim \probDist}\pbox{\poly(\vct{W})} = \rpoly(\probAllTup).`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00			`\end{equation*}`
RA to poly translation; corrections 062320 2020-06-23 19:33:28 -04:00			`\end{Lemma}`
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00
Made a pass on S2. 2020-12-16 12:38:21 -05:00			`Note that in the preceding lemma, we have assigned $\vct{p}$`
			`%(introduced in \Cref{subsec:def-data})`
			`to the variables $\vct{X}$. Intuitively, \Cref{lem:exp-poly-rpoly} states that when we replace each variable $X_i$ with its probability $\prob_i$ in the reduced form of a \bi-lineage polynomial and evaluate the resulting expression in $\mathbb{R}$, then the result is the expectation of the polynomial.`
Finished implementing Oliver's 091420 suggestions 2020-09-17 13:51:57 -04:00
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00
Started texing poly reformation write up. 2020-06-12 11:45:15 -04:00
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Oliver's notes 2020-06-26 17:27:52 -04:00

RA to poly translation; corrections 062320 2020-06-23 19:33:28 -04:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Proof for \tilde{Q}(p,...p) 2020-06-15 18:38:10 -04:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
In the middle of Oliver's 091420 suggestions 2020-09-16 16:27:50 -04:00			`\begin{Corollary}\label{cor:expct-sop}`
Macro name change. 2021-01-29 09:57:20 -05:00			`If $\poly$ is a \bi-lineage polynomial, then the expectation of $\poly$, i.e., $\expct\pbox{\poly} = \rpoly\left(\prob_1,\ldots, \prob_\numvar\right)$ can be computed in $O(\size\inparen{\smbOf{\poly}})$, where $\size\inparen{\poly}$ denotes the total number of multiplication/addition operators in $\poly$.`
More poly-formulation. 2020-06-17 10:58:02 -04:00			`\end{Corollary}`
Pass over S2, S3; Ended up saving a column or so 2020-12-19 00:45:30 -05:00			`%\AH{What if $\poly$ is not in \abbrSMB form?}`
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00
Ported some defs from S4 to S2; capitalized variables. 2020-12-18 11:39:38 -05:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`
Moved S2 proofs into Appendix 2020-12-17 17:08:48 -05:00
poly 2020-12-14 13:58:56 -05:00			`%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%`

			`%%% Local Variables:`
			`%%% mode: latex`
			`%%% TeX-master: "main"`
			`%%% End:`