We begin the analysis by showing that with high probability an estimate is approximately $\numWorldsP$, where $p$ is the probability measure for a given TIPD. Note that
\numWorldsP = \numWorldsSum\label{eq:mu}.
The first step is to show that the expectation of the estimate of a tuple t's membership across all worlds is $\numWorldsSum$.
=&\expect{\sum_{\substack{j \in [B],\\
\wVec \in \pw~|~ \sketchHash{i}[\wVec] = j,\\
\wVec[w']\in \pw~|~ \sketchHash{i}[\wVec[w']] = j} } v_t[\wVec] \cdot s_i[\wVec] \cdot s_i[\wVec[w']]}\\
=&\multLineExpect\big[\sum_{\substack{j \in [B],\\
\wVec~|~\sketchHashParam{\wVec}= j,\\
\wVecPrime~|~\sketchHashParam{\wVecPrime} = j,\\
\wVec = \wVecPrime}} \kMapParam{\wVec} \cdot \sketchPolarParam{\wVec} \cdot \sketchPolarParam{\wVecPrime} + \nonumber \\
&\phantom{{}\kMapParam{\wVec}}\sum_{\substack{j \in [B], \\
\wVec~|~\sketchHashParam{\wVec} = j,\\
\wVecPrime ~|~ \sketchHashParam{\wVecPrime} = j,\\ \wVec \neq \wVecPrime}} \kMapParam{\wVec} \cdot \sketchPolarParam{\wVec} \cdot\sketchPolarParam{\wVecPrime}\big]\textit{(by linearity of expectation)}\\
=&\expect{\sum_{\substack{j \in [B],\\
\wVec~|~\sketchHashParam{\wVec}= j,\\
\wVecPrime~|~\sketchHashParam{\wVecPrime} = j,\\
\wVec = \wVecPrime}} \kMapParam{\wVec} \cdot \sketchPolarParam{\wVec} \cdot \sketchPolarParam{\wVecPrime}} \nonumber \\
&\phantom{{}\big[}\textit{(by uniform distribution in the second summation)}\\
=& \estExp \label{eq:estExpect}
For the next step, we show that the variance of an estimate is small.$$\varParam{\estimate}$$
&= \expect{\big(\estTwo\big)^2}\\
\wVec_1, \wVec_2,\\
\wVecPrime_1, \wVecPrime_2 \in \pw,\\
\sketchHashParam{\wVec_1} = \sketchHashParam{\wVecPrime_1},\\
\sketchHashParam{\wVec_2} = \sketchHashParam{\wVecPrime_2}
}}\kMapParam{\wVec_1} \cdot \kMapParam{\wVec_2}\cdot\sketchPolarParam{\wVec_1}\cdot\sketchPolarParam{\wVec_2}\cdot\sketchPolarParam{\wVecPrime_1}\cdot\sketchPolarParam{\wVecPrime_2} }\label{eq:var-sum-w}
Note that four-wise independence is assumed across all four random variables of \eqref{eq:var-sum-w}. Zooming in on the inner products of the $\sketchPolar$ functions,
\polarProdEq \label{eq:polar-product}
it can be seen that for $\wOne, \wOneP \in \pw$ and $\wTwo, \wTwoP \in \pw'$, all four random variables in \eqref{eq:polar-product} take their values from $\pw$, although we have iteration over two separate sets $\pw$.\AR{I do not know what you mean by ``iteration"} Thus, there are four possible sets of $\wVec$ variable combinations, namely:
&\distPattern{2}:&\forElems{\cTwo}& \textit{*} \\
&\distPattern{3}:&\forElems{\cThree}& \textit{*} \\
&\distPattern{4}:&\forElems{\cFour}& \textit{*}\\
$$\text{ }^*\textit{(and all variants of the respective pattern)}$$
We are interested in those particular cases whose expectation does not equal zero, since an expectation of zero will not add to the summation of \eqref{eq:var-sum-w}. In expectation we have that
\forAllW{\distPattern{1}}&\rightarrow\expect{%\sum_{\substack{\elems \\
%\st \cOne}}
\polarProdEq} = 1 \label{eq:polar-prod-all}\\
\forAllW{\distPattern{2}}&\rightarrow\expect{%\sum_{\substack{\elems \\
%\st \cTwo}}
\polarProdEq} = 1 \label{eq:polar-prod-two-and-two}\\
\forAllW{\distPattern{3}}&\rightarrow\expect{%\sum_{\substack{\elems \\
%\st \cThree}}
\polarProdEq} = 0 \nonumber \\
\forAllW{\distPattern{4}}&\rightarrow\expect{%\sum_{\substack{\elems \\
%\st \cFour}}
\polarProdEq} = 0 \nonumber \\
\forAllW{\distPattern{5}}&\rightarrow\expect{%\sum_{\substack{\elems \\
%\st \cFive}}
\polarProdEq} = 0 \nonumber
Only equations \eqref{eq:polar-prod-all} and \eqref{eq:polar-prod-two-and-two} influence the $\var$ computation.
Considering $\distPattern{1}$ the variance results in
For the distribution pattern $\cTwo$, we have three variants to consider.
&\vCase{1}:&\cTwo \\
When considered separately, the variants have the following $\var$.
\cTwo&= \variantOne \label{eq:variantOne}\\
\cTwoV{\wOne}{\wTwo}{\wOneP}{\wTwoP}&=\variantTwo \label{eq:variantTwo}\\
\big(\estExp\big)^2 = \distPatOne + \variantOne
With only \eqref{eq:variantTwo} and \eqref{eq:variantThree} remaining, we have
\varParam{\estimate} = \\
\variantTwo ~+ \\
Converting terms into their space requirements yields
&\variantTwo \Rightarrow\numWorldsP \cdot \frac{\numWorlds}{\sketchCols} - 1\label{eq:spaceOne}\\
&\variantThree \Rightarrow \numWorldsP \cdot \frac{\numWorldsP - 1}{\sketchCols}\label{eq:spaceTwo}
\eqref{eq:spaceOne} and \eqref{eq:spaceTwo} further reduce to
\frac{2^{2N}(\prob + \prob^2)}{\sketchCols} - \numWorlds(\frac{\prob}{\sketchCols} + \prob)\label{eq:variance}
By \eqref{eq:variance} we have then
\varSym &< 2^{2N}\big(\frac{2\prob}{\sketchCols}\big) \\
\sd &<\sdEq\\
\sdRel& < \sqrt{\frac{2}{\sketchCols\prob}}.
Recall that $\sdRel = \frac{\sd}{\mu}$ where $\mu$ is defined as $\numWorldsP$ in \eqref{eq:mu}.
Since the sketch has multiple trials, a probability of exceeding error bound $\errB$ smaller than one half guarantees an estimate that is less than or equal to the error bound when taking the median of all trials. Expressing the error relative to $\mu$ in Chebyshev's Inequality yields
Substituting $\mu\epsilon$ for $k\sd$ and solving for $\sketchCols$ results in
&k\cdot\sdEq = \mu\epsilon\\
&k = \frac{\mu\epsilon}{\sdEq}\\
&k = \frac{\mu\epsilon\sqrt{\sketchCols}}{\numWorlds \sqrt{2\prob}}\\
&k^2 = \frac{\mu^2\epsilon^2\sketchCols}{2^{\numWorlds}\cdot2\prob} = \frac{\prob\errB^2\sketchCols}{2}\\
&\chebyK\Rightarrow \sketchCols = \frac{6}{\epsilon^2\prob}