Fields may be mistyped (typo, missing comma)
Comment text can be inlined into the file
-
+
Merge Data From Two Sources
@@ -164,10 +164,10 @@
Format alignment (GIS coordinates, $ vs €)
Precision alignment (State vs County)
-
+
JSON Shredding
@@ -178,10 +178,10 @@
Type alignment (Records with 'address' as an array)
Schema matching$^2$
-
+
@@ -368,6 +368,11 @@ Sampling (x10), 300, 242.5666234549135, 300, 119.61607021316885, 162.00108394436
$K$-Semirings
+
+ Provenance Semirings
+ T.J. Green & G. Karvounarakis & V. Tannen(PODS 2007)
+
+
@@ -442,11 +447,6 @@ Sampling (x10), 300, 242.5666234549135, 300, 119.61607021316885, 162.00108394436
-
- Provenance Semirings
- T.J. Green & G. Karvounarakis & V. Tannen(PODS 2007)
-
-
$$\left<\;\mathcal K,\;\oplus,\;\otimes,\;\mathbb 0,\;\mathbb 1\;\right>$$
@@ -513,14 +513,14 @@ Sampling (x10), 300, 242.5666234549135, 300, 119.61607021316885, 162.00108394436
- Extractors
+ Information Extractors
$$\mathcal K^W \rightarrow \mathcal K$$
(plug in any $K$-Semiring-compatible $\mathcal K$)
- Possible World Value
- - $\texttt{PW_i}(\vec k) \equiv \vec k_i$
+ - $\texttt{PW}_i(\vec k) \equiv \vec k_i$
- Certain Value
- $\mathcal C(\vec k) \equiv min(\vec k)$
- Possible Value
@@ -663,10 +663,10 @@ Sampling (x10), 300, 242.5666234549135, 300, 119.61607021316885, 162.00108394436
We can Approximate
- Soundness
- - $Q(\mathcal C(\mathcal D)) \geq \mathcal C(Q(\mathcal D))$
+ - $Q(\mathcal C(\mathcal D)) \leq \mathcal C(Q(\mathcal D))$
- We can efficiently compute a conservative approximation of $\mathcal C$.
-
- Completeness
+ - (Conditional) Completeness
- $Q(\mathcal C(\mathcal D)) = \mathcal C(Q(\mathcal D))$ ...if $Q$ is safe
@@ -680,7 +680,7 @@ Sampling (x10), 300, 242.5666234549135, 300, 119.61607021316885, 162.00108394436
- So then we implemented it...
+ We implemented it...
|