main
Boris Glavic 2023-06-28 12:20:46 +02:00
parent 041e54a872
commit 7e96f6d6b3
2 changed files with 12 additions and 2 deletions

Binary file not shown.

View File

@ -19,7 +19,17 @@
%% values in them; it is your responsibility as an author to replace
%% the commands and values with those provided to you when you
%% complete the rights form.
\setcopyright{none}
\copyrightyear{2023}
\acmYear{2023}
\setcopyright{acmlicensed}\acmConference[HILDA '23]{Workshop on
Human-In-the-Loop Data Analytics}{June 18, 2023}{Seattle, WA, USA}
\acmBooktitle{Workshop on Human-In-the-Loop Data Analytics (HILDA '23),
June 18, 2023, Seattle, WA, USA}
\acmPrice{15.00}
\acmDOI{10.1145/3597465.3605220}
\acmISBN{979-8-4007-0216-7/23/06}
%\setcopyright{none}
% \copyrightyear{2018}
% \acmYear{2018}
% \acmDOI{XXXXXXX.XXXXXXX}
@ -125,7 +135,7 @@
% Spreadsheets provide a convenient, friendly direct manipulation interface to datasets.
Efforts to scale spreadsheets either follow a `virtual` strategy that imposes a spreadsheet interface over an existing database engine or a `materialized' strategy based on re-engineering the spreadsheet engine.
Because database engines are not optimized for spreadsheet access patterns, the materialized approach has better performance.
However, the virtual approach offers several advantages that can not be easily replicated in the materialized approach, including the ability to re-apply user interactions to an updated dataset.
However, the virtual approach offers several advantages that can not be easily replicated in the materialized approach, including the ability to re-apply user interactions to an updated dataset.
We propose a hybrid approach, where patterns of user updates are indexed (as in the materialized approach) and overlaid on an existing dataset (as in the virtual approach).
We introduce the overlay update model, and outline strategies for efficiently accessing an overlay spreadsheet.
A key feature of our approach is storing updates generated by bulk operations (e.g., copy/paste) as ``patterns" that can be leveraged to reduce execution costs.