paper-Vizier-SpreadsheetOve.../reviews.org

2.4 KiB

Reviewer 1

The paper presents an overlap approach to scale spreadsheets, where patterns of user updates are indexed and overlaid on an existing dataset. The presented ideas are interesting, and the results are quite promising.

Experiments: Materialized strategies are more efficient than virtual but the experiments show that DataSpread is inefficient as compared to Vizier. Please clarify the results and discuss them in more detail.

Reviewer 2

The Overlay Spreadsheets approach addresses the very important problem of scaling spreadsheet interfaces. It combines ideas from virtual and materialized spreadsheet approaches to improve execution time. The paper is well-motivated and relevant to the HILDA community, and I think it would lead to a lot of interesting discussions on the pros and cons of virtual vs materialized approaches.

Thoughts for improvement:

While the paper is well-organized, it should be checked for typos.

Prior to presenting the formal data model in section 2, it might help to present a motivating example containing a workflow/queries to show concrete scenarios highlighting the pros and cons of each approach, and showing how the overlay approach addresses these shortcomings. This would build intuition before formalization, and make it easier to follow.

Reviewer 3

This paper presents a technique scale spreadsheets to large scale data by using a hybrid approach that combines the performance benefits of the materialized approach and the ability for the virtual overlay approach to reapply user interaction to updated datasets. The paper describes the technique for this hybrid approach and demonstrate that the implementation reduces execution cost compared to a material

STRENGTH A strong technique based on a strong observation that bulk updates in a spreadsheet rely on expression patterns, which Contain a small benchmark section, covering 4 cases, which shows that the technique introduced is faster than the materialized-based technique baseline. (MINOR) WEAKNESSES The intro of the paper speaks as if works like Wrangler supports generate spreadsheets features, even though the work leverages declarative transforms to support certain operations specific to data cleaning tasks (but not all spreadsheet operations). Similarly, it's unclear from the paper if the implementation in Vizier supports general free-form spreadsheets or only "spreadsheet mode for data frames".