Website/slides/talks/2017-3-ICDE-SmallData/Questions.rtf
2017-04-24 21:50:24 -04:00

195 lines
15 KiB
Plaintext

{\rtf1\ansi\ansicpg1252\cocoartf1504\cocoasubrtf820
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
{\colortbl;\red255\green255\blue255;\red0\green0\blue0;}
{\*\expandedcolortbl;;\cssrgb\c0\c0\c0;}
{\*\listtable{\list\listtemplateid1\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid1\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid1}
{\list\listtemplateid2\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid101\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid2}
{\list\listtemplateid3\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid201\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid202\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid3}
{\list\listtemplateid4\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid301\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid302\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid4}
{\list\listtemplateid5\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid401\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid402\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid5}
{\list\listtemplateid6\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid501\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid502\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid6}
{\list\listtemplateid7\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid601\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid602\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid7}
{\list\listtemplateid8\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid701\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid8}}
{\*\listoverridetable{\listoverride\listid1\listoverridecount0\ls1}{\listoverride\listid2\listoverridecount0\ls2}{\listoverride\listid3\listoverridecount0\ls3}{\listoverride\listid4\listoverridecount0\ls4}{\listoverride\listid5\listoverridecount0\ls5}{\listoverride\listid6\listoverridecount0\ls6}{\listoverride\listid7\listoverridecount0\ls7}{\listoverride\listid8\listoverridecount0\ls8}}
\margl1440\margr1440\vieww14160\viewh16500\viewkind0
\deftab720
\pard\pardeftab720\sl560\partightenfactor0
\f0\b\fs48 \cf2 \expnd0\expndtw0\kerning0
Discussion Topics\
\pard\pardeftab720\sl420\partightenfactor0
\fs36 \cf2 What is Small Data?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls1\ilvl0
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
The Big Data era was marked by changes in fundamental data management assumptions. Broadly, how are these assumptions changing now?\
\ls1\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Broadly, what are the challenges that you see arising from these assumptions changing?\
\ls1\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Broadly, what is "Small Data" to you?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Time Outline
\b0\fs26 \
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls2\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }1:30 -- start\
{\listtext \'95 }2:00 -- Motivation\
{\listtext \'95 }2:25 -- Challenges\
{\listtext \'95 }2:40 -- Proposed Solutions\cf2 \expnd0\expndtw0\kerning0
\uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Interfacing with Small Data (End ~2:10)\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 \strike \strikec2 {\listtext \'95 }\expnd0\expndtw0\kerning0
Who (or what) are the users targeted by (like being hunted by?) small data?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0\cf2 \kerning1\expnd0\expndtw0 \strike0\striked0 {\listtext \'95 }What kinds of infrastructure is / will be required for accessing small data?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls3\ilvl1\cf2 {\listtext \uc0\u8259 }Is it even possible to provide the same degree of infrastructure support for small data as for big data?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0\cf2 {\listtext \'95 }\expnd0\expndtw0\kerning0
What is/are the right languages/interfaces for interacting with small data?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls3\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Is SQL overkill? Is declarative programming the right model?\
\ls3\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Should we be working with Excel?\
\ls3\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Are ORMs the answer?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are the human factors affecting "small data" management?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Making Data More Personal (End ~2:20)\
\pard\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 One implication of "Small Data" is data becoming more personal:\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What kinds of data can I collect about me and my life?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls4\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Relatedly, what kinds of data are already being collected about me and my life?\
\ls4\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
How can I leverage this data to improve my life / myself / the world around me?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How can we help non-technical users to better discover (and leverage) data available about them?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls4\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Can I know/control what data is collected about me. With whom it is shared? (EU policies vs. US policies)\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are the implications, whether good or bad, of more of "my" data being digitized?\
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How do I maintain control over my data?\
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What can databases do to improve my control over my data?\
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Is it practical, or even possible to make "Big Data" into Small Data?\uc0\u8232 \
\pard\tx720\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Moving Data to the Edge (End ~2:30)\
\pard\tx720\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 Small data is a consequence of data processing moving to millions of edge devices\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Why is data processing moving out to edge devices?\
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What types of data processing happen at the edge?h\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls5\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Are there good models and/or benchmarks for these types of workloads?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What kinds of limitations or challenges does edge processing run into?\
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Are there ways in which edge processing is easier?\
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What tools are available for data processing out at the edge?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls5\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
What are the limitations of these tools, and how might those limitations be overcome?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Classically, the database community favors monolithic data management solutions. Is that the right choice for edge computing? Why so or why no?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Making Data More Accessible (End ~2:40)\
\pard\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 Another implication of "Small Data" is the opportunity for transparency into research, news, etc...\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What is the right way to make research, news, and other data summarized by humans accessible to a public that may want to re-use it?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
e.g., ProPublica/Open Govt approach: Dump & Document CSV files\
\ls6\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
e.g., Bloomberg/NYTimes approach: Interactive Visualizations\
\ls6\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
e.g., Jens's Janiform Documents\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are good guidelines for transparency in data reporting?\
\ls6\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How can databases support making data transparent and accessible?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }Data Discovery\
{\listtext \uc0\u8259 }Data Curation\
{\listtext \uc0\u8259 }Data Exposition\expnd0\expndtw0\kerning0
\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What types of tools exist to facilitate data discovery?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
What are the limitations of these tools and how might those limitations be overcome?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Open Data (End ~2:50)\
\pard\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 On the personal side of small data, governmental open data is giving us unprecedented insight into our communities\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls7\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How are people making use of governmental open data?\
\ls7\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are the challenges in getting governments to release open data, and how might those challenges be overcome?\
\ls7\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Some might argue that placing a barrier between datasets and the public is a good thing --- that non-statisticians shouldn't be trying to make claims based on what are likely to be messy and noisy datasets. Do you agree/disagree? and why?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls7\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Is this a challenge that can be overcome if we give the public the right tools?\
\ls7\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
On a related note, what happens when "Big Data" isn't enough? (i.e., when the open data is too sparse to make inferences about)\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls7\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Broadly, how can we make open data more accessible? (or more safely accessible?)\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls7\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Issue with anonymity of open data (see NYC taxi example)\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Closing\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls8\ilvl0
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
If you could get the database research community to work on one thing related to small data, what would that be?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 }