195 lines
15 KiB
Plaintext
195 lines
15 KiB
Plaintext
{\rtf1\ansi\ansicpg1252\cocoartf1504\cocoasubrtf820
|
|
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
|
|
{\colortbl;\red255\green255\blue255;\red0\green0\blue0;}
|
|
{\*\expandedcolortbl;;\cssrgb\c0\c0\c0;}
|
|
{\*\listtable{\list\listtemplateid1\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid1\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid1}
|
|
{\list\listtemplateid2\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid101\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid2}
|
|
{\list\listtemplateid3\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid201\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid202\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid3}
|
|
{\list\listtemplateid4\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid301\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid302\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid4}
|
|
{\list\listtemplateid5\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid401\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid402\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid5}
|
|
{\list\listtemplateid6\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid501\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid502\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid6}
|
|
{\list\listtemplateid7\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid601\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid602\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid7}
|
|
{\list\listtemplateid8\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid701\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid8}}
|
|
{\*\listoverridetable{\listoverride\listid1\listoverridecount0\ls1}{\listoverride\listid2\listoverridecount0\ls2}{\listoverride\listid3\listoverridecount0\ls3}{\listoverride\listid4\listoverridecount0\ls4}{\listoverride\listid5\listoverridecount0\ls5}{\listoverride\listid6\listoverridecount0\ls6}{\listoverride\listid7\listoverridecount0\ls7}{\listoverride\listid8\listoverridecount0\ls8}}
|
|
\margl1440\margr1440\vieww14160\viewh16500\viewkind0
|
|
\deftab720
|
|
\pard\pardeftab720\sl560\partightenfactor0
|
|
|
|
\f0\b\fs48 \cf2 \expnd0\expndtw0\kerning0
|
|
Discussion Topics\
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\fs36 \cf2 What is Small Data?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls1\ilvl0
|
|
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
The Big Data era was marked by changes in fundamental data management assumptions. Broadly, how are these assumptions changing now?\
|
|
\ls1\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Broadly, what are the challenges that you see arising from these assumptions changing?\
|
|
\ls1\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Broadly, what is "Small Data" to you?\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
\cf2 \
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Time Outline
|
|
\b0\fs26 \
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls2\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }1:30 -- start\
|
|
{\listtext \'95 }2:00 -- Motivation\
|
|
{\listtext \'95 }2:25 -- Challenges\
|
|
{\listtext \'95 }2:40 -- Proposed Solutions\cf2 \expnd0\expndtw0\kerning0
|
|
\uc0\u8232 \
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Interfacing with Small Data (End ~2:10)\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls3\ilvl0
|
|
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 \strike \strikec2 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Who (or what) are the users targeted by (like being hunted by?) small data?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls3\ilvl0\cf2 \kerning1\expnd0\expndtw0 \strike0\striked0 {\listtext \'95 }What kinds of infrastructure is / will be required for accessing small data?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls3\ilvl1\cf2 {\listtext \uc0\u8259 }Is it even possible to provide the same degree of infrastructure support for small data as for big data?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls3\ilvl0\cf2 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What is/are the right languages/interfaces for interacting with small data?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls3\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Is SQL overkill? Is declarative programming the right model?\
|
|
\ls3\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Should we be working with Excel?\
|
|
\ls3\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Are ORMs the answer?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls3\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What are the human factors affecting "small data" management?\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
\cf2 \uc0\u8232 \
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Making Data More Personal (End ~2:20)\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
|
|
\b0\fs26 \cf2 One implication of "Small Data" is data becoming more personal:\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What kinds of data can I collect about me and my life?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls4\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Relatedly, what kinds of data are already being collected about me and my life?\
|
|
\ls4\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
How can I leverage this data to improve my life / myself / the world around me?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
How can we help non-technical users to better discover (and leverage) data available about them?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls4\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Can I know/control what data is collected about me. With whom it is shared? (EU policies vs. US policies)\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What are the implications, whether good or bad, of more of "my" data being digitized?\
|
|
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
How do I maintain control over my data?\
|
|
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What can databases do to improve my control over my data?\
|
|
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Is it practical, or even possible to make "Big Data" into Small Data?\uc0\u8232 \
|
|
\pard\tx720\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Moving Data to the Edge (End ~2:30)\
|
|
\pard\tx720\pardeftab720\sl300\partightenfactor0
|
|
|
|
\b0\fs26 \cf2 Small data is a consequence of data processing moving to millions of edge devices\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Why is data processing moving out to edge devices?\
|
|
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What types of data processing happen at the edge?h\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls5\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Are there good models and/or benchmarks for these types of workloads?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What kinds of limitations or challenges does edge processing run into?\
|
|
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Are there ways in which edge processing is easier?\
|
|
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What tools are available for data processing out at the edge?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls5\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
What are the limitations of these tools, and how might those limitations be overcome?\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Classically, the database community favors monolithic data management solutions. Is that the right choice for edge computing? Why so or why no?\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
\cf2 \uc0\u8232 \
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Making Data More Accessible (End ~2:40)\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
|
|
\b0\fs26 \cf2 Another implication of "Small Data" is the opportunity for transparency into research, news, etc...\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What is the right way to make research, news, and other data summarized by humans accessible to a public that may want to re-use it?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
e.g., ProPublica/Open Govt approach: Dump & Document CSV files\
|
|
\ls6\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
e.g., Bloomberg/NYTimes approach: Interactive Visualizations\
|
|
\ls6\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
e.g., Jens's Janiform Documents\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What are good guidelines for transparency in data reporting?\
|
|
\ls6\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
How can databases support making data transparent and accessible?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }Data Discovery\
|
|
{\listtext \uc0\u8259 }Data Curation\
|
|
{\listtext \uc0\u8259 }Data Exposition\expnd0\expndtw0\kerning0
|
|
\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What types of tools exist to facilitate data discovery?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
What are the limitations of these tools and how might those limitations be overcome?\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
\cf2 \uc0\u8232 \
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Open Data (End ~2:50)\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
|
|
\b0\fs26 \cf2 On the personal side of small data, governmental open data is giving us unprecedented insight into our communities\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls7\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
How are people making use of governmental open data?\
|
|
\ls7\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
What are the challenges in getting governments to release open data, and how might those challenges be overcome?\
|
|
\ls7\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Some might argue that placing a barrier between datasets and the public is a good thing --- that non-statisticians shouldn't be trying to make claims based on what are likely to be messy and noisy datasets. Do you agree/disagree? and why?\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls7\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Is this a challenge that can be overcome if we give the public the right tools?\
|
|
\ls7\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
On a related note, what happens when "Big Data" isn't enough? (i.e., when the open data is too sparse to make inferences about)\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls7\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
Broadly, how can we make open data more accessible? (or more safely accessible?)\
|
|
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
|
|
\ls7\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
|
|
Issue with anonymity of open data (see NYC taxi example)\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
\cf2 \uc0\u8232 \
|
|
\pard\pardeftab720\sl420\partightenfactor0
|
|
|
|
\b\fs36 \cf2 Closing\
|
|
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
|
|
\ls8\ilvl0
|
|
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
|
|
If you could get the database research community to work on one thing related to small data, what would that be?\
|
|
\pard\pardeftab720\sl300\partightenfactor0
|
|
\cf2 \uc0\u8232 } |