Merge branch 'master' of gitlab.odin.cse.buffalo.edu:odin-lab/Website

This commit is contained in:
Oliver Kennedy 2017-04-27 12:21:45 -04:00
commit 6769443708
19 changed files with 823 additions and 13 deletions

View file

@ -4,7 +4,7 @@
"role" : "PI",
"amount" : 523700,
"effort" : "50%",
"status" : "submitted",
"status" : "rejected",
"start" : "08/2017", "end" : "07/2021",
"type" : "grant",
"copis" : ["Barry Smith"],
@ -41,7 +41,12 @@
"urls" : {
"whitepaper" : "http://odin.cse.buffalo.edu/grants/2016-NSF-Vizier-Whitepaper.pdf",
"proposal" : "http://odin.cse.buffalo.edu/grants/2016-NSF-Vizier.pdf"
}
},
"supplements" : [
{ "type" : "NSF-REU",
"amount" : 24000
}
]
},
{ "title" : "DD: Success in Partnership: Computing for all of STEM",
"agency" : "NSF: DRL: STEM+C",

View file

@ -135,6 +135,7 @@
"fullname" : "The International Conference on Data Engineering",
"type" : "conference",
"selectivity" : {
"2017" : 0.18, // Executive Meeting: 63/356 accepted
"2013" : 0.19, // source: http://digitalpiglet.org/research/
"2010" : 0.21, // source: http://www.cs.duke.edu/~shivnath/publications.html
"2009" : 0.17, // source: http://www.cs.duke.edu/~shivnath/publications.html

View file

@ -0,0 +1,195 @@
{\rtf1\ansi\ansicpg1252\cocoartf1504\cocoasubrtf820
{\fonttbl\f0\fswiss\fcharset0 Helvetica;}
{\colortbl;\red255\green255\blue255;\red0\green0\blue0;}
{\*\expandedcolortbl;;\cssrgb\c0\c0\c0;}
{\*\listtable{\list\listtemplateid1\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid1\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid1}
{\list\listtemplateid2\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid101\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid2}
{\list\listtemplateid3\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid201\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid202\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid3}
{\list\listtemplateid4\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid301\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid302\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid4}
{\list\listtemplateid5\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid401\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid402\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid5}
{\list\listtemplateid6\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid501\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid502\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid6}
{\list\listtemplateid7\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid601\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{hyphen\}}{\leveltext\leveltemplateid602\'01\uc0\u8259 ;}{\levelnumbers;}\fi-360\li1440\lin1440 }{\listname ;}\listid7}
{\list\listtemplateid8\listhybrid{\listlevel\levelnfc23\levelnfcn23\leveljc0\leveljcn0\levelfollow0\levelstartat1\levelspace360\levelindent0{\*\levelmarker \{disc\}}{\leveltext\leveltemplateid701\'01\uc0\u8226 ;}{\levelnumbers;}\fi-360\li720\lin720 }{\listname ;}\listid8}}
{\*\listoverridetable{\listoverride\listid1\listoverridecount0\ls1}{\listoverride\listid2\listoverridecount0\ls2}{\listoverride\listid3\listoverridecount0\ls3}{\listoverride\listid4\listoverridecount0\ls4}{\listoverride\listid5\listoverridecount0\ls5}{\listoverride\listid6\listoverridecount0\ls6}{\listoverride\listid7\listoverridecount0\ls7}{\listoverride\listid8\listoverridecount0\ls8}}
\margl1440\margr1440\vieww14160\viewh16500\viewkind0
\deftab720
\pard\pardeftab720\sl560\partightenfactor0
\f0\b\fs48 \cf2 \expnd0\expndtw0\kerning0
Discussion Topics\
\pard\pardeftab720\sl420\partightenfactor0
\fs36 \cf2 What is Small Data?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls1\ilvl0
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
The Big Data era was marked by changes in fundamental data management assumptions. Broadly, how are these assumptions changing now?\
\ls1\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Broadly, what are the challenges that you see arising from these assumptions changing?\
\ls1\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Broadly, what is "Small Data" to you?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Time Outline
\b0\fs26 \
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls2\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }1:30 -- start\
{\listtext \'95 }2:00 -- Motivation\
{\listtext \'95 }2:25 -- Challenges\
{\listtext \'95 }2:40 -- Proposed Solutions\cf2 \expnd0\expndtw0\kerning0
\uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Interfacing with Small Data (End ~2:10)\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 \strike \strikec2 {\listtext \'95 }\expnd0\expndtw0\kerning0
Who (or what) are the users targeted by (like being hunted by?) small data?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0\cf2 \kerning1\expnd0\expndtw0 \strike0\striked0 {\listtext \'95 }What kinds of infrastructure is / will be required for accessing small data?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls3\ilvl1\cf2 {\listtext \uc0\u8259 }Is it even possible to provide the same degree of infrastructure support for small data as for big data?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0\cf2 {\listtext \'95 }\expnd0\expndtw0\kerning0
What is/are the right languages/interfaces for interacting with small data?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls3\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Is SQL overkill? Is declarative programming the right model?\
\ls3\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Should we be working with Excel?\
\ls3\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Are ORMs the answer?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls3\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are the human factors affecting "small data" management?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Making Data More Personal (End ~2:20)\
\pard\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 One implication of "Small Data" is data becoming more personal:\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What kinds of data can I collect about me and my life?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls4\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Relatedly, what kinds of data are already being collected about me and my life?\
\ls4\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
How can I leverage this data to improve my life / myself / the world around me?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How can we help non-technical users to better discover (and leverage) data available about them?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls4\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Can I know/control what data is collected about me. With whom it is shared? (EU policies vs. US policies)\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls4\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are the implications, whether good or bad, of more of "my" data being digitized?\
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How do I maintain control over my data?\
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What can databases do to improve my control over my data?\
\ls4\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Is it practical, or even possible to make "Big Data" into Small Data?\uc0\u8232 \
\pard\tx720\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Moving Data to the Edge (End ~2:30)\
\pard\tx720\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 Small data is a consequence of data processing moving to millions of edge devices\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Why is data processing moving out to edge devices?\
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What types of data processing happen at the edge?h\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls5\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Are there good models and/or benchmarks for these types of workloads?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What kinds of limitations or challenges does edge processing run into?\
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Are there ways in which edge processing is easier?\
\ls5\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What tools are available for data processing out at the edge?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls5\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
What are the limitations of these tools, and how might those limitations be overcome?\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls5\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Classically, the database community favors monolithic data management solutions. Is that the right choice for edge computing? Why so or why no?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Making Data More Accessible (End ~2:40)\
\pard\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 Another implication of "Small Data" is the opportunity for transparency into research, news, etc...\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What is the right way to make research, news, and other data summarized by humans accessible to a public that may want to re-use it?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
e.g., ProPublica/Open Govt approach: Dump & Document CSV files\
\ls6\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
e.g., Bloomberg/NYTimes approach: Interactive Visualizations\
\ls6\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
e.g., Jens's Janiform Documents\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are good guidelines for transparency in data reporting?\
\ls6\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How can databases support making data transparent and accessible?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }Data Discovery\
{\listtext \uc0\u8259 }Data Curation\
{\listtext \uc0\u8259 }Data Exposition\expnd0\expndtw0\kerning0
\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls6\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What types of tools exist to facilitate data discovery?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls6\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
What are the limitations of these tools and how might those limitations be overcome?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Open Data (End ~2:50)\
\pard\pardeftab720\sl300\partightenfactor0
\b0\fs26 \cf2 On the personal side of small data, governmental open data is giving us unprecedented insight into our communities\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls7\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
How are people making use of governmental open data?\
\ls7\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
What are the challenges in getting governments to release open data, and how might those challenges be overcome?\
\ls7\ilvl0\kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Some might argue that placing a barrier between datasets and the public is a good thing --- that non-statisticians shouldn't be trying to make claims based on what are likely to be messy and noisy datasets. Do you agree/disagree? and why?\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls7\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Is this a challenge that can be overcome if we give the public the right tools?\
\ls7\ilvl1\kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
On a related note, what happens when "Big Data" isn't enough? (i.e., when the open data is too sparse to make inferences about)\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls7\ilvl0\cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
Broadly, how can we make open data more accessible? (or more safely accessible?)\
\pard\tx940\tx1440\pardeftab720\li1440\fi-1440\sl300\partightenfactor0
\ls7\ilvl1\cf2 \kerning1\expnd0\expndtw0 {\listtext \uc0\u8259 }\expnd0\expndtw0\kerning0
Issue with anonymity of open data (see NYC taxi example)\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 \
\pard\pardeftab720\sl420\partightenfactor0
\b\fs36 \cf2 Closing\
\pard\tx220\tx720\pardeftab720\li720\fi-720\sl300\partightenfactor0
\ls8\ilvl0
\b0\fs26 \cf2 \kerning1\expnd0\expndtw0 {\listtext \'95 }\expnd0\expndtw0\kerning0
If you could get the database research community to work on one thing related to small data, what would that be?\
\pard\pardeftab720\sl300\partightenfactor0
\cf2 \uc0\u8232 }

Binary file not shown.

After

Width:  |  Height:  |  Size: 244 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 189 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 6.9 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 167 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 76 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 64 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 6.3 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 10 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 78 KiB

View file

@ -0,0 +1,235 @@
<!doctype html>
<html lang="en">
<head>
<meta charset="utf-8">
<title>Small Data</title>
<meta name="description" content="Small Data">
<meta name="author" content="Oliver Kennedy">
<meta name="apple-mobile-web-app-capable" content="yes" />
<meta name="apple-mobile-web-app-status-bar-style" content="black-translucent" />
<meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1.0, user-scalable=no, minimal-ui">
<link rel="stylesheet" href="../reveal.js-3.1.0/css/reveal.css">
<link rel="stylesheet" href="ubodin.css" id="theme">
<!-- Code syntax highlighting -->
<link rel="stylesheet" href="../reveal.js-3.1.0/lib/css/zenburn.css">
<style>
table.panelists td.name {
text-align: right;
}
table.panelists td.affiliation {
font-size: smaller;
text-decoration: italic;
text-align: left;
}
</style>
<!-- Printing and PDF exports -->
<script>
var link = document.createElement( 'link' );
link.rel = 'stylesheet';
link.type = 'text/css';
link.href = window.location.search.match( /print-pdf/gi ) ? '../reveal.js-3.1.0/css/print/pdf.css' : '../reveal.js-3.1.0/css/print/paper.css';
document.getElementsByTagName( 'head' )[0].appendChild( link );
</script>
<!--[if lt IE 9]>
<script src="../reveal.js-3.1.0/lib/js/html5shiv.js"></script>
<![endif]-->
</head>
<body>
<div class="reveal">
<!-- Any section element inside of this container is displayed as a slide -->
<div class="slides">
<section>
<section>
<h1>Small Data</h1>
</section>
<section>
<table class="panelists">
<tr>
<td class="name">D. Richard Hipp</td>
<td class="affiliation"><i>Hipp, Wyrick &amp; Company, Inc.</i></td>
</tr>
<tr>
<td class="name">Stratos Idreos</td>
<td class="affiliation"><i>Harvard</i></td>
</tr>
<tr>
<td class="name">Amélie Marian</td>
<td class="affiliation"><i>Rutgers</i></td>
</tr>
<tr>
<td class="name">Arnab Nandi</td>
<td class="affiliation"><i>Ohio State University</i></td>
</tr>
<tr>
<td class="name">Carmela Troncoso</td>
<td class="affiliation"><i>IMDEA Software Institute</i></td>
</tr>
<tr>
<td class="name">Eugene Wu</td>
<td class="affiliation"><i>Columbia University</i></td>
</tr>
<tr>
<td class="name" style="padding-top: 40px">Oliver Kennedy</td>
<td class="affiliation"><i>University at Buffalo</i></td>
</tr>
</table>
</section>
</section>
<section>
<section>
<img src="graphics/phasechange.png" height="400px">
<aside class="notes">
Data management goes through phase changes where classical assumptions break, leading to a change in the landscape of data management systems.
</aside>
<attribution>study.com</attribution>
</section>
<section>
<img src="graphics/largedisk.jpg" height="400px">
<aside class="notes">For example, a decade ago disks started getting cheap</aside>
<attribution>digitalmofo.com<attribution>
</section>
<section>
<img src="graphics/hoarder.jpg" height="400px">
<aside class="notes">People started hoarding data</aside>
<attribution>wkbr.com</attribution>
</section>
<section>
<img src="graphics/jonesey.jpg" height="400px">
<aside class="notes">... and realized they could pick signals out of noise.</aside>
<attribution>The Hunt for Red October</attribution>
</section>
<section>
<img src="graphics/5_vs.png" height="400px">
<aside class="notes">Which leads to a number of problems...</aside>
<attribution>bigdata.black</attribution>
</section>
</section>
<section>
<section>
<h2>We're at the cusp of a new phase change.</h2>
</section>
<section>
<h3>Jens Dittrich: "The Case for <b>Small Data</b>"</h3>
<img src="graphics/Jens-BigData.png">
<aside class="notes">We shamelessly stole the name from Jens, who unfortunately wasn't able to join the panel.</aside>
</section>
</section>
<section>
<section>
<h2>Ground Rules</h2>
<p class="fragment">Join the conversation!</p>
<p class="fragment">Not being recorded, so speak your mind!</p>
<h2 class="fragment">Shameless Plugs</h2>
<p class="fragment"><a href="http://pocketdata.info">http://pocketdata.info</a><br/>A collection of datasets, papers, etc... for small data research</p>
</section>
</section>
<section>
<section>
<table class="panelists">
<tr>
<td class="name">D. Richard Hipp</td>
<td class="affiliation"><i>Hipp, Wyrick &amp; Company, Inc.</i></td>
</tr>
<tr>
<td class="name">Stratos Idreos</td>
<td class="affiliation"><i>Harvard</i></td>
</tr>
<tr>
<td class="name">Amélie Marian</td>
<td class="affiliation"><i>Rutgers</i></td>
</tr>
<tr>
<td class="name">Arnab Nandi</td>
<td class="affiliation"><i>Ohio State University</i></td>
</tr>
<tr>
<td class="name">Carmela Troncoso</td>
<td class="affiliation"><i>IMDEA Software Institute</i></td>
</tr>
<tr>
<td class="name">Eugene Wu</td>
<td class="affiliation"><i>Columbia University</i></td>
</tr>
<tr>
<td class="name" style="padding-top: 40px">Oliver Kennedy</td>
<td class="affiliation"><i>University at Buffalo</i></td>
</tr>
</table>
<!--
<p style="font-size: 12pt">Continue the discussion. Start at <a href="http://pocketdata.info">http://pocketdata.info</a></p>
-->
</section>
<section>
<img src="graphics/bigsmalldata.png" height="600px">
</section>
<section>
<img src="graphics/artisinal.JPG" height="600px">
</section>
</section>
</div></div>
<script src="../reveal.js-3.1.0/lib/js/head.min.js"></script>
<script src="../reveal.js-3.1.0/js/reveal.js"></script>
<script>
// Full list of configuration options available at:
// https://github.com/hakimel/../reveal.js#configuration
Reveal.initialize({
controls: false,
progress: true,
history: true,
center: true,
slideNumber: true,
transition: 'fade', // none/fade/slide/convex/concave/zoom
// Optional ../reveal.js plugins
dependencies: [
{ src: '../reveal.js-3.1.0/lib/js/classList.js', condition: function() { return !document.body.classList; } },
{ src: '../reveal.js-3.1.0/plugin/math/math.js',
condition: function() { return true; },
mathjax: '../reveal.js-3.1.0/js/MathJax.js'
},
{ src: '../reveal.js-3.1.0/plugin/markdown/marked.js', condition: function() { return !!document.querySelector( '[data-markdown]' ); } },
{ src: '../reveal.js-3.1.0/plugin/markdown/markdown.js', condition: function() { return !!document.querySelector( '[data-markdown]' ); } },
{ src: '../reveal.js-3.1.0/plugin/highlight/highlight.js', async: true, condition: function() { return !!document.querySelector( 'pre code' ); }, callback: function() { hljs.initHighlightingOnLoad(); } },
{ src: '../reveal.js-3.1.0/plugin/zoom-js/zoom.js', async: true },
{ src: '../reveal.js-3.1.0/plugin/notes/notes.js', async: true }
]
});
</script>
</body>
</html>

View file

@ -0,0 +1,366 @@
@font-face {
font-family: 'News Cycle';
font-style: normal;
font-weight: 400;
src: local('News Cycle'), local('NewsCycle'), url(../reveal.js-3.1.0/fonts/9Xe8dq6pQDsPyVH2D3tMQsDdSZkkecOE1hvV7ZHvhyU.ttf) format('truetype');
}
@font-face {
font-family: 'News Cycle';
font-style: normal;
font-weight: 700;
src: local('News Cycle Bold'), local('NewsCycle-Bold'), url(../reveal.js-3.1.0/fonts/G28Ny31cr5orMqEQy6ljt8BaWKZ57bY3RXgXH6dOjZ0.ttf) format('truetype');
}
@font-face {
font-family: 'Lato';
font-style: normal;
font-weight: 400;
src: local('Lato Regular'), local('Lato-Regular'), url(../reveal.js-3.1.0/fonts/1EqTbJWOZQBfhZ0e3RL9uvesZW2xOQ-xsNqO47m55DA.ttf) format('truetype');
}
@font-face {
font-family: 'Lato';
font-style: normal;
font-weight: 700;
src: local('Lato Bold'), local('Lato-Bold'), url(../reveal.js-3.1.0/fonts/MZ1aViPqjfvZwVD_tzjjkwLUuEpTyoUstqEm5AMlJo4.ttf) format('truetype');
}
@font-face {
font-family: 'Lato';
font-style: italic;
font-weight: 400;
src: local('Lato Italic'), local('Lato-Italic'), url(../reveal.js-3.1.0/fonts/61V2bQZoWB5DkWAUJStypevvDin1pK8aKteLpeZ5c0A.ttf) format('truetype');
}
@font-face {
font-family: 'Lato';
font-style: italic;
font-weight: 700;
src: local('Lato Bold Italic'), local('Lato-BoldItalic'), url(../reveal.js-3.1.0/fonts/HkF_qI1x_noxlxhrhMQYECZ2oysoEQEeKwjgmXLRnTc.ttf) format('truetype');
}
/**@import url(https://fonts.googleapis.com/css?family=News+Cycle:400,700);
@import url(https://fonts.googleapis.com/css?family=Lato:400,700,400italic,700italic);
**/
/**
* A simple theme for reveal.js presentations, similar
* to the default theme. The accent color is darkblue.
*
* This theme is Copyright (C) 2012 Owen Versteeg, https://github.com/StereotypicalApps. It is MIT licensed.
* reveal.js is Copyright (C) 2011-2012 Hakim El Hattab, http://hakim.se
*/
/*********************************************
* GLOBAL STYLES
*********************************************/
body {
background: #fff;
background-color: #fff; }
*:fullscreen, *:-webkit-full-screen, *:-moz-full-screen {
background-color: #fff;
}
.reveal {
font-family: 'Lato', sans-serif;
font-size: 36px;
font-weight: normal;
color: #000; }
::selection {
color: #fff;
background: rgba(0, 0, 0, 0.99);
text-shadow: none; }
.reveal .slides > section, .reveal .slides > section > section {
line-height: 1.3;
font-weight: inherit; }
/*********************************************
* STATIC HEADER/FOOTER
*********************************************/
.reveal .header {
position: absolute;
top: 0px;
left: 0px;
right: 0px;
height: 25px;
text-align: center;
padding-left: 15px;
padding-right: 15px;
padding-bottom: 10px;
padding-top: 15px;
background-color: #041a9b;
color: white;
font-size: 0.5em;
z-index: 100;
}
.reveal .footer {
position: absolute;
bottom: 0px;
left: 0px;
right: 0px;
height: 40px;
text-align: center;
padding-left: 15px;
padding-right: 15px;
padding-bottom: 10px;
padding-top: 20px;
background-color: #041a9b;
color: white;
font-size: 0.5em;
z-index: 100;
}
/*********************************************
* HEADERS
*********************************************/
.reveal h1, .reveal h2, .reveal h3, .reveal h4, .reveal h5, .reveal h6, .reveal .headertext {
margin: 0 0 20px 0;
color: #000;
font-family: 'News Cycle', Impact, sans-serif;
font-weight: normal;
line-height: 1.2;
letter-spacing: normal;
text-transform: none;
text-shadow: none;
word-wrap: break-word; }
.reveal h1 {
font-size: 3.77em; }
.reveal h2 {
font-size: 2.11em; }
.reveal h3 {
font-size: 1.55em; }
.reveal h4 {
font-size: 1em; }
.reveal h1 {
text-shadow: none; }
/*********************************************
* OTHER
*********************************************/
.reveal p {
margin: 20px 0;
line-height: 1.3; }
.reveal attribution {
font-size: 0.5em;
position: absolute;
right: -10px;
bottom: -10px;
text-align: right;
}
/* Ensure certain elements are never larger than the slide itself */
.reveal img, .reveal video, .reveal iframe {
max-width: 95%;
max-height: 95%; }
.reveal strong, .reveal b {
font-weight: bold; }
.reveal em {
font-style: italic; }
.reveal ol, .reveal dl, .reveal ul {
display: inline-block;
text-align: left;
margin: 0 0 0 1em; }
.reveal ol {
list-style-type: decimal; }
.reveal ul {
list-style-type: disc; }
.reveal ul > li {
margin-top: 20px; }
.reveal ul ul {
list-style-type: square; }
.reveal ul ul ul {
list-style-type: circle; }
.reveal ul ul, .reveal ul ol, .reveal ol ol, .reveal ol ul {
display: block;
margin-left: 40px; }
.reveal dt {
font-weight: bold; }
.reveal dd {
margin-left: 40px; }
.reveal q, .reveal blockquote {
quotes: none; }
.reveal blockquote {
display: block;
position: relative;
width: 70%;
margin: 20px auto;
padding: 5px;
font-style: italic;
background: rgba(255, 255, 255, 0.05);
box-shadow: 0px 0px 2px rgba(0, 0, 0, 0.2); }
.reveal blockquote p:first-child, .reveal blockquote p:last-child {
display: inline-block; }
.reveal q {
font-style: italic; }
.reveal pre {
display: block;
position: relative;
width: 90%;
margin: 20px auto;
text-align: left;
font-size: 0.55em;
font-family: monospace;
line-height: 1.2em;
word-wrap: break-word;
box-shadow: 0px 0px 6px rgba(0, 0, 0, 0.3); }
.reveal code {
font-family: monospace;
}
.reveal pre code {
display: block;
padding: 5px;
overflow: auto;
max-height: 400px;
word-wrap: normal;
background: #3F3F3F;
color: #DCDCDC; }
.reveal table {
margin: auto;
border-collapse: collapse;
border-spacing: 0; }
.reveal table th {
font-weight: bold;
border-bottom: 1px solid; }
.reveal table th, .reveal table td {
text-align: center;
padding: 0.2em 0.5em 0.2em 0.5em;}
.reveal table th[align="left"], .reveal table td[align="left"] {
text-align: left; }
.reveal table th[align="right"], .reveal table td[align="right"] {
text-align: right; }
.reveal table tr:last-child td {
border-bottom: none; }
.reveal sup {
vertical-align: super; }
.reveal sub {
vertical-align: sub; }
.reveal small, .reveal .smalltext {
display: inline-block;
font-size: 0.6em;
line-height: 1.2em;
vertical-align: top; }
.reveal small * {
vertical-align: top; }
/*********************************************
* LINKS
*********************************************/
.reveal a {
color: #00008B;
text-decoration: none;
-webkit-transition: color 0.15s ease;
-moz-transition: color 0.15s ease;
transition: color 0.15s ease; }
.reveal a:hover {
color: #0000f1;
text-shadow: none;
border: none; }
.reveal .roll span:after {
color: #fff;
background: #00003f; }
/*********************************************
* IMAGES
*********************************************/
.reveal section img {
margin: 15px 0px;
background: rgba(255, 255, 255, 0.12);
}
.reveal section img.bordered
{
border: 4px solid #000;
box-shadow: 0 0 10px rgba(0, 0, 0, 0.15);
}
.reveal a img {
-webkit-transition: all 0.15s linear;
-moz-transition: all 0.15s linear;
transition: all 0.15s linear; }
.reveal a:hover img {
background: rgba(255, 255, 255, 0.2);
border-color: #00008B;
box-shadow: 0 0 20px rgba(0, 0, 0, 0.55); }
/*********************************************
* NAVIGATION CONTROLS
*********************************************/
.reveal .controls div.navigate-left, .reveal .controls div.navigate-left.enabled {
border-right-color: #00008B; }
.reveal .controls div.navigate-right, .reveal .controls div.navigate-right.enabled {
border-left-color: #00008B; }
.reveal .controls div.navigate-up, .reveal .controls div.navigate-up.enabled {
border-bottom-color: #00008B; }
.reveal .controls div.navigate-down, .reveal .controls div.navigate-down.enabled {
border-top-color: #00008B; }
.reveal .controls div.navigate-left.enabled:hover {
border-right-color: #0000f1; }
.reveal .controls div.navigate-right.enabled:hover {
border-left-color: #0000f1; }
.reveal .controls div.navigate-up.enabled:hover {
border-bottom-color: #0000f1; }
.reveal .controls div.navigate-down.enabled:hover {
border-top-color: #0000f1; }
/*********************************************
* PROGRESS BAR
*********************************************/
.reveal .progress {
background: rgba(0, 0, 0, 0.2); }
.reveal .progress span {
background: #00008B;
-webkit-transition: width 800ms cubic-bezier(0.26, 0.86, 0.44, 0.985);
-moz-transition: width 800ms cubic-bezier(0.26, 0.86, 0.44, 0.985);
transition: width 800ms cubic-bezier(0.26, 0.86, 0.44, 0.985); }
/*********************************************
* SLIDE NUMBER
*********************************************/
.reveal .slide-number {
color: #00008B; }

View file

@ -106,25 +106,33 @@ classContent:
meta:
slides: slides/2017-04-06-Views.pdf
- date: Apr. 11
topic: Theory of Transactions (Schedule Equivalence, Serializability)
topic: Theory of Transactions (Schedule Equivalence, Serializability, Locking)
meta:
slides: slides/2017-04-11-TransactionCorrectness.pdf
- date: Apr. 13
topic: Transaction Control Protocols (Locking, Lock Granularity)
topic: Transaction Control Protocols (Optimistic Concurrency)
meta:
slides: slides/2017-04-13-OptimisticConcurrency.pdf
- date: Apr. 18
topic: Transaction Control Protocols (OCC, MVCC)
topic: Logging and Aries
meta:
slides: slides/2017-04-18-Logging.pdf
- date: Apr. 20
topic: Project 3 overview
meta:
slides: slides/2017-04-20-Checkpoint3.pdf
- date: Apr. 25
topic: Logging and Logs as a Physical Layout
- date: Apr. 27
topic: Logging (Write-Ahead Logging, Undo Logging, ARIES)
- date: May. 2
topic: Parallel and Distributed Algorithms
- date: May. 4
- date: Apr. 27
topic: Data Warehousing
- date: May. 2
topic: Probabilistic Databases
- date: May. 4
topic: Spark
- date: May. 9
topic: TBD
- date: May. 11
topic: Final Exam Review
- date: May. 11
topic: <b>Final Exam</b>
---
<h1 style="text-align: center;"><%= title %></h1>
<p style="text-align: justify;">Data Management Systems (including Relational Databases, Non-Relational Databases, and NoSQL storage systems) form the basis of the Big Data Economy we now live in.  A data management system is responsible for storing data, enabling efficient access to that data, as well as mediating concurrent modifications.  This class approaches the challenges of designing a data management system from a standpoint that is both principled and practical.  The course revolves around a term-long programming assignment, in which you will build a system that answers SQL queries efficiently.  Course lectures will focus on the conceptual basis for this system, and will discuss how the techniques you learn generalize (e.g., to the use of NoSQL systems)</p>
@ -164,7 +172,7 @@ In this course, you will learn...
<li>50% exams
<ul>
<li>25% Midterm on Mar.30 (in class)</li>
<li>25% Comprehensive Final on Thu May 14 (4:00-6:30)</li>
<li>25% Comprehensive Final on Thu May 11(in class)</li>
</ul>
</li>
<li>50% projects