class: middle, center, title-slide
.footnote[Tim Head for the Binder team, Wild Tree Tech, JupyterCon 2018]
.footnote[By Fæ, CC BY-SA 3.0, Link]
???
Earliest libraries were archives of clay tablets. Appeared in Mesopotamia about 2600BC. Making clay tablets was hard work, as you could imagine. Recording information is something humans have been doing for a very long time. It is a valuable thing to do! Information is power.
???
Things have changed a bit since then. We invented paper and the printing press. At first access was restricted, you had to pay or prove you were a real scholar to get access. The ideas of the modern public library only started around 1850 or so. Now spreading information via books and documents has become easy! We can do this at scale!
???
Modern libraries are modern!
???
But you hardly ever go there anymore, these days a large part of the work of a library is to broker access to digital information and to curate that information.
.larger[
The third law states that all forces between two objects exist in equal magnitude and opposite direction: if one object A exerts a force
$F_A$ on a second object B, then B simultaneously exerts a force$F_B$ on A, and the two forces are equal in magnitude and opposite in direction. ]
or
.middle.larger[ $$ F_A = -F_B $$ ]
.footnote[From https://en.wikipedia.org/wiki/Newton%27s_laws_of_motion, August 2018]
???
Today experiments can't be described on a static piece of paper anymore. It is like using text to describe Newton's laws instead of using equations. Text takes a lot more space and is less precise than using maths. Today's experiments involve software which is essential to understanding the work that happened. We need to communicate more than just text.
We need a new medium.
paper == static
.footnote[https://www.theatlantic.com/science/archive/2018/04/the-scientific-paper-is-obsolete/556676/]
???
With just the static paper it isn't actually well defined what they did. This means that we can't reproduce their work because we don't actually know what it is they did.
???
We need the code and the environment in which that code was run in order to have a full definition of what it is they did. Now we can start discussing about reproducing their work.
class: middle, center
class: middle, center
???
This means that producing information is basically free now, but successfully consuming it has never been harder. It is very likely that computer programs played a role in generating that information, so you need to have access to them as well as the text itself
class: middle, center
.footnote[From https://xkcd.com/1987/]
Your IT department tightly controls what can be installed, there are approved tools that you shall use.
???
Conversations go something like:
A: Could we upgrade our scikit-learn version? They fixed several bugs that we have been working around for the last 6 months.
B: Any new versions need to be audited first.
A: Ok, well, I guess we keep working around the issues then ...
Anything goes, all the modern tools, all the time. This is the frontier!
???
You have discussions like:
A: I tried to run your script to generate the charts of our monthly sales numbers. It complains about not finding the Shedazzle shell??
B: Ah yeah, Shedazzle is the latest in AI powered productivity shell, everyone is using it now, you should also change. Bash really hurts your productivity.
A: Ah ok, so ... uhm I guess I'll try installing Shedazzle then ...
B: Make sure to install the latest beta, the current release is a bit flakey.
A: Ok ... all I wanted to do is make a few charts ...
.footnote[Credit: ENI Norge]
???
Package up everything together with your code. Libraries, dependencies, everything. Makes a huge bundle.
Technically speaking this is sending a VM or a container image.
???
Deliver the instructions for assembling the kitchen sink approach.
Technically speaking this is like sending a Dockerfile instead of the built container image.
class: middle, center
.larger[Crafting a good Dockerfile requires significant expertise.]
These are the six lines you need to install one package:
RUN apt-get update && \
apt-get install --yes --no-install-recommends \
* less && \
apt-get purge && \
apt-get clean && \
rm -rf /var/lib/apt/lists/*
.larger[Crafting a good Dockerfile requires significant expertise.]
Nothing suspicious to see ...
class: middle, center
class: middle, center
Mimics what a human would do:
$ git clone https://github.com/davidmascharka/tbd-nets
--
Analyse repository:
Mimics what a human would do:
$ git clone https://github.com/davidmascharka/tbd-nets
Analyse repository and install dependencies:
$ conda install -f environment.yml
--
Start Jupyter notebook:
$ jupyter notebook
It can parse many different files that specify what dependencies to install.
This means that you can keep working the way you have always been working,
and benefit from repo2docker
from day one.
Supported configuration files: .larger[ .left-column[
requirements.txt
environment.yml
apt.txt
REQUIRE
- and more! ] .right-column[
install.R
runtime.txt
postBuild
Dockerfile
] ]
Hi Tim,
thanks for helping out on our project. To run things
you need to install Docker, repo2docker and then run
repo2docker https://github.com/davidmascharka/tbd-nets
It produces a lot of output but at the end there is
a URL that you need to paste into your browser. That
will show a Jupyter notebook.
Tina
.larger[ Can we make it even easier?
Maybe just a link people can click? ]
class: middle, center
???
Combine repo2docker
with JupyterHub to build images for any git repository
ondemand.
We call it BinderHub.
.center[
.larger[+
repo2docker
+]
A public BinderHub operated by the Binder team.
Also support RShiny, Bokeh apps, appmode, ...
Also support RShiny, Bokeh apps, appmode, ...
class: middle, center
- and many more who aren't captured in GitHub history
- Join us at https://github.com/jupyterhub/binder
- Chat with us https://gitter.im/jupyterhub/binder
- Become part of the community!
class: middle, center
In the last 80 days we have had users from almost everywhere on the planet.
???
Some countries we are missing: Cuba, North Korea, Chad, Central African Republic, Western Sahara, Mali, Guinea-Bissau, Eritrea. (Is Spitzbergen a country?)
About 1.2 million Binders launched since 1st January 2018.
Over 50000 Binders are launched every week.
Daily launches:
Most of the interesting users are in the "long tail", this makes it hard to put them on a slide.
???
https://github.com/wichit2s/programmingfundamentals/ - University course in Thailand
https://github.com/msereiko/coding_workshop - NYC coding for students
https://github.com/AndeanROAD/PythonISYA - Andean Regional Office of Astronomy for Development
https://github.com/Coleridge-Initiative - http://coleridgeinitiative.org/
https://github.com/fboylu/binder - Microsoft internal training event, you've made it when MS asks for your help right?
https://github.com/fonnesbeck/cqs_machine_learning
https://github.com/story645/EAS213 For the CUNY-CREST summer high school research experience (HIRES), 26 NYC students spent about a week learning Python for earth and atmospheric science." HIRES is the High School Initiative in Remote Sensing of the Earth Systems Engineering and Sciences
.footnote[https://www.theatlantic.com/science/archive/2018/04/the-scientific-paper-is-obsolete/556676/]
To communicate the ideas of today we need to go beyond "paper".
With Binder the sharing of computational work has left the clay tablet era.
Libraries have always been about curating and spreading knowledge!
-- .larger[ Let's put a BinderHub in every library. ]
class: middle, center
class: bottom, center