poisson_sampling Blog post on performing Poisson-based large-scale subsampling/bootstrapping: http://blog.cloudera.com/blog/2013/02/how-to-resample-from-a-large-data-set-in-parallel-with-r-on-hadoop