46 Handbook of Big Data
8. Apache Hadoop. Hadoop, 2009.
9. Ryan Hafen, Luke Gosink, Jason McDermott, Karin Rodland, Kerstin Kleese-Van Dam,
and William S. Cleveland. Trelliscope: A system for detailed visualization in the deep
analysis of large complex data. In IEEE Symposium on Large-Scale Data Analysis and
Visualization (LDAV ), pp. 105–112. IEEE, Atlanta, GA, 2013.
10. Michael J. Kane. Scatter matrix concordance: A diagnostic for regressions on subsets
of data. Statistical Analysis and Data Mining: The ASA Data Science Journal, 2015.
11. Ariel Kleiner, Ameet Talwalkar, Purnamrita Sarkar, and Michael I. Jordan. A scalable
bootstrap for massive data. Journal of the Royal Statistical Society: Series B (Statistical
Methodology), 2014.
12. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation
for Statistical Computing, Vienna, Austria, 2012.
13. Steven L. Scott, Alexander W. Blocker, Fernando V. Bonassi, Hugh A. Chipman,
Edward I. George, and Robert E. McCulloch. Bayes and big data: The consensus Monte
Carlo algorithm. In EFaB@Bayes 250 Conference, volume 16, 2013.
14. Konstantin Shvachko, Hairong Kuang, Sanjay Radia, and Robert Chansler. The Hadoop
distributed file system. In IEEE 26th Symposium on Mass Storage Systems and Tech-
nologies, pp. 1–10. IEEE, Incline Village, NV, 2010.
15. Luke Tierney, Anthony Rossini, Na Li, and Han Sevcikova. SNOW: Simple Network of
Workstations. R package version 0.3-13, 2013.
16. Edward R. Tufte. Visual Explanations: Images and Quantities, Evidence and Narrative,
volume 36. Graphics Press, Cheshire, CT, 1997.
17. John W. Tukey. Exploratory Data Analysis. 1977.
18. John W. Tukey and Paul A. Tukey. Computer graphics and exploratory data analysis:
An introduction. The Collected Works of John W. Tukey: Graphics: 1965–1985, 5:419,
1988.
19. Shivaram Venkataraman. SparkR: R frontend for Spark. R package version 0.1, 2013.
20. Hadley Wickham. The split-apply-combine strategy for data analysis. Journal of Sta-
tistical Software, 40(1):1–29, 2011.
21. Leland Wilkinson, Anushka Anand, and Robert L. Grossman. Graph-theoretic scagnos-
tics. In INFOVIS, volume 5, p. 21, 2005.
22. Leland Wilkinson and Graham Wills. Scagnostics distributions. Journal of Computa-
tional and Graphical Statistics, 17(2):473–491, 2008.
23. Matei Zaharia, Mosharaf Chowdhury, Michael J. Franklin, Scott Shenker, and Ion
Stoica. Spark: Cluster computing with working sets. In Proceedings of the 2nd USENIX
Conference on Hot Topics in Cloud Computing, pp. 10–10, 2010.