Chapter 12 References and resources
12.1 Online resources
12.1.1 Getting started with sparklyr
- The Getting Started chapter of the Mastering Spark with R book
- The Prerequisites appendix of the Mastering Spark with R book
- RStudio’s spark website
- Overview of the dplyr syntax
- The Data transformation chapter of R for Data Science
12.1.2 DBI, Spark SQL, Hive
- Spark SQL, Built-in Functions
- Documentation on Hive Operators and User-Defined Functions website
- The DBI package on CRAN
- The Introduction to DBI page of Databases using R
- The Overview page of dbplyr documentation
12.1.3 Docker
- Get started with Docker
- The Rocker Project - Docker Containers for the R Environment
12.1.4 Spark API, Java, Scala and friends
- Spark Scala API documentation
- Spark Java API documentation
- Wikipedia’s article on Method Chaining
- Stackoverflow discussion of reflection
12.1.5 Apache Arrow
- Homepage of Apache Arrow
- Arrow C++ library installation guide
- R package arrow on CRAN
- R package arrow on GitHub
12.2 Physical Books
- Spark: The Definitive Guide by Matei Zaharia, Bill Chambers
- Mastering Spark with R by Edgar Ruiz, Kevin Kuo, Javier Luraschi