As we saw in the previous chapter, file formats such as Parquet already use the columnar format; the same benefits are being realized with Spark using a columnar format in memory. Some of the benefits are as follows:
- Denser storage
- Compatibility with external already columnar formats, such as Parquet TensorFlow