Reading and Writing Parquet Files in Spark
Parquest file is one of the most common file format used in data engineering because of its performance. It is an Open Source Column oriented file format where data is stored by columns and not by rows. And Parquet file automatically uses compression algorithems to reduce the file size and faster data performance. In this … Read more