Caching is a technique of storing intermediate results in memory or disk. Computing the whole data again is not needed if we are using it again in further data processing.
In SPARK we do cache the DataFrame so we can use the result in next tranforma...