pyspark.sql.DataFrame.persist¶
-
DataFrame.
persist
(storageLevel=StorageLevel(True, True, False, False, 1))[source]¶ Sets the storage level to persist the contents of the
DataFrame
across operations after the first time it is computed. This can only be used to assign a new storage level if theDataFrame
does not have a storage level set yet. If no storage level is specified defaults to (MEMORY_AND_DISK).Note
The default storage level has changed to MEMORY_AND_DISK to match Scala in 2.0.
New in version 1.3.