Actually run the workload.
Validate that the data set has a correct schema and fix if necessary.
Validate that the data set has a correct schema and fix if necessary. This is to solve issues such as the KMeans load-from-disk pathway returning a DataFrame with all the rows as StringType instead of DoubleType.