Utils¶

snap_ml_spark.Utils.dump_to_snap_format(X, y, filename, transpose=False, implicit_vals=False)¶

Non-distributed data writing to snap format

Parameters

X (numpy array or sparse matrix) – The data used for training or inference.
y (numpy array) – The labels of the samples in X.
filename (str) – The file where X and y will be stored in snap format.
transpose (bool , default : False) – If transpose is True, X will be stored in transposed format.

snap_ml_spark.Utils.read_from_snap_format(filename)¶

Non-distributed data loading from snap format

Parameters: filename (str) – The file where the data resides.
Returns: X, y – Returns two datasets. X : the data used for training or inference y : the labels of the samples in X.
Return type: numpy array or sparse matrix, numpy array