pydit.statistics.profile_dataframe_statistics.profile_dataframe

pydit.statistics.profile_dataframe_statistics.profile_dataframe(obj, return_dict=False, unique_min=10)[source]

Create a summary of a DataFrame with various statistics.

Returns a DataFrame or a dict with common statistics to profile the data. In particular it focuses on unique (cardinality) blanks, nulls, and datetimes.

Parameters:
  • obj (pandas.DataFrame) – DataFrame to profile.

  • return_dict (bool, optional, default=False) – If True, return a dict instead of a DataFrame.

  • unique_min (int, optional, default=10)

Returns:

DataFrame with various statistics.

Return type:

DataFrame