Data profiling
There is a handy function in Optimus called profile
that returns useful stats about our dataset. Let's see how to use it:
df.profile(bins=5)
This code will return a dictionary:
{'columns': {'id': {'stats': {'match': 504, Â Â Â Â 'missing': 0, Â Â Â Â 'mismatch': 0, Â Â Â Â 'profiler_dtype': {'dtype': 'int', 'categorical': True}, Â Â Â Â 'frequency': [{'value': 1, 'count': 1}, Â Â Â Â {'value': 332, 'count': 1}, Â Â Â Â {'value': 345, 'count': 1}, Â Â Â Â {'value': 344, 'count': 1}, Â Â Â Â {'value': 343, 'count': 1}], Â Â Â Â 'count_uniques': 504}, Â Â 'dtype': 'int64...