toad.detector module

Command line tools for detecting csv data

Team: ESC

Examples

python detector.py -i xxx.csv -o report.csv

toad.detector.getTopValues(series, top=5, reverse=False)[source]

Get top/bottom n values

Parameters
  • series (Series) – data series

  • top (number) – number of top/bottom n values

  • reverse (bool) – it will return bottom n values if True is given

Returns

Series of top/bottom n values and percentage. [‘value:percent’, None]

Return type

Series

toad.detector.getDescribe(series, percentiles=[0.25, 0.5, 0.75])[source]

Get describe of series

Parameters
  • series (Series) – data series

  • percentiles – the percentiles to include in the output

Returns

the describe of data include mean, std, min, max and percentiles

Return type

Series

toad.detector.countBlank(series, blanks=[])[source]

Count number and percentage of blank values in series

Parameters
  • series (Series) – data series

  • blanks (list) – list of blank values

Returns

number of blanks str: the percentage of blank values

Return type

number

toad.detector.isNumeric(series)[source]

Check if the series’s type is numeric

Parameters

series (Series) – data series

Returns

bool

toad.detector.detect(dataframe)[source]

Detect data

Parameters

dataframe (DataFrame) – data that will be detected

Returns

report of detecting

Return type

DataFrame