toad.detector module

Command line tools for detecting csv data

Team: ESC

Examples

python detector.py -i xxx.csv -o report.csv

toad.detector.getTopValues(series, top=5, reverse=False)[source]

Get top/bottom n values

Parameters:
  • series (Series) – data series
  • top (number) – number of top/bottom n values
  • reverse (bool) – it will return bottom n values if True is given
Returns:

Series of top/bottom n values and percentage. [‘value:percent’, None]

Return type:

Series

toad.detector.getDescribe(series, percentiles=[0.25, 0.5, 0.75])[source]

Get describe of series

Parameters:
  • series (Series) – data series
  • percentiles – the percentiles to include in the output
Returns:

the describe of data include mean, std, min, max and percentiles

Return type:

Series

toad.detector.countBlank(series, blanks=[])[source]

Count number and percentage of blank values in series

Parameters:
  • series (Series) – data series
  • blanks (list) – list of blank values
Returns:

number of blanks str: the percentage of blank values

Return type:

number

toad.detector.isNumeric(series)[source]

Check if the series’s type is numeric

Parameters:series (Series) – data series
Returns:bool
toad.detector.detect(dataframe)[source]

Detect data

Parameters:dataframe (DataFrame) – data that will be detected
Returns:report of detecting
Return type:DataFrame