eland.DataFrame.describe

DataFrame.describe() → pandas.core.frame.DataFrame

Generate descriptive statistics that summarize the central tendency, dispersion and shape of a dataset’s distribution, excluding NaN values.

Analyzes both numeric and object series, as well as DataFrame column sets of mixed data types. The output will vary depending on what is provided. Refer to the notes below for more detail.

TODO - add additional arguments (current only numeric values supported)

Returns
pandas.Dataframe:

Summary information

Examples

>>> df = ed.DataFrame('localhost', 'flights', columns=['AvgTicketPrice', 'FlightDelayMin']) # ignoring percentiles
>>> df.describe() # doctest: +SKIP
       AvgTicketPrice  FlightDelayMin
count    13059.000000    13059.000000
mean       628.253689       47.335171
std        266.386661       96.743006
min        100.020531        0.000000
...
...
...
max       1199.729004      360.000000