eland.DataFrame.describe#

DataFrame.describe() DataFrame#

Generate descriptive statistics that summarize the central tendency, dispersion and shape of a dataset’s distribution, excluding NaN values.

Analyzes both numeric and object series, as well as DataFrame column sets of mixed data types. The output will vary depending on what is provided. Refer to the notes below for more detail.

TODO - add additional arguments (current only numeric values supported)

Returns#

pandas.Dataframe:

Summary information

See Also#

pandas.DataFrame.describe

Examples#

>>> df = ed.DataFrame('http://localhost:9200', 'flights', columns=['AvgTicketPrice', 'FlightDelayMin']) # ignoring percentiles
>>> df.describe() 
       AvgTicketPrice  FlightDelayMin
count    13059.000000    13059.000000
mean       628.253689       47.335171
std        266.386661       96.743006
min        100.020531        0.000000
...
...
...
max       1199.729004      360.000000