pandas.DataFrame supported APIsΒΆ

The following table lists both implemented and not implemented methods. If you have need of an operation that is listed as not implemented, feel free to open an issue on the http://github.com/elastic/eland, or give a thumbs up to already created issues. Contributions are also welcome!

The following table is structured as follows: The first column contains the method name. The second column is a flag for whether or not there is an implementation in Modin for the method in the left column. Y stands for yes, N stands for no.

https://github.com/adgirish/kaggleScape/blob/master/results/annotResults.csv represents a prioritised list.

Method

Count

Notes

pd.read_csv

1422

y

pd.DataFrame

886

y

df.append

792

n

df.mean

783

y

df.head

783

y

df.drop

761

y

df.sum

755

y

df.to_csv

693

y

df.get

669

y

df.mode

653

n

df.astype

649

n

df.sub

637

n

pd.concat

582

n

df.apply

577

n

df.groupby

557

n

df.join

544

n

df.fillna

543

n

df.max

508

y

df.reset_index

434

n

pd.unique

433

n

df.le

405

n

df.count

399

y

pd.value_counts

397

y

df.sort_values

390

n

df.transform

387

n

df.merge

376

n

df.add

346

n

df.isnull

338

n

df.min

321

y

df.copy

314

n

df.replace

300

n

df.std

261

n

df.hist

246

y

df.filter

234

n

df.describe

220

y

df.ne

218

n

df.corr

217

n

df.median

217

n

df.items

212

n

pd.to_datetime

204

n

df.isin

203

n

df.dropna

195

n

pd.get_dummies

190

n

df.rename

185

n

df.info

180

y

df.set_index

166

n

df.keys

159

y

df.sample

155

n

df.agg

140

y

df.where

138

n

df.boxplot

134

n

df.clip

116

n

df.round

116

n

df.abs

101

n

df.stack

97

n

df.tail

94

y

df.update

92

n

df.iterrows

90

n

df.transpose

87

n

df.any

85

n

df.pipe

80

n

pd.eval

73

n

df.eval

73

n

pd.read_json

72

n

df.nunique

70

y

df.pivot

70

n

df.select

68

n

df.as_matrix

67

n

df.notnull

66

n

df.cumsum

66

n

df.prod

64

n

df.unstack

64

n

df.drop_duplicates

63

n

df.div

63

n

pd.crosstab

59

n

df.select_dtypes

57

y

df.pow

56

n

df.sort_index

56

n

df.product

52

n

df.isna

51

n

df.dot

46

n

pd.cut

45

n

df.bool

44

n

df.to_dict

44

n

df.diff

44

n

df.insert

44

n

df.pop

44

n

df.query

43

y

df.var

43

n

df.__init__

41

y

pd.to_numeric

39

n

df.squeeze

39

n

df.ge

37

n

df.quantile

37

n

df.reindex

37

n

df.rolling

35

n

pd.factorize

32

n

pd.melt

31

n

df.melt

31

n

df.rank

31

n

pd.read_table

30

n

pd.pivot_table

30

n

df.idxmax

30

n

pd.test

29

n

df.iteritems

29

n

df.shift

28

n

df.mul

28

n

pd.qcut

25

n

df.set_value

25

n

df.all

24

n

df.skew

24

n

df.aggregate

23

y

pd.match

22

n

df.nlargest

22

n

df.multiply

21

n

df.set_axis

19

n

df.eq

18

n

df.resample

18

n

pd.read_sql

17

n

df.duplicated

16

n

pd.date_range

16

n

df.interpolate

15

n

df.memory_usage

15

n

df.divide

14

n

df.cov

13

n

df.assign

12

n

df.subtract

12

n

pd.read_pickle

11

n

df.applymap

11

n

df.first

11

n

df.kurt

10

n

df.truncate

10

n

df.get_value

9

n

pd.read_hdf

9

n

df.to_html

9

y

pd.read_sql_query

9

n

df.take

8

n

df.to_pickle

7

n

df.itertuples

7

n

df.to_string

7

y

df.last

7

n

df.sem

7

n

pd.to_pickle

7

n

df.to_json

7

n

df.idxmin

7

n

df.xs

6

n

df.combine

6

n

pd.rolling_mean

6

n

df.to_period

6

n

df.convert_objects

5

n

df.mask

4

n

df.pct_change

4

n

df.add_prefix

4

n

pd.read_excel

4

n

pd.rolling_std

3

n

df.to_records

3

n

df.corrwith

3

n

df.swapaxes

3

n

df.__iter__

3

n

df.to_sql

3

n

pd.read_feather

3

n

df.to_feather

3

n

df.__len__

3

n

df.kurtosis

3

n

df.mod

2

n

df.to_sparse

2

n

df.get_values

2

n

df.__eq__

2

n

pd.bdate_range

2

n

df.get_dtype_counts

2

n

df.combine_first

2

n

df._get_numeric_data

2

n

df.nsmallest

2

n

pd.scatter_matrix

2

n

df.rename_axis

2

n

df.__setstate__

2

n

df.cumprod

2

n

df.__getstate__

2

n

df.equals

2

n

df.__getitem__

2

y

df.clip_upper

2

n

df.floordiv

2

n

df.to_excel

2

n

df.reindex_axis

1

n

pd.to_timedelta

1

n

df.ewm

1

n

df.tz_localize

1

n

df.tz_convert

1

n

df.to_hdf

1

n

df.lookup

1

n

pd.merge_ordered

1

n

df.swaplevel

1

n

df.first_valid_index

1

n

df.lt

1

n

df.add_suffix

1

n

pd.rolling_median

1

n

df.to_dense

1

n

df.mad

1

n

df.align

1

n

df.__copy__

1

n

pd.set_eng_float_format

1

n

df.add_suffix

1

n

pd.rolling_median

1

n

df.to_dense

1

n

df.mad

1

n

df.align

1

n

df.__copy__

1

n

pd.set_eng_float_format

1

n

DataFrame method

Eland Implementation? (Y/N/P/D)

Notes for Current implementation

T

N

abs

N

add

N

add_prefix

N

add_suffix

N

agg aggregate

Y

align

N

all

N

any

N

append

N

apply

N

See agg

applymap

N

as_blocks

N

as_matrix

N

asfreq

N

asof

N

assign

N

astype

N

at

N

at_time

N

axes

N

between_time

N

bfill

N

blocks

N

bool

N

boxplot

N

clip

N

clip_lower

N

clip_upper

N

combine

N

combine_first

N

compound

N

consolidate

N

convert_objects

N

copy

N

corr

N

corrwith

N

count

Y

cov

N

cummax

N

cummin

N

cumprod

N

cumsum

N

describe

Y

diff

N

div

N

divide

N

dot

N

drop

Y

drop_duplicates

N

dropna

N

dtypes

Y

duplicated

N

empty

Y

eq

N

equals

N

eval

N

ewm

N

expanding

N

ffill

N

fillna

N

filter

N

first

N

first_valid_index

N

floordiv

N

from_csv

N

from_dict

N

from_items

N

from_records

N

ftypes

N

ge

N

get

Y

get_dtype_counts

N

get_ftype_counts

N

get_value

N

get_values

N

groupby

N

gt

N

head

Y

hist

Y

iat

N

idxmax

N

idxmin

N

iloc

N

infer_objects

N

info

Y

insert

N

interpolate

N

is_copy

N

isin

N

isna

N

isnull

N

items

N

iteritems

N

iterrows

N

itertuples

N

ix

N

join

N

keys

Y

kurt

N

kurtosis

N

last

N

last_valid_index

N

le

N

loc

N

lookup

N

lt

N

mad

N

mask

N

max

Y

mean

Y

median

N

melt

N

memory_usage

N

merge

N

min

Y

mod

N

mode

N

mul

N

multiply

N

ndim

N

ne

N

nlargest

N

notna

N

notnull

N

nsmallest

N

nunique

Y

pct_change

N

pipe

N

pivot

N

pivot_table

N

plot

N

pop

N

pow

N

prod

N

product

N

quantile

N

query

Y

radd

N

rank

N

rdiv

N

reindex

N

reindex_axis

N

reindex_like

N

rename

N

rename_axis

N

reorder_levels

N

replace

N

resample

N

reset_index

N

rfloordiv

N

rmod

N

rmul

N

rolling

N

round

N

rpow

N

rsub

N

rtruediv

N

sample

N

select

N

select_dtypes

Y

sem

N

set_axis

N

set_index

N

set_value

N

shape

Y

shift

N

size

N

skew

N

slice_shift

N

sort_index

N

sort_values

N

sortlevel

N

squeeze

N

stack

N

std

N

style

N

sub

N

subtract

N

sum

Y

swapaxes

N

swaplevel

N

tail

Y

take

N

to_clipboard

N

to_csv

Y

to_dense

N

to_dict

N

to_excel

N

to_feather

N

to_gbq

N

to_hdf

N

to_html

Y

to_json

N

to_latex

N

to_msgpack

N

to_panel

N

to_parquet

N

to_period

N

to_pickle

N

to_records

N

to_sparse

N

to_sql

N

to_stata

N

to_string

Y

Default sets max_rows=60

to_timestamp

N

to_xarray

N

transform

N

transpose

N

truediv

N

truncate

N

tshift

N

tz_convert

N

tz_localize

N

unstack

N

update

N

values

N

var

N

where

N

xs

N

Deprecated in pandas