eland.Index#

class eland.Index(query_compiler: QueryCompiler, es_index_field: Optional[str] = None)#

The index for an eland.DataFrame.

TODO - This currently has very different behaviour than pandas.Index

Currently, the index is a field that exists in every document in an Elasticsearch index. For slicing and sorting operations it must be a docvalues field. By default _id is used, which can’t be used for range queries and is inefficient for sorting:

https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-id-field.html (The value of the _id field is also accessible in aggregations or for sorting, but doing so is discouraged as it requires to load a lot of data in memory. In case sorting or aggregating on the _id field is required, it is advised to duplicate the content of the _id field in another field that has doc_values enabled.)

__init__(query_compiler: QueryCompiler, es_index_field: Optional[str] = None)#

Methods

__init__(query_compiler[, es_index_field])

es_info(buf)

Attributes

ID_INDEX_FIELD

ID_SORT_FIELD

es_index_field

is_source_field

sort_field