[FEATURE REQUEST]: Please consider extending indexing support to non - spark implementation of structured storage, specifically, stand - alone Java and Rust implementation of Parquet / Delta Lake #514

MironAtHome · 2021-12-03T20:31:37Z

Feature requested
Similar to Access database, an SQL Server engine counterpart, running in process of application, please consider extending indexing support to Spark counterpart, currently provided as libraries, over delta lake. Of those two notable implementations are Java DSR/DSW ( stand - alone reader and writer ):
https://github.com/delta-io/connectors
and Rust delta lake implementation:
https://github.com/delta-io/delta-rs

With proliferation and success of such Data Warehousing implementations as Snowflake, the ETL pipeline of data warehouse increasingly relies on external storage as staging area, mostly, for cost saving features.
Extending support to indexing over external delta lake storage would significantly strengthen appeal of Azure Storage solutions to practically all BI implementations, without exceptions, providing nice financial boost to entire platform as well as competitive edge with regards to query performance.

As a type of user = data engineer, data scientist, developer etc., I want desired feature, in order to goal.

Acceptance criteria

Describe what you consider (e.g., functionality requirement) has to be true to mark this feature as complete

Acceptance criteria 1
Acceptance criteria 2
...

Success criteria

Define any measurements that need to be performed

Measurement 1 (e.g., measure the index overhead)
Measurement 2 (e.g., measure the execution time for TPC-H Q1)
UX improvement 1 (e.g., it should take one command to do this)

Additional context

Add any other context or screenshots about the feature request here.

MironAtHome added enhancement New feature or request untriaged This is the default tag for a newly created issue labels Dec 3, 2021

dai-chen mentioned this issue Jan 28, 2022

[FEATURE REQUEST]: Integration with Presto/Trino query engine #526

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE REQUEST]: Please consider extending indexing support to non - spark implementation of structured storage, specifically, stand - alone Java and Rust implementation of Parquet / Delta Lake #514

[FEATURE REQUEST]: Please consider extending indexing support to non - spark implementation of structured storage, specifically, stand - alone Java and Rust implementation of Parquet / Delta Lake #514

MironAtHome commented Dec 3, 2021 •

edited

Loading

[FEATURE REQUEST]: Please consider extending indexing support to non - spark implementation of structured storage, specifically, stand - alone Java and Rust implementation of Parquet / Delta Lake #514

[FEATURE REQUEST]: Please consider extending indexing support to non - spark implementation of structured storage, specifically, stand - alone Java and Rust implementation of Parquet / Delta Lake #514

Comments

MironAtHome commented Dec 3, 2021 • edited Loading

MironAtHome commented Dec 3, 2021 •

edited

Loading