Thanks for following it up and the pointer to Data Catalog. I will take a look at it.

Since Apache Hive megastore is included in data datalog already, what I need is just a PrestoSQL cluster as alternative to Apache Spark. PrestoSQL is one excellent SQL query engine for:
* interactive queries from workloads like dashboard via superset
* ad/hoc query from superset/notebook
* ETL tasks to move data among different data stores with transaction support.

I’m happy to present it back once I have enough experience with Data Catalog.

On Aug 10, 2020, at 10:09 AM, Juana Nakfour <jnakfour@redhat.com> wrote:

Hi Ke,
We do have an ODH component that covers some of Presto's functionality, Data Catalog: https://opendatahub.io/news/2019-12-15/data-catalog-in-odh.html We are working on migrating Data Catalog to ODH 0.7+. If you are interested in contributing Presto to ODH, you are welcome to present it to the community and tell us how it compares to Data Catalog.


On Fri, Aug 7, 2020 at 11:24 AM Ke Zhu - kzhu@us.ibm.com <kzhu@us.ibm.com> wrote:
Hello all! I’m new to OpenDataHub and I’ve seen a list of familiar OSS from the website: https://opendatahub.io/docs.html

But didn’t see PrestoSQL (https://prestosql.io/). I wonder if there’s any interests including it into opendatahub since it’s very popular and capable for my team’s daily work especially dashboard use cases.

I’m also happy to contribute back code.
Users mailing list -- users@lists.opendatahub.io
To unsubscribe send an email to users-leave@lists.opendatahub.io