Saturday, June 10, 2023
HomeBig DataExtending Databricks Unity Catalog with an Open Apache Hive Metastore API

Extending Databricks Unity Catalog with an Open Apache Hive Metastore API


As we speak, we’re excited to announce the preview of a Hive Metastore (HMS) interface for Databricks Unity Catalog, which permits any software program appropriate with Apache Hive to hook up with Unity Catalog. Apache Hive is essentially the most extensively supported catalog interface within the {industry}, usable in nearly each main computing platform. This function lets organizations centralize their information administration, discovery and governance in Unity Catalog and hook up with it from a variety of computing platforms, together with Amazon Elastic MapReduce (EMR), open supply Apache Spark, Amazon Athena, Presto, Trino, and others. It additionally ensures constant information governance throughout these platforms.

To hitch this preview, contact your Databricks consultant.

Hive Metastore interface for Unity Catalog
Hive Metastore interface for Unity Catalog

In right this moment’s quickly evolving information administration panorama, many organizations run a number of computing platforms and face the problem of constantly implementing information discovery and governance throughout them. This typically requires information groups to juggle a number of information catalogs and governance instruments, resulting in excessive operational overhead and difficulties in information discovery, entry administration, and auditing.

Databricks Unity Catalog is a unified governance answer for information, analytics and AI with easy options to find information, handle permissions, audit accesses, observe information lineage and high quality, and share information throughout organizations. With the HMS interface, now you can join any software program that helps the industry-standard Apache Hive API to Unity, tremendously simplifying governance throughout computing platforms.

On this weblog, we’ll discover some great benefits of this function and the way it can improve your information administration practices.

Why we constructed a Hive Metastore interface for Unity Catalog

Openness

In a various information ecosystem, openness performs a vital position in seamless information integration and collaboration. Apache Hive is essentially the most extensively supported catalog API within the {industry}. Unity Catalog’s open HMS interface aligns with our open lakehouse platform technique and gives a unified and standardized strategy for accessing enterprise information and avoiding vendor lock-in. By utilizing Unity Catalog with this open interface, organizations can simplify their information administration structure and guarantee it might help each present and future instruments.

Constant Information Governance

Managing information throughout a number of platforms poses a big problem in sustaining constant governance. The HMS interface in Unity Catalog successfully tackles this problem, by extending enterprise-grade governance offered by Unity Catalog to a various vary of compute platforms. This integration ensures constant information compliance, enhances safety measures, enforces strong entry controls, and facilitates centralized auditing.

Simple Path to Modernize Legacy Workloads

For patrons who’ve lengthy standing legacy workloads operating on completely different computing platforms, the HMS interface integration permits them to centralize metadata and entry controls in Unity Catalog, encompassing each their legacy and Databricks workloads. This integration provides a simple path for migrating these workloads operating on platforms equivalent to Amazon EMR, whereas guaranteeing consistency all through the migration course of.

Price Optimization

The HMS interface for Unity Catalog brings important value optimization advantages to organizations. Historically, organizations needed to make investments important time and assets into managing a number of catalogs, which not solely incurred extra prices but in addition launched complexity and potential inconsistencies. Syncing information and insurance policies between a number of catalogs is fragile and error-prone. This integration eliminates the necessity to handle, keep, and synchronize separate information catalogs.

Conclusion

The HMS interface for Unity Catalog brings collectively the perfect of each worlds—open interoperability and enterprise-grade governance. By connecting Unity Catalog with numerous compute platforms, organizations can unlock enhanced information accessibility, improved governance, scalability, value optimization, interoperability, and future-proofing, and eradicate the excessive operational value of managing a number of information platforms right this moment. This lets organizations give attention to taking advantage of their information belongings as a substitute.

Contact your Databricks Consultant to affix this thrilling preview!

Additionally, don’t miss extra thrilling updates about information and AI governance and register for our periods on the Information and AI Summit.



Supply hyperlink

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
Google search engine

Most Popular

Recent Comments