Databricks introduces OpenSharing, a new standard for AI sharing

Just as Databricks developed open standards in 2021 to enable enterprises to securely share data with internal and external partners, the vendor introduced OpenSharing to enable organizations to share their AI assets.

Five years ago, Databricks built Delta Sharing, a subproject within the open source Delta Lake project, to enable enterprises to share and collaborate on data without moving or duplicating it.

OpenSharing, released on June 10 and available on GitHub, is hosted by the Linux Foundation and is an extension of Delta Sharing. New open source standards enable organizations to further collaborate across platforms, departments, and partners by sharing AI models, agent skills with expertise and workflows, and unstructured data.

Additionally, OpenSharing expands collaboration by adding support for platforms that connect to the Apache Iceberg REST catalog. This enables new cross-organizational sharing and adds partnerships with on-premises storage partners to enable non-moveable sharing of on-premises data and AI assets.

“OpenSharing represents a shift from simple data exchange to a unified, managed interface for AI and data stacks,” William McKnight, president of McKnight Consulting, told TechTarget. “Beyond traditional tables… this framework provides a blueprint for studying and extending how autonomous agents interact with distributed data, which could be extremely important for data sharing.”

Stephen Catanzano, an analyst at Omdia, a division of Informa TechTarget, similarly called OpenSharing an important development.

“OpenSharing is a solid development in the AI infrastructure space,” he told TechTarget. “This is especially important because it extends secure zero-copy sharing beyond structured data to include agent skills and AI models, assets that are becoming critical in the agent era. Until now, organizations have not had a standardized way to share these AI components across platforms.”

San Francisco-based Databricks pioneered the data lakehouse format for storing data. Like many data management vendors, Databricks has had machine learning capabilities since its founding in 2013, but in recent years has focused much of its product development on enabling customers to build generative and agent AI capabilities.