Particular due to Phillip Jones, Senior Product Supervisor, and Harshal Brahmbhatt, Techniques Engineer from Cloudflare for his or her contributions to this weblog.
Organizations throughout industries need to share their knowledge and AI property in a single, unified approach, no matter clouds or areas. Nevertheless, many organizations nonetheless wrestle to share knowledge with prospects, groups and companions, dealing with platform compatibility points and limitations, excessive egress prices, and an absence of governance and safety. Databricks and the Linux Basis developed Delta Sharing as the primary open method for safe knowledge sharing. Clients have been utilizing Delta Sharing to simply and securely share knowledge throughout platforms, clouds and areas, with out the necessity for replication.
Immediately, we’re excited to announce Delta Sharing with Cloudflare R2 integration is in Public Preview to assist prospects sharing knowledge throughout clouds and areas to save lots of on egress prices. Databricks now helps Delta Sharing from Cloudflare R2, Cloudflare’s zero egress, distributed object storage providing. Joint prospects can now make the most of zero egress charges with out pricey replication throughout areas and no vendor lock-in.
Strategic partnership with Cloudflare
Databricks partnered with Cloudflare to assist organizations share their knowledge with prospects and companions in a single unified approach, no matter cloud or area. Cloudflare R2 is a zero-egress distributed storage supplied by Cloudflare that allows prospects to share essentially the most up-to-date datasets with their companions, suppliers, and contours of companies with out compromising safety and privateness.
Matthew Prince, co-founder and CEO of Cloudflare, defined the worth of the partnership, “The mix of Cloudflare’s huge world community and nil egress storage, together with Databricks’ highly effective sharing and processing capabilities, will give our joint prospects the quickest, most safe, and most inexpensive knowledge sharing capabilities throughout the globe.”
Utilizing Delta Sharing with Cloudflare R2, prospects are actually in command of the place to maneuver and use their knowledge and AI (reside datasets, fashions, and notebooks), sharing the newest throughout platforms, clouds and areas without having for replication, zero egress prices, no vendor lock-in, and with out compromising on safety and governance.
“The mix of Cloudflare’s huge world community and nil egress storage, together with Databricks’ highly effective sharing and processing capabilities, will give our joint prospects the quickest, most safe, and most inexpensive knowledge sharing capabilities throughout the globe.”
— Matthew Prince, CEO & Co-founder Cloudflare
“Delta Sharing supplies the primary open protocol for sharing knowledge throughout numerous computing platforms, clouds and areas. We’re enthusiastic about how this may push open interchange ahead and assist all of our prospects collaborate extra simply,” defined Matei Zaharia, Co-Founder and CTO at Databricks concerning the partnership with Cloudflare.
“Delta Sharing supplies the primary open protocol for sharing knowledge throughout numerous computing platforms, clouds and areas. We’re enthusiastic about how this may push open interchange ahead and assist all of our prospects collaborate extra simply.”
— Matei Zaharia, Co-Founder and CTO at Databricks
Allium saves as much as $645K per yr utilizing Delta Sharing and Cloudflare R2
Within the final 15 years, the monetary {industry} has been remodeled with the introduction of blockchain know-how and the utilization of cryptocurrency throughout industries. This evolution has generated an ever-increasing quantity of transactional knowledge from public blockchains, out there for traders and merchants to achieve essential, real-time insights.
Allium is a Databricks buyer that gives a easy knowledge platform with quick and correct blockchain knowledge. They assist prospects starting from monetary establishments to crypto-native companies unlock the complete energy of their knowledge. Allium presents a devoted knowledge infrastructure and merchandise together with managed blockchain databases, enriched knowledge schemas, and real-time notification capabilities. They’re a pacesetter on this area, serving 15 blockchains, together with EVMs and Bitcoin, 100+ schemas, and 250+ TB knowledge in dimension to empower all types of crypto purposes – from accounting and auditing for merchants to scrub buying and selling filtering for NFT marketplaces. Allium meets their prospects wherever they’re—of their knowledge setting, leading to greater than 1 PB of knowledge switch month-to-month within the final quarter, and this quantity continues to surge following the latest crypto restoration fueled by ETF optimism.
Whereas the huge enhance in knowledge switch volumes has contributed to Allium’s speedy enterprise progress, it has additionally added a major problem to its backside line– easy methods to construct a cost-efficient knowledge storage and sharing resolution that meets its prospects’ wants. Particularly, how can they share knowledge with their prospects to any location – throughout clouds and areas – and reduce costly knowledge egress prices from cloud distributors.
Earlier than adopting the joint resolution of Delta Sharing with Cloudflare R2, Allium had applied different platforms however discovered them prohibitively costly, with estimated prices reaching $53.8K month-to-month for a 1 Petabyte knowledge egress, totaling roughly $645K yearly.
“We initially leveraged Snowflake’s replication system nevertheless it lacked management and was costly. In Snowflake, serving knowledge to completely different areas requires us to duplicate knowledge to that area, so it robotically incurs a whole lot of storage prices in addition to some egress prices. This expense will increase exponentially for any operational schema change, which occurs ceaselessly at our scale,” explains Ethan Chan, Co-Founder and CEO of Allium.
“In Snowflake, serving knowledge to completely different areas requires us to duplicate knowledge to that area, so it robotically incurs a whole lot of storage prices in addition to some egress prices. This expense will increase exponentially for any operational schema change, which occurs ceaselessly at our scale.”
— Ethan Chan, Co-Founder and CEO of Allium
The mix of Delta Sharing with Cloudflare R2 has offered Allium with an economical and safe knowledge sharing resolution, without having for pricey and complicated replications or vendor lock-in. Allium is now in command of the place they transfer and use their knowledge with Delta Sharing’s multicloud assist and has consolidated its cloud storage with Cloudflare R2 to construct its next-generation knowledge sharing platform.
Chan explains, “Combining each Delta Sharing and Cloudflare R2 collectively permits us to ship knowledge to our prospects reliably and cost-effectively. We ship the best high quality blockchain knowledge to our prospects of their most popular setting, whereas minimizing our storage and egress prices, saving as much as $645K per yr. Plus, this provides us each the management and safety to scale our choices sustainably.”
Allium makes use of this integration to maximise their value financial savings (see diagram beneath) by persisting the blockchain knowledge utilizing Delta UniForm (Delta Lake Common Format), a seamless approach of unifying Parquet desk codecs with out creating further copies. Allium permits Apache Iceberg and Delta connectors that learn the information saved in Cloudflare R2. In addition they implement Delta Sharing to seamlessly and securely share their knowledge throughout areas and platforms, all with zero egress prices for outbound transfers.
“Combining each Delta Sharing and Cloudflare R2 collectively permits us to ship knowledge to our prospects reliably and cost-effectively. We ship the best high quality blockchain knowledge to our prospects of their most popular setting, whereas minimizing our storage and egress prices, saving as much as $645K per yr.”
— Ethan Chan, Co-Founder and CEO of Allium

Allium additionally lately expanded its product line to share its Ethereum Realtime Information, now listed on Databricks Market. This dataset helps customers throughout the cryptocurrency area sharing precious insights about Ethereum’s dynamics. Accessible for buy, it contains a number of particulars about Ethereum’s blockchain, together with sensible contracts, NFT and decentralized finance (DeFi) markets, and extra.
Key {industry} use circumstances
One other instance of a sort of buyer that may profit from utilizing Delta Sharing and Cloudflare R2 is a knowledge aggregator utilizing a generally used ‘hub and spoke’ architectural sample. An information aggregator makes a speciality of amassing and merging knowledge from numerous sources right into a unified, cohesive dataset. A ‘hub and spoke’ knowledge sharing situation is outlined as one-to-many, the place one group shares with many consumers. These knowledge aggregators focus on amassing, merging and sharing datasets to numerous purchasers throughout completely different areas, clouds, and platforms. Nevertheless, these organizations face a typical problem— easy methods to scale knowledge sharing in an economical and predictable approach. Ideally, they can profit from economies of scale, in order that as their variety of purchasers will increase, the sharing value ought to solely enhance marginally. As well as, they do not need to have any dependency on their purchasers adopting knowledge replication for value financial savings, however solely be in command of managing the prices with a predictable method.
Industries that sometimes use knowledge aggregators embody monetary providers, healthcare and life sciences, and media and leisure. Sharing knowledge helps drive important enterprise wants corresponding to decision-making, market evaluation, analysis, and supporting general enterprise operations. For instance, knowledge aggregators play a vital function in powering numerous monetary purposes and providers, corresponding to budgeting apps, funding platforms, lending options, and extra by securely accessing and analyzing customers’ monetary info. See desk beneath for some industry-specific use circumstances.
| Trade | Information Aggregator Use Case | Use Case Particulars |
|---|---|---|
| Media and Leisure | Content material Archiving | Aggregators can be utilized to archive content material systematically, making it simpler for media corporations to share their content material with companions and prospects to entry and repurpose their historic content material for brand new audiences or platforms. |
| Monetary Providers | Credit score Scoring and Danger Evaluation | Information aggregators present insights into customers’ monetary habits, corresponding to spending patterns, earnings ranges, and debt obligations. This info is shared and can be utilized by lenders and monetary establishments to evaluate credit score threat and assist them make lending selections based mostly on general credit score scores. |
| Healthcare and Life Sciences | Business Effectiveness | Healthcare knowledge aggregators can present scientific prescription knowledge to hospitals, healthcare suppliers, pharmaceutical corporations, and analysis establishments for evaluation and utilization in many various methods. This might embody figuring out new markets to enter, measuring gross sales channel dynamics, or shopping for patterns in retail pharmacies or hospitals. |
Calculate financial savings and when to implement a joint resolution
Cloud egress prices typically scale proportionally with the amount of knowledge queried from the information share. The diagram beneath exhibits that because the variety of queries (and quantity of knowledge) will increase, so does the egress value. Clients can use this method to match completely different storage options and quantify the cost-benefit of utilizing Cloudflare R2’s resolution, which does not introduce any egress value. Because the diagram beneath highlights, Cloudflare R2’s resolution can result in vital financial savings relative to different cloud storage options.
For instance, based mostly on commonplace pricing assumptions, the evaluation beneath signifies that knowledge property whose knowledge switch actions exceed 26% throughout completely different clouds or 85% throughout areas on a month-to-month foundation can profit from vital month-to-month financial savings on each storage and egress prices.1

Take a look at drive Delta Sharing and Cloudflare R2
Delta Sharing and Cloudflare R2 are actually out there in Public Preview. To implement the joint resolution, you do not have emigrate all of your knowledge to Cloudflare R2 (see associated weblog, Architecting World Information Collaboration with Delta Sharing). You solely want to duplicate the shared knowledge as soon as to R2, in three simple steps (see the diagram beneath):
- Add Cloudflare R2 as an exterior storage location
- Create new tables, volumes, or ML fashions in Cloudflare R2, and sync knowledge incrementally utilizing Deep Clone
- Create a Delta Share, as regular on the R2 desk

Consult with the technical documentation for extra particulars. It’s also possible to present suggestions to our workforce at [email protected].
Utilizing Delta Sharing with Cloudflare R2, now you can profit from a brand new method to share knowledge and AI throughout platforms, clouds and areas, with zero egress prices, no vendor lock-in, and with out compromising on safety and governance.
Be taught extra about easy methods to combine Delta Sharing into your knowledge collaboration technique with the newest assets:
1 The fee financial savings calculation was based mostly on the belief that 10% of the information is refreshed month-to-month, and knowledge is replicated to Cloudflare R2 for sharing goal whereas holding the unique copy in S3.
