UPDATED 09:00 EDT / FEBRUARY 17 2021

BIG DATA

Databricks cozies up to Google Cloud with BigQuery integration and first-ever container support

Data analytics unicorn Databricks Inc. and Google LLC’s cloud today unit announced a partnership that will integrate Databricks’ distributed query platform with Google’s BigQuery data warehouse.

The partnership will leverage the Google Kubernetes Engine to enable Databricks to work in a containerized cloud environment for the first time. Databricks users will be able to create a “lakehouse” – the company’s term for a hybrid data lake and data warehouse — that can be used for data engineering, data science, machine learning and analytics simultaneously on Google Cloud’s network.

Databricks, which announced a massive $1 billion funding round two weeks ago, already has partnerships with cloud giants Amazon Web Services Inc. and Microsoft Corp. It has made a point of working with each partner to leverage the advantages of its platform. In the case of Google, the focus is on machine learning and deep learning tools and Google’s authorship of Kubernetes, an orchestrator for the portable and integrated software wrappers called containers.

The arrangement will enable customers of both companies to provision the Databricks platform rapidly from the Google Cloud Marketplace with simplified procurement and provisioning, elastic scalability and pricing, unified billing, single-sign-on security and data protection controls that Databricks said are appropriate for the most highly regulated industries.

BigQuery integration lets customers extend their existing lakehouse capabilities to include Google Cloud-native services and use Google BigQuery for analytics. Customers can also leverage Google’s extensive machine learning and deep learning libraries to build artificial intelligence models that span data lakes, data warehouses and multiple business intelligence tools.

The companies will provide pre-built connectors between Databricks and BigQuery, Google Cloud storage and Pub/Sub real-time message service. The integration will also enable data workflows created in Databricks to be trained with Google’s AI Platform data science and machine learning services and deployed with the cloud vendor’s AI Platform Prediction model hosting.

Databricks also said it will support the Google Kubernetes Engine, a secure, managed Kubernetes service, enabling Databricks to use the cloud provider’s managed services for security, network policy and compute as well as to speed the release of new features.

Among the joint ecosystem partners that have committed to integrate with Databricks on Google Cloud are Collibra NV, Confluent Inc., Fishtown Analytics LLC, Fivetran Inc., Immuta Inc., Informatica Corp., Infoworks.io Inc., MongoDB Inc., Privacera Inc., Qlik Inc., Tableau Software Inc., Cognizant Technology Solutions Corp. and Trifacta Inc.

Photo: Robert Hof/SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU