Google Cloud Is Attempting To Make Data ‘limitless

Google Cloud has released a preview of BigLake, its data lake storage engine, as part of its goal to eliminate all “data restrictions” and dissolve the barrier between data lakes and warehouses.  During a pre-briefing, Google explained that users  don’t  have to copy the data, move the data across your object stores, like in Google […]
SIA Team
April 6, 2022

Google Cloud has released a preview of BigLake, its data lake storage engine, as part of its goal to eliminate all “data restrictions” and dissolve the barrier between data lakes and warehouses. 

During a pre-briefing, Google explained that users  don’t  have to copy the data, move the data across your object stores, like in Google Cloud Storage, S3, or Azure in a multi-cloud environment, and you get a single place to access all of your data.

Additionally, it can support any open file formats such as Parquet, as well as open-source processing engines such as Apache Spark or Beam, and a variety of table formats such as Delta and Iceberg. 

Also, Google announced the formation of the Data Cloud Alliance, which includes Confluent, Databricks, Dataiku, Deloitte, Elastic, Fivetran, MongoDB, Neo4j, Redis, and Starburst as founding partners. 

The members of the alliance will provide infrastructure, APIs, and integration support to enable data portability and accessibility across numerous platforms and products in a variety of scenarios. To improve data portability, they will also collaborate on new, common industry data models, processes, and platform interfaces.

Vertex AI Workbench, which has been designed to be directly integrated with a full suite of AI and data products, including BigQuery, Serverless Spark, and Dataproc, was introduced as part of the tech giant’s Data Cloud Summit to bring data and ML systems into a single interface so teams can have common tool sets across data analytics, data science, and machine learning.