Data Analytics

Tobiko Data raises $21.8M to transform data transformation


Big data transformation startup Tobiko Data Inc. revealed today it has closed on two separate funding rounds, including a $17.3 million Series A round that followed an initial $4.5 million seed raise.

The Series A was led by Theory Ventures and saw participation from 20Sales, plus the chief executive officers of Fivetran Inc., Sutro Labs Inc and MotherDuck Corp., while Unusual Ventures led the seed round. The total amount raised via the two rounds was $21.8 million, and the announcement coincides with the launch of its managed cloud and enterprise platforms, which are both generally available starting today.

Tobiko Data has created a data transformation platform that’s built on top of the open-source SQLMesh and SQLGlot projects. It’s designed to help companies easily build more efficient and scalable data pipelines for artificial intelligence and other workloads.

SQLMesh is a tool that applies DevOps best practices to data pipelines written in SQL or Python. It helps data teams run, test and deploy data transformations efficiently and safely with features like virtual environments, automatic DAG generation and smart change categorization. Data scientists and analysts can create data pipelines using the familiar Structured Query Language, with minimal involvement of data engineers, reducing much of the manual toil involved with such tasks.

Meanwhile, SQLGlot is a key component of SQLMesh that acts as an SQL parser, transpiler and translator, with support for 24 different SQL dialects.

According to Tobiko Data, many enterprise data teams believe they are shackled by the lack of scalability offered by existing data transformation tools, which still require too much manual work and suffer from inefficient compute.

Co-founder and CEO Tyson Mao said existing data transformation products are fundamentally broken because they’re unable to make sense of SQL. Because of this, he said they blindly execute scripts without any comprehension of the underlying changes that result.

Mao said this is a problem because a single change in a query can “affect billions of rows of data.” He added that some companies “spend millions every year on unnecessary rebuilds of the warehouse when only a small precise change is needed.”

The startup offers an alternative that doesn’t waste costly and unnecessary compute cycles on the data warehouse, which can take hours or even days, depending on the size of the company and how much data it owns. Tobiko simplifies data pipeline creation because its platform has a better understanding of SQL semantics, which means it will only execute the necessary downstream changes, rather than completely rebuilding the data warehouse each time a new data pipeline is added.

It also supports virtual data environments that can understand and remember every version of every data pipeline, so it can avoid duplicative computation. “Tobiko eliminates redundant computations, which can become extremely costly,” Mao said.

The startup said the data integration company Fivetran has already transformed its data transformation approach after implementing its platform, boosting the efficiency and productivity of data analysts by 150%. Fivetran CEO George Fraser said his team was beyond impressed with the capabilities of SQLMesh.

“My team was afraid of making changes, of testing, of wasting money, so much so we made our own forked version of dbt,” Fraser explained. “Even then, we were still scared. We were trying to figure out what’s missing! But after using SQLMesh across everyone on our team, not only are we not afraid, we’re excited.”

According to Fraser, the most important impact of using Tobiko Data’s platform is the philosophical shift in his team’s mindset. Because the team knows it won’t be wasting any money when it makes changes to its underlying data, it can quickly test new data pipelines and quickly undo something if any problems occur. “Everyone on our team has instantly evolved,” he said. “We finally have a transformation tool that can evolve with us.”

Tobiko Data said it will use the funds from today’s round to continue developing SQLMesh and SQLGlot. The company plans to showcase its platform at upcoming events, including the Snowflake Summit on June 6 and Databricks Data + AI Summit on June 10.

Image: SiliconANGLE/Microsoft Designer

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU



Source

Related Articles

Back to top button