Cookie Notice
This site uses cookies for performance, analytics, personalization and advertising purposes.
For more information about how we use cookies please see our Cookie Policy.
Manage Consent Preferences
These cookies are essential in order to enable you to move around the website and use its features, such as accessing secure areas of the website.
These are analytics cookies that allow us to collect information about how visitors use a website, for instance which pages visitors go to most often, and if they get error messages from web pages. This helps us to improve the way the website works and allows us to test different ideas on the site.
These cookies allow our website to properly function and in particular will allow you to use its more personal features.
These cookies are used by third parties to build a profile of your interests and show you relevant adverts on other sites. You should check the relevant third party website for more information and how to opt out, as described below.
Fully managed in the cloud
Self-managed anywhere
Use the input above to search.
Here are some suggestions:
Trino Summit is a two-day virtual conference on the 11th and 12th of December 2024. It's an event that brings together engineers, analysts, data scientists, and anyone interested in using or contributing to Trino.
Learn moreShowing 9 of 9 results
It seems like only yesterday that Trino celebrated being around for a decade. Born out of Facebook to address the need for improved performance...
Imagine that your desk resembled the above image. Now you need to find all the invoices for a particular month to calculate your average...
Apache Iceberg is becoming increasingly popular and is turning into the de facto standard for table formats. The first-ever Iceberg Summit was held this...
Data migration, pivotal in the big data digital transformation era, involves the strategic transfer of data across systems. It's not just about moving data;...
A bird and a bunny walk into a bar... The bird says, “I’m the Python dataframe library with tons of optionality”. The bunny says,...
Delta Lake was initially developed by Databricks and by 2019 evolved to become an open source project. Since then, they’ve created a few key...
Let’s take a quick tour of the DataFrame API implementation with Python that runs the code ultimately as SQL on Starburst Galaxy. You’ll see the rich API that is available to data engineers who prefer to write programs over SQL.
If you’re a data engineer tasked with building and managing data pipelines, Starburst Galaxy enables you to build a data pipeline workflow using modern data lakes and SQL. This approach offers both simplicity and power. What might have required a complex, user defined function (UDF) in Python using other systems can be accomplished with the accessibility and universality of SQL alongside the ease and cost effectiveness of the data lake.
Materialized views have become available in Starburst Galaxy for catalogs using Great Lakes connectivity. For folks who are NOT already using Starburst Galaxy — come sign up — it’s FREE — especially if you want to exercise the content in this blog post.
© Starburst Data, Inc. Starburst and Starburst Data are registered trademarks of Starburst Data, Inc. All rights reserved. Presto®, the Presto logo, Delta Lake, and the Delta Lake logo are trademarks of LF Projects, LLC
Up to $500 in usage credits included