Cookie Notice
This site uses cookies for performance, analytics, personalization and advertising purposes.
For more information about how we use cookies please see our Cookie Policy.
Manage Consent Preferences
These cookies are essential in order to enable you to move around the website and use its features, such as accessing secure areas of the website.
These are analytics cookies that allow us to collect information about how visitors use a website, for instance which pages visitors go to most often, and if they get error messages from web pages. This helps us to improve the way the website works and allows us to test different ideas on the site.
These cookies allow our website to properly function and in particular will allow you to use its more personal features.
These cookies are used by third parties to build a profile of your interests and show you relevant adverts on other sites. You should check the relevant third party website for more information and how to opt out, as described below.
Fully managed in the cloud
Self-managed anywhere
Use the input above to search.
Here are some suggestions:
Trino Summit is a two-day virtual conference on the 11th and 12th of December 2024. It's an event that brings together engineers, analysts, data scientists, and anyone interested in using or contributing to Trino.
Learn moreShowing 73 results
This article defines what data lakes are, why they are important, and how they compare to other big data storage technologies, including: Databases Data...
Table maintenance is necessary for Apache Iceberg tables in order to keep your data optimized and performant. That extra effort is worth the reward...
Generative AI took the world by storm in 2023 and there has been a tremendous amount of hype around the possibilities it brings to...
Of all the seventy-plus speakers at the festival, there was one presentation that I found to be particularly interesting – and not because the speaker also happens to be our customer. That presentation was from Lutz Künneke, Director of Engineering, and Isa Inalcik, Senior Data Engineer, at BestSecret, a leading European online destination for off-price fashion based near Munich, Germany. As Künneke got to the stage, the first words out of his mouth were: “We are moving off of Snowflake.”
In a new report Cloud Data Warehouse vs. Cloud Data Lakehouse: A Snowflake vs. Starburst TCO and Performance Comparison, published by GigaOm, concluded that a Starburst lakehouse architecture could achieve superior price-performance and significantly faster time-to-insight at a much lower total cost of ownership (TCO).
Discover how Starburst’s nanoblock indexing accelerates data lake analytics, optimizing queries, and reducing data reads. Try it in Starburst Galaxy for accelerated performance!
Data analytics certification program to learn about topics such as data lakes and data lakehouses, and modern table formats like Apache Iceberg.
ETL operates as the engine behind the data pipeline process, moving data from a raw state to a consumable one. Let’s unpack the way in which this typically operates in a modern data lake or data lakehouse. Later, we’ll take a tour to see how Starburst Galaxy fits in this picture and how it can be used to construct the Land, Structure and Consume layers typical of a modern data lake.
With the Looker and Starburst Galaxy integration, teams can now extend Looker beyond data in Google Cloud services like BigQuery to other cloud data sources – including data in AWS and Azure. This means that Looker can now support customers with multi-cloud environments.
Of all the choices a startup has to make in its early stages, deciding on the right data analytics architecture might not seem critical,...
A data lake analytics platform is needed in order to bridge the gap between what can be a large number of analytical AI tools with data lakes, lakehouses, legacy systems and other technologies in the ecosystem.
The number of unique data vendors has grown, tripling in the past decade (from about 50 to close to 150 today), driven in a large part by massive data stack investments, which total about $245 billion between 2012 to 2021.
In our last post, we discussed two methods for running geospatial analysis with Trino and the Hive connector and explored a few optimization techniques...
You’ve hired pedigreed data scientists and engineers, invested in shiny new software, and perhaps even reorganized your entire business, all in the hopes of...
Technology vendors have long peddled a version of nirvana where all of a company’s data would be centralized in one location. The “single source...
Accessing data in cloud storage has been an ongoing challenge for analysts, data engineers, and organizations as a whole. Additional work is required to...
Most organizations have data and continue to generate and collect it on a daily basis, but have a far more difficult time in getting...
Par Martial Coiffe & Victor Coustenoble 2022 nous a confirmé que l’architecture data demeure au cœur des préoccupations des entreprises et organisations en France,...
The shift to cloud-based software-as-a-service platforms is accelerating in just about every tech industry. So it wasn’t much of a surprise to the analytics...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
After years of building enterprise data warehouses, at first glance, a data lake architecture may appear to be similar to a data warehouse. After...
The increasing popularity of data lakes isn't surprising anyone in the analytics space. The appeal of importing data from multiple sources into a single...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
Last week in San Francisco was one for the Trino history books. After three years of planning, rescheduling, planning, and rescheduling some more, Starburst...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
A data lakehouse combines the principles of a data lake and a data warehouse to include the best of both worlds. Data lakehouses are...
This post is part of the Iceberg blog series. Read the entire series: Introduction to Apache Iceberg in Trino Iceberg Partitioning and Performance Optimizations...
Data lakes have amazing attributes. For one, it enables us to handle vast, complex datasets. Data lakes offer an up-to-date stream of data that...
Data lakes deliver unprecedented agility A data lake is an essential tool for big data analytics. A key advantage of developing a data lake...
Starburst has played a key role in the Trino community for a long time now. We contribute to the success of Trino every day....
AWS S3 has become one of the most widely used storage platforms in the world. Companies store a variety of data on S3 from...
Data virtualization revolutionized the data infrastructure space by serving data consumers directly on top of data stores, without the need to move data elsewhere....
In the big data analytics world, enabling analytics on unstructured text is a powerful capability. For that reason, it would be of use that...
After Covid-19, many business executives faced one of the toughest leadership tests to turn this challenge into an amazing opportunity. What did the business...
As organizations strive to become more agile, there has been a mass movement jumping headfirst into what is called a security data lake. Gartner...
When optimizing your analytics database performance, one of the most important decisions is to choose how data is stored and accessed. There are two...
Data lakes enable the implemention of a wide range of solutions, including raw data collection, flexible data access for users, and building fast and...
The glory days of SIEM are over. Security teams are not only measured by their ability to collect as much data as possible, but...
One of the true pillars of the tech revolution, PostgreSQL is an OLTP database designed primarily to handle transactional workloads. The technology has been...
I’m excited to announce the acquisition of Varada, a data analytics accelerator, based out of Tel Aviv, Israel. Varada offers a data lake analytics...
Before I joined Starburst, I worked in the AdTech industry where companies buy and sell user data for online targeting advertisement campaigns or ML/AI-based...
Best-in-class organizations need fast, reliable data analytics that enable business leadership to identify patterns and key insights that will help them predict the best...
Recently, I had the pleasure of chatting with Ravit Jain on his show “The Ravit Show” to discuss the evolution of Trino and where...
This blog was co-authored by Claudius Li, Product Manager at Starburst, and Joe Lodin, Information Engineer at Starburst. Starburst recently donated the Delta Lake...
So why use a big data SQL query engine? Well, have you suffered from the following problems with processing and analyzing big data via...
Starburst released the 2021 State of Data market research report, conducted by Enterprise Management Associates (EMA), in collaboration with Red Hat, early last year....
I think of Starburst Stargate as the Lord of the Rings feature. Or the galactic empire feature. In a prior blog post, I introduced...
As companies shift their analytical ecosystems from on-premise to cloud and try to avoid “data lock-in”, we’re noticing some very interesting data patterns. This...
I’m one of those strange people who has always enjoyed doing performance testing. The thought of spinning up lots of machines to do my...
The idea of a single source of truth has been around since the beginning of big data. However, over the years, through the data...
Many data and analytics practitioners have heard about this socio-technical paradigm shift, Data Mesh, and would like to learn more. But before describing what...
Analysts are often tasked with deriving insights for business units where the data can span multiple locations. This is increasingly true today when the...
At our Datanova for Data Scientists conference on July 14, I held a discussion with Dain Sundstrom and David Philips, CTOs of Starburst, about...
As companies shift their analytical ecosystems from on-premise to cloud and try to avoid “data lock-in”, we’re noticing some very interesting data patterns. This...
Microsoft has migrated thousands of customers to its Azure cloud platform and has quickly become the second most popular cloud provider. Companies have easily...
Trino on ice I: A gentle introduction to Iceberg Trino on ice II: In-place table evolution and cloud compatibility with Iceberg Trino on ice...
At Starburst, we believe in building optionality into your data architecture & strategy. To us, optionality means building for flexibility so that you don’t...
Trino on ice I: A gentle introduction to Iceberg Trino on ice II: In-place table evolution and cloud compatibility with Iceberg Trino on ice...
Trino on ice I: A gentle introduction to Iceberg Trino on ice II: In-place table evolution and cloud compatibility with Iceberg Trino on ice...
Trino on ice I: A gentle introduction to Iceberg Trino on ice II: In-place table evolution and cloud compatibility with Iceberg Trino on ice...
After a decade of running Hive queries on their data lakes, many companies are astonished at the speeds in which they are able to...
Datanova is just next week. More than 2,000 data and analytics leaders will join us to learn more about how to unlock the value...
Datanova 2021 is going to have plenty of panels and informative content for anyone interested in the future of big data management. We're also...
In today’s data architecture economy, there are no shortages of options when it comes to choosing various distributions and deployment strategies for a given...
One of the things that really drew me to and got me excited about Trino over 4 years ago was that it wasn’t tied...
A few days ago I read a Gartner report stating that data scientists spend 23% of their time on data collection and preparation. I...
Our customer base has been growing quickly, and we’re excited to share a case study highlighting one of our largest clients, a telecommunications...
Nirvana - a state of perfect happiness; an ideal or idyllic place. In big data “Nirvana” is a wishlist of items: The ability to...
TL;DR - There is now Starburst Enterprise Databricks Delta Lake compatibility. Delta Lake The big data ecosystem has many components but the one...
We recently invited 451 Research VP, Matt Aslett to share his thoughts and observations on the practice of separating the storage and computation of...
It seems like migrating to the cloud has dominated the news and a lot of companies are shuttering their data centers and letting cloud...
With Amazon’s Simple Storage Service (Amazon S3), the object storage solution from Amazon Web Services (AWS), you can build a scalable, cost-efficient data lake...
© Starburst Data, Inc. Starburst and Starburst Data are registered trademarks of Starburst Data, Inc. All rights reserved. Presto®, the Presto logo, Delta Lake, and the Delta Lake logo are trademarks of LF Projects, LLC
Up to $500 in usage credits included