Showing 12 of 237 results
What is a data lake?
November 12, 2024
This article defines what data lakes are, why they are important, and how they compare to other big data storage technologies, including: Databases Data...
How to identify and manage data silos
November 6, 2024
Data silos occur when the logic from one part of the business does not match the logic of another. This hinders organizations from making...
How AI and data analytics drive demand for relevant business data
October 23, 2024
Data is everywhere, but unlocking its value is often a complex task. Today, the emergence of AI makes this important job even more critical,...
What is Trino?
September 25, 2024
Trino is more popular than ever, but what is it? Let’s start with a definition. Trino is a massively parallel processing, distributed SQL query...
Apache Iceberg vs Delta Lake: What are the differences?
September 16, 2024
The cloud data lakehouse is gaining momentum, driven by the evolution of table formats like Apache Iceberg, Delta Lake, and Hudi. With improved transactional...
Compute cost best practices: How to optimize data costs across all architectures
August 27, 2024
Rising compute costs are often the unwanted price of success in the data world. Usage-based pricing models mean that the more data you use,...
What is the best way to query Apache Iceberg tables?
August 21, 2024
The summer of 2024 has ignited a revolution in the data lake space. As the dust settles, one thing is clear: Apache Iceberg has...
GenAI drives adoption of open data architecture
August 13, 2024
Generative AI (GenAI) continues to reshape industries. It has the potential to revolutionize employee productivity, drive innovation, and enhance operational efficiencies using data. To...
Solving capacity management problems for Trino clusters, and how Starburst Galaxy makes it easy
August 8, 2024
Solving capacity management issues for Trino clusters is a complicated problem. Although Trino is powerful, using that power effectively often requires a manual understanding...
Using Trino for Your Data Transformations
August 7, 2024
While you may be used to performing data transformations via Python or Scala, the truth is that you can actually achieve the same results...
Transitioning from Hadoop to a modern lakehouse: The Dell Data Lakehouse powered by Starburst
August 2, 2024
In today's fast-paced data-driven world, staying ahead means constantly evolving your data infrastructure. Hadoop has been a cornerstone of big data processing; however, the...
Building an open and interoperable data lakehouse with Starburst Galaxy
July 31, 2024
Organizations are seeking an efficient, scalable, and secure way to manage and analyze their data. One way to achieve this is to migrate from...