×

Starburst Strategy blog

Search blog

Filter by

Reset filters

Subscribe

Showing 12 of 229 results

3 Data Ingestion Best Practices: The Trends to Drive Success

July 11, 2024

It’s time to talk about data pipelines, specifically data ingestion best practices. Typically, a data engineering pipeline has three stages:  Data Ingestion Data Transformation...

What’s next for Trino

July 3, 2024

It seems like only yesterday that Trino celebrated being around for a decade. Born out of Facebook to address the need for improved performance...

Transitioning from Hadoop to modern lakehouses

June 20, 2024

As organizations strive to harness the full potential of their data, the limitations of legacy Hadoop systems become increasingly apparent. Hadoop's architecture has been...

Real-world insights from Asurion: Data quality in practice

June 13, 2024

Why should organizations pay attention to data quality is the heart of the question. Moreover, how does big data and inconsistent data impact data-driven...

Why Apache Iceberg will accelerate competition for compute engines

June 13, 2024

Apache Iceberg emerged last week triumphant, having won the race to become king of the data lakehouse.  In many ways, this was a long...

Advanced Data Management: Trino, Hadoop, and AWS for a Robust Lakehouse

June 12, 2024

Apache Hadoop revolutionized enterprise data management by offering an open-source alternative to expensive proprietary data systems. Companies could process massive datasets using the commodity...

Snowflake, Databricks, Tabular, Iceberg, what does it all mean?

June 11, 2024

What happened last week? Snowflake Summit ran from Tuesday (June 4, 2024) through Thursday. This year, the conference was overshadowed by two significant announcements:...

Why partitioning matters: 3 Best practices to improve performance

June 5, 2024

Imagine that your desk resembled the above image. Now you need to find all the invoices for a particular month to calculate your average...

Enhancing Apache Hadoop Data Management with Trino and Starburst

June 1, 2024

For almost two decades, companies have built big data processing architectures based on the Hadoop ecosystem. To extend the Hadoop project beyond its core...

What’s the difference between Apache Parquet vs AVRO

May 30, 2024

The modern data lakehouse combines Apache Iceberg’s open table format, Trino’s open-source SQL query engine, and commodity object storage. Open file formats also influence...

What’s the difference between batch vs streaming data processing

May 29, 2024

Batch and streaming data processing are techniques companies use to analyze data from very different sources. Although it dates back to the era of...

Top 5 reasons to not adopt Apache Iceberg

May 22, 2024

Apache Iceberg is becoming increasingly popular and is turning into the de facto standard for table formats. The first-ever Iceberg Summit was held this...

1 2 3 20

Subscribe

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.

s