Showing 12 of 229 results
![](https://www.starburst.io/wp-content/uploads/2024/07/Data-Ingestion-Best-Practices.png)
3 Data Ingestion Best Practices: The Trends to Drive Success
July 11, 2024
It’s time to talk about data pipelines, specifically data ingestion best practices. Typically, a data engineering pipeline has three stages: Data Ingestion Data Transformation...
![](https://www.starburst.io/wp-content/uploads/2024/07/Whats-next-for-Trino-.png)
What’s next for Trino
July 3, 2024
It seems like only yesterday that Trino celebrated being around for a decade. Born out of Facebook to address the need for improved performance...
![](https://www.starburst.io/wp-content/uploads/2024/06/Transitioning-from-Hadoop-to-modern-lakehouses.png)
Transitioning from Hadoop to modern lakehouses
June 20, 2024
As organizations strive to harness the full potential of their data, the limitations of legacy Hadoop systems become increasingly apparent. Hadoop's architecture has been...
![](https://www.starburst.io/wp-content/uploads/2024/06/Asurion-data-quality-blog-2.png)
Real-world insights from Asurion: Data quality in practice
June 13, 2024
Why should organizations pay attention to data quality is the heart of the question. Moreover, how does big data and inconsistent data impact data-driven...
![](https://www.starburst.io/wp-content/uploads/2024/06/apache-iceberg-compute-engines.png)
Why Apache Iceberg will accelerate competition for compute engines
June 13, 2024
Apache Iceberg emerged last week triumphant, having won the race to become king of the data lakehouse. In many ways, this was a long...
![](https://www.starburst.io/wp-content/uploads/2024/07/aws-hadoop.png)
Advanced Data Management: Trino, Hadoop, and AWS for a Robust Lakehouse
June 12, 2024
Apache Hadoop revolutionized enterprise data management by offering an open-source alternative to expensive proprietary data systems. Companies could process massive datasets using the commodity...
![](https://www.starburst.io/wp-content/uploads/2024/06/snowflake-databricks-tabular-iceberg.png)
Snowflake, Databricks, Tabular, Iceberg, what does it all mean?
June 11, 2024
What happened last week? Snowflake Summit ran from Tuesday (June 4, 2024) through Thursday. This year, the conference was overshadowed by two significant announcements:...
![](https://www.starburst.io/wp-content/uploads/2024/06/iceberg-partitioning.png)
Why partitioning matters: 3 Best practices to improve performance
June 5, 2024
Imagine that your desk resembled the above image. Now you need to find all the invoices for a particular month to calculate your average...
![](https://www.starburst.io/wp-content/uploads/2024/07/hadoop-data-management.png)
Enhancing Apache Hadoop Data Management with Trino and Starburst
June 1, 2024
For almost two decades, companies have built big data processing architectures based on the Hadoop ecosystem. To extend the Hadoop project beyond its core...
![](https://www.starburst.io/wp-content/uploads/2024/06/parquet-vs-avro.png)
What’s the difference between Apache Parquet vs AVRO
May 30, 2024
The modern data lakehouse combines Apache Iceberg’s open table format, Trino’s open-source SQL query engine, and commodity object storage. Open file formats also influence...
![](https://www.starburst.io/wp-content/uploads/2024/06/Batch-vs-Streaming-Data-Processing.png)
What’s the difference between batch vs streaming data processing
May 29, 2024
Batch and streaming data processing are techniques companies use to analyze data from very different sources. Although it dates back to the era of...
![](https://www.starburst.io/wp-content/uploads/2024/05/iceberg-tables.png)
Top 5 reasons to not adopt Apache Iceberg
May 22, 2024
Apache Iceberg is becoming increasingly popular and is turning into the de facto standard for table formats. The first-ever Iceberg Summit was held this...