×

Tag: SQL

Showing 37 results

5 ways to simplify ETL using SQL

5 ways to simplify ETL using SQL

October 31, 2024

Using SQL for ETL offers many advantages. To help understand how it's best to look at the ETL process more broadly. Data pipelines use...

Run optimized geospatial queries with Trino

Run optimized geospatial queries with Trino

March 23, 2023

The Trino open source distributed query engine is known as a choice for running ad-hoc analysis where there’s no need to model the data and...

Lie #3 — You’re ready for the AI + ML deep end

Lie #3 — You’re ready for the AI + ML deep end

February 3, 2023

You’ve hired pedigreed data scientists and engineers, invested in shiny new software, and perhaps even reorganized your entire business, all in the hopes of...

Trino for Large-Scale ETL @ Lyft

Trino for Large-Scale ETL @ Lyft

January 25, 2023

Lyft operates one of the largest transportation networks in the world. A business like ours depends on data on so many levels. Data relating...

Over 80 Data & Analytics Statistics, Data, Trends, and Facts

Over 80 Data & Analytics Statistics, Data, Trends, and Facts

December 28, 2022

Most organizations have data and continue to generate and collect it on a daily basis, but have a far more difficult time in getting...

Reliving the Hype: Highlights from Trino Summit 2022

Reliving the Hype: Highlights from Trino Summit 2022

November 18, 2022

Last week in San Francisco was one for the Trino history books. After three years of planning, rescheduling, planning, and rescheduling some more, Starburst...

Introducing Full Query Passthrough For Faster Query Federation

Introducing Full Query Passthrough For Faster Query Federation

November 15, 2022

Best-in-class SQL query functionality has always been and remains a fundamental principle that defines Starburst’s query engine. With the recent implementation of full query...

Second Edition of Trino: The Definitive Guide

Second Edition of Trino: The Definitive Guide

October 5, 2022

Starburst has played a key role in the Trino community for a long time now. We contribute  to the success of Trino every day....

4 Key Things You Should Know About Indexing

4 Key Things You Should Know About Indexing

September 22, 2022

Data indexing radically accelerates query run time and concurrency without the need for massive compute resources. But before expecting indexing to solve all your...

The Difference Between Micro-Partitioning vs. Indexing and a Better Way

The Difference Between Micro-Partitioning vs. Indexing and a Better Way

September 8, 2022

When optimizing your analytics database performance, one of the most important decisions is to choose how data is stored and accessed. There are two...

Scaling Up: When to Migrate from PostgreSQL to a Data Lake

Scaling Up: When to Migrate from PostgreSQL to a Data Lake

July 13, 2022

One of the true pillars of the tech revolution, PostgreSQL is an OLTP database designed primarily to handle transactional workloads. The technology has been...

Confessions of a Space Quest League Advocate

Confessions of a Space Quest League Advocate

July 6, 2022

Mission 2 Wrap and Mission 3 Launch We all know at least one pandemic puzzler, a devoted crossworder, or a religious wordler who finds...

Employee Perspective: Accelerating Data-Driven Insights in AdTech

Employee Perspective: Accelerating Data-Driven Insights in AdTech

June 16, 2022

Before I joined Starburst, I worked in the AdTech industry where companies buy and sell user data for online targeting advertisement campaigns or ML/AI-based...

The Benefit Of Using An Externally-Audited Data Analytics Solution

The Benefit Of Using An Externally-Audited Data Analytics Solution

June 2, 2022

As a business begins to see the challenges of distributed data access, the selection of a query engine becomes critical for business operations.  For...

The Past, Present, and Future of Trino

The Past, Present, and Future of Trino

May 24, 2022

Recently, I had the pleasure of chatting with Ravit Jain on his show “The Ravit Show” to discuss the evolution of Trino and where...

ETL vs Interactive Queries: The Case for Both

ETL vs Interactive Queries: The Case for Both

May 5, 2022

This is Part 1 of a 2-part blog about how Trino can support both interactive and batch use cases.  In Part 1, we will...

Faster Query Processing: CPU Time

Faster Query Processing: CPU Time

March 25, 2022

A key engineering responsibility at Starburst is on performance enhancements. One is to reduce the amount of time that a CPU has to work...

The Benefits of a Big Data SQL Query Engine

The Benefits of a Big Data SQL Query Engine

February 16, 2022

So why use a big data SQL query engine? Well, have you suffered from the following problems with processing and analyzing big data via...

Top 6 Reasons to Migrate to the Cloud

Top 6 Reasons to Migrate to the Cloud

January 25, 2022

Starburst released the 2021 State of Data market research report, conducted by Enterprise Management Associates (EMA), in collaboration with Red Hat, early last year....

The Right Way to Query Across Data Sources in Tableau (or, The Cross-Database Join Is Not Always Your Friend)

The Right Way to Query Across Data Sources in Tableau (or, The Cross-Database Join Is Not Always Your Friend)

January 13, 2022

Summary Use the right tool for the right job. Not doing so means the difference between your Tableau viz rendering in seconds vs. minutes...

Achieving Lightning-Fast Analytics on the Salesforce Customer 360

Achieving Lightning-Fast Analytics on the Salesforce Customer 360

January 6, 2022

Over the past twenty or so years, companies have experienced a Cambrian explosion of where their customer data resides.Cloud and on-premises enterprise applications aim...

Enabling Data Sovereignty with Starburst Stargate

Enabling Data Sovereignty with Starburst Stargate

December 29, 2021

In the data analytics and compliance world, data sovereignty is a concept that has our attention. Policy makers suggest that the best way to...

The Analytics Engine for Distributed Data

The Analytics Engine for Distributed Data

October 1, 2021

The idea of a single source of truth has been around since the beginning of big data. However, over the years, through the data...

Dynamic Filtering: Supporting High Speed Access to Data

Dynamic Filtering: Supporting High Speed Access to Data

September 20, 2021

Analysts are often tasked with deriving insights for business units where the data can span multiple locations.  This is increasingly true today when the...

How Assurance Unlocked More Business Value with Starburst

How Assurance Unlocked More Business Value with Starburst

September 9, 2021

By leveraging Starburst, Assurance was able to improve conversion rates, reduce costs, and enable robust modeling. Read the full case study here. ...

Accelerating Data Science with Trino

Accelerating Data Science with Trino

August 31, 2021

At our Datanova for Data Scientists conference on July 14, I held a discussion with Dain Sundstrom and David Philips, CTOs of Starburst, about...

Why Performance Matters: Parquet, Delta Lake, Dynamic Filtering

Why Performance Matters: Parquet, Delta Lake, Dynamic Filtering

August 26, 2021

My fascination with SQL query performance started quite some time ago and I contributed a paper on efficient processing of data warehousing during my...

Kafka and Starburst: 3 Considerations for Accelerating Time to Value

Kafka and Starburst: 3 Considerations for Accelerating Time to Value

July 27, 2021

What is Kafka? Apache Kafka was created at LinkedIn and open sourced into the Apache Software foundation in early 2011. Kafka was developed to...

Data Federation and Data Virtualization Never Worked in the Past But Now it’s Different

Data Federation and Data Virtualization Never Worked in the Past But Now it’s Different

July 13, 2021

Thirty years ago it was already commonplace for large businesses to have hundreds --- even thousands of different database instances managing data from the...

The State of Data Analysts

The State of Data Analysts

June 28, 2021

The world of data analysis is constantly changing and evolving, and sometimes it can be hard to keep up with. I had the pleasure...

Query Federation Made Simple at Comcast

Query Federation Made Simple at Comcast

June 24, 2021

The media and telecommunications provider now known as Comcast began as a regional operator with just five channels and 12,000 customers. Today, Comcast has...

Data Mesh: The Answer to the Data Warehouse Hypocrisy

Data Mesh: The Answer to the Data Warehouse Hypocrisy

March 25, 2021

Note: I start this piece with some technical background that has nothing to do with the data mesh, and is only relevant to data...

Top 10 Reasons to Migrate from EMR Trino to Starburst Enterprise

Top 10 Reasons to Migrate from EMR Trino to Starburst Enterprise

November 13, 2020

In today’s data architecture economy, there are no shortages of options when it comes to choosing various distributions and deployment strategies for a given...

The Death of Apache Drill

The Death of Apache Drill

August 6, 2020

One of the things that really drew me to and got me excited about Trino over 4 years ago was that it wasn’t tied...

Presto & Data Science: Getting Data Into the Hands of Data Scientists (Faster)

Presto & Data Science: Getting Data Into the Hands of Data Scientists (Faster)

June 26, 2020

A few days ago I read a Gartner report stating that data scientists spend 23% of their time on data collection and preparation. I...

Presto on Kubernetes

Presto on Kubernetes

August 2, 2019

Kubernetes (K8s) eases the burden and complexity of configuring, deploying, managing, and monitoring containerized applications. We are excited to announce the availability and support...

The 4 Stages to Big Data Nirvana (In the Cloud)

The 4 Stages to Big Data Nirvana (In the Cloud)

July 18, 2019

Nirvana - a state of perfect happiness; an ideal or idyllic place.  In big data “Nirvana” is a wishlist of items: The ability to...

Start for Free with Starburst Galaxy

Up to $500 in usage credits included

You will need a valid email in order to activate your free trial.

Please fill in all required fields and ensure you are using a valid email address.

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.

s