×

Tag: Analytics Engineer

Showing 57 results

Fueling Trino large-scale geospatial analysis with Starburst Warp Speed

Fueling Trino large-scale geospatial analysis with Starburst Warp Speed

March 27, 2023

In our last post, we discussed two methods for running geospatial analysis with Trino and the Hive connector and explored a few optimization techniques...

Run optimized geospatial queries with Trino

Run optimized geospatial queries with Trino

March 23, 2023

The Trino open source distributed query engine is known as a choice for running ad-hoc analysis where there’s no need to model the data and...

Has the notion of a single data source for Financial Services run its course?

Has the notion of a single data source for Financial Services run its course?

January 20, 2023

More than any other industry, Financial Services is likely to only partially realize the elusive utopian state of 'the single source of truth' for...

Building a federated data lakehouse with Starburst Galaxy

Building a federated data lakehouse with Starburst Galaxy

January 11, 2023

We are eleven days into the new year, and I have spent the past two weeks exerting unreasonable amounts of effort trying to make...

6 Reasons to Attend Datanova 2023: For Data Rebels

6 Reasons to Attend Datanova 2023: For Data Rebels

January 5, 2023

Over the past few weeks, we’ve shared a few examples of what it means to be a data rebel. Hopefully you’ve recognized yourself in...

Over 80 Data & Analytics Statistics, Data, Trends, and Facts

Over 80 Data & Analytics Statistics, Data, Trends, and Facts

December 28, 2022

Most organizations have data and continue to generate and collect it on a daily basis, but have a far more difficult time in getting...

Tableau Cloud + Starburst: New Connector Supports Shift to Cloud-based SaaS

Tableau Cloud + Starburst: New Connector Supports Shift to Cloud-based SaaS

December 19, 2022

The shift to cloud-based software-as-a-service platforms is accelerating in just about every tech industry. So it wasn’t much of a surprise to the analytics...

What Are The Different Types Of Data Products

What Are The Different Types Of Data Products

December 16, 2022

As we’ve gone from Data Mesh theory to practice, organizations have been shifting their focus towards the central tenet of Data Mesh — building...

Building lakehouse with dbt and Trino

Building lakehouse with dbt and Trino

November 30, 2022

In this series, we demonstrate how to build data pipelines using dbt and Trino with data directly from your operational systems. They can use...

Reliving the Hype: Highlights from Trino Summit 2022

Reliving the Hype: Highlights from Trino Summit 2022

November 18, 2022

Last week in San Francisco was one for the Trino history books. After three years of planning, rescheduling, planning, and rescheduling some more, Starburst...

Explore A New Way Of Utilizing A Data Lakehouse

Explore A New Way Of Utilizing A Data Lakehouse

November 10, 2022

A data lakehouse combines the principles of a data lake and a data warehouse to include the best of both worlds. Data lakehouses are...

Join the Team: Realizing the Promise Of Big Data

Join the Team: Realizing the Promise Of Big Data

November 4, 2022

I have been in and around data since my days with Microsoft Access, Excel, and SQL Server circa 2000, and was fortunate to witness...

Countdown to Trino Summit 2022

Countdown to Trino Summit 2022

November 1, 2022

It’s finally here! We are closing in on the final countdown to Trino Summit 2022, and I can feel myself getting more excited with...

Build a Data Lakehouse Reporting Structure with dbt and Starburst Galaxy

Build a Data Lakehouse Reporting Structure with dbt and Starburst Galaxy

October 18, 2022

Since my first introduction to dbt, I was intrigued to say the least. Working as a data engineer, I was attempting to manage complicated...

Accenture Master Class: How to Adopt a Data Product Mindset

Accenture Master Class: How to Adopt a Data Product Mindset

October 17, 2022

Since Datanova: The Data Mesh Summit and our in-person executive discussions on data products and Data Mesh, we’ve been validating the data product approach...

Second Edition of Trino: The Definitive Guide

Second Edition of Trino: The Definitive Guide

October 5, 2022

Starburst has played a key role in the Trino community for a long time now. We contribute  to the success of Trino every day....

Building Reporting Structures on S3 using Starburst Galaxy and Apache Iceberg

Building Reporting Structures on S3 using Starburst Galaxy and Apache Iceberg

October 4, 2022

AWS S3 has become one of the most widely used storage platforms in the world. Companies store a variety of data on S3 from...

The Data Virtualization Evolution is Just Beginning

The Data Virtualization Evolution is Just Beginning

October 4, 2022

Data virtualization revolutionized the data infrastructure space by serving data consumers directly on top of data stores, without the need to move data elsewhere....

Accenture Master Class: Creating Data Products

Accenture Master Class: Creating Data Products

September 27, 2022

Since Datanova: The Data Mesh Summit and our in-person executive discussions on data products and Data Mesh, we’ve been validating the data product approach...

4 Key Things You Should Know About Indexing

4 Key Things You Should Know About Indexing

September 22, 2022

Data indexing radically accelerates query run time and concurrency without the need for massive compute resources. But before expecting indexing to solve all your...

Accenture Master Class: Why Organizations Should Create Data Products

Accenture Master Class: Why Organizations Should Create Data Products

September 6, 2022

Since Datanova: The Data Mesh Summit and our in-person executive discussions on data products and Data Mesh, we’ve been validating the data product approach...

Near Real-Time Ingestion For Trino

Near Real-Time Ingestion For Trino

August 4, 2022

It is quite popular in today's data climate for modern data architectures to have some sort of batch processing system to move data into...

A Better Solution For Managing and Maintaining Data Pipelines, Now In Public Preview

A Better Solution For Managing and Maintaining Data Pipelines, Now In Public Preview

July 6, 2022

Customers who want a single, super fast and easy-to-use solution for both interactive and longer-running data pipeline queries now have a solution: take advantage...

Confessions of a Space Quest League Advocate

Confessions of a Space Quest League Advocate

July 6, 2022

Mission 2 Wrap and Mission 3 Launch We all know at least one pandemic puzzler, a devoted crossworder, or a religious wordler who finds...

Employee Perspective: Accelerating Data-Driven Insights in AdTech

Employee Perspective: Accelerating Data-Driven Insights in AdTech

June 16, 2022

Before I joined Starburst, I worked in the AdTech industry where companies buy and sell user data for online targeting advertisement campaigns or ML/AI-based...

Transforming Your Data Pipelines with Starburst

Transforming Your Data Pipelines with Starburst

June 9, 2022

Current State of ETL/ELT Extract-transform-load, more commonly known by its street name “ETL”, has been around since the early days of computing. Bringing together...

The Past, Present, and Future of Trino

The Past, Present, and Future of Trino

May 24, 2022

Recently, I had the pleasure of chatting with Ravit Jain on his show “The Ravit Show” to discuss the evolution of Trino and where...

Part 2: How to Run Batch Processes Using Starburst Galaxy

Part 2: How to Run Batch Processes Using Starburst Galaxy

May 19, 2022

This is Part 2 of a 2-part blog about how Trino can support both interactive and batch use cases. In Part 1, we explored...

ETL vs Interactive Queries: The Case for Both

ETL vs Interactive Queries: The Case for Both

May 5, 2022

This is Part 1 of a 2-part blog about how Trino can support both interactive and batch use cases.  In Part 1, we will...

Enter the Starburst Space Quest League for a Chance to Win Big!

Enter the Starburst Space Quest League for a Chance to Win Big!

April 18, 2022

Calling all data pros! Are you ready for a $20k payday? Yes, you heard it right – you could be walking away with $20,000...

Faster Query Processing: CPU Time

Faster Query Processing: CPU Time

March 25, 2022

A key engineering responsibility at Starburst is on performance enhancements. One is to reduce the amount of time that a CPU has to work...

Gartner® Report: Are Data Fabric and Data Mesh the Same or Different?

Gartner® Report: Are Data Fabric and Data Mesh the Same or Different?

March 7, 2022

Data Fabric and Data Mesh continue to sustain legions of hype and debate. Data and analytics leaders are longing for a new roadmap, beyond...

The Benefits of a Big Data SQL Query Engine

The Benefits of a Big Data SQL Query Engine

February 16, 2022

So why use a big data SQL query engine? Well, have you suffered from the following problems with processing and analyzing big data via...

6 Reasons to Attend Datanova 2022: #4, Accenture Master Class

6 Reasons to Attend Datanova 2022: #4, Accenture Master Class

January 19, 2022

So far, we’ve highlighted a few reasons why you should attend Datanova: The Data Mesh Summit: The Woz and Justin Borgman. The next reason...

The Right Way to Query Across Data Sources in Tableau (or, The Cross-Database Join Is Not Always Your Friend)

The Right Way to Query Across Data Sources in Tableau (or, The Cross-Database Join Is Not Always Your Friend)

January 13, 2022

Summary Use the right tool for the right job. Not doing so means the difference between your Tableau viz rendering in seconds vs. minutes...

Achieving Lightning-Fast Analytics on the Salesforce Customer 360

Achieving Lightning-Fast Analytics on the Salesforce Customer 360

January 6, 2022

Over the past twenty or so years, companies have experienced a Cambrian explosion of where their customer data resides.Cloud and on-premises enterprise applications aim...

Data warehouse vs Lake vs Lakehouse architecture

Data warehouse vs Lake vs Lakehouse architecture

December 6, 2021

As companies shift their analytical ecosystems from on-premise to cloud and try to avoid “data lock-in”, we’re noticing some very interesting data patterns. This...

Tableau is Just Better with Starburst

Tableau is Just Better with Starburst

November 15, 2021

I’m one of those strange people who has always enjoyed doing performance testing. The thought of spinning up lots of machines to do my...

Data Mesh: Data as a Product

Data Mesh: Data as a Product

October 21, 2021

Data Mesh is based on four central concepts, the second of which is data as a product. In this blog, we’ll explore what that...

Data Mesh Architecture: Domain-oriented Ownership

Data Mesh Architecture: Domain-oriented Ownership

October 14, 2021

A data mesh architecture is based on four central concepts, the first of which is domain-oriented ownership.  In this blog, we’ll explore what that...

The Intelligent Edge

The Intelligent Edge

September 13, 2021

Today’s digital world is an expanding frontier of emerging technologies. There are endless innovations, inspired by data, informed by data, enabled by data, and...

How Assurance Unlocked More Business Value with Starburst

How Assurance Unlocked More Business Value with Starburst

September 9, 2021

By leveraging Starburst, Assurance was able to improve conversion rates, reduce costs, and enable robust modeling. Read the full case study here. ...

Why Performance Matters: Parquet, Delta Lake, Dynamic Filtering

Why Performance Matters: Parquet, Delta Lake, Dynamic Filtering

August 26, 2021

My fascination with SQL query performance started quite some time ago and I contributed a paper on efficient processing of data warehousing during my...

Hybrid Distributed Data Store and RDBMS

Hybrid Distributed Data Store and RDBMS

August 12, 2021

As companies shift their analytical ecosystems from on-premise to cloud and try to avoid “data lock-in”, we’re noticing some very interesting data patterns. This...

Kafka and Starburst: 3 Considerations for Accelerating Time to Value

Kafka and Starburst: 3 Considerations for Accelerating Time to Value

July 27, 2021

What is Kafka? Apache Kafka was created at LinkedIn and open sourced into the Apache Software foundation in early 2011. Kafka was developed to...

Query Federation Made Simple at Comcast

Query Federation Made Simple at Comcast

June 24, 2021

The media and telecommunications provider now known as Comcast began as a regional operator with just five channels and 12,000 customers. Today, Comcast has...

Managing Secrets in Trino

Managing Secrets in Trino

June 3, 2021

Most companies want to follow good security practices. With the number of security breaches coming out daily, it almost feels like a matter of...

Starburst Elements: Start Fast with Starburst Galaxy

Starburst Elements: Start Fast with Starburst Galaxy

May 20, 2021

This is the fourth episode in our video series, Starburst Elements, focused around anything and everything Starburst. In this episode, our Product Manager Vishal...

Starburst Elements: Introduction to Starburst Galaxy

Starburst Elements: Introduction to Starburst Galaxy

May 7, 2021

This is the third episode in our video series, Starburst Elements, focused around anything and everything Starburst. In this episode, our Product Manager Vishal...

Data Mesh: The Answer to the Data Warehouse Hypocrisy

Data Mesh: The Answer to the Data Warehouse Hypocrisy

March 25, 2021

Note: I start this piece with some technical background that has nothing to do with the data mesh, and is only relevant to data...

A Gentle Introduction to the Hive Connector

A Gentle Introduction to the Hive Connector

February 12, 2021

One of the most confusing aspects when starting with the Hive connector comes from the complex Hive model and overlapping use cases of this...

Reasons to Attend Datanova 2021: # 6, Technical Training from the Creators of Trino

Reasons to Attend Datanova 2021: # 6, Technical Training from the Creators of Trino

February 5, 2021

We love data engineers at Starburst. They are our people, even when their Starburst Data equivalents try to trick Marketing into pronouncing the data...

Top 10 Reasons to Migrate from EMR Trino to Starburst Enterprise

Top 10 Reasons to Migrate from EMR Trino to Starburst Enterprise

November 13, 2020

In today’s data architecture economy, there are no shortages of options when it comes to choosing various distributions and deployment strategies for a given...

Presto & Data Science: Getting Data Into the Hands of Data Scientists (Faster)

Presto & Data Science: Getting Data Into the Hands of Data Scientists (Faster)

June 26, 2020

A few days ago I read a Gartner report stating that data scientists spend 23% of their time on data collection and preparation. I...

How a Telecommunications Giant Established Universal Data Access

How a Telecommunications Giant Established Universal Data Access

April 3, 2020

  Our customer base has been growing quickly, and we’re excited to share a case study highlighting one of our largest clients, a telecommunications...

More Secure, More Connected: Starburst Presto Updated to 323e

More Secure, More Connected: Starburst Presto Updated to 323e

November 21, 2019

Starburst Presto 323e is the now our most exciting and feature rich release by Starburst to date. When we founded Starburst, our vision was to...

Presto on Kubernetes

Presto on Kubernetes

August 2, 2019

Kubernetes (K8s) eases the burden and complexity of configuring, deploying, managing, and monitoring containerized applications. We are excited to announce the availability and support...

Start Free with
Starburst Galaxy

Up to $500 in usage credits included

  • Query your data lake fast with Starburst's best-in-class MPP SQL query engine
  • Get up and running in less than 5 minutes
  • Easily deploy clusters in AWS, Azure and Google Cloud
For more deployment options:
Download Starburst Enterprise

Please fill in all required fields and ensure you are using a valid email address.

s