

What is Starburst Galaxy?
Starburst Galaxy is a price-performant, fully managed, multi-cloud data and analytics platform powered by Trino, a leading open-source distributed MPP SQL query engine. Starburst Galaxy is used for both interactive ad-hoc analytics and long-running workloads like batch and ETL/ELT, and offers high scalability and query completion rates even as the amount of data, query volume, and query complexity increases. The service runs federated queries across data lakes, cloud data warehouses, on-premises databases, and relational data management systems like PostgreSQL and MySQL. Galaxy also supports fault-tolerant execution, smart indexing and caching, Data Products, and universal search and schema discovery.
What is Amazon Athena?
Amazon Athena, available in serverless and dedicated versions, is a query service that analyzes data in Amazon Web Services (primarily Amazon S3) using standard SQL for ad-hoc analytics. Amazon Athena serverless has no infrastructure for customers to manage, and they only pay for queries that run. Amazon Athena was originally built on a fork of Presto (PrestoDB version .217), originally released in January 2019.
Starburst is a Leader in Enterprise Big Data Analytics
Don’t take our word for it. Starburst is named #1 for Quality of Support and Ease of Use in G2 Crowd’s Grid Report based on real customer reviews. Additionally, customers said this about Starburst:
- 100% of users rated Starburst 4+ stars
- 100% of users believe Starburst is headed in the right direction
- 96% meets users requirements
- 93% of users would recommend

Simplicity
Going beyond key platform governance and management capabilities, a modern data and analytics platform empowers data teams with easy-to-use functionality that increases productivity without adding complexity. It allows businesses to use a range of existing investments in just a few clicks. It enables data teams to easily discover, create, govern, analyze and share federated data products from distributed data sets across the organization.
Automated AWS compute plane set-up
Automated data maintenance
Multi-cloud platform
Built-in data security
Data Products
Automated cluster management
Built-in real-time usage monitoring
Built-in query scheduler
Built-in Natural Language Processing
Automated data lake optimization
Predictable pricing
Comparison based on publicly available information as of July 8, 2024.
* In preview. Contact us to learn more.
Access
True data access empowers organizations with the ability to use all their data, no matter where it lives, across data lakes, data warehouses, and databases while having confidence in security and governance controls. True access is about meeting business needs on time while adhering to regulatory data sovereignty requirements. Your open lakehouse should free your data sources for analytics and AI, not confine them in another way.
Cloud data federation
On-premise data federation
AWS service account
Time-based policies
RBAC
ABAC
Column/Row masking
SSO via AWS IAM, Okta, Azure AD, and Google
Universal Search and schema discovery
Uses Trino connectors for federation
In platform universal search and schema discovery
Optimized first party connectors - parallelism, cached views, dynamic filtering, security, and authentication
Query sharing
Data Products sharing
Data profiling
Data lineage
Streaming ingest
Comparison based on publicly available information as of July 8, 2024.
* In preview. Contact us to learn more.
Scalability
Internet scale matters in an internet-powered world but not every workload needs that power and performance. Your open data lakehouse, p0wers modern data and analytics and puts control of performance and costs in your hands. It ensures high-performance scalability is available at a click of a button or automatically when you need it most while optimizing price-to-performance for all analytics workloads. It also instills confidences that queries will execute as scheduled, even at high concurrencies.
Works with S3 Express One Zone
Ad-hoc and interactive queries
Results and repeated subquery caching
High concurrency
Control over concurrency and prioritization
Fault Tolerant Execution
Built-in data catalog
Autoscales by adding more nodes per cluster
Customizable scaling for cost and performance optimization
Consistently executes long-running batch queries
Smart indexing and caching
Fine-grained resource management
Comparison based on publicly available information as of July 8, 2024.
* In preview. Contact us to learn more.
Optionality
Open file and table formats are table stakes in providing optionality. Your open lakehouse goes beyond the fundamentals to ensure your business has full control over your data by accessing data where it lives across hybrid and multi-cloud data architectures, by allowing choice in cloud providers, security, and BI tools, and ensuring expert Trino support is available if and when your teams need it most.
OS Trino query engine
Supports popular open file formats
Supports Python
Supports hybrid and cloud data architectures
Supports data catalogs beyond AWS Glue
Runs on multiple clouds
Expert in-house Trino support
Natively run SQL on Iceberg, Delta Lake, Hudi, and Hive table formats
In platform capability to migrate Hive to Delta or Iceberg tables
Comparison based on publicly available information as of July 8, 2024.
* In preview. Contact us to learn more.

Start for Free with Starburst Galaxy
Up to $500 in usage credits included
Discover
Easily search across data sources and clouds to find the data you need.
Govern
Streamline data governance with built-in RBAC and ABAC.
Analyze
Run internet-scale workloads with the power of Trino.
Fast
Accelerate queries with smart indexing and caching technologies like Warp Speed.
More Deployment Options