Cookie Notice
This site uses cookies for performance, analytics, personalization and advertising purposes.
For more information about how we use cookies please see our Cookie Policy.
Manage Consent Preferences
These cookies are essential in order to enable you to move around the website and use its features, such as accessing secure areas of the website.
These are analytics cookies that allow us to collect information about how visitors use a website, for instance which pages visitors go to most often, and if they get error messages from web pages. This helps us to improve the way the website works and allows us to test different ideas on the site.
These cookies allow our website to properly function and in particular will allow you to use its more personal features.
These cookies are used by third parties to build a profile of your interests and show you relevant adverts on other sites. You should check the relevant third party website for more information and how to opt out, as described below.
Fully managed in the cloud
Self-managed anywhere
Use the input above to search.
Here are some suggestions:
Trino Summit is a two-day virtual conference on the 11th and 12th of December 2024. It's an event that brings together engineers, analysts, data scientists, and anyone interested in using or contributing to Trino.
Learn moreUse the input above to search.
Here are some suggestions:
Trino Summit is a two-day virtual conference on the 11th and 12th of December 2024. It's an event that brings together engineers, analysts, data scientists, and anyone interested in using or contributing to Trino.
Learn morefaster queries
reduced infrastructure costs
in projected savings
Americas
Healthcare & Life Science
Hadoop
Enterprise
1000+
Providing users with one endpoint is so much easier. They can use the same familiar tools, but everything is happening faster.
Mike Prior
Principal IO Engineer
faster queries
reduced infrastructure costs
in projected savings
Information technology service provider Optum is dedicated to shaping a healthcare system that gives patients a complete view of their health, providing them with personalized insights that lead to improved outcomes. A subsidiary of the UnitedHealth Group, Optum uses technology to connect the brightest people, places and ideas across the healthcare ecosystem. The company’s mission depends in part on providing its analysts with fast, secure access to data. Initially, Optum’s data lake architecture couldn’t support its needs at scale. Tired of poor query performance and inefficient resource utilization, Optum’s Advanced Research & Analytics group deployed Starburst Enterprise to improve data access, accelerate time to insight, maintain strong security, and reduce costs.
The majority of Optum’s data resides in a petabyte-scale Hadoop data lake, while the rest is siloed in SAS, Microsoft SQL Server, Teradata, and Postgres databases. More than 10,000 users need fast access to this distributed data. Previously, Optum relied on Hive and Spark SQL for analytics. “Our data lake backbone was on a traditional Hadoop infrastructure,” explains Optum Principal IO Engineer Mike Prior. “While that approach had its day, it’s not flexible. We needed to scale out and separate our compute from our storage without moving the data.”
Querying two different databases required copying data from one to another or engaging in an expensive ETL operation, and queries were taking far too long. “Our goal was to have faster access to data,” Prior notes. “If an analyst wants to run an ad hoc query, they want the response in seconds, not minutes.”
Finally, as a healthcare technology solutions provider, Optum needs to ensure that it appropriately limits access to sensitive Personal Health Information (PHI), and the company wanted a solution that could support these requirements at scale.
After evaluating various solutions, Optum chose Starburst Enterprise. Starburst can be deployed on-prem or in the cloud, and Optum ultimately chose the former, running on Hadoop. Today, the company depends on Starburst Enterprise as a high-performance, distributed query engine that gives its users a single point of secure access to all of its data, allowing Optum to query data where it resides.
Starburst, built on the open-source project Trino, allows users to join different data sources. Analyst workloads are simplified, as they only need to write queries using Trino’s SQL dialect. Business intelligence analysts and data scientists don’t need to learn new dialects or techniques, and they have enhanced access to the data they need to generate or discover insights.
At Optum, the benefits of Starburst Enterprise include:
Accelerated Ad Hoc Queries
Improved performance was essential for Optum, and ad hoc queries with Starburst are 10X faster than Hive, and 2X to 3X faster than Spark. Not only that, but the platform is consistently fast across different jobs. One user attested that queries which would have taken upwards of five minutes in the company’s previous Hive-based environment now finish in under 10 seconds with Starburst.
30% Drop in Infrastructure Resources
In addition to improving query performance by 10X, Starburst reduced resource utilization by 30%. One of the primary advantages of Starburst Enterprise is that it separates storage from compute, allowing companies to dial up compute as the need arises, and not pay for resources they aren’t actively using. With Starburst, Optum’s data largely remains in its data lake, with their Starburst cluster tuned to the needs of different groups. “We’re able to spin up and spin down workers as needed,” explains Prior, “and we use autoscaling to cover peak demand.”
Improved Consistency & Increased Utilization
Prior and his team have found that Starburst Enterprise is a more reliable query engine when faced with larger workloads. Like many other customers, Optum has seen more and more analysts adopting the platform over time. There’s no mystery here — when a solution works quickly and effectively, users come to depend on it. Plus, it’s simpler to work through Starburst Enterprise. “Providing users with one endpoint is so much easier,” Prior says. “They can use the same familiar tools, but everything is happening faster.”
Global Security Management
At Optum, certain users or business groups may need access to PHI, while others should not be able to see patient data. Managing permissions and access policies is essential to Optum’s business, and Starburst Enterprise makes this process painless and seamless for Prior and his colleagues. “We just want one place to configure security for all data access and Starburst Enterprise allows us to do that,” he notes.
No More ETL or Data Duplication
Extracting data from one source, transforming this data to make it compatible, and then loading it into another warehouse or data lake is an expensive and time-consuming operation. With Starburst Enterprise, if an analyst needs to query data residing in two siloed databases, ETL is no longer necessary. Starburst allows analysts to quickly and easily query data where it lives. This reduces data duplication as well, since nothing needs to be copied from one silo to another in the first place.
Savings, Success, and Flexibility
Although Starburst Enterprise is now deployed on-prem, Prior appreciates the fact that the platform can be run in the cloud, and that they can make this transition without any disruptive changes. He’s open to deploying in the cloud at some point in the future, and possibly establishing a multi-cluster, on-prem-to-cloud connection. Overall, the advantages highlighted above, including accelerated queries and enhanced resource utilization, are part of a larger impact on the business as a whole. Optum has seen improved customer retention and satisfaction, and the company is anticipating $8 million dollars in savings over the long run thanks to analytics insights uncovered with the help of Starburst Enterprise.
More resources: Optum webinar
Americas
Healthcare & Life Science
Hadoop
Enterprise
1000+
© Starburst Data, Inc. Starburst and Starburst Data are registered trademarks of Starburst Data, Inc. All rights reserved. Presto®, the Presto logo, Delta Lake, and the Delta Lake logo are trademarks of LF Projects, LLC
Up to $500 in usage credits included