On-demand Webinar

Trino
and
Starburst
Training
Series

This 5-part training series takes you through designing high-performance data lake table structures and exploring Apache Iceberg features.  We cover additional performance benefits of materialized views and leveraging our Warp Speed acceleration engine. Finally, we focus on data engineering pipeline options of SQL and Python.

You will receive the session recordings when you sign up for the series! Details below. 

Access the series on-demand!

Session 1: Creating & querying data lake tables

  • Session details:
    • Leverage Starburst Galaxy to create a catalog and schema aligned to an AWS S3 object store.
    • Construct external tables to existing datasets.
    • Utilize optimized columnar file formats to create new tables.
    • Design partitioned tables to improve performance.
    • Federate data lake queries to join with traditional data sources. 

Session 2: Modern table formats & Apache Iceberg 

  • Session details:
    • Move beyond Hive to understand the benefits of modern table formats such as Apache Iceberg. 
    • Run ACID compliant transactions to modify your data and understand how this creates new table versions.
    • Explore time-travel queries and table rollbacks.
    • Modify the partitioning strategy without rebuilding your table.

Session 3: Data pipelines, views & data products

  • Session details:
    • Learn how to create views and materialized view in Starburst Galaxy.
    • Understand the refresh options for materialized views.
    • Define popular strategies to include ETL/ELT, CDC, and SCD.
    • Build a SQL-based pipeline to spans the reference architecture’s land, structure, and consume layers. 
    • Create and secure granular data products for your downstream consumers.

Session 4: Experience Warp Speed in action (Demo only)

  • Session details:
    • Understand the architecture of this autonomous data lake acceleration technology.
    • Query tables on standard and accelerated clusters to showcase the performance gains from the smart indexing and caching that occurs.
    • Explore the web UI’s visualizations on how beneficial Warp Speed was for your queries.

Session 5: Transformation processing with PyStarburst

  • Session details:
    • Differentiate SQL-based data engineering from programming-based.
    • Understand how PyStarburst implements lazy execution with Starburst Galaxy.
    • Explore the DataFrame API.
    • Write Python code to perform analytical questions and transformation processing.

Meet the speaker:

Lester Martin

Lester Martin

Educational Engineer at Starburst

Cookie Notice

This site uses cookies for performance, analytics, personalization and advertising purposes. For more information about how we use cookies please see our Cookie Policy.

Manage Consent Preferences

Essential/Strictly Necessary Cookies

Required

These cookies are essential in order to enable you to move around the website and use its features, such as accessing secure areas of the website.

Analytical/Performance Cookies

These are analytics cookies that allow us to collect information about how visitors use a website, for instance which pages visitors go to most often, and if they get error messages from web pages.

Functional/Preference Cookies

These cookies allow our website to properly function and in particular will allow you to use its more personal features.

Targeting/Advertising Cookies

These cookies are used by third parties to build a profile of your interests and show you relevant adverts on other sites.