Last Updated: 2024-01-15
Google BigQuery is a fully-managed, serverless data warehouse and analytics platform offered by Google Cloud. It is designed to handle large-scale data analytics and processing using SQL-like queries.
Like other cloud data warehouses, BigQuery enables users to analyze massive datasets in real-time with the ability to run SQL-like queries against large amounts of data without having to worry about the underlying infrastructure.
In this tutorial, you will learn how to configure a Google BigQuery catalog in Starburst Galaxy.
Once you've completed this tutorial, you will be able to:
Starburst tutorials are designed to get you up and running quickly by providing bite-sized, hands-on educational resources. Each tutorial explores a single feature or topic through a series of guided, step-by-step instructions.
As you navigate through the tutorial you should follow along using your own Starburst Galaxy account. This will help consolidate the learning process by mixing theory and practice.
Now it's time to pivot over to Starburst Galaxy to begin the process of connecting your BigQuery datasource.
Starburst Galaxy separates users by role. Configuring a new catalog will require access to a role with appropriate privileges. Today, you'll be using the accountadmin role.
This is a quick step, but an important one.
Sign into Starburst Galaxy in the usual way. If you have not already set up an account, you can do that here.
Your current role is listed in the top right-hand corner of the screen.
Adding a new BigQuery catalog follows the same process as adding other data sources in Starburst Galaxy.
The steps below will show you how to start the process of configuring a new catalog.
Create a new catalog for your BigQuery data source.
Starburst Galaxy allows the creation of catalogs for a number of different data sources. In this case, you are going to create a new catalog in the BigQuery category.
The new catalog needs both a name and description. This ensures that you can find it later.
When you connect Starburst Galaxy to a new data source, it is necessary to undergo an authentication process. This helps ensure that you are connecting the right data source and that you have the appropriate permissions.
Starburst Galaxy supports authentication using a JSON key. This is the only method of authentication available.
Every new catalog connection includes a test before you connect it. This helps to ensure that you have input the correct credentials and allows you to quickly fix any problems before actually connecting.
You're almost there! Time to test the connection and then complete the process of creating your new BigQuery catalog.
Starburst Galaxy allows you to configure your catalog in a number of ways regarding access controls.
Select the appropriate read access for your situation.
The catalog is now created, but you are not able to use it until you add it to a cluster. Clusters can include many catalogs. You can either add the new BigQuery catalog to an existing cluster, or do this at a later stage.
You can add a catalog to a cluster at any time.
Congratulations! You have reached the end of this tutorial, and the end of this stage of your journey.
You're all set! Now you can query the data in your BigQuery data source.
At Starburst, we believe in continuous learning. This tutorial provides the foundation for further training available on this platform, and you can return to it as many times as you like. Future tutorials will make use of the concepts used here.
Starburst has lots of other tutorials to help you get up and running quickly. Each one breaks down an individual problem and guides you to a solution using a step-by-step approach to learning.
Visit the Tutorials section to view the full list of tutorials and keep moving forward on your journey!