Data Catalog
source link: https://cloud.google.com/data-catalog
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
Data Catalog
A fully managed and highly scalable data discovery and metadata management service.
New customers get $300 in free credits to spend on Google Cloud during the Free Trial. All customers get up to 1 MiB of business or ingested metadata storage and 1 million API calls, free of charge.
-
action/check_circle_24px Created with Sketch.
Pinpoint your data with a simple but powerful faceted-search interface
-
action/check_circle_24px Created with Sketch.
Sync technical metadata automatically and create schematized tags for business metadata
-
action/check_circle_24px Created with Sketch.
Tag sensitive data automatically, through Cloud Data Loss Prevention (DLP) integration
-
action/check_circle_24px Created with Sketch.
Get access immediately then scale without infrastructure to set up or manage
Benefits
Simplifies data discovery at any scale
Empower any user on the team to find or tag data with a powerful UI, built with the same search technology as Gmail, or via API access. Data Catalog is fully managed, so you can start and scale effortlessly.
Offers a unified view of all datasets
Understand your data assets in Google Cloud and beyond. Integrations with BigQuery, Pub/Sub, Cloud Storage, and many connectors provide a unified view and tagging mechanism for technical and business metadata.
Key features
Key features
Serverless
Fully managed and scalable metadata management service; requires no infrastructure to set up or manage, allowing you to focus on your business.
Metadata as a service
Metadata management service for cataloging data assets via custom APIs and the UI, thereby providing a unified view of data wherever it is.
Central catalog
A flexible and powerful cataloging system for capturing both technical metadata (automatically) as well as business metadata (tags) in a structured format.
Documentation
Documentation
Quickstart for tagging datasets
Make a BigQuery dataset, create a tag template with a schema, look up the Data Catalog entry for your table, and attach the tag to your table.
How to search with Data Catalog
Use Data Catalog to perform a search of data assets, such as datasets, tables, views, and Pub/Sub topics in your Google Cloud projects.
Restricting access with BigQuery column-level security
This page explains how to use BigQuery column-level security to restrict access to BigQuery data at the column level.
Access on-premises metadata connectors on GitHub
Commons code used by the Data Catalog connectors and links for the connectors sample code.
Not seeing what you’re looking for?
Use cases
Use cases
While you can use the Data Catalog API to create your own connectors for ingesting metadata from a data source of your choice, we provide you with “ready to use” open-source connectors for ingesting metadata from a number of common data sources like MySQL, PostgreSQL, Hive, Teradata, Oracle, SQL Server, Redshift, and more. Once in Data Catalog, all assets can be searched for and tagged.
Integrate your on-premises RDBMS metadata
Code samples and best practices for ingesting metadata from MySQL, PostgreSQL, Teradata, Oracle, SQLServer, Redshift, Vertica, and Greenplum.
Keep up with your on-premises Hive Server
Code samples and best practices for ingesting metadata from an on-premises Hive server into Data Catalog.
Live sync your on-premises Hive Server metadata changes
Code samples and best practices for how to incrementally ingest metadata changes from an on-premises Hive server into Data Catalog.
The Data Catalog API can be used to ingest metadata from any business intelligence asset. For Looker and Tableau we have open-sourced ready-to-use connectors so they're discoverable and can be tagged directly in Data Catalog.
Integrate metadata from Looker
Code samples and best practices for ingesting metadata from Looker assets.
Integrate metadata from Tableau
Code samples and best practices for ingesting metadata from Tableau assets
All features
All features
Pricing
Pricing
Pricing for Data Catalog is split between metadata storage and API calls—both on a consumption basis. Metadata storage includes any new metadata stored in Data Catalog, including:
• Business metadata, such as Data Catalog tag templates and tags
• Cloud Storage filesets schemas attached to Pub/Sub topics
• Custom types metadata stored in Data Catalog, etc.
Metadata storage does not include the technical metadata stored by other Google Cloud services, for example, dataset table and column names stored in BigQuery. Detailed pricing and examples for both metadata storage and API calls may be found in the Data Catalog documentation.
Partners
Partners and integrations
Our strategic partnerships help build a strong ecosystem and allow customers to have a unified data discovery experience for hybrid cloud scenarios, using their platform of choice.
Take the next step
Start building on Google Cloud with $300 in free credits and 20+ always free products.
-
Need help getting started?
-
Work with a trusted partner
-
Continue browsing
Recommend
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK