Knowledge Catalog adds unstructured data profiling via Gemini
Knowledge Catalog can now profile unstructured data, such as PDFs in Cloud Storage, using Vertex AI Gemini models. This feature extracts semantic insights and relationships, enhancing understanding of diverse data sources. It is currently in Preview and accessible only through the Dataplex REST API.
Features (1) ›
- Knowledge Catalog
Knowledge Catalog now supports data profile scans for unstructured data (such as PDFs in Cloud Storage) on existing BigQuery object tables. This feature uses Vertex AI Gemini models to extract semantic insights, including entities and relationships, from unstructured content. Note: Data profile scans for unstructured data are currently available in Preview using the Dataplex REST API only. The cloud console and gcloud workflows are not supported for this feature. For more information, see About unstructured data insights and Use data profile for unstructured data .
https://docs.cloud.google.com/release-notes#June_11_2026
