DataPrincipal Daily - June 9th, 2026
Streaming SQL is teaching its catalog to talk to agents, table formats are growing vector types, and the orchestration and vector-database lines spent the week shipping quiet, useful fixes.
⚡ TL;DR
Materialize v26.27.0 turns on a
querytool in its MCP Server for Agents by default and auto-routes queries to the right data product clusters.Apache Hudi 1.2 adds a native VECTOR type, BLOB and VARIANT support, and Lance file format integration for AI workloads on the lakehouse.
Prefect 3.7.4 starts emitting lifecycle events for variables, flows, blocks, and automations, plus a Redis-backed cleanup queue.
Weaviate v1.38.0 takes HFresh to GA and previews multi-tenant Namespaces and a secure MCP server.
Dagster 1.13.8 patches a SQL injection in its ClickHouse integration, and Milvus 2.6.18 ships nullable vectors.
🌊 Streaming & Real-Time
Materialize v26.27.0 turns on agent query tools by default
Materialize’s weekly v26.27.0 release (early June) flips the MCP Server for Agents so its query tool is enabled by default, letting an agent join across data products without manual setup, and the server now routes queries to the appropriate data product catalog clusters while surfacing hydration readiness. The release also cleans up default EXPLAIN output for joins and cross joins and fixes filter pushdown with timestamp arithmetic, JSON CASE handling, LATERAL subquery results, and an SSH tunnel failure against HTTPS schema registries. They are wiring agent access straight into the catalog.
🏗️ Table Formats & Lakehouse
Apache Hudi 1.2 adds a native VECTOR type and Lance format
Apache Hudi 1.2 (June 7) brings first-class support for embeddings through a native VECTOR type with built-in similarity search on Hudi tables, alongside BLOB and VARIANT handling for binary objects, documents, and semi-structured records. The release pairs that with Lance file format integration for vector and multimodal storage, Flink improvements (Record Level Index support, dynamic bucket scaling, and a FLIP-27 based source), and cloud-native multi-writer concurrency through native storage-based locking across major cloud providers. Putting vectors and the records they describe in a single table format is an interesting concept, since AI feature data does not want to live in a separate vector store forever.
🔧 Orchestration & Tooling
Prefect 3.7.4 emits lifecycle events for core objects
Prefect 3.7.4 (June 5) starts emitting created, updated, and deleted lifecycle events for domain objects, including variables, flows, block documents, and automations, which gives automations a clean hook into the platform’s own state changes. The release also adds a Redis-backed cleanup queue with optimistic transactions, a reconciler and dispatcher for delivering worker messages over WebSocket with lease expiry and retries, and fixes for deployment initialization parsing and service recovery after a Redis disruption. Lifecycle events on metadata objects, not just on runs, enable a team to automate governance responses.
Dagster 1.13.8 patches a ClickHouse SQL injection
Dagster 1.13.8 (June 4) fixes a SQL injection in its ClickHouse integration related to dynamic partition keys and adds dedicated tags and icons for Microsoft Fabric and OneLake, while the UI’s “Docs” tab becomes “Components.” BigQuery, Snowflake, and DuckDB managers now skip writing empty DataFrames to tables, special characters are allowed in team owners, and a clutch of dbt and backfill display bugs got cleared.
📐 Governance & AI Infrastructure
OpenMetadata 1.13.0 makes MCP a first-class service
OpenMetadata 1.13.0 (June 8) promotes MCP to a first-class service category with execution logs and OAuth support, adds typed glossary relations (relatedTo, synonym, antonym, broader, narrower), and ships an Ontology Explorer at /governance/ontology with graph visualization. The release also brings live indexing with a dead-letter queue, tunable search index settings, storage auto-classification for containers, and nine new connectors, including Microsoft Fabric, Google Drive, Pub/Sub, and QuestDB. This is a mandatory movement for data governance, and it’s great to see it being taken care of.
Weaviate v1.38.0 takes HFresh GA and previews Namespaces
Weaviate v1.38.0 (June 5) moves HFresh to general availability with memory, disk-write, and allocation reductions, and previews Namespaces for data and control-plane isolation on multi-tenant shared clusters with RBAC integration and cascading delete. The preview list also covers nested object filtering inside indexed JSON, runtime reindexing of property index types, and a secure MCP server that lets agents read and write to Weaviate natively.
💎 Gems & Tools
Milvus 2.6.18 ships nullable vectors Milvus 2.6.18 (June 5) lets vector fields accept NULL without placeholder data and adds element-level search on individual entries inside Struct Array fields. HTTP/2 support for the REST server and better QueryNode scheduling under load round it out.


