OpenLineage 1.49, RisingWave 3.0, MotherDuck Flights
OpenLineage 1.49.0 adds a Cassandra JDBC extractor and dbt lineage fixes, RisingWave 3.0 defaults to DataFusion for batch, and MotherDuck previews Flights

DataPrincipal Daily - June 11th, 2026
The lineage standard and the streaming engines both shipped real releases this week, while the table-format vendors stayed quiet ahead of next week’s Databricks summit.
⚡ TL;DR
OpenLineage 1.49.0 (June 10) adds a Cassandra JDBC extractor and fixes column lineage across dbt, BigQuery, Dataproc, Spark, and Snowflake.
RisingWave 3.0.0 (June 11) makes the DataFusion engine the default for batch queries and adds an exactly-once Delta Lake sink.
MotherDuck previewed Flights, a Python ingest runtime that any MCP agent can drive, and took its Dives data apps to general availability.
Dagster published a five-level orchestration maturity model arguing the L2 to L3 jump is the move to asset-aware platforms.
🏗️ Platforms & Architecture
OpenLineage 1.49.0 adds a Cassandra extractor and fixes dbt column lineage
OpenLineage 1.49.0 shipped on June 10, a week after the 1.48.0 release. It adds a CassandraJdbcExtractor that parses Cassandra JDBC URLs to the driver specification so Cassandra reads and writes appear in lineage graphs, and it clears a run of connector bugs: missing column lineage in the dbt structured-log processor under --consume-structured-logs, malformed BigQuery job IDs in the dbt adapter that dropped project and location, GCP Dataproc job IDs that broke on retried jobs, weak lakehouse detection in Spark config, and lost Snowflake column lineage when identifiers were quoted.
RisingWave 3.0.0 makes DataFusion the default batch engine
RisingWave shipped its 3.0.0 major release on June 11, turning on the Apache DataFusion engine by default for batch queries and adding an exactly-once Delta Lake sink alongside a new HTTP sink. The release also wires in Iceberg source ingestion metrics, extends Postgres CDC to vector types, improves SQL Server CDC, and moves the codebase onto Hyper 1.0 and Tonic and Prost 0.14. One breaking change aligns the hash of jsonbb string references with plain string slices, which anyone pinning RisingWave should read before upgrading.
🔧 Tools & Products
MotherDuck previews Flights, an agent-driven ingest runtime
MotherDuck released Flights in preview on June 10, a Python runtime for building and operating ingestion pipelines that connects to MotherDuck through the DuckDB Python client. The pipelines can be built in SQL or the UI and operated by any MCP-capable agent, with the company naming Claude, ChatGPT, and Cursor, and Flights stays limited to Business and Enterprise plans during preview. The same June 10 release took Dives, MotherDuck’s interactive data apps, to general availability on all plans.
📐 Practices & Governance
Dagster publishes a five-level orchestration maturity model
Dagster posted an Orchestration Maturity Model on June 9 that grades data platforms from L1 to L5 and places most teams at L2, where a central scheduler runs jobs and reports whether code ran but not whether the data is healthy. It frames L3 as the point where the platform treats data assets as the unit of work, measuring asset health rather than job status and triggering on data readiness rather than the clock. The higher levels layer on freshness targets and self-service across teams.
💎 Gems & Tools
Ibis A portable Python dataframe API that compiles the same code to more than 20 backends including DuckDB, Polars, Spark, Trino, and ClickHouse. Worth a look when you want to write analytics once and move it across engines without a rewrite.
Marquez The LFAI and Data reference server for OpenLineage, collecting run, job, and dataset metadata behind a REST API. A useful companion if today’s OpenLineage release has you wiring lineage into a real catalog.

