Global Lineage View
The Global Lineage page provides a comprehensive visualization of all data products and their relationships within a space.
Accessing Global Lineage
Navigate to Lineage in the main sidebar to open the space-wide view.
URL: /lineage
Graph Overview
The global view displays every data product in the space as a node, with all lineage relationships shown as connecting edges. The graph automatically arranges these elements using a hierarchical layout that flows from sources to consumers.
Layout Algorithm
The graph uses topological sorting to determine positioning. It identifies root products (those with no upstream sources) and places them at the far left (Level 0). Subsequent products are assigned to levels based on their dependency depth, with nodes at each level centered vertically. Edges are routed using a step-style for a clean visual flow.
Visual Hierarchy
The layout provides a clear visual hierarchy: source data sits at the far left, transformation layers occupy the middle, and consumer endpoints like dashboards and exports are positioned at the far right.
Searching and Locating
Node Search
To find specific products in a large graph, use the search bar in the graph header. Typing a product name filters results in real-time; selecting a match centers the viewport on that product.
Keyboard Navigation
You can cycle through multiple search matches using the Enter key, with a counter showing your position in the results (e.g., "1 of 5").
Viewport Centering
When you locate a product, the graph automatically centers on it and adjusts the zoom level for optimal visibility. The selected node is highlighted to make it stand out visually against the rest of the graph.
dbt-Style Selectors
For advanced navigation, the graph supports dbt-style selector syntax in the selector input field.
Syntax Patterns
You can select a single product by name (product_name), or include its context: +product_name selects the product and all its upstream sources, while product_name+ includes all downstream consumers. To see the full lineage, use +product_name+. You can also limit depth, such as +2,product_name for two levels of upstream dependencies, or product_name,+1 for one level of downstream.
Using Selectors
Enter a selector to filter the graph; the view updates to show only matching nodes and their relevant connections. Press the Clear (X) button to return to the full graph view.
Examples
Entering +customer_orders+ shows customer_orders along with all its sources and consumers. Using +2,daily_sales focuses on daily_sales and its upstream dependencies up to two levels deep.
If you're familiar with dbt's --select syntax, you'll feel right at home with these selectors.
Interactive Highlighting
Path Focus
Hovering over any node or edge highlights the connected path. In-path elements remain fully visible, while the rest of the graph dims to 25% opacity in grayscale. This immediate visual focus makes it easy to trace specific pipelines even within complex graphs.
Node Interaction
Clicking Nodes
Clicking any node triggers a fast, single-page navigation to that product's detail page. You can use your browser's back button to return to the lineage view preserving your state.
Node Information
Nodes display the product name and a color-coded type icon, providing a quick visual reference for the product's role in the pipeline.
Orphan Products
Orphan Cluster
Products with no lineage connections are grouped in a separate "Orphan Cluster" positioned below the main graph. This area uses a grid-based layout to clearly separate these disconnected items from the main pipeline.
Why Orphans Exist
A product might appear here if its lineage hasn't been curated yet, if it's a standalone data source, or if it was newly created. To integrate an orphan, open its detail page and add connections via the Lineage tab.
Graph Controls
Navigation Controls
Controls in the graph corner allow you to Zoom In and Zoom Out to adjust the view, Fit View to automatically frame all nodes, and Reset to return to the default zoom and position.
Minimap
A minimap in the corner provides an overview of the entire graph, showing your current viewport position. You can click or drag within the minimap to quickly pan across large graphs.
Performance Considerations
Large Spaces
For spaces with many products, the initial load may take a moment. Using search and selectors to focus on relevant areas, or filtering by tags and types, can help manage complexity.
Browser Performance
If the graph feels slow, try reducing the number of visible nodes using selectors, closing other processor-intensive browser tabs, or refreshing the page to reset the graph state.
Use Cases
Pipeline Architecture Review
The global view is ideal for reviewing overall data architecture. It helps identify data silos (unconnected clusters), locate bottleneck products with excessive downstream consumers, and spot missing connections in your pipeline.
Onboarding
New team members can use the global view to understand major data flows, identify key source and consumer products, and use selectors to explore specific pipelines without getting lost.
Impact Assessment
Before making changes, find the product in the global view and use +product_name+ to visualize its full impact. This helps identify all affected downstream consumers so you can plan communication and coordination effectively.
Best Practices
Regular Review
Periodically review the global lineage to ensure connections are up to date, identify orphan products that require curation, and spot any unexpected patterns or gaps in your data architecture.
Collaborative Curation
Lineage is most valuable when complete. Encourage teams to maintain their products' lineage, include lineage review in onboarding processes, and use the global view in governance meetings to drive visibility and accountability.