Integration Overview
Qarion connects to your existing data infrastructure to synchronize metadata, manage access, and track lineage.
Supported Platforms
Supported Platforms
Data Warehouses
Qarion supports major cloud data warehouses including Snowflake, BigQuery, Databricks, and Redshift. For all these platforms, Qarion provides full metadata synchronization, quality checks, and lineage discovery by parsing SQL history and view definitions.
Databases
You can connect to standard relational databases such as PostgreSQL, MySQL, and SQL Server. These integrations support metadata synchronization and quality checks but do not currently support automated lineage discovery.
Transformation Tools
Qarion integrates with transformation engines like dbt and Airflow. The dbt integration synchornizes metadata and lineage from your dbt project, while Airflow integration captures lineage from DAG executions.
Cloud Storage
For object storage, Qarion connects to AWS S3, Azure Blob Storage, and Google Cloud Storage. These integrations primarily support metadata synchronization to inventory your data files.
Integration Components
Metadata Sync
The system automatically imports your database structure, including schemas, tables, views, and columns. It captures technical details such as data types, primary keys, foreign key relationships, and existing descriptions.
Quality Checks
You can run validations directly against your source data. This includes standard checks for Freshness (when was data last updated) and Volume (row counts), as well as Custom SQL checks for domain-specific validation rules.
Lineage Discovery
Qarion traces data flow by analyzing various sources: it parses SQL in view definitions and query logs, interprets dbt model dependencies, and extracts metadata from ETL job configurations.
Getting Started
Quick Setup Process
To get started, navigate to Source Systems and register a new platform. You will need to configure secure connection details, test the connectivity, and run an initial sync to import your metadata. Finally, map the imported assets to your Data Products to begin governance.
Prerequisites
Before connecting, ensure network access is configured (firewalls, VPNs), a service account with read permissions is available, and any necessary IP whitelisting or SSL certificates are in place.
Platform-Specific Guides
Detailed setup instructions are available for specific platforms:
- Snowflake Integration — Connect to Snowflake data warehouse
- BigQuery Integration — Connect to Google BigQuery
- dbt Integration — Import dbt project metadata and lineage
General Connection Settings
Common Parameters
Most integrations require standard connection details: the Host address or endpoint, the specific Port for the service, the default Database or catalog name, and optionally a default Schema.
Authentication Methods
Supported authentication varies by platform. Username/Password is common for standard databases. Key Pair authentication is supported for Snowflake and Databricks. Service Accounts are used for Google Cloud services (BigQuery, GCS), and OAuth or API Tokens are available for platforms like Snowflake, dbt Cloud, and Airflow.
Security Best Practices
Service Accounts
Always create dedicated service accounts for Qarion (e.g., qarion_sync_service) rather than using personal credentials. This ensures continuity and allows you to grant only the minimum required permissions.
Credential Storage
Credentials in Qarion are encrypted at rest. Access is strictly controlled—only administrators can view or edit connection details. You should rotate these credentials periodically according to your security policies.
Network Security
Secure your connections by using SSL/TLS encryption. Where possible, restrict access to specific IP ranges to further lock down your database endpoints.
Troubleshooting
Connection Failures
If a connection fails, first check network connectivity (can you ping the host?). Then verify firewall rules and security groups, ensure credentials are valid, and check that SSL/TLS certificates are correctly configured.
Sync Issues
Common causes:
- Permission denied on specific schemas
- Timeout on large metadata sets
- Rate limiting from platform
Missing Metadata
If tables or columns are missing:
- Verify schema access permissions
- Check if objects are excluded by filters
- Trigger a manual resync
Related Documentation
- Source Systems — General source system management
- Data Lineage — Understanding data flow