Dagster logo

Dagster

by Elementl · Since 2018
No reviews yet
ActiveAvailable globallyCloud
Quick facts
VendorElementl
Year launched2018
StatusActive
Location2 Embarcadero Ctr San Francisco, CA 94111
Countries servedGlobal
Languages9
Integrations5+
Free tier
Free trial
Contact sales

About Dagster

Dagster is a data orchestrator platform from Elementl that helps users build, schedule, and monitor reliable data pipelines. It combines data orchestration, data catalog, and data quality so teams can efficiently manage their data workflows. Designed for AI and modern data platforms, Dagster supports integrated observability to provide insights into the data processes. With its flexible architecture, users can adapt it to their specific requirements, enabling effective collaboration among team members. Users can easily navigate data dependencies and ensure data integrity throughout the pipeline. Key capabilities: data orchestration data catalog data quality integrated observability flexible architecture Best for: teams that need to build and manage complex data pipelines effectively.

Dagster, developed by Elementl, represents a significant evolution in the orchestration of data and AI pipelines by shifting from a traditional task-based mindset to an asset-oriented paradigm. Unlike legacy orchestrators that focus on executing isolated tasks, Dagster treats data assets—such as tables, files, or machine learning models—as first-class citizens, enabling more meaningful monitoring, lineage tracking, and quality control. This approach is reflected throughout its architecture, making it especially powerful for organizations aiming to build robust, transparent, and scalable data platforms. At the heart of this experience is Dagit, the intuitive user interface that not only provides visually rich asset catalogs and lineage graphs but also supports detailed run views, Gantt charts, and configuration editors. Whether reviewing job runs, inspecting event logs, or managing sensors and schedules, Dagit ensures that both developers and less technical stakeholders can navigate and utilize the platform effectively. Functionally, Dagster stands out with its built-in data quality features such as freshness checks, asset health validations, and integration with dbt for transformation testing.

Pros & Cons

What users like
  • +Unified Control Plane: Centralized platform for building, scaling, and observing AI and data pipelines.
  • +Data-Aware Orchestration: Models data assets with built-in lineage, metadata, and quality checks.
  • +Developer-Friendly: Supports local testing, branch deployments, and reusable components.
  • +Integrated Catalog: Offers real-time insights, asset health monitoring, and cost transparency.
  • +Flexible Integrations: Works with S3, Snowflake, PowerBI, dbt, and more.
  • +Enterprise-Ready: Scales across teams with governance and observability features.
  • +Fast Setup: Declarative workflows and CI/CD-native architecture streamline deployment.
  • +Strong Community & Resources: Includes tutorials, webinars, and Dagster University for learning.
What users flag
  • Learning Curve: Asset-based orchestration and declarative design may require onboarding time.
  • Newer Ecosystem: Compared to legacy tools like Airflow, Dagster’s plugin ecosystem is still growing.
  • Limited Non-Python Support: Primarily built for Python-based workflows.

Features

Key features

Asset-Oriented Orchestration
Models data assets (tables, files, ML models) as first-class citizens, enabling built-in catalog, lineage, and cost insights. 📊
Software Engineering Best Practices for Data
Built for local testing, branch deployments, and reusable components to improve developer experience. 💻
Unified Control Plane
Provides platform-wide visibility, governance, and quality assurance across teams, eliminating data silos. 🤝
Integrated Data Quality & Observability
Automatically tracks data health, lineage, and issues, ensuring data integrity and transparency. ✅
Flexible Integrations
Connects seamlessly with any data stack, from S3 to Snowflake, PowerBI, and various modern data tools. 🔗
AI & Machine Learning Pipeline Support
Optimized for building, scaling, and observing complex AI and ML workflows. 🤖

Additional features

Data Orchestration
Core capability for building, scaling, and observing AI and data pipelines.
Data Catalog
Built-in catalog for data assets, tables, files, ML models, and notebooks.
Data Quality
Features for monitoring data quality, freshness, and asset health, including asset checks and metadata-bound checks.
Cost Insights
Provides insights into the cost of materializing individual assets and longitudinal reporting on metadata.
Components
Reusable components to focus on business logic and reduce boilerplate.
Integrations
Built-in integrations with a wide range of tools (e.g., S3, Snowflake, PowerBI, dbt, Airbyte, Fivetran, Tableau, Looker, Sigma, Spark, Pandas, Atlan).
Enterprise Features
Includes features like role-based access management, component-level isolation, and SAML-based SSO.
Workflows (ETL/ELT Pipelines)
Supports the creation and management of ETL/ELT data pipelines.
Workflows (AI & Machine Learning)
Optimized for building and orchestrating AI and machine learning workflows.
Data Modernization
Tools to facilitate the modernization of data platforms.
Data Products
Capabilities for building and shipping data products faster.
Local Testing
Allows developers to test pipelines locally in any dev stage.
Branch Deployments
Supports automatic deployment to staging environments for confident shipping.
Platform-wide Visibility
Provides a unified view across teams for collaboration and governance.
Data Lineage
Automatically tracks and documents the complete lineage of data assets.
Metadata & Source Observability
Centralizes metadata with built-in observability and diagnostics.
Automated Tracking & Auditing
Automatically tracks, documents, and audits datasets for integrity and compliance.
Declarative Automation
Uses new AutomationCondition APIs for automated materialization and pipeline control.
Self-Service Capabilities
Enables teams to focus on business logic by providing reusable components.
Monitoring & Optimization
Tools to automate, monitor, and optimize data pipelines.
Testability
Built with software engineering best practices for testing code in any stage, not just production.
Debugging Tools
Features for debugging runs, querying logs, and viewing Gantt charts for diagnostics.
Alerting System
Can trigger Slack or email notifications on run failure/success, schedule/sensor tick failure.
Run Queuing
Manages the queuing and launching of runs and backfill jobs.
Scheduler Daemon
Manages schedules, sensors, and run queuing, and performs periodic tasks.
Sensor Daemon
Creates runs based on external state changes (e.g., new file, external system issues).
Run Monitoring Daemon
Handles run worker failures.
Run Launchers
Supports various execution engines like Kubernetes, Amazon ECS, and custom launchers.
Asset Versioning and Caching
Manages versions of code and data, with caching capabilities.
Data Quality Checks
Specific checks associated with assets to ensure expectations are met.
Declarative Programming Model
Defines data assets and their computation through a declarative approach.
Type Safety
Strong typing system to validate inputs and outputs of each operation (op).
Unified Codebase
Facilitates a single, unified codebase for data platforms.
ETL Course
Offers a free, comprehensive seven-lesson course on ETL implementation with Dagster.
Open-Source Project
Available as an open-source library for building data systems.

Pricing

Free trial
Free version
Request a quote
Promo Offer

Monthly plans

Solo Plan

USD 10

Countries & Languages

Global
Countries served
9
Interface languages
10
Billing currencies

Interface languages

EnglishSpanishFrenchGermanItalianPortugueseChineseJapaneseKorean

Billing currencies

🇺🇸USD🇪🇺EUR🇬🇧GBP🇯🇵JPY🇦🇺AUD🇨🇦CAD🇨🇭CHF🇨🇳CNY🇭🇰HKD🇳🇿NZD

No reviews yet

Be the first to drop a review

Alternatives to Dagster

Instabug logo

Instabug

Instabug is a mobile observability platform from Luciq that changes app quality into business outcomes…

Datadog logo

Datadog

Datadog is an observability and security platform from Datadog that lets users see inside any…

C

Calyptia Core

Calyptia Core is a data observability platform from Calyptia that helps organizations monitor and manage…

EV Observe logo

EV Observe

[API Error: HTTPSConnectionPool(host='api.openai.com', port=44]

Hybrid Cloud Observability logo

Hybrid Cloud Observability

SolarWinds Observability Self-Hosted is a hybrid cloud observability platform from SolarWinds that helps manage the…

E

Edge Delta

Edge Delta is a observability software from Edge Delta that focuses on AI agent connectivity…

Often compared with Dagster

Compare any two tools →
Instabug logo
Instabug
Observability
0.0
Datadog logo
Datadog
Observability
0.0
C
Calyptia Core
Observability
0.0
EV Observe logo
EV Observe
Observability
0.0