Dagster

by Elementl · Since 2018

No reviews yet

ActiveAvailable globallyCloud

Quick facts

VendorElementl

Year launched2018

StatusActive

Location2 Embarcadero Ctr San Francisco, CA 94111

Countries servedGlobal

Languages9

Integrations5+

Free tierN/A

Free trialN/A

Contact salesN/A

About Dagster

Dagster is a data orchestrator platform from Elementl that helps users build, schedule, and monitor reliable data pipelines. It combines data orchestration, data catalog, and data quality so teams can efficiently manage their data workflows. Designed for AI and modern data platforms, Dagster supports integrated observability to provide insights into the data processes. With its flexible architecture, users can adapt it to their specific requirements, enabling effective collaboration among team members. Users can easily navigate data dependencies and ensure data integrity throughout the pipeline. Key capabilities: data orchestration data catalog data quality integrated observability flexible architecture Best for: teams that need to build and manage complex data pipelines effectively.

Dagster, developed by Elementl, represents a significant evolution in the orchestration of data and AI pipelines by shifting from a traditional task-based mindset to an asset-oriented paradigm. Unlike legacy orchestrators that focus on executing isolated tasks, Dagster treats data assets—such as tables, files, or machine learning models—as first-class citizens, enabling more meaningful monitoring, lineage tracking, and quality control. This approach is reflected throughout its architecture, making it especially powerful for organizations aiming to build robust, transparent, and scalable data platforms. At the heart of this experience is Dagit, the intuitive user interface that not only provides visually rich asset catalogs and lineage graphs but also supports detailed run views, Gantt charts, and configuration editors. Whether reviewing job runs, inspecting event logs, or managing sensors and schedules, Dagit ensures that both developers and less technical stakeholders can navigate and utilize the platform effectively. Functionally, Dagster stands out with its built-in data quality features such as freshness checks, asset health validations, and integration with dbt for transformation testing.

Pros & Cons

Pros

Unified Control Plane: Centralized platform for building, scaling, and observing AI and data pipelines.
Data-Aware Orchestration: Models data assets with built-in lineage, metadata, and quality checks.
Developer-Friendly: Supports local testing, branch deployments, and reusable components.
Integrated Catalog: Offers real-time insights, asset health monitoring, and cost transparency.
Flexible Integrations: Works with S3, Snowflake, PowerBI, dbt, and more.
Enterprise-Ready: Scales across teams with governance and observability features.
Fast Setup: Declarative workflows and CI/CD-native architecture streamline deployment.
Strong Community & Resources: Includes tutorials, webinars, and Dagster University for learning.

Cons

Learning Curve: Asset-based orchestration and declarative design may require onboarding time.
Newer Ecosystem: Compared to legacy tools like Airflow, Dagster’s plugin ecosystem is still growing.
Limited Non-Python Support: Primarily built for Python-based workflows.

Features

Key features

Asset-Oriented Orchestration

Models data assets (tables, files, ML models) as first-class citizens, enabling built-in catalog, lineage, and cost insights. 📊

Software Engineering Best Practices for Data

Built for local testing, branch deployments, and reusable components to improve developer experience. 💻

Unified Control Plane

Provides platform-wide visibility, governance, and quality assurance across teams, eliminating data silos. 🤝

Integrated Data Quality & Observability

Automatically tracks data health, lineage, and issues, ensuring data integrity and transparency. ✅

Flexible Integrations

Connects seamlessly with any data stack, from S3 to Snowflake, PowerBI, and various modern data tools. 🔗

AI & Machine Learning Pipeline Support

Optimized for building, scaling, and observing complex AI and ML workflows. 🤖

Additional features

Data Orchestration

Core capability for building, scaling, and observing AI and data pipelines.

Data Catalog

Built-in catalog for data assets, tables, files, ML models, and notebooks.

Data Quality

Features for monitoring data quality, freshness, and asset health, including asset checks and metadata-bound checks.

Cost Insights

Provides insights into the cost of materializing individual assets and longitudinal reporting on metadata.

Components

Reusable components to focus on business logic and reduce boilerplate.

Integrations

Built-in integrations with a wide range of tools (e.g., S3, Snowflake, PowerBI, dbt, Airbyte, Fivetran, Tableau, Looker, Sigma, Spark, Pandas, Atlan).

Enterprise Features

Includes features like role-based access management, component-level isolation, and SAML-based SSO.

Workflows (ETL/ELT Pipelines)

Supports the creation and management of ETL/ELT data pipelines.

Workflows (AI & Machine Learning)

Optimized for building and orchestrating AI and machine learning workflows.

Data Modernization

Tools to facilitate the modernization of data platforms.

Data Products

Capabilities for building and shipping data products faster.

Local Testing

Allows developers to test pipelines locally in any dev stage.

Branch Deployments

Supports automatic deployment to staging environments for confident shipping.

Platform-wide Visibility

Provides a unified view across teams for collaboration and governance.

Data Lineage

Automatically tracks and documents the complete lineage of data assets.

Metadata & Source Observability

Centralizes metadata with built-in observability and diagnostics.

Automated Tracking & Auditing

Automatically tracks, documents, and audits datasets for integrity and compliance.

Declarative Automation

Uses new AutomationCondition APIs for automated materialization and pipeline control.

Self-Service Capabilities

Enables teams to focus on business logic by providing reusable components.

Monitoring & Optimization

Tools to automate, monitor, and optimize data pipelines.

Testability

Built with software engineering best practices for testing code in any stage, not just production.

Debugging Tools

Features for debugging runs, querying logs, and viewing Gantt charts for diagnostics.

Alerting System

Can trigger Slack or email notifications on run failure/success, schedule/sensor tick failure.

Run Queuing

Manages the queuing and launching of runs and backfill jobs.

Scheduler Daemon

Manages schedules, sensors, and run queuing, and performs periodic tasks.

Sensor Daemon

Creates runs based on external state changes (e.g., new file, external system issues).

Run Monitoring Daemon

Handles run worker failures.

Run Launchers

Supports various execution engines like Kubernetes, Amazon ECS, and custom launchers.

Asset Versioning and Caching

Manages versions of code and data, with caching capabilities.

Data Quality Checks

Specific checks associated with assets to ensure expectations are met.

Declarative Programming Model

Defines data assets and their computation through a declarative approach.

Type Safety

Strong typing system to validate inputs and outputs of each operation (op).

Unified Codebase

Facilitates a single, unified codebase for data platforms.

ETL Course

Offers a free, comprehensive seven-lesson course on ETL implementation with Dagster.

Open-Source Project

Available as an open-source library for building data systems.

Pricing

Free trial

Free version

Request a quote

Promo Offer

Monthly plans

Solo Plan

USD 10/mo

billed monthly

Countries & Languages

Global

Countries served

Interface languages

Billing currencies

Interface languages

EnglishSpanishFrenchGermanItalianPortugueseChineseJapaneseKorean

Billing currencies

🇺🇸USD🇪🇺EUR🇬🇧GBP🇯🇵JPY🇦🇺AUD🇨🇦CAD🇨🇭CHF🇨🇳CNY🇭🇰HKD🇳🇿NZD

Reviews

No reviews yet

Be the first to drop a review

Alternatives to Dagster

Instabug

Instabug is a mobile observability platform from Luciq that changes app quality into business outcomes…

Datadog

Datadog is an observability and security platform from Datadog that lets users see inside any…

OpManager Nexus

OpManager Nexus by ManageEngine is an advanced observability software platform designed to provide centralized monitoring…

SolarWinds User Experience Monitoring

SolarWinds User Experience Monitoring is a solution that combines real user monitoring (RUM) and synthetic…

Calyptia Core

Calyptia Core is a data observability platform from Calyptia that helps organizations monitor and manage…

EV Observe

[API Error: HTTPSConnectionPool(host='api.openai.com', port=44]

Spot something wrong or outdated?

Suggest a correction — a reviewer verifies every change.

About Dagster

Dagster Details

Vendor

Elementl

Year Launched

2018

Location

2 Embarcadero Ctr San Francisco, CA 94111

Deployment

cloud

Training Options

documentation, videos, demo, community

Countries Served

All Countries

Languages

English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Korean

Users

Data Engineers, Machine Learning Engineers, Data Scientists, Data Platform Engineers, DevOps Engineers, Analytics Engineers, AI/ML Ops Specialists, Software Engineers working on data infrastructure, Data Architects, Technical Product Managers

Industries Served

Finance, Software & Technology, Retail & E-commerce, Life Sciences & Biotech, Government & Public Sector, Media & Entertainment, Telecommunications, Healthcare, Education, Energy & Utilities

Dagster's In-App Market Place

Does Dagster have an in-app market place?

Yes

How many Mini-Apps in the marketplace?

Mini Apps

N/A

Pricing Options