Blog
Scaling Analysis Without Scaling the Team

Scaling Analysis Without Scaling the Team

October 20, 2025
Scaling Analysis Without Scaling the Team
Scaling Analysis Without Scaling the Team

Learn how to maximize the impact of your data stack with a lean team—keeping your analysts at the heart of every decision.

At Dagster, we’ve always believed in building the tools we need ourselves. That’s how Dagster began. We saw that existing data platforms weren’t powerful enough to support a company without a dedicated team of experts.

Internally, we’ve always run a lean data team. In fact, our internal Dagster platform, managing over a thousand assets and several thousand materializations each day, is maintained by a single dedicated data engineer (you can see our internal Dagster platform).

More recently, I joined as Dagster’s first dedicated data analyst. The team was thrilled to have someone who could dig deeper into our data platform. But we quickly realized that Dagster alone didn’t give me the same ability to scale my analytical workflows.

The Problem

As I settled into the role, it quickly became clear just how much of a backlog was waiting. Our data platform had scaled impressively in terms of infrastructure, but that didn’t necessarily make human analysis any easier.

At the same time, something remarkable was happening in AI. Large language models weren’t just getting smarter; their ability to process massive amounts of context meant they could finally hold the entire shape of a data platform in their “mind” at once.

For the first time, it seemed possible to build an assistant that truly understood our data environment, rather than one that simply guessed at SQL snippets.

So we started experimenting. What if analysis could scale the same way Dagster scales pipelines? What if I could create a context-aware copilot for data—one that everyone at the company could use? That prototype became Compass.

How We Use Compass

As Compass’s first user, I’ve seen firsthand how the process evolved. In the beginning, most of its answers needed to be double-checked manually. That experience convinced us to focus more on building around a centralized context store.

Instead of having Compass infer the important parts of our data stack, I could now curate everything myself, the tables that should be used and the business rules that govern them.

This kind of institutional knowledge is critical, yet it rarely fits neatly into traditional database systems. It’s where analysts bring the most value. With Compass, though, I was able to design workflows that felt like an extension of my own intuition.

As Compass matured, it began supporting more of my workflows, and the number of use cases it could handle continued to grow:

Conversations with the data

The most common scenario I see now is someone doing exploratory analysis on their own. This might start with a single question or evolve into a long back-and-forth as users test and refine a hypothesis.

It’s not uncommon to browse the Compass channel and find conversation threads that are thirty messages deep, a conversation that might once have taken up my entire day.

Multi-User Conversations

One major advantage of chat-based data tools like Compass, compared to traditional dashboards, is how naturally they support collaborative, data-driven discussions. A user can start a conversation with Compass and, once they uncover something interesting, bring in other team members.

That might mean pulling me in to double-check a SQL query, or inviting a sales manager to talk through a trend.

Having these conversations within the same tool everyone already uses makes collaboration far smoother than asking people to switch between platforms or deal with extra logins.

Expertise and Context

Every Compass conversation is backed by the centralized context store, which means every question is grounded in the company’s institutional knowledge. Over time, that knowledge base reinforces itself.

For example, a user might add their own insight such as noting that “there should be a custom Salesforce field that maps to customer service tier.” Compass routes that contribution back to me so I can review and approve it before it becomes part of the shared context store.

Similarly, if Compass encounters a question it can’t answer, I’m automatically notified. That ensures I’m involved when necessary but not bogged down by questions the platform can already handle on its own.

Prototyping and Onboarding

Once Compass became the de facto expert on our company’s data, nearly every data-related project started with it. Many Dagster team members now use Compass to write and refine queries before using them elsewhere. Even new dashboards and features typically begin life as Compass-powered prototypes.

Scheduling and Integration

Because Compass is built on top of Slack’s API, it can take advantage of everything Slack offers for normal conversations. You can schedule reports, request recurring analyses, or even tie insights directly to upcoming meetings and events—all without leaving the chat.

Lessons Learned

Our goal in designing a data copilot was the same as our goal with Dagster itself. We’ve always understood the importance of data practitioners and we never wanted to remove the need for an analyst. Instead, we wanted to scale their workflows.

If anything, I now feel more connected to the data than I did before Compass. Back then, I was constantly jumping between requests, trying to keep up. Now, everything is centralized. I have standardized workflows that ensure people get accurate answers and I can spend more time on meaningful analysis.

The more we’ve built Compass, the more we’ve relied on it. Along the way, a few lessons have stood out:

  • Scale without scaling the team. Compass allows me to focus on higher-order work instead of drowning in one-off requests.
  • Conversations beat dashboards. The fastest path to insight often isn’t another chart, it’s a question and a thoughtful back-and-forth.
  • Organizational memory matters. Compass remembers the reasoning behind analyses, so we’re never starting from scratch.
  • Humans and AI are stronger together. Compass provides technical leverage, and I bring nuance and understanding. Neither is enough on its own but together, they’re transformative.

Why it Matters

Traditionally, growing data needs have meant growing the data team. Compass broke that equation. It lets us scale our ability to answer questions and generate insights without increasing headcount at the same pace.

Compass began as our internal data copilot, built to solve our own bottlenecks. Today, it’s becoming much more than that. As we bring other organizations online with Compass, we’re seeing how the right combination of people, platforms, and AI can transform data teams.

Have feedback or questions? Start a discussion in Slack or Github.

Interested in working with us? View our open roles.

Want more content like this? Follow us on LinkedIn.

Dagster Newsletter

Get updates delivered to your inbox

Latest writings

The latest news, technologies, and resources from our team.

Multi-Tenancy for Modern Data Platforms
Webinar

April 7, 2026

Multi-Tenancy for Modern Data Platforms

Learn the patterns, trade-offs, and production-tested strategies for building multi-tenant data platforms with Dagster.

Deep Dive: Building a Cross-Workspace Control Plane for Databricks
Webinar

March 24, 2026

Deep Dive: Building a Cross-Workspace Control Plane for Databricks

Learn how to build a cross-workspace control plane for Databricks using Dagster — connecting multiple workspaces, dbt, and Fivetran into a single observable asset graph with zero code changes to get started.

Dagster Running Dagster: How We Use Compass for AI Analytics
Webinar

February 17, 2026

Dagster Running Dagster: How We Use Compass for AI Analytics

In this Deep Dive, we're joined by Dagster Analytics Lead Anil Maharjan, who demonstrates how our internal team utilizes Compass to drive AI-driven analysis throughout the company.

Making Dagster Easier to Contribute to in an AI-Driven World
Making Dagster Easier to Contribute to in an AI-Driven World
Blog

April 1, 2026

Making Dagster Easier to Contribute to in an AI-Driven World

AI has made contributing to open source easier but reviewing contributions is still hard. At Dagster, we’re improving the contributor experience with smarter review tooling, clearer guidelines, and a focus on contributions that are easier to evaluate, merge, and maintain.

DataOps with Dagster: A Practical Guide to Building a Reliable Data Platform
DataOps with Dagster: A Practical Guide to Building a Reliable Data Platform
Blog

March 17, 2026

DataOps with Dagster: A Practical Guide to Building a Reliable Data Platform

DataOps is about building a system that provides visibility into what's happening and control over how it behaves

Unlocking the Full Value of Your Databricks
Unlocking the Full Value of Your Databricks
Blog

March 12, 2026

Unlocking the Full Value of Your Databricks

Standardizing on Databricks is a smart strategic move, but consolidation alone does not create a working operating model across teams, tools, and downstream systems. By pairing Databricks and Unity Catalog with Dagster, enterprises can add the coordination layer needed for dependency visibility, end-to-end lineage, and faster, more confident delivery at scale.

How Magenta Telekom Built the Unsinkable Data Platform
Case study

February 25, 2026

How Magenta Telekom Built the Unsinkable Data Platform

Magenta Telekom rebuilt its data infrastructure from the ground up with Dagster, cutting developer onboarding from months to a single day and eliminating the shadow IT and manual workflows that had long slowed the business down.

Scaling FinTech: How smava achieved zero downtime with Dagster
Case study

November 25, 2025

Scaling FinTech: How smava achieved zero downtime with Dagster

smava achieved zero downtime and automated the generation of over 1,000 dbt models by migrating to Dagster's, eliminating maintenance overhead and reducing developer onboarding from weeks to 15 minutes.

Zero Incidents, Maximum Velocity: How HIVED achieved 99.9% pipeline reliability with Dagster
Case study

November 18, 2025

Zero Incidents, Maximum Velocity: How HIVED achieved 99.9% pipeline reliability with Dagster

UK logistics company HIVED achieved 99.9% pipeline reliability with zero data incidents over three years by replacing cron-based workflows with Dagster's unified orchestration platform.

Modernize Your Data Platform for the Age of AI
Guide

January 15, 2026

Modernize Your Data Platform for the Age of AI

While 75% of enterprises experiment with AI, traditional data platforms are becoming the biggest bottleneck. Learn how to build a unified control plane that enables AI-driven development, reduces pipeline failures, and cuts complexity.

Download the eBook on how to scale data teams
Guide

November 5, 2025

Download the eBook on how to scale data teams

From a solo data practitioner to an enterprise-wide platform, learn how to build systems that scale with clarity, reliability, and confidence.

Download the e-book primer on how to build data platforms
Guide

February 21, 2025

Download the e-book primer on how to build data platforms

Learn the fundamental concepts to build a data platform in your organization; covering common design patterns for data ingestion and transformation, data modeling strategies, and data quality tips.