Skip to content See Autonomous Agents In Action
CASE STUDIES

A Global Financial Services Leader Builds the Foundation for Agentic Data Operations

financial services

The Situation

When one of the world’s leading global financial institutions set out to modernize their hybrid Snowflake, Databricks, and on-premises data infrastructure, moving to a medallion architecture across Databricks, Kafka, and Airflow, they quickly ran into a problem every large enterprise faces at this stage: the data environment had outpaced the team’s ability to monitor it manually.

Data quality checks were largely reactive. Issues surfaced after they’d already propagated downstream. Engineers described the experience as “seeing the vehicle with no wheels.” By the time they spotted a problem, something had already broken. With hundreds of tables spread across cloud and on-premise systems, plans to implement thousands of validation rules, and strict regulatory requirements (including mandatory audit trails and compliance controls), the team needed a fundamentally different approach.

The question wasn’t whether to automate, it was which platform could scale to match their environment without requiring years of configuration work upfront.

The Decision

The team selected Anomalo and deployed it in three phases designed to accelerate time-to-value while managing the complexity of their hybrid environment.

  • Phase 1 — Foundation: They started with Anomalo’s native Snowflake app, configuring critical transaction and reference data tables with AI-based anomaly detection alongside custom validation rules. No rules library to build from scratch. Anomalo’s profiling engine learned what normal looked like for their data automatically.
  • Phase 2 — Scale: Using Anomalo’s API, they integrated data quality checks directly into their pipelines, creating automated quality gates inside their medallion architecture. Anomalo’s AI assistant (AIDA) accelerated rule creation and helped surface profiling insights faster than manual analysis would have allowed.
  • Phase 3 — Hybrid coverage: The team transitioned to a containerized deployment on OpenShift, enabling a single Anomalo instance to monitor both cloud and on-premise systems simultaneously, eliminating fragmented coverage and providing unified visibility across their full data estate.

Thirty days from kickoff to live in Snowflake, in a highly regulated environment with multiple security approval gates. That timeline surprised the team.

What Changed

  • End-to-end ETL visibility. Before Anomalo, the team had no reliable way to confirm whether ETL processes had completed successfully and produced quality data. After deployment, they could see correlated signals across both source systems and Anomalo. If a pipeline failed in Snowflake, a corresponding alert appeared in Anomalo. Engineers stopped chasing process failures blind and started resolving them with full context.
  • Automated quality gates in production pipelines. Using Anomalo’s API integration, data quality checks became part of the pipeline itself, rather than a downstream audit step. Bad records were automatically quarantined before reaching downstream consumers, and clean data continued flowing without manual intervention.
  • Regulatory compliance infrastructure, built in. Anomalo’s audit trails, automated documentation of quality checks, and integration with their metadata management platform gave the compliance team what they needed without building a parallel documentation process by hand.
  • A platform the team could actually expand. The shift to containerized deployment didn’t just solve the hybrid coverage problem, it created a replicable pattern for connecting additional on-premise systems over time. This gave the data governance program a clear path to scale.

Looking Ahead

With production deployment established and monitoring coverage expanding, the team is now exploring Anomalo’s proactive Insights capabilities, moving from reactive quality monitoring toward autonomous surfacing of meaningful changes across their most critical datasets. The same foundation built for compliance and pipeline reliability becomes, over time, the infrastructure for a self-driving data operation.

The bottom line: In a highly regulated, hybrid-cloud environment where manual quality processes couldn’t keep pace with the scale of transformation, Anomalo delivered automated monitoring, end-to-end pipeline visibility, and the compliance controls to support enterprise data governance. They did it in 30 days, and without rebuilding their data stack.

Request a Demo Contact Us