Databand.ai
Active

Machine learning development orchestration platform

AI Enablers, Development Platform, Monitor
Artificial Intelligence Machine Learning

Business Overview

Data engineers manage the backbone for modern data products. They build the foundations for all data science and analytics efforts. But for the average data engineer, it’s a challenge to make sure jobs are running successfully and data is up to quality standards. For companies whose revenue and operations depend on accurate, on-time data flows, that’s a huge problem.

We built Databand to help data engineers monitor and manage their DAGs. Databand is the only observability solution plugged into the open source ecosystem of solutions that all leading teams are using. Because we are so integrated, we can provide a deeper understanding of how your infra is performing, how much it’s costing you, and how accurate the data is so that you can unlock data engineering productivity.

Founded
December 2017
Employees
17
Business model
B2B
Offering type
Software
Funding stage
Seed
Sectors
AI Enablers, Development Platform, Monitor, Technologies

Founders

Databand.ai
Victor Shafran
CPO
AI Expert

Clients and Case Studies

OptimalIQ
Pagaya
Orbotech

AI Technology Stack

AI Description

Databand.ai platform orchestrates ML creation and data processing within organizations and provides visibility to data scientists and engineers involved in the process. The platform streamlines the integration, productization, and testing of ML pipelines, thus enabling the different stakeholders to work together on ML projects in an efficient, frictionless, way.

 

These are some recent contributions we’ve made to Apache Airflow:

InProcess Executor

Together with our friends from Polidea we created a new executor useful for debugging and DAG development purposes. This executor executes single task instance at time and is able to work with SQLite and sensors.

Scheduler Optimizations

Working with Polidea, we’ve made major progress in optimizing Airflow scheduler performance. In total, tests are showing 10x faster query performance with over 2000 fewer queries by count. See the list below for some of the optimizations that have been pushed (and counting):

[AIRFLOW-6856] Bulk fetch paused_dag_ids
[AIRFLOW-6857] Bulk sync DAGs
[AIRFLOW-6862] Do not check the freshness of fresh DAG
[AIRFLOW-6869] Bulk fetch DAGRuns for _process_task_instances
[AIRFLOW-6881] Bulk fetch DAGRun for create_dag_run
[AIRFLOW-6887] Do not check the state of fresh DAGRun

AI employees
3
AI application
Vertical AI
AI types
Artificial Intelligence, Machine Learning
AI tools
Azure Machine Learning, Databricks, Google Cloud, mlFlow, Spark

Similar Startups

Related Investors

Sign up for free

  • Save profiles to your list
  • View all funding rounds
  • Advanced sorting filter access
  • Search investment rounds
  • Search exit events

Sign up for Pro

  • Export lists (CSV, Excel) of startups and investors
  • View newly added AI startups
  • View investor emails
  • View investment funds and rounds sources
  • Advanced filters, sorting and search
  • Starting at just $15 per month