Grafana Labs is a developer of a performance monitoring platform that assists organizations in achieving their monitoring, visualization, and observability objectives. The company offers an open and composable platform centered around Grafana, an open-source software designed for effective monitoring, metric analytics, and visualization. Grafana Labs provides commercial products such as Grafana Enterprise, which caters to the needs of large organizations with additional features and support, and Grafana Cloud, a hosted solution that integrates Prometheus and Graphite for metrics alongside Loki for log management. Additionally, the Grafana Accelerator Program fosters early-stage companies and projects within the Grafana ecosystem by offering resources such as free cloud accounts, subscriptions, cash grants, and access to core developers, thereby promoting innovation and collaboration in the field.
dbt Labs, founded in 2016 and based in Philadelphia, operates a software-as-a-service platform that specializes in open-source data engineering. The company's primary offering, the data build tool (dbt), empowers data analysts and engineers to organize, cleanse, and transform data for analysis using SQL. This platform facilitates data modeling, event tracking, and KPI measurement, enabling users to build efficient data transformation workflows. In addition to its core software, dbt Labs provides training and consulting services in data analytics, supporting organizations in leveraging their data more effectively. Through its mission to enhance the creation and dissemination of organizational knowledge, dbt Labs positions itself as a vital resource for businesses seeking to improve their data analytics capabilities.
Developer of an open-source platform designed to build bridges across the data analytics ecosystem. The company's platform connects data, hardware, and developers and builds multi-language across platform tools for accelerated data interchange and in-memory processing, enabling developers, data scientists, and data engineers to improve the toolchains and interoperability standards for data access, preparation, cleaning, analytics, and feature engineering.
Starburst Data is a data access and analytics company that develops an SQL query engine. It provides fast and interactive enterprise-ready distribution, consisting of additional tooling and configurations, enabling data analysts to run fast analytic queries against various data sources ranging in size from gigabytes to petabytes.
The company was founded in 2017 and is headquartered in Boston, Massachusetts. Starburst Data is a data access and analytics company developing an SQL-on-Anything analytics platform.
Dune Analytics AS is a data analytics platform focused on blockchain research, particularly for the Ethereum network. Founded in 2018 and based in Oslo, Norway, the company offers tools that enable users to query, extract, and visualize extensive data from the Ethereum blockchain. Its community-driven approach allows users to collaborate and share insights, making it a valuable resource for researchers, developers, and analysts in the blockchain space.
Airbyte is an open-source data integration platform that facilitates the synchronization of data from various applications, APIs, and databases to data warehouses. The platform enables users to automate data pipelines using pre-built or custom connectors, allowing for efficient data consolidation and analytics. By providing a flexible and user-friendly solution, Airbyte helps businesses effectively gather and manage their data, even from users who employ ad-blocking tools. This functionality supports a wide range of data-driven decision-making processes across different industries.
Cockroach Labs, Inc. specializes in developing open-source database software, notably CockroachDB, which is a distributed SQL database tailored for modern cloud applications. Founded in 2015 and headquartered in New York, the company also has an office in Cambridge, Massachusetts. CockroachDB is designed to scale horizontally and can withstand various types of failures, including those at the disk, machine, rack, and datacenter levels, while maintaining minimal latency and no need for manual intervention. It supports strongly-consistent ACID transactions and offers an SQL API for data management. The software is suitable for diverse clients, ranging from startups to Fortune 500 companies, and is utilized for enterprise-grade disaster recovery services. CockroachDB combines the rich features of SQL with the scalability typically associated with NoSQL databases, enabling developers to innovate rapidly without compromising on consistency. The platform is trusted by major enterprises across various sectors, including finance, retail, and media.
Dagster Labs develops an open-source Python library known as Dagster, which serves as a data orchestration platform for building modern data applications. The platform facilitates collaboration among infrastructure engineers, data engineers, and data scientists, enabling them to efficiently process and produce trusted data. Dagster offers a user-friendly programming model for constructing data pipelines and workflows, designed to integrate smoothly with existing tools and systems. Its incrementally-adoptable architecture allows clients to leverage their current infrastructure while enhancing their data management capabilities.
ClickHouse is an open-source, column-oriented online analytical processing (OLAP) database management system that enables users to generate real-time analytical reports using SQL queries. The system is designed for high-performance query processing, allowing enterprises to handle large volumes of data efficiently. By reducing storage requirements and processing significant amounts of data within short timeframes, ClickHouse offers a secure, reliable, and scalable solution for data management. This capability is particularly beneficial for organizations seeking to optimize their data analytics and streamline data processing workflows.
Domino Data Lab, Inc. offers an enterprise data science platform that supports both on-premise and cloud-based solutions for data analysis applications. The company's flagship products include Domino Cloud, a managed cloud infrastructure for running and scaling data models, and Domino On-Premise, which facilitates tracking, sharing, and auditing of analyses. Domino Data Lab serves a diverse range of industries, including financial services, insurance, media and technology, health and life sciences, manufacturing, retail, ecommerce, and consumer products. The platform is designed to empower data science teams, allowing organizations to manage and scale their data science efforts effectively. Notable clients like Allstate, Dell Technologies, and Bayer utilize Domino to enhance collaboration, accelerate research, and deliver impactful models. Founded in 2012 and based in San Francisco, California, Domino Data Lab was formerly known as Cerebro, Inc. and adopted its current name in February 2015.
Toro Data Labs, Inc., founded in 2018 and located in Daly City, California, specializes in data quality monitoring solutions. The company offers an application designed to help organizations ensure the integrity and reliability of their data. By leveraging their expertise in monitoring extensive data catalogs, Toro aims to provide effective tools for a broader audience, addressing the increasing need for data quality management in various industries.
Grafana Labs is a developer of a performance monitoring platform that assists organizations in achieving their monitoring, visualization, and observability objectives. The company offers an open and composable platform centered around Grafana, an open-source software designed for effective monitoring, metric analytics, and visualization. Grafana Labs provides commercial products such as Grafana Enterprise, which caters to the needs of large organizations with additional features and support, and Grafana Cloud, a hosted solution that integrates Prometheus and Graphite for metrics alongside Loki for log management. Additionally, the Grafana Accelerator Program fosters early-stage companies and projects within the Grafana ecosystem by offering resources such as free cloud accounts, subscriptions, cash grants, and access to core developers, thereby promoting innovation and collaboration in the field.
Unsupervised is an AI-based that finds hidden insights in complex data. It built the first AI-based on unsupervised learning that automatically discovers the most important patterns in data without requiring human guidance or supervision.
It was founded in 2017 and is headquartered in Boulder, Colorado.
Scale AI, Inc. is a data platform that specializes in providing high-quality training and validation data for artificial intelligence applications. Founded in 2016 and based in San Francisco, the company offers a range of annotation tools, including platforms for 3D sensor data, images, videos, text, and documents. Its products support various AI tasks such as content moderation, transcription, and data comparison, and are utilized across multiple sectors including autonomous vehicles, retail, conversational AI, and robotics. Scale AI's technology enables companies to streamline their machine learning processes, allowing teams to concentrate on developing innovative models rather than the labor-intensive task of data labeling. The company serves notable clients in the technology and transportation industries, enhancing the efficiency and effectiveness of AI development.
Starburst Data is a data access and analytics company that develops an SQL query engine. It provides fast and interactive enterprise-ready distribution, consisting of additional tooling and configurations, enabling data analysts to run fast analytic queries against various data sources ranging in size from gigabytes to petabytes.
The company was founded in 2017 and is headquartered in Boston, Massachusetts. Starburst Data is a data access and analytics company developing an SQL-on-Anything analytics platform.
PingCAP, Inc., founded in 2015 and based in Beijing, China, develops TiDB, an open-source distributed Hybrid Transactional/Analytical Processing (HTAP) database. TiDB is designed to provide a comprehensive solution for both online transactional processing (OLTP) and online analytical processing (OLAP), offering infinite horizontal scalability, strong consistency, and high availability. The database is cloud-native and features SQL compatibility, allowing for seamless integration with existing systems. By focusing on open-source technology, PingCAP aims to provide a robust platform for users seeking efficient and stable database management for their online transactions and analytical needs.
Haystack is a developer of business software aimed at enhancing organizational connectivity and communication. Its platform serves as a centralized hub where employees can access information about colleagues and teams, facilitating seamless collaboration. By integrating data from various enterprise systems, Haystack allows organizations to create unified profiles for individuals, teams, and projects. This approach not only improves alignment and engagement within large companies but also aids in scaling organizational culture and accelerating the onboarding process for new hires. The team at Haystack comprises professionals with backgrounds from leading technology firms, and the company has secured over $8 million in funding from notable investors.
Operator of an open banking platform intended to provide customized and automated financial services to clients. The company's platform offers a standardized bank data interface that is based on the concept of data sharing with user consent and provides tools to financial institutions to distribute their products on third-party platforms such as retailers and marketplaces, enabling clients to securely access and manage their bank account data.
Starburst Data is a data access and analytics company that develops an SQL query engine. It provides fast and interactive enterprise-ready distribution, consisting of additional tooling and configurations, enabling data analysts to run fast analytic queries against various data sources ranging in size from gigabytes to petabytes.
The company was founded in 2017 and is headquartered in Boston, Massachusetts. Starburst Data is a data access and analytics company developing an SQL-on-Anything analytics platform.
Domino Data Lab, Inc. offers an enterprise data science platform that supports both on-premise and cloud-based solutions for data analysis applications. The company's flagship products include Domino Cloud, a managed cloud infrastructure for running and scaling data models, and Domino On-Premise, which facilitates tracking, sharing, and auditing of analyses. Domino Data Lab serves a diverse range of industries, including financial services, insurance, media and technology, health and life sciences, manufacturing, retail, ecommerce, and consumer products. The platform is designed to empower data science teams, allowing organizations to manage and scale their data science efforts effectively. Notable clients like Allstate, Dell Technologies, and Bayer utilize Domino to enhance collaboration, accelerate research, and deliver impactful models. Founded in 2012 and based in San Francisco, California, Domino Data Lab was formerly known as Cerebro, Inc. and adopted its current name in February 2015.
Confluent, Inc. develops a data platform that focuses on real-time data integration, stream processing, and analytics for various sectors, including organizations and government entities. The company's core offering is based on Apache Kafka, an open-source technology that serves as a scalable and fault-tolerant messaging system, allowing businesses to collect and utilize data from diverse sources such as user activities, application metrics, and logs. Confluent also provides several additional solutions, including Confluent Cloud, a managed streaming service that simplifies the development of streaming applications; Confluent Operator, which facilitates the management of Apache Kafka on Kubernetes; KSQL, an open-source streaming SQL engine for interactive queries; and ksqlDB, an event streaming database. Founded in 2014 and headquartered in Mountain View, California, Confluent has expanded its presence with multiple offices across the United States and internationally in locations such as the United Kingdom, Germany, Australia, and Singapore.
Databricks Inc. offers a unified data analytics cloud platform focused on simplifying data engineering and collaborative data science. The company's platform facilitates data integration, real-time experimentation, and the deployment of production applications for developers and data scientists. Key products include Databricks, a cloud-based data processing solution, Databricks Delta for unified data management, MLflow for managing the machine learning lifecycle, and Delta Lake for handling batch and streaming data. Databricks serves a diverse range of industries, including advertising, healthcare, financial services, and manufacturing. The company, founded in 2013 and headquartered in San Francisco with additional offices in London, Amsterdam, and Bengaluru, has formed strategic partnerships with various technology firms. Additionally, Databricks Ventures invests in companies that align with its vision for data, analytics, and artificial intelligence, particularly through initiatives like the Lakehouse Fund, which supports the development of the lakehouse architecture.
Scale AI, Inc. is a data platform that specializes in providing high-quality training and validation data for artificial intelligence applications. Founded in 2016 and based in San Francisco, the company offers a range of annotation tools, including platforms for 3D sensor data, images, videos, text, and documents. Its products support various AI tasks such as content moderation, transcription, and data comparison, and are utilized across multiple sectors including autonomous vehicles, retail, conversational AI, and robotics. Scale AI's technology enables companies to streamline their machine learning processes, allowing teams to concentrate on developing innovative models rather than the labor-intensive task of data labeling. The company serves notable clients in the technology and transportation industries, enhancing the efficiency and effectiveness of AI development.
Kyligence Inc. is a data intelligence company established in 2016 and headquartered in Shanghai, China. It specializes in providing an intelligent data platform designed to simplify big data analytics across both on-premises and cloud environments. The company’s primary offerings include the Kyligence Analytics Platform, which enables sub-second query responses on large datasets, and Apache Kylin, an open-source OLAP engine that facilitates interactive analytics on petabyte-scale data using SQL interfaces. Additionally, Kyligence provides a range of services such as intelligent diagnosis and optimization through its KyBot tool, along with IT consulting and website management solutions. By focusing on Big Data technologies and innovation, Kyligence aims to enhance productivity for business users, analysts, and engineers, enabling them to effectively manage and derive insights from extensive datasets.
Databricks Inc. offers a unified data analytics cloud platform focused on simplifying data engineering and collaborative data science. The company's platform facilitates data integration, real-time experimentation, and the deployment of production applications for developers and data scientists. Key products include Databricks, a cloud-based data processing solution, Databricks Delta for unified data management, MLflow for managing the machine learning lifecycle, and Delta Lake for handling batch and streaming data. Databricks serves a diverse range of industries, including advertising, healthcare, financial services, and manufacturing. The company, founded in 2013 and headquartered in San Francisco with additional offices in London, Amsterdam, and Bengaluru, has formed strategic partnerships with various technology firms. Additionally, Databricks Ventures invests in companies that align with its vision for data, analytics, and artificial intelligence, particularly through initiatives like the Lakehouse Fund, which supports the development of the lakehouse architecture.
Domino Data Lab, Inc. offers an enterprise data science platform that supports both on-premise and cloud-based solutions for data analysis applications. The company's flagship products include Domino Cloud, a managed cloud infrastructure for running and scaling data models, and Domino On-Premise, which facilitates tracking, sharing, and auditing of analyses. Domino Data Lab serves a diverse range of industries, including financial services, insurance, media and technology, health and life sciences, manufacturing, retail, ecommerce, and consumer products. The platform is designed to empower data science teams, allowing organizations to manage and scale their data science efforts effectively. Notable clients like Allstate, Dell Technologies, and Bayer utilize Domino to enhance collaboration, accelerate research, and deliver impactful models. Founded in 2012 and based in San Francisco, California, Domino Data Lab was formerly known as Cerebro, Inc. and adopted its current name in February 2015.
Domino Data Lab, Inc. offers an enterprise data science platform that supports both on-premise and cloud-based solutions for data analysis applications. The company's flagship products include Domino Cloud, a managed cloud infrastructure for running and scaling data models, and Domino On-Premise, which facilitates tracking, sharing, and auditing of analyses. Domino Data Lab serves a diverse range of industries, including financial services, insurance, media and technology, health and life sciences, manufacturing, retail, ecommerce, and consumer products. The platform is designed to empower data science teams, allowing organizations to manage and scale their data science efforts effectively. Notable clients like Allstate, Dell Technologies, and Bayer utilize Domino to enhance collaboration, accelerate research, and deliver impactful models. Founded in 2012 and based in San Francisco, California, Domino Data Lab was formerly known as Cerebro, Inc. and adopted its current name in February 2015.