Astronomer
Series D in 2025
Astronomer is a data engineering software company that builds a managed Apache Airflow platform. Founded in 2014 and based in Cincinnati, Astronomer offers Astro, a data orchestration platform powered by Apache Airflow that enables data teams to build, run, and observe data pipelines-as-code. The platform is designed to deploy, manage, and scale distributed Airflow workflows with features such as isolated resource allocation and controlled user access, supporting data engineers, data scientists, and data analysts in delivering trusted analytics. Astronomer's solutions help organizations increase data availability and visibility by making pipelines observable and governance-friendly across environments.
Structify
Seed Round in 2025
Structify is a technology company that specializes in data extraction and structuring. It offers a platform that enables businesses to create custom datasets by specifying the data format, selecting the source(s), and assigning agents to search and extract the data. The company's in-house model ensures the quality and accuracy of the extracted data across various use cases. Structify has found applications in finance, construction, and corporate technology, helping teams extract and organize data from diverse sources such as pitch decks, geotechnical documents, and real-time chain of command data.
Charta Health is a technology company specializing in medical billing optimization. It employs artificial intelligence to scrutinize patient charts before claims are submitted, ensuring all potential revenue opportunities are captured, claim denials are minimized, and billing standards are consistently met. The platform offers real-time analytics, audits, and autonomous chart reviews, empowering healthcare professionals to enhance administrative efficiency and patient care.
Hightouch
Series C in 2025
Hightouch operates a customer data platform that synchronizes data between various marketing and operational tools. It connects businesses' existing data warehouses with applications like CRM systems, email platforms, and advertising networks for personalized marketing campaigns and enhanced customer engagement.
Cleanlab specializes in enhancing the reliability of artificial intelligence systems by focusing on data-centric approaches. Its platform improves dataset quality, identifies low-quality outputs, determines root causes, enhances response accuracy, and applies guardrails to enable safe, accurate, and scalable AI deployment.
Polars is a rust based dataframe framework for data scientists and engineers that enables data processing at any scale.
Cleanlab
Seed Round in 2023
Cleanlab specializes in enhancing the reliability of artificial intelligence systems by focusing on data-centric approaches. Its platform improves dataset quality, identifies low-quality outputs, determines root causes, enhances response accuracy, and applies guardrails to enable safe, accurate, and scalable AI deployment.
Hightouch
Series B in 2023
Hightouch operates a customer data platform that synchronizes data between various marketing and operational tools. It connects businesses' existing data warehouses with applications like CRM systems, email platforms, and advertising networks for personalized marketing campaigns and enhanced customer engagement.
Rubber Ducky Labs
Seed Round in 2023
Rubber Ducky Labs develops an AI-driven metadata enrichment platform that enhances product discovery for e-commerce teams. The tools provide AI-powered tagging that supports multi-modal inputs and require no technical expertise, enabling online retailers to improve personalization, search, marketing, and SEO through enriched catalog data. The company focuses on recommender-system capabilities to optimize how products are surfaced and recommended to shoppers, helping retailers refine catalog quality and discoverability across channels.
Granica is a company that specializes in enhancing the efficiency and safety of artificial intelligence applications through its AI Data Readiness Platform. This platform assists data engineers and leaders in creating AI-ready data, which is crucial for optimizing machine learning models and analytics. By improving the efficiency of tabular and natural language processing training data, Granica significantly reduces storage costs and enhances query performance. The company focuses on mitigating risks associated with sensitive data leakage and harmful content, thereby ensuring privacy and safety in AI inputs and outputs. Granica curates and refines training datasets to select impactful samples, ultimately boosting the performance and reliability of AI models. Targeting enterprises in data-intensive sectors such as financial services, retail, geo-spatial intelligence, and autonomous vehicles, Granica enables clients to leverage AI more effectively, enhancing trust and improving business outcomes.
Nightfall
Series B in 2022
Nightfall offers a cloud data protection platform and developer platform that uses machine learning to identify and classify sensitive business data across SaaS apps, APIs, and data infrastructure. It detects exposures, automates data hygiene, and enforces workflows such as quarantines, deletions, and alerts with no end-user disruption. The platform exposes cloud-hosted APIs and SDKs for developers to integrate data classification and protection capabilities into applications and data pipelines, helping organizations save time and keep business information safe.
Tecton offers an enterprise-grade feature store, empowering businesses to harness machine learning effectively. Its platform addresses unique ML data requirements, enabling teams to build, serve, and scale features swiftly and reliably.
Hightouch
Series B in 2021
Hightouch operates a customer data platform that synchronizes data between various marketing and operational tools. It connects businesses' existing data warehouses with applications like CRM systems, email platforms, and advertising networks for personalized marketing campaigns and enhanced customer engagement.
LeapYear
Venture Round in 2021
LeapYear Technologies, Inc., established in 2014 and headquartered in Berkeley, California, specializes in developing a platform that enables enterprises to securely extract value from their most sensitive data. The platform employs differential privacy, a mathematically proven standard, to protect data while allowing it to be leveraged for machine learning purposes. This allows domain experts, data scientists, and partner organizations to create automated machine learning applications and APIs, monetizing previously inaccessible or restricted data sources. LeapYear has collaborated with major companies across various sectors, including financial services, healthcare, technology, and insurance, to combine, analyze, and securely utilize previously siloed information.
Hightouch
Series A in 2021
Hightouch operates a customer data platform that synchronizes data between various marketing and operational tools. It connects businesses' existing data warehouses with applications like CRM systems, email platforms, and advertising networks for personalized marketing campaigns and enhanced customer engagement.
Nebulous
Seed Round in 2020
Nebulous, Inc. is a technology company focused on developing blockchain hardware and software infrastructure for the decentralized internet. Founded in 2014 and based in Boston, Massachusetts, the company is known for its flagship product, Sia, a decentralized cloud storage platform that utilizes blockchain technology to create a data storage marketplace. Additionally, Nebulous offers Skynet, a decentralized content delivery network (CDN) and file-sharing platform tailored for developers, along with SiaStream, which provides affordable storage solutions for media files, enabling fast streaming. As a core contributor to the open-source Skynet project, Nebulous plays a significant role in advancing the decentralized web ecosystem.
Astronomer
Series A in 2020
Astronomer is a data engineering software company that builds a managed Apache Airflow platform. Founded in 2014 and based in Cincinnati, Astronomer offers Astro, a data orchestration platform powered by Apache Airflow that enables data teams to build, run, and observe data pipelines-as-code. The platform is designed to deploy, manage, and scale distributed Airflow workflows with features such as isolated resource allocation and controlled user access, supporting data engineers, data scientists, and data analysts in delivering trusted analytics. Astronomer's solutions help organizations increase data availability and visibility by making pipelines observable and governance-friendly across environments.
Hazelcast
Series D in 2020
Hazelcast, Inc. is a technology company that specializes in developing an open-source clustering and scalable data distribution platform tailored for Java applications. Founded in 2008 and headquartered in Palo Alto, California, with an additional research and development office in Istanbul, Turkey, Hazelcast offers a suite of products including an in-memory data grid solution, a stream processing engine called Hazelcast Jet, and a web-based management tool for monitoring and managing clusters. The company's platform integrates high-performance stream processing with rapid data management, enabling businesses to effectively handle transactional, operational, and analytical workloads. Organizations worldwide leverage Hazelcast's solutions to simplify real-time architectures, enhance business-critical processes, and support AI and machine learning deployments, ultimately driving efficiency and revenue while maintaining a low total cost of ownership.
Verana Health
Series D in 2020
Verana Health, Inc. is a digital health company based in San Francisco, California, that specializes in transforming healthcare data into actionable insights. Founded in 2008 and formerly known as Digisight Technologies, the company connects healthcare providers with patients through innovative solutions. Its flagship product, SightBook, is a mobile application that enables users to test their vision and share results with their physicians in real-time, facilitating timely appointments and treatments. Verana Health manages data from over 20,000 healthcare providers and 70 electronic health record systems, leveraging its AI-enhanced platform, VeraQ™, to create a robust healthcare data ecosystem. By utilizing advanced analytics on curated datasets, Verana Health supports life sciences partners in improving clinical research and enhancing patient care outcomes.
Reonomy specializes in commercial real estate data and analytics. It uses big data, partnerships, and machine learning to provide comprehensive property intelligence, enabling professionals to discover new opportunities and make informed decisions.
Nightfall
Series A in 2019
Nightfall offers a cloud data protection platform and developer platform that uses machine learning to identify and classify sensitive business data across SaaS apps, APIs, and data infrastructure. It detects exposures, automates data hygiene, and enforces workflows such as quarantines, deletions, and alerts with no end-user disruption. The platform exposes cloud-hosted APIs and SDKs for developers to integrate data classification and protection capabilities into applications and data pipelines, helping organizations save time and keep business information safe.
Astronomer
Seed Round in 2019
Astronomer is a data engineering software company that builds a managed Apache Airflow platform. Founded in 2014 and based in Cincinnati, Astronomer offers Astro, a data orchestration platform powered by Apache Airflow that enables data teams to build, run, and observe data pipelines-as-code. The platform is designed to deploy, manage, and scale distributed Airflow workflows with features such as isolated resource allocation and controlled user access, supporting data engineers, data scientists, and data analysts in delivering trusted analytics. Astronomer's solutions help organizations increase data availability and visibility by making pipelines observable and governance-friendly across environments.
Kantar
Acquisition in 2019
Kantar, a division of WPP, is a global leader in data-driven insights and consulting. With a team of 26,500 employees across 95 countries, Kantar unites diverse expertise in market research, analytics, and consulting to provide clients with comprehensive business insights. Its services span the entire consumer cycle, aiding clients in strategic decision-making. Kantar Health, a division of Kantar, specializes in data, analytics, and research for the life sciences industry, helping clients bring safe and effective treatments to patients worldwide.
Nebulous
Seed Round in 2019
Nebulous, Inc. is a technology company focused on developing blockchain hardware and software infrastructure for the decentralized internet. Founded in 2014 and based in Boston, Massachusetts, the company is known for its flagship product, Sia, a decentralized cloud storage platform that utilizes blockchain technology to create a data storage marketplace. Additionally, Nebulous offers Skynet, a decentralized content delivery network (CDN) and file-sharing platform tailored for developers, along with SiaStream, which provides affordable storage solutions for media files, enabling fast streaming. As a core contributor to the open-source Skynet project, Nebulous plays a significant role in advancing the decentralized web ecosystem.
Hazelcast
Series D in 2019
Hazelcast, Inc. is a technology company that specializes in developing an open-source clustering and scalable data distribution platform tailored for Java applications. Founded in 2008 and headquartered in Palo Alto, California, with an additional research and development office in Istanbul, Turkey, Hazelcast offers a suite of products including an in-memory data grid solution, a stream processing engine called Hazelcast Jet, and a web-based management tool for monitoring and managing clusters. The company's platform integrates high-performance stream processing with rapid data management, enabling businesses to effectively handle transactional, operational, and analytical workloads. Organizations worldwide leverage Hazelcast's solutions to simplify real-time architectures, enhance business-critical processes, and support AI and machine learning deployments, ultimately driving efficiency and revenue while maintaining a low total cost of ownership.
LeapYear Technologies, Inc., established in 2014 and headquartered in Berkeley, California, specializes in developing a platform that enables enterprises to securely extract value from their most sensitive data. The platform employs differential privacy, a mathematically proven standard, to protect data while allowing it to be leveraged for machine learning purposes. This allows domain experts, data scientists, and partner organizations to create automated machine learning applications and APIs, monetizing previously inaccessible or restricted data sources. LeapYear has collaborated with major companies across various sectors, including financial services, healthcare, technology, and insurance, to combine, analyze, and securely utilize previously siloed information.
Brillio
Acquisition in 2019
Brillio is a technology consulting and services company that helps organizations accelerate digital transformation. It provides services across digital experiences, product engineering, data analytics, digital front office, big data and analytics, digital infrastructure, and related areas. The company serves clients across banking and finance, energy and utilities, consumer packaged goods, retail, technology, and media and entertainment industries worldwide. Founded in 2004 and based in Santa Clara, California, Brillio focuses on helping organizations design, build, and scale digital platforms, products, and services enabled by data and emerging technologies.
CoinAlpha
Seed Round in 2018
CoinAlpha, Inc. is a FinTech software engineering firm based in Mountain View, California, founded in 2017. The company specializes in developing technologies that enable traders, developers, and fintech firms to create contract-based financial products. CoinAlpha is known for its open-source initiatives, including the Basket Protocol for Ethereum tokens and the Fund Protocol for fund administration. The company has also created Hummingbot, a leading open-source software for building market-making and arbitrage bots, which empowers users to manage token liquidity effectively. Additionally, CoinAlpha operates Hummingbot Miner, a liquidity mining platform that allows users to earn rewards by running market-making bots on specific trading pairs. The concept of liquidity mining was first introduced by CoinAlpha in a 2019 whitepaper. The founders bring extensive experience in investment banking and trading, complemented by engineering and product leadership backgrounds in notable Silicon Valley companies. CoinAlpha has successfully raised $14.8 million in funding from various investors.
SigFig is a wealth management company that focuses on providing personalized investment advice to investors of all wealth levels. Founded in 2007 and headquartered in San Francisco, the company leverages a blend of design, data science, and technology to empower users with the information and guidance necessary for achieving their financial goals. Through partnerships with major financial institutions, SigFig enhances the investment management process by offering accessible and affordable guidance tailored to individual needs. Its enterprise technology is designed to be secure, scalable, and compliant, allowing partners to accelerate their time to market. Additionally, SigFig's innovation initiatives support product development and client engagement, further solidifying its commitment to improving the investment experience for both investors and financial advisors.
Reonomy specializes in commercial real estate data and analytics. It uses big data, partnerships, and machine learning to provide comprehensive property intelligence, enabling professionals to discover new opportunities and make informed decisions.
LeapYear Technologies, Inc., established in 2014 and headquartered in Berkeley, California, specializes in developing a platform that enables enterprises to securely extract value from their most sensitive data. The platform employs differential privacy, a mathematically proven standard, to protect data while allowing it to be leveraged for machine learning purposes. This allows domain experts, data scientists, and partner organizations to create automated machine learning applications and APIs, monetizing previously inaccessible or restricted data sources. LeapYear has collaborated with major companies across various sectors, including financial services, healthcare, technology, and insurance, to combine, analyze, and securely utilize previously siloed information.
Trooly, Inc. is a San Francisco-based company, incorporated in 2013, that operates an online platform aimed at enhancing peer-to-peer social and business interactions. As a subsidiary of Airbnb, Trooly specializes in providing Instant Trust services designed to verify, screen, and predict the trustworthiness of relationships in various contexts. By employing advanced machine learning technology, Trooly synthesizes digital footprints in real time, allowing businesses to obtain insights with greater efficacy than traditional background checks and credit scores. This system requires minimal customer data input, making it both accessible and cost-effective, with results typically delivered in around 30 seconds. Trooly’s services serve a diverse clientele, including financial institutions, peer-to-peer marketplaces, marketers, and employers, fostering better risk assessment and customer relationship management. The company operates as a consumer reporting agency, ensuring compliance with applicable privacy and data protection laws while balancing transparency with respect for individual privacy.
SigFig is a wealth management company that focuses on providing personalized investment advice to investors of all wealth levels. Founded in 2007 and headquartered in San Francisco, the company leverages a blend of design, data science, and technology to empower users with the information and guidance necessary for achieving their financial goals. Through partnerships with major financial institutions, SigFig enhances the investment management process by offering accessible and affordable guidance tailored to individual needs. Its enterprise technology is designed to be secure, scalable, and compliant, allowing partners to accelerate their time to market. Additionally, SigFig's innovation initiatives support product development and client engagement, further solidifying its commitment to improving the investment experience for both investors and financial advisors.
Reonomy specializes in commercial real estate data and analytics. It uses big data, partnerships, and machine learning to provide comprehensive property intelligence, enabling professionals to discover new opportunities and make informed decisions.
Hazelcast
Series B in 2014
Hazelcast, Inc. is a technology company that specializes in developing an open-source clustering and scalable data distribution platform tailored for Java applications. Founded in 2008 and headquartered in Palo Alto, California, with an additional research and development office in Istanbul, Turkey, Hazelcast offers a suite of products including an in-memory data grid solution, a stream processing engine called Hazelcast Jet, and a web-based management tool for monitoring and managing clusters. The company's platform integrates high-performance stream processing with rapid data management, enabling businesses to effectively handle transactional, operational, and analytical workloads. Organizations worldwide leverage Hazelcast's solutions to simplify real-time architectures, enhance business-critical processes, and support AI and machine learning deployments, ultimately driving efficiency and revenue while maintaining a low total cost of ownership.
Numerator
Series B in 2014
Numerator is a market intelligence firm that combines omnichannel marketing, merchandising, and sales data to simplify strategic decision-making for brand, retail, and agency clients. It uniquely links consumer purchase behavior to influencing factors using the InfoScout OmniPanel, which has captured over 500 million receipts. Numerator serves industry leaders like Nike, Unilever, and Procter & Gamble with real-time path-to-purchase data.
SigFig is a wealth management company that focuses on providing personalized investment advice to investors of all wealth levels. Founded in 2007 and headquartered in San Francisco, the company leverages a blend of design, data science, and technology to empower users with the information and guidance necessary for achieving their financial goals. Through partnerships with major financial institutions, SigFig enhances the investment management process by offering accessible and affordable guidance tailored to individual needs. Its enterprise technology is designed to be secure, scalable, and compliant, allowing partners to accelerate their time to market. Additionally, SigFig's innovation initiatives support product development and client engagement, further solidifying its commitment to improving the investment experience for both investors and financial advisors.
Driven, Inc. specializes in Big Data application development and performance management solutions for a global clientele. The company offers its flagship product, Driven, which is an application performance management solution designed to support DevOps teams in debugging, monitoring, and managing Big Data applications. Driven is compatible with major processing frameworks such as Apache Hive, MapReduce, Cascading, Scalding, and Apache Spark, providing organizations with essential visibility and insights to enhance application performance. Additionally, Driven, Inc. offers Cascading, a platform for building Big Data applications on Apache Hadoop, alongside various solutions for performance monitoring, troubleshooting, and analytics. The company also provides training services to ensure effective use of its offerings. Founded in 2008 and headquartered in San Francisco, California, Driven, Inc. operates as a subsidiary of Xplenty Ltd. Its diverse user base spans multiple industries, including social media, retail, financial services, telecommunications, bioinformatics, and geospatial sectors.
Hazelcast
Series A in 2013
Hazelcast, Inc. is a technology company that specializes in developing an open-source clustering and scalable data distribution platform tailored for Java applications. Founded in 2008 and headquartered in Palo Alto, California, with an additional research and development office in Istanbul, Turkey, Hazelcast offers a suite of products including an in-memory data grid solution, a stream processing engine called Hazelcast Jet, and a web-based management tool for monitoring and managing clusters. The company's platform integrates high-performance stream processing with rapid data management, enabling businesses to effectively handle transactional, operational, and analytical workloads. Organizations worldwide leverage Hazelcast's solutions to simplify real-time architectures, enhance business-critical processes, and support AI and machine learning deployments, ultimately driving efficiency and revenue while maintaining a low total cost of ownership.
SigFig is a wealth management company that focuses on providing personalized investment advice to investors of all wealth levels. Founded in 2007 and headquartered in San Francisco, the company leverages a blend of design, data science, and technology to empower users with the information and guidance necessary for achieving their financial goals. Through partnerships with major financial institutions, SigFig enhances the investment management process by offering accessible and affordable guidance tailored to individual needs. Its enterprise technology is designed to be secure, scalable, and compliant, allowing partners to accelerate their time to market. Additionally, SigFig's innovation initiatives support product development and client engagement, further solidifying its commitment to improving the investment experience for both investors and financial advisors.
Numerator
Series A in 2013
Numerator is a market intelligence firm that combines omnichannel marketing, merchandising, and sales data to simplify strategic decision-making for brand, retail, and agency clients. It uniquely links consumer purchase behavior to influencing factors using the InfoScout OmniPanel, which has captured over 500 million receipts. Numerator serves industry leaders like Nike, Unilever, and Procter & Gamble with real-time path-to-purchase data.
Helomics Corporation
Series D in 2010
Helomics Corporation is a personalized healthcare company that develops diagnostic and tumor profiling solutions to guide cancer treatment decisions. Its Precision Cellular Analytical Platform analyzes cell cycle and proliferation data over a multiweek timeframe to support physicians in selecting effective therapies. The company offers products including ChemoFx for gynecologic cancers, BioSpeciFx biomarker tests to evaluate tumor biology and potential drug response or prognosis, GeneFx Colon for stage II colon cancer, and GeneFx Lung for early stage non-small cell lung cancer. It also provides tumor profiling services such as bioinformatics, contract research and development, and biorepository and banking. Helomics collaborates with industry partners to advance personalized oncology insights. The company was founded in 1995 and is based in Pittsburgh, Pennsylvania.
MyEdu is a student academic platform designed to support college students in their educational journey and career readiness. By leveraging a comprehensive database of academic information sourced from universities across the United States, MyEdu provides students with easy-to-use web applications that facilitate the creation of personalized graduation plans and professional profiles. The platform empowers students to make informed decisions at critical academic milestones, such as choosing majors, planning degrees, and selecting courses, while also offering targeted job and internship opportunities. MyEdu’s extensive repository includes course details, professor ratings, grade histories, and student reviews, which collectively enhance the decision-making process. By focusing on the unique needs of students and collaborating closely with universities, MyEdu aims to transform the college experience and help students achieve their academic and career goals.
vAuto
Venture Round in 2007
vAuto, a brand of Cox Automotive since 2010, specializes in providing a comprehensive suite of inventory management solutions tailored for automotive dealers. The company leverages advanced retail and wholesale market data along with data science-driven insights to facilitate informed decision-making throughout the inventory lifecycle, from sourcing to selling. By offering end-to-end solutions, vAuto enables clients to enhance the efficiency and profitability of their operations. Additionally, vAuto's innovations, such as ProfitTime GPS, utilize artificial intelligence and machine learning to further support dealers in optimizing their performance. The company is dedicated to helping automotive dealers improve their business practices through insightful data and tailored support.
AthenaHQ is a technology company that offers a platform to help businesses understand and optimize their brand's visibility in AI-driven search results, particularly in generative engines like ChatGPT. The platform provides real-time monitoring of brand mentions, identifies content gaps where AI lacks knowledge about the business, and pinpoints websites cited by chatbots, enabling clients to enhance their brand's presence and improve its visibility on search platforms.