LlamaIndex
Corporate Round in 2025
LlamaIndex is a developer of a central interface toolkit that enhances large language model (LLM) applications by integrating external data. The company provides a flexible data framework that allows businesses to connect various custom data sources, including unstructured, structured, and semi-structured data. This capability enables the seamless development of powerful end-user applications, facilitating the effective utilization of LLMs in diverse business contexts.
Fennel AI
Acquisition in 2025
Whether it is recommending videos to watch on TikTok, things to buy on Amazon, or recommending jobs to apply for on LinkedIn, recommendation engines are the drivers of the modern digital economy. However, the technology to power these recommendations has only been available to a select few big tech companies so far. We, at Fennel AI, are an ex-Facebook/Google team that’s on a mission to enable all the companies in the world to harness this technology and build delightful products for their customers.
Gable is a business-to-business data infrastructure software-as-a-service company that provides a platform aimed at enhancing collaboration through the writing and execution of data contracts. This platform facilitates effective communication between data providers and consumers, promoting improved data quality on a larger scale. Additionally, Gable offers a management platform focused on enhancing data visibility and governance. By employing artificial intelligence, Gable's platform scans code to identify data creation points, monitor its movement, and assess its effects prior to deployment. This comprehensive approach helps organizations prevent disruptions, fosters collaboration, and ensures compliance with relevant standards.
Blade Bridge
Acquisition in 2025
Blade Bridge offers a suite of tools that accelerate data projects for SI's and product vendors.
Twelve Labs
Series A in 2024
Twelve Labs is a developer of a multimodal artificial intelligence platform that focuses on enhancing video understanding capabilities for businesses and developers. The platform enables users to analyze and interpret video content through various modalities, including visual and auditory data. It offers APIs for functions such as search, generation, and classification, allowing clients to create embeddings from videos to support a range of applications. Twelve Labs' technology is applicable across various industries, including technology, media, entertainment, and security, facilitating the development of intelligent video applications that can be deployed on any cloud and customized with specific data to meet user needs.
Koantek
Venture Round in 2024
Koantek is an IT consulting firm specializing in data-driven solutions. It assists businesses in maximizing their data and AI investments by providing advisory and implementation services in data strategy, migration, engineering, advanced analytics, and cloud infrastructure automation. By simplifying data complexities, Koantek helps clients accelerate business growth.
SuperAnnotate
Series B in 2024
SuperAnnotate is a leading platform that specializes in creating high-quality training datasets for various applications in artificial intelligence, including generative AI, computer vision, and natural language processing. The company provides advanced annotation software designed to facilitate the annotation, training, and automation of machine learning pipelines. With a focus on scalability, SuperAnnotate offers tools for image, video, text, and lidar annotation, along with features that enhance collaboration and quality management. Additionally, it integrates programming software development kits to streamline the data organization and annotation process. By enabling machine learning teams to develop and manage accurate datasets efficiently, SuperAnnotate significantly accelerates the creation of successful machine learning models.
Galileo.ai is a technology company specializing in AI application management. It offers a suite of services including development, testing, monitoring, and security for AI applications. The core product is an AI platform designed to enhance machine learning processes by automatically identifying errors and data gaps, thereby improving efficiencies, reducing costs, and mitigating biases across various industries such as healthcare, finance, and insurance.
Braintrust Data
Series A in 2024
Braintrust offers an AI stack to simplify the process from evaluations to data management, ensuring seamless integration into any business. Braintrust simplifies the evaluation process by facilitating easy scoring, logging, and visualization of outputs. Users can investigate failures, monitor performance trends, and address queries like identifying regressions or assessing new models. The platform offers a Prompt Playground, allowing users to compare multiple prompts, benchmarks, and input/output pairs across runs, enabling both ephemeral tinkering and experiment evaluation on large datasets. In Continuous Integration, Braintrust seamlessly integrates into workflows, enabling progress tracking on the main branch and automatic comparison of new experiments with live versions before deployment. With a focus on Datasets, the platform enables the effortless capture of rated examples from staging and production, incorporating them into versioned "golden" datasets stored in the cloud. This ensures the evolution of datasets without jeopardizing evaluations dependent on them.
Voyage AI
Series A in 2024
Voyage AI specializes in creating state-of-the-art embedding models and rerankers, designed to enhance the quality and efficiency of unstructured data search and retrieval, particularly in retrieval-augmented generation (RAG) systems. Led by top-tier researchers, the company's offerings outperform competitors in terms of accuracy, speed, and cost-effectiveness, and provide flexible deployment options. Voyage AI caters to specific domains such as code, finance, and law, delivering tailored, high-precision models. Additionally, it offers customized solutions and bespoke models to address unique business needs, enabling clients to improve their AI-driven processes and data analysis capabilities.
Cube Dev, Inc., established in 2016 and headquartered in San Francisco, specializes in designing and developing open-source analytical API platforms. The company offers tools for building visualization-agnostic user interfaces, analytical API servers, and supports modern data stores. Its flagship product, Cube Cloud, provides managed infrastructure, query inspection and tracing, pre-aggregation management, and monitoring capabilities. Cube Dev's platform enables businesses to build internal business intelligence tools and add customer-facing analytics to existing applications, handling large datasets securely and efficiently.
XponentL Data
Seed Round in 2024
XponentL Data is a technology company that specializes in data and artificial intelligence (AI) platforms. Its core business is to streamline the process of data-driven decision-making by automating data collection, preparation, and analysis. The company's platform bridges the gap between data producers and consumers, delivering data products that offer insights, fuel AI, and capture value. This enables businesses to extract knowledge from their data and make more informed decisions, ultimately increasing productivity and fostering creativity.
Fireworks AI
Series A in 2024
Fireworks AI is a company that provides a generative AI platform designed to facilitate rapid product iteration and optimize operational costs. Its platform allows developers and businesses to run, fine-tune, and share large language models (LLMs) efficiently. By focusing on high-speed collaboration and service, Fireworks AI enables users to effectively address product challenges and scale their operations more swiftly. The company's offerings are geared towards enhancing the development process, making it easier for organizations to innovate and adapt in a competitive landscape.
Lilac AI
Acquisition in 2024
Lilac AI provides data science tools for improving the quality of data for generative AI applications and language models (LLMs).
Unstructured
Series B in 2024
Unstructured is a developer of an open-source data transformation platform that simplifies the process of converting raw data, such as PDFs and Microsoft Office documents, into formats compatible with language models. The platform supports over 25 file types, including PDF, DOC, and PPTX, and offers connectors to various systems like SharePoint, S3, and Databricks. By facilitating effortless data extraction and integration into AI workflows, Unstructured enhances the accessibility of human-generated information, ensuring that it is readily available for generative AI systems. Its modular architecture allows users to incorporate any third-party model, making it a versatile solution for preprocessing natural language data for machine learning applications.
Mistral AI_
Venture Round in 2024
Mistral AI is a company dedicated to developing advanced artificial intelligence solutions, with a particular emphasis on creating open-source large language models. The company focuses on producing customizable and compute-efficient AI models that support various applications, including natural language processing and complex problem-solving. Mistral AI aims to advance the field of artificial intelligence by fostering a community-driven development approach, allowing businesses and organizations to utilize powerful AI technologies without the burden of building and maintaining their own models. Through its innovative offerings, Mistral AI seeks to drive efficiency, innovation, and informed decision-making across diverse sectors.
Adaptive ML
Series A in 2024
Adaptive ML is a technology company that specializes in developing a Large Language Model (LLM) platform. This platform enables businesses to train and deploy language models, with a unique feature of incorporating user feedback to enhance model performance. By leveraging company data, user interactions, and feedback, Adaptive ML's platform generates AI models that continuously learn and improve, helping businesses achieve superior performance without requiring expertise in complex techniques like reinforcement learning.
Entrada
Seed Round in 2024
Entrada recognizes the transformative impact of data on business agility and innovation. Founded to empower Databricks users to unlock their data's full potential, Entrada offers expert services in modernizing data platforms to drive business objectives and monetization of data-centric services. As a trusted partner, Entrada is dedicated to ensuring businesses can fully harness their data for great decision-making and excellent customer experiences.
Glean is a company that specializes in AI-based search engine software aimed at enhancing data accessibility within enterprises. Its primary offering, the Workplace Search platform, integrates with various internal data sources and applications to facilitate efficient information retrieval. By leveraging advanced search technologies, retrieval augmented generation, and large language models, Glean's software delivers personalized answers to employees' queries, enabling them to quickly locate relevant data points based on specific keywords. This innovative approach helps organizations streamline their operations by ensuring that employees can easily access the information they need to make informed decisions.
Einblick
Acquisition in 2024
Einblick is a visual data computing platform that enhances organizations' ability to analyze data, forecast outcomes, and make informed decisions. The platform uniquely integrates the computational capabilities of traditional data science notebooks with modern canvas-based collaboration tools, creating a user-friendly graphical environment for building and deploying models. This touch-enabled interface allows data scientists to efficiently construct high-performance models and present them interactively to decision-makers. Einblick's clientele includes prominent organizations such as a major German luxury car brand, DARPA, and a significant internet service provider, highlighting its diverse application across various industries.
Anomalo is a developer of an artificial intelligence data validation tool that enables users to continuously inspect and validate data entering their warehouses. The company’s solution automatically detects and explains issues in enterprise data, facilitating seamless integration with data warehouses. By employing automated machine learning technology, Anomalo’s tool allows organizations to validate and document their data with minimal configuration, eliminating the need for users to write any code. This innovation helps companies maintain data integrity and improve the reliability of their data-driven decisions.
Mistral AI_
Series A in 2023
Mistral AI is a company dedicated to developing advanced artificial intelligence solutions, with a particular emphasis on creating open-source large language models. The company focuses on producing customizable and compute-efficient AI models that support various applications, including natural language processing and complex problem-solving. Mistral AI aims to advance the field of artificial intelligence by fostering a community-driven development approach, allowing businesses and organizations to utilize powerful AI technologies without the burden of building and maintaining their own models. Through its innovative offerings, Mistral AI seeks to drive efficiency, innovation, and informed decision-making across diverse sectors.
Arcion
Acquisition in 2023
Arcion is a cloud-native data mobility platform that specializes in high-performance, real-time data pipelines. The company offers an autonomous migration and cloud-neutral database replication solution, allowing businesses to efficiently migrate database updates to streaming data pipelines. This capability provides a consistent, real-time view of customer data and business intelligence across various applications and business units. By streamlining data migration processes, Arcion helps organizations reduce migration and licensing costs while enhancing the productivity of their engineering teams.
Prophecy.io
Series B in 2023
Prophecy.io offers a data transformation copilot that assists users in developing, deploying, and monitoring data pipelines across cloud platforms. The platform integrates AI and a visual interface to enhance productivity for various data users, enabling them to manage complex data workflows with ease.
Cleanlab is a technology company that specializes in enhancing the quality of datasets used by businesses for analytics and machine learning tasks. It offers a comprehensive platform centered around Data-Centric Artificial Intelligence (DCAI), which addresses the entire data quality pipeline within a single framework. This platform enables companies to diagnose and rectify issues in their datasets, thereby improving the reliability of their machine learning models.
DigPath
Non Equity Assistance in 2023
We are an innovative AI-driven digital pathology company, dedicated to shaping the future of diagnostics and research, with a mission to empower accurate diagnoses and elevate patient care globally.
Neon is a cloud-native, fully managed Postgres as a service. By separating storage from computing, Neon offers autoscaling, branching, and bottomless storage to give developers a simple, reliable, and powerful experience. Neon aims to provide a highly performant and cost-effective database infrastructure by leveraging cloud-native technologies and innovative architectural features.
Hightouch
Series B in 2023
Hightouch is a customer data platform that specializes in data synchronization between various marketing and operational tools. The company enables businesses to connect their existing data warehouses with applications such as customer relationship management systems, email marketing platforms, and advertising networks. By providing a user-friendly interface, Hightouch simplifies data integration and management, allowing organizations to access and utilize their data effectively without duplication. This approach helps maintain data integrity and security, while empowering clients to leverage their data for personalized marketing campaigns. Hightouch's platform features audience segmentation, data transformation, and automated workflows, which assist businesses in optimizing their marketing strategies and operational processes.
MosaicML
Acquisition in 2023
MosaicML is a company focused on creating an efficient infrastructure for training large language models and improving the overall efficiency of neural networks. It develops software and artificial intelligence training algorithms that enhance the training process by utilizing algorithmic techniques such as sparsity and network pruning. These innovations allow users to effectively and securely train large-scale AI models on their proprietary data, while also optimizing for speed, quality, and cost. MosaicML aims to streamline the machine learning model recomposition process, making it easier for organizations to harness the power of AI in their operations.
Snowplow
Venture Round in 2023
Snowplow is an enterprise-grade event analytics platform that specializes in behavioral data management. It empowers data teams by providing tools to track, contextualize, validate, and model customer interactions on websites and applications. The platform integrates web analytics with various third-party data sources, allowing businesses to gain comprehensive insights into customer behavior. Snowplow's solutions facilitate customer journey analytics, marketing attribution, product analytics, and paywall optimization, addressing complex data challenges and enhancing overall data-driven decision-making for organizations.
Lovelytics
Venture Round in 2023
Lovelytics is a data, AI, and analytics consulting company that specializes in transforming data into actionable insights for leading organizations. The firm offers a range of services, including data advisory, enterprise data environment design and implementation, data science and machine learning, data visualization, and training. By partnering with clients, Lovelytics focuses on enhancing self-sufficiency and hands-on enablement, ultimately driving business outcomes and creating sustainable value. Through its expertise, the company aims to help clients optimize and modernize their data ecosystems, ensuring they can better understand and leverage their data for strategic decision-making.
Catalyst Software
Venture Round in 2023
Catalyst Software Corporation, headquartered in New York, develops an intuitive customer success platform designed to enhance customer experience and reduce churn for businesses. As a Software-as-a-Service (SaaS) provider, it offers a comprehensive suite of features including analytics, workflow automation, product usage tracking, and a task manager that consolidates various communication tools into a single interface. The platform allows users to log customer interactions automatically, create 360º profiles, and manage campaigns and account segmentation effectively. Additionally, it integrates with other SaaS applications to provide a unified dashboard that facilitates data-driven decision-making around customer success. Catalyst aims to empower teams to identify expansion opportunities and drive recurring revenue growth by aligning strategic actions with customer objectives. Founded in 2016, the company is now part of Totango.
Immuta
Venture Round in 2023
Immuta, Inc. is a data security company that specializes in providing organizations with a platform for managing data privacy, security, and access control. The Immuta Data Security Platform enables users to discover and classify sensitive data, implement access control policies, and monitor data usage without the need for coding. This automated data governance solution supports self-service access to data while ensuring compliance with various regulations. Immuta is utilized by a diverse range of industries, including finance, healthcare, government, and manufacturing, to facilitate cloud migration and secure collaboration. Founded in 2014 and headquartered in College Park, Maryland, with additional offices in Boston, Massachusetts and Columbus, Ohio, Immuta has established itself as a trusted partner for Fortune 500 companies and government agencies worldwide.
Okera Inc. is a data management company that operates an Active Data Access Platform designed to streamline data provisioning, access, governance, and auditing. Founded in 2016 and based in San Francisco, California, with an additional office in Seattle, Okera focuses on enhancing data security, privacy, compliance, and sensitive data management. The platform enables organizations to automatically discover and audit data lakes, create no-code access policies through a visual policy engine, and enforce fine-grained access controls across hybrid and multi-cloud environments, including AWS and Azure. By implementing comprehensive data access controls, Okera empowers data teams to confidently harness the potential of their data for innovation and growth while navigating the complexities of evolving data privacy regulations.
Perplexity
Series A in 2023
Perplexity is an artificial intelligence-based search engine platform that combines large language models with traditional search engines. It leverages natural language processing and generative AI technologies to deliver conversational responses to user queries, aiming to enhance the search experience beyond standard results. The platform is designed to facilitate the development of safe and beneficial artificial general intelligence, offering an open-source environment accessible to the public. This allows clients to acquire skills and knowledge in software development, positioning Perplexity as a bridge between conventional search capabilities and advanced AI-driven assistance.
Matillion
Venture Round in 2022
Matillion Ltd. specializes in cloud data integration software solutions that empower companies to effectively utilize their data. The company offers a range of products, including Matillion ETL for Amazon Redshift and Matillion ETL for Snowflake, both of which facilitate the extraction, loading, and transformation of structured and semi-structured data in cloud environments. Additionally, Matillion Data Loader serves as a SaaS-based tool that loads data from source systems into cloud data warehouses, enhancing data accessibility for informed decision-making. Matillion also provides a business intelligence solution for self-service reporting and analytics, alongside Matillion Exchange, a marketplace for users to share and download integration jobs. With a client base that includes Fortune 500 companies and mid-sized tech enterprises, Matillion operates from its headquarters in Manchester, United Kingdom, and has offices in New York, Denver, and Seattle. The company, established in 2010, is dedicated to accelerating data readiness and maximizing the impact of data across various industries.
DataJoy
Acquisition in 2022
DataJoy is a developer of a revenue intelligence platform that integrates data across various organizational functions, including marketing, sales, product, and finance. The platform utilizes machine learning algorithms to analyze this unified data, providing insights that help companies understand and enhance their revenue performance. By tracking key performance indicators and detecting anomalies, DataJoy enables organizations to make informed projections and optimize their strategies for growth. Ultimately, the company aims to assist businesses in building a repeatable, profitable, and predictable revenue model.
Tecton is a technology company that specializes in providing an enterprise-level feature store platform for machine learning (ML). Its core offering enables ML teams to efficiently build, serve, and scale features from diverse data sources, aiming to enhance model performance and drive tangible business outcomes. The platform addresses the unique data challenges faced by ML teams, making it accessible to a broader range of companies.
Cortex Labs
Acquisition in 2022
Cortex Labs is a developer of a serverless computing platform that supports machine learning engineering teams by providing cloud-native model serving infrastructure. The platform is designed to facilitate the deployment of large-scale machine learning applications, including computer vision and natural language processing. It enables users to build and integrate APIs into any application, manage both real-time and batch inference workloads, and create streamlined, reproducible workflows. By doing so, Cortex Labs empowers engineering teams to efficiently ship machine learning applications into production.
Hex Technologies
Series B in 2022
Hex is a software company that provides collaborative data science and analytics. They provide individuals to learn and organizations to know things so they can make better decisions. Hex brings together SQL, Python, R, and no-code in powerful notebooks and allows users to publish projects as interactive data apps that anyone can use with one click.
Dbt Labs is a developer of an open-source analytics engineering tool founded in 2016 and headquartered in Philadelphia, Pennsylvania. The company focuses on empowering data analysts to create and disseminate organizational knowledge through its innovative platform. By enabling users with SQL knowledge to build data transformation workflows, Dbt Labs provides a transformation workflow tool that facilitates quick and collaborative deployment of analytics code. This tool emphasizes software engineering practices such as modularity, portability, and documentation, thereby allowing teams to streamline their analytics processes and enhance their ability to derive insights from data.
Arcion is a cloud-native data mobility platform that specializes in high-performance, real-time data pipelines. The company offers an autonomous migration and cloud-neutral database replication solution, allowing businesses to efficiently migrate database updates to streaming data pipelines. This capability provides a consistent, real-time view of customer data and business intelligence across various applications and business units. By streamlining data migration processes, Arcion helps organizations reduce migration and licensing costs while enhancing the productivity of their engineering teams.
Revelate is a developer of a data fulfillment platform that offers a comprehensive suite of capabilities for data sharing and commercialization. The platform is designed to alleviate the challenges faced by data teams in distributing data according to consumer needs, both within and outside their organizations. By seamlessly integrating into existing data ecosystems, Revelate empowers companies to prepare, package, and distribute data efficiently and effectively from any source to any recipient. This innovative approach helps organizations fully realize the value of their data assets.
Hunters is a cybersecurity company that specializes in developing an artificial intelligence-based platform for detecting and responding to cyber threats. Founded in 2018 and headquartered in Tel Aviv, Israel, the company offers its solution, Hunters.AI, which autonomously identifies cyberattacks that may evade traditional security measures across various IT environments, including cloud and network systems. Hunters.AI integrates diverse security telemetry and intelligence, enriching threat signals with detailed tactics, techniques, and procedures to enhance detection capabilities. By utilizing machine learning and cloud-based analytics, the platform correlates threat patterns and generates high-fidelity attack narratives, enabling cybersecurity teams to respond swiftly and effectively to potential breaches.
Labelbox, Inc. is a technology company that specializes in providing an AI-driven platform for data labeling and management. Founded in 2018 and headquartered in San Francisco, California, the company enables businesses to outsource their data annotation needs, facilitating the creation and management of datasets essential for machine learning applications. The platform features a visual workflow interface, annotation tools, quality control capabilities, and performance analytics, which collectively streamline the data labeling process. Labelbox supports teams in utilizing the latest advancements in generative AI and large language models, ensuring that AI systems receive appropriate human oversight and automation. Its services are utilized by prominent enterprises, including Walmart, Procter & Gamble, Genentech, and Adobe, as well as numerous leading AI teams across various industries.
8080 Labs
Acquisition in 2021
8080 Labs is a software development company that specializes in creating tools to enhance the accessibility of data science for users of all skill levels. The company's flagship product, bamboolib, is a user-friendly, UI-based data science tool that allows users to quickly and efficiently explore and transform data without the need for coding. By streamlining the data manipulation process, 8080 Labs empowers data scientists to boost their productivity, enabling them to focus on analysis rather than programming. The company offers both paid and open-source software, reinforcing its commitment to making data science tools available to a broader audience.
Redash
Acquisition in 2020
Redash, Ltd. is a company founded in 2015 and based in Tel Aviv-Yafo, Israel, that specializes in developing an open-source platform designed to facilitate data-driven decision-making for organizations. The platform enables data scientists and SQL analysts to integrate various data sources, such as operational databases and data lakes, into cohesive dashboards. By democratizing data access, Redash allows enterprises to visualize and share data insights effectively, fostering a culture of data utilization within organizations. In June 2020, Redash became a subsidiary of Databricks Inc., further enhancing its capabilities in the data analytics landscape.
Theom is an IT company founded in 2020 and headquartered in San Francisco, California. It specializes in cloud and data security services, offering a platform designed to discover, track, and protect enterprise data within cloud environments. Theom's platform enables rapid deployment and provides immediate value by identifying risks related to data loss and prioritizing corrective actions. This allows enterprises to concentrate on their growth while securely leveraging their data in the cloud.
Neon is a cloud-native, fully managed Postgres as a service. By separating storage from computing, Neon offers autoscaling, branching, and bottomless storage to give developers a simple, reliable, and powerful experience. Neon aims to provide a highly performant and cost-effective database infrastructure by leveraging cloud-native technologies and innovative architectural features.