Pulse AI
Seed Round in 2025
Pulse converts complex information into LLM-ready inputs. Their API supports all document formats, from PDFs to Word, Excel, etc. Pulse integrates seamlessly with any existing data pipeline in minutes without any training or complexity.
It was founded in 2024 and is located in San Francisco, California, United States.
Use AI to automatically map data between any two schemas in seconds. Lume AI helps teams ingest client data, normalize data from different sources, and build data pipelines, automatically.
Paces is developing a data platform aimed at enhancing the development and operation of green infrastructure projects. By providing actionable data signals across 3,500 U.S. counties, the platform encompasses a variety of critical information, including environmental, permitting, zoning, interconnection data, and competition risks. Additionally, Paces offers modeled signals that assess climate risks, profitability forecasts, and parcel rankings. This granular data is tailored to specific land parcels and green infrastructure assets, enabling developers, operators, and investors to make informed decisions about profitable construction and investment in sustainable projects.
Codified
Seed Round in 2024
Codified is a cloud-based SaaS company that specializes in data governance for Generative AI and Retrieval-Augmented Generation (RAG) applications. Its platform helps enterprises manage and control data usage within these tools, addressing concerns about data exposure and potential leaks of sensitive information. Codified automatically categorizes data into manageable buckets and suggests rules for each category, enabling businesses to maintain data security, comply with regulations, and mitigate risks associated with sensitive data exposure in AI-driven applications.
Bluebirds
Seed Round in 2023
Bluebirds is an AI-based data platform to find customer contacts that switched jobs.
Dynamo AI
Series A in 2023
DynamoFL empowers enterprises to deploy Gen AI solutions in a safe, private, and compliant manner. Our tools offer a scalable and efficient solution for defense against adversarial AI threats, ensuring the integrity and security of ML models throughout their lifecycle. Our suite of proactive AI model defense solutions can be deployed to both identify and remediate security and robustness risks in Generative AI systems. We enable organizations to deploy state-of-the-art LLMs on substantially cheaper hardware, even including CPU resources.
Integration Labs
Seed Round in 2022
Integration Labs, formerly RootFi, provides a unified data integration platform and API that lets companies read and write customers' accounting, e-commerce, and payments data across platforms. The solution supports automating accounting, underwriting credit risk, and deriving business insights by syncing data across systems with standardized data models, enabling clients to streamline AP/AR, payroll accounting, and to build new tools and applications.
Blacksun
Pre Seed Round in 2022
Fetch Complex Blockchain Data in minutes instead of days
Dynamo AI
Seed Round in 2022
DynamoFL empowers enterprises to deploy Gen AI solutions in a safe, private, and compliant manner. Our tools offer a scalable and efficient solution for defense against adversarial AI threats, ensuring the integrity and security of ML models throughout their lifecycle. Our suite of proactive AI model defense solutions can be deployed to both identify and remediate security and robustness risks in Generative AI systems. We enable organizations to deploy state-of-the-art LLMs on substantially cheaper hardware, even including CPU resources.
Paces
Pre Seed Round in 2022
Paces is developing a data platform aimed at enhancing the development and operation of green infrastructure projects. By providing actionable data signals across 3,500 U.S. counties, the platform encompasses a variety of critical information, including environmental, permitting, zoning, interconnection data, and competition risks. Additionally, Paces offers modeled signals that assess climate risks, profitability forecasts, and parcel rankings. This granular data is tailored to specific land parcels and green infrastructure assets, enabling developers, operators, and investors to make informed decisions about profitable construction and investment in sustainable projects.
Daohq is a global platform facilitating the discovery, investment, and management of Decentralized Autonomous Organizations (DAOs). It offers advanced analytics for over 2,000 DAOs, enabling investors, researchers, and developers to access comprehensive data on financial, governance, and social statistics via its DAOHQ DAO Data API. The company's platform aims to fuel the decentralized future by providing a one-stop shop for managing organizations and companies on the blockchain.
Redbird
Pre Seed Round in 2022
Redbird is an AI-powered analytics platform for teams to easily automate advanced analytics work in minutes, without writing code. Redbird's solution offers a comprehensive set of automated data services, including collection, wrangling, modeling, and reporting, as well as a user-friendly interface that does not require specialized engineering knowledge to use. Redbird allows data engineers to be freed up to work on more complex or technical assignments.
Atlas
Venture Round in 2022
Atlas is an all-in-one customer support tool that helps you transform your customer support team from a cost center into an engine of product innovation. We bring together key information from across your customers’ journey into a single location so you can give faster, more effective responses and can analyze and learn from your customers’ holistic needs.
Pareto.AI
Seed Round in 2022
Pareto.AI operates a data labeling platform that connects artificial intelligence companies with a network of skilled data workers. The company focuses on providing tailored services to meet client needs, ranging from same-day experimental data to fully-managed teams. By combining the expertise of real people with machine automation, Pareto.AI enables entrepreneurs to delegate time-consuming tasks, allowing them to concentrate on more critical work. This approach not only enhances efficiency but also empowers diverse professionals worldwide to contribute to AI training. Ultimately, Pareto.AI offers clients the flexibility to refine and optimize various aspects of their AI and large language models, streamlining the development process and improving outcomes.
Nanonets automates document processing and data extraction workflows using AI. It leverages advanced OCR and deep learning models to convert unstructured documents like invoices, receipts, and contracts into structured output. By integrating with existing systems via APIs, Nanonets reduces manual effort by up to 90%, delivering industry-leading accuracy and cost savings of up to 50%.
Superdao
Seed Round in 2022
Superdao is a platform designed for the management and operation of decentralized autonomous organizations (DAOs). It provides tools for creating and customizing intelligent contracts, managing contributors, and accessing a member directory and treasury dashboard. Additionally, Superdao offers a wallet analytics feature that delivers insights into dapp users, NFT and token holders, DAO voters, and quest participants, allowing organizations to analyze their audiences and gather competitive insights. The platform is tailored for Web3 growth teams, facilitating improved decision-making and governance structures for decentralized ventures. With preloaded data from numerous top Web3 projects, Superdao enhances the ability of businesses to build target lists and leverage third-party applications, ultimately supporting the collaborative management of innovative and economically viable decentralized applications.
Secoda is the self-serve data discovery tool that saves teams thousands of hours every year by making data discovery, trust, and understanding faster and easier.
Today, data teams are collecting tons of data, but most employees don't know what data exists, how to use it, and what data to trust. This confusion happens because different components of company data get collected in fragmented tools. Teams use dbt and Snowflake for data cataloging, Google Sheets for events, Confluence for general knowledge, Slack to manage data requests, Mode for reports, Github, Looker, the list goes on and on.
Secoda lets teams manage their queries, charts, data catalog, data dictionary, and documentation in one place.
After teams switch to Secoda, data knowledge is centralized in one place. Instead of having to switch between Snowflake, dbt, Confluence Google Sheets, BI tool, and your query editor, Secoda keeps all your data knowledge in one place for everyone who needs it.
REWORTH
Seed Round in 2021
REWORTH facilitates interactions between banks and merchants by leveraging cashback incentives. It uses anonymous transactional data to personalize offers, boost engagement, and provide real-time business intelligence. REWORTH's API infrastructure enables third-party services like credit scoring and fraud prevention.
Solipay
Seed Round in 2021
Solipay is democratizing personal data for everyone.
Biodock
Seed Round in 2021
Biodock is a developer of a cloud-based analysis platform that specializes in the analysis and storage of biological data, particularly microscopy images. Its innovative technology automates what traditionally takes months of analysis into just minutes, leveraging an end-to-end AI architecture. The platform offers auto-scaling storage and GPU compute capabilities, resulting in analysis that is 30-50% more accurate. Designed with scientists in mind, it provides a user-friendly interface that enables clinicians to efficiently upload, analyze, and download data at a single-cell level, complete with publication-ready graphs. Additionally, Biodock processes millions of cells weekly for its academic and enterprise customers, contributing to the creation of a vast image dataset that enhances its competitive edge in the market.
Flatfile is a data onboarding platform that enables developers to validate, map, and import customer data from spreadsheets into software applications. It embeds a JavaScript snippet to add an import button in web apps and provides a secure Concierge workspace for collaboration to manage data ingestion. The platform supports CSV, XLS, and TSV uploads, allows users to configure a target model for data validation, and learns over time to improve accuracy. It is API-first and widely adopted by hundreds of companies, including AstraZeneca, Square, and Sage.
Founded in 2019, Explo operates a user-friendly platform that connects directly to databases or warehouses for exploration, analysis, and visualization of data. It integrates any data source and enables insights sharing without requiring SQL knowledge.
Streamdal
Seed Round in 2020
Streamdal, previously known as Batch, is a developer of a data performance monitoring tool designed to enhance visibility within messaging systems. This innovative tool enables users to observe their data queues and replay past events, which is crucial for diagnosing outages and managing data-related issues. By providing the capability to revert changes, Streamdal helps organizations maintain data integrity and respond effectively to potential data disasters. The company's focus on monitoring and replaying data empowers users to streamline their operations and improve overall system reliability.
Flatfile
Seed Round in 2020
Flatfile is a data onboarding platform that enables developers to validate, map, and import customer data from spreadsheets into software applications. It embeds a JavaScript snippet to add an import button in web apps and provides a secure Concierge workspace for collaboration to manage data ingestion. The platform supports CSV, XLS, and TSV uploads, allows users to configure a target model for data validation, and learns over time to improve accuracy. It is API-first and widely adopted by hundreds of companies, including AstraZeneca, Square, and Sage.
Glisten.AI
Seed Round in 2020
Glisten.AI is a San Francisco-based company that develops artificial intelligence software specifically for the e-commerce industry. The company's technology automates the representation of product data for retailers, transforming inconsistent and unstructured information into a structured format that is easier to manage. By consolidating and classifying product information according to a standardized taxonomy, Glisten.AI enables merchants to streamline the processing of product data. Their software employs computer vision to generate structured attribute information and enhances product identification, making the data more suitable for machine learning, search, analytics, and other technological applications. Founded by Sarah Wooders and Alice Deng, Glisten.AI aims to facilitate better data management for e-commerce businesses.
Founded in 2019, Explo operates a user-friendly platform that connects directly to databases or warehouses for exploration, analysis, and visualization of data. It integrates any data source and enables insights sharing without requiring SQL knowledge.
Datasaur
Seed Round in 2020
Datasaur, Inc. is a company based in Sunnyvale, California, founded in 2019, that specializes in data labeling solutions for natural language processing tasks. The company offers a machine learning platform that provides ad hoc data labeling tools tailored to clients' specific needs. Datasaur's platform supports various applications, including contract summarization, customer service call transcript analysis, invoice understanding, product review evaluation, and fake news detection. It allows teams to collaborate on projects, facilitating multiple users to work together while utilizing a specialized review tool to identify discrepancies among team members. Additionally, Datasaur incorporates built-in intelligence to catch errors, ensuring higher accuracy in the data labeling process.
Biobot Analytics
Seed Round in 2018
Biobot Analytics, Inc. is a wastewater epidemiology company based in Cambridge, Massachusetts, founded in 2017. It specializes in transforming wastewater infrastructure into public health observatories through innovative technology that analyzes sewage for health-related data. By deploying proprietary sampling devices and spatial analytics, Biobot measures various health threats, including infectious diseases, antibiotic resistance, and drug consumption patterns. The company's Opioid Consumption Monitoring Program specifically estimates opioid use in urban areas by analyzing wastewater. This approach generates anonymized public health data that is independent of traditional hospital reporting systems, allowing for rapid adaptability to emerging public health threats. Biobot Analytics offers its insights as a service, providing government officials with actionable data to prioritize interventions and address public health challenges effectively.
Nanonets
Seed Round in 2017
Nanonets automates document processing and data extraction workflows using AI. It leverages advanced OCR and deep learning models to convert unstructured documents like invoices, receipts, and contracts into structured output. By integrating with existing systems via APIs, Nanonets reduces manual effort by up to 90%, delivering industry-leading accuracy and cost savings of up to 50%.
Pathmind
Seed Round in 2016
Pathmind develops a SaaS platform that leverages cloud computing and AI to optimize decisions in industrial operations and supply chains. It integrates with simulation software like AnyLogic, enabling users to discover better decision paths through complex scenarios.
Sensai
Venture Round in 2015
Sensai is a content analytics platform designed to assist sales, operations, and finance teams in large enterprises. The company specializes in providing advanced analytics technology that empowers business analysts to explore and derive insights from various data sets without needing extensive support from data science or programming teams. Sensai's innovative content-discovery language allows users to pose questions and efficiently uncover emerging concepts within both enterprise repositories and external data feeds. Its unique methods for clustering and tuning results facilitate rapid testing of theories and enable users to quantify and iterate on data patterns, making it a valuable tool for research, governance, and anti-fraud initiatives.