LlamaIndex
Series A in 2025
LlamaIndex is a versatile framework that connects custom data sources with large language models. It offers advanced data connectors, secure cloud infrastructure, and customizable indexing to enable businesses to create scalable AI agents for document research, workflow automation, and insights generation.
Rockset, Inc. is a technology company based in San Mateo, California, founded in 2016. It specializes in developing a serverless search and analytics engine designed to facilitate the creation of applications without the need for data pipelines or extensive preparation. The platform allows users to work directly with raw data, seamlessly integrating and continuously syncing new data from various sources. Utilizing familiar SQL, Rockset enables clients to perform hybrid search and real-time analytics on diverse data types, including vector, text, geospatial, and JSON data, without requiring a fixed schema. This approach simplifies the data management process and enhances the efficiency of data-driven applications.
LlamaIndex
Seed Round in 2023
LlamaIndex is a versatile framework that connects custom data sources with large language models. It offers advanced data connectors, secure cloud infrastructure, and customizable indexing to enable businesses to create scalable AI agents for document research, workflow automation, and insights generation.
Bluesky
Seed Round in 2022
Bluesky develops data cloud software that focuses on providing intelligent workload optimization and cost governance tools for data-driven organizations. By offering next-generation data infrastructure, Bluesky enables CIOs and senior management to manage and optimize data cloud expenses effectively. Its solutions enhance productivity for data analysts and data scientists while simplifying processes for data engineers. This approach allows organizations to innovate while maintaining control over their costs, thereby addressing the complex challenges associated with data management in cloud environments.
Gretel.ai
Series B in 2021
Founded in 2019, Gretel.ai is a San Diego-based company that specializes in developing software for data categorization and identification. It offers a multimodal synthetic data platform powered by advanced generative AI and privacy-enhancing technologies.
Gem is a comprehensive recruiting platform that integrates seamlessly with LinkedIn, Gmail, Outlook, and applicant tracking systems, designed to enhance the efficiency of talent acquisition teams. It allows recruiters to compile targeted lists, find email addresses, and automate follow-up communications, significantly improving response rates and saving time. The platform provides visibility into the entire hiring funnel by automatically tracking each touchpoint and offering insights into potential biases related to gender, race, and ethnicity throughout the interview process. This data-driven approach enables teams to collaborate effectively, ensuring that no candidate is contacted multiple times. Additionally, managers gain insights into their team's recruiting pipeline, with all interactions and activities synchronized for optimal data integrity.
Snorkel AI
Series C in 2021
Snorkel AI, Inc. is a technology company that specializes in developing an end-to-end machine-learning platform designed to facilitate the creation, management, and monitoring of AI applications. Founded in 2016 and based in Palo Alto, California, the company offers Snorkel Flow, a platform that enables programmatic data labeling, augmentation, and curation. This platform supports the clean integration and management of data for AI, as well as the training and deployment of various models. Snorkel AI's technology is particularly adept at extracting entities, relationships, and structured information from complex documents and forms, while also ranking content based on relevance and other factors. The company serves diverse sectors, including finance, government, telecommunications, insurance, healthcare, and e-commerce, and is recognized for its contributions to AI research through peer-reviewed publications.
Snorkel AI
Series B in 2021
Snorkel AI, Inc. is a technology company that specializes in developing an end-to-end machine-learning platform designed to facilitate the creation, management, and monitoring of AI applications. Founded in 2016 and based in Palo Alto, California, the company offers Snorkel Flow, a platform that enables programmatic data labeling, augmentation, and curation. This platform supports the clean integration and management of data for AI, as well as the training and deployment of various models. Snorkel AI's technology is particularly adept at extracting entities, relationships, and structured information from complex documents and forms, while also ranking content based on relevance and other factors. The company serves diverse sectors, including finance, government, telecommunications, insurance, healthcare, and e-commerce, and is recognized for its contributions to AI research through peer-reviewed publications.
Rockset, Inc. is a technology company based in San Mateo, California, founded in 2016. It specializes in developing a serverless search and analytics engine designed to facilitate the creation of applications without the need for data pipelines or extensive preparation. The platform allows users to work directly with raw data, seamlessly integrating and continuously syncing new data from various sources. Utilizing familiar SQL, Rockset enables clients to perform hybrid search and real-time analytics on diverse data types, including vector, text, geospatial, and JSON data, without requiring a fixed schema. This approach simplifies the data management process and enhances the efficiency of data-driven applications.
Gretel.ai
Venture Round in 2020
Founded in 2019, Gretel.ai is a San Diego-based company that specializes in developing software for data categorization and identification. It offers a multimodal synthetic data platform powered by advanced generative AI and privacy-enhancing technologies.
Gem is a comprehensive recruiting platform that integrates seamlessly with LinkedIn, Gmail, Outlook, and applicant tracking systems, designed to enhance the efficiency of talent acquisition teams. It allows recruiters to compile targeted lists, find email addresses, and automate follow-up communications, significantly improving response rates and saving time. The platform provides visibility into the entire hiring funnel by automatically tracking each touchpoint and offering insights into potential biases related to gender, race, and ethnicity throughout the interview process. This data-driven approach enables teams to collaborate effectively, ensuring that no candidate is contacted multiple times. Additionally, managers gain insights into their team's recruiting pipeline, with all interactions and activities synchronized for optimal data integrity.
Snorkel AI
Series A in 2020
Snorkel AI, Inc. is a technology company that specializes in developing an end-to-end machine-learning platform designed to facilitate the creation, management, and monitoring of AI applications. Founded in 2016 and based in Palo Alto, California, the company offers Snorkel Flow, a platform that enables programmatic data labeling, augmentation, and curation. This platform supports the clean integration and management of data for AI, as well as the training and deployment of various models. Snorkel AI's technology is particularly adept at extracting entities, relationships, and structured information from complex documents and forms, while also ranking content based on relevance and other factors. The company serves diverse sectors, including finance, government, telecommunications, insurance, healthcare, and e-commerce, and is recognized for its contributions to AI research through peer-reviewed publications.
Snorkel AI
Seed Round in 2020
Snorkel AI, Inc. is a technology company that specializes in developing an end-to-end machine-learning platform designed to facilitate the creation, management, and monitoring of AI applications. Founded in 2016 and based in Palo Alto, California, the company offers Snorkel Flow, a platform that enables programmatic data labeling, augmentation, and curation. This platform supports the clean integration and management of data for AI, as well as the training and deployment of various models. Snorkel AI's technology is particularly adept at extracting entities, relationships, and structured information from complex documents and forms, while also ranking content based on relevance and other factors. The company serves diverse sectors, including finance, government, telecommunications, insurance, healthcare, and e-commerce, and is recognized for its contributions to AI research through peer-reviewed publications.
Gretel.ai
Seed Round in 2020
Founded in 2019, Gretel.ai is a San Diego-based company that specializes in developing software for data categorization and identification. It offers a multimodal synthetic data platform powered by advanced generative AI and privacy-enhancing technologies.
Trifacta is a data engineering cloud platform that enables data wrangling, preparation, quality assurance, and automated data pipelines. It offers AI-assisted, self-service data transformation and collaborative tooling with universal connectivity to access data from any source for any application. The platform helps analysts and data professionals evaluate, correct, and validate data quality, accelerate transformation, and automate pipelines at scale. Used across industries worldwide, thousands of users at more than 10,000 companies leverage Trifacta to prepare data for analytics, visualization, and downstream systems.
Rockset, Inc. is a technology company based in San Mateo, California, founded in 2016. It specializes in developing a serverless search and analytics engine designed to facilitate the creation of applications without the need for data pipelines or extensive preparation. The platform allows users to work directly with raw data, seamlessly integrating and continuously syncing new data from various sources. Utilizing familiar SQL, Rockset enables clients to perform hybrid search and real-time analytics on diverse data types, including vector, text, geospatial, and JSON data, without requiring a fixed schema. This approach simplifies the data management process and enhances the efficiency of data-driven applications.
GrowingIO
Series B in 2018
GrowingIO is a prominent analytics platform provider based in Beijing, China, established in May 2015 by Simon Zhang, Dingding Ye, Justin Chen, Yuanming Shan, and Jonathan Wu. The company specializes in helping businesses drive growth through data insights. GrowingIO's platform enables customers to track user behavior data across apps, mini-apps (such as WeChat), and websites, build dashboards, and analyze quick insights. By providing real-time behavioral data and efficient management of core business indicators, GrowingIO empowers companies with data-driven information to support their growth strategies.
DIRT Protocol
Seed Round in 2018
DIRT Protocol, founded in 2017 and based in the United States, is a platform designed for decentralized curation of trusted data sets. It facilitates the organization and accessibility of information by allowing users to contribute information similarly to Wikipedia. However, DIRT Protocol employs a unique mechanism of token staking to incentivize honesty among contributors. Each participant must deposit tokens to submit data, ensuring that only accurate information is shared. If inaccuracies are identified, other users can challenge the data and earn tokens for correcting it. This economic model promotes the integrity of the data set, making it less likely for misinformation to persist. DIRT Protocol thus serves as a foundational tool for building Token Curated Registries, helping communities maintain high-quality information standards.
Trifacta is a data engineering cloud platform that enables data wrangling, preparation, quality assurance, and automated data pipelines. It offers AI-assisted, self-service data transformation and collaborative tooling with universal connectivity to access data from any source for any application. The platform helps analysts and data professionals evaluate, correct, and validate data quality, accelerate transformation, and automate pipelines at scale. Used across industries worldwide, thousands of users at more than 10,000 companies leverage Trifacta to prepare data for analytics, visualization, and downstream systems.
GrowingIO
Series A in 2016
GrowingIO is a prominent analytics platform provider based in Beijing, China, established in May 2015 by Simon Zhang, Dingding Ye, Justin Chen, Yuanming Shan, and Jonathan Wu. The company specializes in helping businesses drive growth through data insights. GrowingIO's platform enables customers to track user behavior data across apps, mini-apps (such as WeChat), and websites, build dashboards, and analyze quick insights. By providing real-time behavioral data and efficient management of core business indicators, GrowingIO empowers companies with data-driven information to support their growth strategies.
Ozlo
Venture Round in 2016
Ozlo is a platform that empowers intelligent systems with meaningful interactions. It offers a unique knowledge index combining probabilistic assertions with factual data, providing a more nuanced understanding of the world.
Trifacta is a data engineering cloud platform that enables data wrangling, preparation, quality assurance, and automated data pipelines. It offers AI-assisted, self-service data transformation and collaborative tooling with universal connectivity to access data from any source for any application. The platform helps analysts and data professionals evaluate, correct, and validate data quality, accelerate transformation, and automate pipelines at scale. Used across industries worldwide, thousands of users at more than 10,000 companies leverage Trifacta to prepare data for analytics, visualization, and downstream systems.
Trifacta is a data engineering cloud platform that enables data wrangling, preparation, quality assurance, and automated data pipelines. It offers AI-assisted, self-service data transformation and collaborative tooling with universal connectivity to access data from any source for any application. The platform helps analysts and data professionals evaluate, correct, and validate data quality, accelerate transformation, and automate pipelines at scale. Used across industries worldwide, thousands of users at more than 10,000 companies leverage Trifacta to prepare data for analytics, visualization, and downstream systems.
Trifacta is a data engineering cloud platform that enables data wrangling, preparation, quality assurance, and automated data pipelines. It offers AI-assisted, self-service data transformation and collaborative tooling with universal connectivity to access data from any source for any application. The platform helps analysts and data professionals evaluate, correct, and validate data quality, accelerate transformation, and automate pipelines at scale. Used across industries worldwide, thousands of users at more than 10,000 companies leverage Trifacta to prepare data for analytics, visualization, and downstream systems.
Timeful Inc., founded in 2012 and headquartered in Mountain View, California, is a technology company focused on revolutionizing time management. The company's intelligent time-management platform helps users manage their schedules by tracking commitments, categorizing tasks, and promoting healthy habits. Timeful leverages machine learning to offer personalized scheduling recommendations based on user behavior, availability, and preferences. The application is designed to assist individuals in maximizing their time for meaningful activities. As of May 4, 2015, Timeful operates as a subsidiary of Alphabet Inc.
Trifacta is a data engineering cloud platform that enables data wrangling, preparation, quality assurance, and automated data pipelines. It offers AI-assisted, self-service data transformation and collaborative tooling with universal connectivity to access data from any source for any application. The platform helps analysts and data professionals evaluate, correct, and validate data quality, accelerate transformation, and automate pipelines at scale. Used across industries worldwide, thousands of users at more than 10,000 companies leverage Trifacta to prepare data for analytics, visualization, and downstream systems.
Freshplum
Seed Round in 2011
Freshplum is a provider of data analytic software focused on enhancing decision-making for companies engaged in electronic commerce. The company specializes in revenue analytics, offering tools that enable businesses to leverage data science effectively. By equipping e-commerce companies with insights derived from data, Freshplum aims to improve their operational efficiency and drive revenue growth.
Sociogramics
Seed Round in 2011
Sociogramics, founded in 2011, specializes in leveraging emerging data sets and machine learning to enhance online identity, employment, and income verification processes. The company addresses the limitations of traditional consumer scoring methods by integrating advanced analytics to provide more accurate and timely verification solutions.
Actifio specializes in enterprise data management solutions. It offers Virtual Data Pipeline technology that decouples data from infrastructure, enhancing business resilience, agility, and cloud access. The company's application-centric approach enables customers to capture, manage, and utilize data more efficiently across various environments.
Lumigent Technologies
Venture Round in 2008
Lumigent Technologies specializes in providing audit and compliance solutions through its automated Governance, Risk, and Compliance software. The company's flagship product, AppGRC, is designed to reduce costs and mitigate risks associated with auditing and compliance reporting by continuously monitoring application-specific data and controls. This software safeguards business applications, ensuring the integrity of critical information while translating audit and compliance requirements for individual applications. In addition to AppGRC, Lumigent offers regulatory compliance solutions that include application risk assessments, trusted audit trails, application visibility, user management, policy violation notifications, and automated compliance reports. The company also provides Audit DB 6.2 solutions, which focus on data auditing, policy management, and database assessment, further enhancing its comprehensive approach to compliance and risk management.
Farecast is a fare-prediction service founded in 2003 that helps travelers determine the optimal time to purchase airline tickets. By analyzing 175 billion points of historical airfare data, Farecast predicts whether ticket prices will rise or fall up to a week in advance, boasting a success rate of 70-75%. This innovative approach distinguishes Farecast from other travel companies, as it is the only platform that offers such predictive capabilities. In addition to airfare predictions, Farecast has expanded its services to include hotel price comparisons, displaying results from various travel search sites on an interactive map. This feature helps users identify whether a hotel is overpriced or attractively priced, using color coding to highlight deals. Overall, Farecast aims to assist travelers in making informed decisions to save on travel costs.
Full Capture Solutions
Series B in 2006
Full Capture Solutions, Inc. develops predictive, analytic-driven solutions for the insurance industry. Its analytic technology is applied to insurance claims for discovery, forensic analysis, and predictive modeling. Its applications include automobile, environmental, liability, property, workers compensation, and recovery operations. Full Capture Solutions, was founded in 2004 and is headquartered in East Hartford, Connecticut.
Farecast is a fare-prediction service founded in 2003 that helps travelers determine the optimal time to purchase airline tickets. By analyzing 175 billion points of historical airfare data, Farecast predicts whether ticket prices will rise or fall up to a week in advance, boasting a success rate of 70-75%. This innovative approach distinguishes Farecast from other travel companies, as it is the only platform that offers such predictive capabilities. In addition to airfare predictions, Farecast has expanded its services to include hotel price comparisons, displaying results from various travel search sites on an interactive map. This feature helps users identify whether a hotel is overpriced or attractively priced, using color coding to highlight deals. Overall, Farecast aims to assist travelers in making informed decisions to save on travel costs.
ClearForest
Series C in 2004
ClearForest is a software company established in 1998, specializing in text analytics and text mining solutions. Headquartered just outside Boston, the company also has development operations in Israel, near Tel Aviv. ClearForest focuses on providing text-driven business intelligence software, which includes text analytics tools, publishing software, and market research services. Through its innovative solutions, ClearForest aims to help organizations harness the power of unstructured text data to generate insights and inform decision-making.
HyperRoll
Series C in 2004
HyperRoll is a provider of data warehouse performance acceleration software that significantly enhances the efficiency of data management. Its flagship offering, the HyperRoll Data Performance Management Suite, improves load times, query times, and overall throughput for data warehouses, enabling organizations to respond quickly to changes in data and metadata. The suite is designed to reduce the total cost of ownership for data warehouses through its innovative software architecture. In addition to its core performance management solutions, HyperRoll also delivers a range of services including advisory, consulting, education, and technical support, catering to various sectors such as consumer products, retail, finance, supply chain, and customer relationship management.
Data Domain
Series B in 2003
Data Domain is a provider of deduplication storage systems that focuses on disk-to-disk backup, data archiving, and disaster recovery solutions. Founded in October 2001 and based in Hopkinton, Massachusetts, the company caters to a diverse range of industries, including education, healthcare, finance, and technology, across North America and Europe. Data Domain’s offerings include software products such as replicators for disaster recovery, boost software to enhance data backup efficiency, encryption software for securing incoming data, and retention lock software that supports compliance with IT governance policies. Additionally, the company provides virtual tape library software for SAN environments and a web-based application for system management known as Enterprise Manager. Its comprehensive infrastructure solutions address backup and recovery, archiving, compliance, and virtualization needs, positioning Data Domain as a key player in the data protection and management market.
ClearForest
Series C in 2003
ClearForest is a software company established in 1998, specializing in text analytics and text mining solutions. Headquartered just outside Boston, the company also has development operations in Israel, near Tel Aviv. ClearForest focuses on providing text-driven business intelligence software, which includes text analytics tools, publishing software, and market research services. Through its innovative solutions, ClearForest aims to help organizations harness the power of unstructured text data to generate insights and inform decision-making.
Lumigent Technologies
Venture Round in 2003
Lumigent Technologies specializes in providing audit and compliance solutions through its automated Governance, Risk, and Compliance software. The company's flagship product, AppGRC, is designed to reduce costs and mitigate risks associated with auditing and compliance reporting by continuously monitoring application-specific data and controls. This software safeguards business applications, ensuring the integrity of critical information while translating audit and compliance requirements for individual applications. In addition to AppGRC, Lumigent offers regulatory compliance solutions that include application risk assessments, trusted audit trails, application visibility, user management, policy violation notifications, and automated compliance reports. The company also provides Audit DB 6.2 solutions, which focus on data auditing, policy management, and database assessment, further enhancing its comprehensive approach to compliance and risk management.
Data Domain
Series A in 2002
Data Domain is a provider of deduplication storage systems that focuses on disk-to-disk backup, data archiving, and disaster recovery solutions. Founded in October 2001 and based in Hopkinton, Massachusetts, the company caters to a diverse range of industries, including education, healthcare, finance, and technology, across North America and Europe. Data Domain’s offerings include software products such as replicators for disaster recovery, boost software to enhance data backup efficiency, encryption software for securing incoming data, and retention lock software that supports compliance with IT governance policies. Additionally, the company provides virtual tape library software for SAN environments and a web-based application for system management known as Enterprise Manager. Its comprehensive infrastructure solutions address backup and recovery, archiving, compliance, and virtualization needs, positioning Data Domain as a key player in the data protection and management market.
HyperRoll
Series B in 2002
HyperRoll is a provider of data warehouse performance acceleration software that significantly enhances the efficiency of data management. Its flagship offering, the HyperRoll Data Performance Management Suite, improves load times, query times, and overall throughput for data warehouses, enabling organizations to respond quickly to changes in data and metadata. The suite is designed to reduce the total cost of ownership for data warehouses through its innovative software architecture. In addition to its core performance management solutions, HyperRoll also delivers a range of services including advisory, consulting, education, and technical support, catering to various sectors such as consumer products, retail, finance, supply chain, and customer relationship management.
Acta Technology
Venture Round in 2001
Acta Technology is a provider of a real-time data integration platform that facilitates seamless communication among enterprises, their customers, suppliers, employees, and partners. Founded in 1997 and headquartered in Mountain View, California, the company specializes in batch and real-time data integration solutions within the extraction, transformation, and loading (ETL) segment. Acta Technology's offerings empower organizations to efficiently manage and integrate data, enhancing operational effectiveness and collaboration. The company was acquired by Business Objects, further expanding its influence in the data integration market.
Snorkel AI, Inc. is a technology company that specializes in developing an end-to-end machine-learning platform designed to facilitate the creation, management, and monitoring of AI applications. Founded in 2016 and based in Palo Alto, California, the company offers Snorkel Flow, a platform that enables programmatic data labeling, augmentation, and curation. This platform supports the clean integration and management of data for AI, as well as the training and deployment of various models. Snorkel AI's technology is particularly adept at extracting entities, relationships, and structured information from complex documents and forms, while also ranking content based on relevance and other factors. The company serves diverse sectors, including finance, government, telecommunications, insurance, healthcare, and e-commerce, and is recognized for its contributions to AI research through peer-reviewed publications.
Founded in 2018, Bytez is a New York-based company that develops software providing access to machine learning papers for developers and data scientists. Its platform aims to simplify the discovery, understanding, and use of open-source AI, offering thousands of serverless models, interactive research papers, personalized feeds, and an AI-powered research agent.