Soma Capital

Soma Capital is a venture capital firm established in 2015 and headquartered in San Francisco, California. It focuses on investing in seed-stage startups primarily in the software sector, with a particular interest in areas such as B2B SaaS, artificial intelligence, fintech, clean-tech, frontier-tech, prop-tech, cryptocurrency, health tech, and consumer industries. Soma Capital aims to support founders by leveraging insights from its own founding experience. The firm has successfully invested in several high-profile startups, which collectively are valued at over $60 billion, including notable companies such as Cruise, Rappi, Ironclad, Human Interest, Razorpay, Rippling, and Lambda School. Soma Capital operates as a Registered Investment Adviser and targets opportunities across diverse regions, including Africa, the Middle East, South America, and Asia.

Juan Abundes

Venture Partner

Willem van den Bosch

Investment Partner

Douglas Carney

Investor

Mir Faiyaz

Partner and Head of Growth Investments

Fouad Farhat

Investment Partner

Juliet Fern

Investor

Nikhita Jaaswal

Investment Partner

Taryn Livingstone

Investor

Vishnu Srinivasan

Investor

Past deals in Big Data

Optery

Seed Round in 2023
Developer of a privacy management technology designed to put consumers in control of their personal data. The company's technology removes a user's information from dozens of data brokers and prevents identity theft and fraud, reduces phishing, spam calls, and emails, along with provides tools and services that give consumers visibility into their profiles, enabling consumers and data brokers to interact in fairness and transparency in a way that honors consumers' preferences and legal rights.

Openlayer

Seed Round in 2023
Unbox is a novel, collaborative error analysis platform that "opens up" machine learning models by detecting and eliminating failure patterns and biases. Unbox makes it easy to keep track of all models and datasets, allowing the team to focus on building production-ready models.

Mentum

Seed Round in 2022
Developer of a business intelligence and analytics tool intended to help organizations optimize their supply chain process. The company's platform offers customizable dashboards, real-time data visualization, and advanced reporting features, enabling enterprises to effectively analyze and interpret complex data sets, improve operational efficiency, and drive strategic insights.

Neat

Convertible Note in 2022
Developer of a cloud storage platform designed to store data for future perspectives. The company's software offers a file manager, security, and data management server with shortcut widgets for quick access, and preview at a glance, enabling users to increase the productivity of work in a short time with increased accuracy.

Redbird

Pre Seed Round in 2022
Developer of automation platform designed to automate data-intensive work in minutes, without writing code. The company's platform specializes in deeper automation, orchestration across the data lifecycle, and custom applications that solve the specific needs of different stakeholders, enabling technical and non-technical users to automate workflows to eliminate the need for manual tasks, improve productivity, and easy to use point and click interface.

Transpose

Seed Round in 2022
Transpose operates a data analytics platform designed to simplify access to Web3 data. By converting low-level blockchain data into a suite of high-level, human-readable APIs, Transpose enables users to efficiently obtain comprehensive and real-time blockchain information. This approach streamlines the development process in the Web3 space, making it faster and more accessible for developers and businesses to utilize blockchain technology. Transpose aims to set a new standard for how blockchain data is accessed and utilized, enhancing the overall experience for users dealing with complex blockchain information.

Pareto.AI

Seed Round in 2022
Pareto.AI operates a data labeling platform that connects artificial intelligence companies with a network of skilled data workers. The company focuses on providing tailored services to meet client needs, ranging from same-day experimental data to fully-managed teams. By combining the expertise of real people with machine automation, Pareto.AI enables entrepreneurs to delegate time-consuming tasks, allowing them to concentrate on more critical work. This approach not only enhances efficiency but also empowers diverse professionals worldwide to contribute to AI training. Ultimately, Pareto.AI offers clients the flexibility to refine and optimize various aspects of their AI and large language models, streamlining the development process and improving outcomes.

Nanonets

Series A in 2022
NanoNets is machine learning API for developers which requires 1/10th of data and no machine learning expertise to train a model. Upload the data, wait for a few minutes and get a model you can query over their easy to use cloud API. Often companies do not have enough data to train a machine learning model on their own using state of the art algorithms as well as don't have enough data scientists to work on those problems. NanoNets solves both these problems for companies.

Secoda

Seed Round in 2021
Developer of a collaborative knowledge management tool designed to make data teams share metadata, queries, charts, and documentation. The company's platform helps employees find and understand the right information in a few time, and track the relationships between people and data to help users visualize all the interactions between the different collaborators of the enterprise, enabling users to implement data discovery efficiently.

Akiba Digital

Pre Seed Round in 2021
Akiba Digital is a financial data aggregation company, that houses a data consolidation and enrichment engine that clusters, visualises and predicts consumer behaviour. Our vision is to build a company that is most effective at understanding and enhancing personal financial management for the African consumer.

Haystack

Seed Round in 2021
Our mission is to bring the competitive advantage data-driven engineering teams like Google have to everyone. We help teams track their delivery process and optimization opportunities instead of relying solely on gut feelings - resulting in 40% faster delivery on average.

Tractian

Seed Round in 2021
Your machines are talking, we are listening. TRACTIAN is a platform that analyzes vibration and temperature data collected by our sensors to reduce unplanned downtime inside industries.

Albedo

Seed Round in 2021
Albedo designs and operates satellites that capture imagery at a resolution 9x higher than what's available today. We're modernizing how satellite imagery is used by providing transparent, near real-time access to our ultra-high resolution datasets.

Biodock

Seed Round in 2021
Biodock's cloud platform accelerates microscopy analysis, automating months of microscopy analysis and infrastructure to minutes with our end-to-end AI architecture. Scientists enjoy auto-scaling storage, GPU compute, and 30-50% more accurate analysis. We also have a data flywheel built into our product - we process millions of cells weekly for our free academic and enterprise customers, and we're building the largest image dataset for microscopy images, which gives us a competitive data advantage.

Flatfile

Series A in 2021
Flatfile Inc. offers a platform designed to streamline the data onboarding process for businesses by allowing developers to validate, map, and import data from various web applications. Founded in 2018 and based in Denver, Colorado, the company's key products include Portal, an import button that integrates seamlessly into software applications through a JavaScript snippet, and Concierge, which provides secure workspaces for collaboration on data ingestion challenges. Flatfile's platform supports multiple file formats, including CSV, XLS, and TSV, and features a JavaScript configurator that enables users to define target models for data validation. This automation helps organizations manage and structure imported data effectively, reducing the time spent on data cleaning and allowing teams to focus on utilizing their data for decision-making.

Gisual

Pre Seed Round in 2020
Gisual provides outage intelligence for telecoms and service providers We automate the collection and dissemination of 3rd party outage intel. Gisual’s data intelligence dramatically reduces complexity, costs, and resolution times. We enhance your network monitoring system, but instead of telling you what's happening inside your network we tell you everything happening outside of your network.

Gisual

Pre Seed Round in 2020
Gisual provides outage intelligence for telecoms and service providers We automate the collection and dissemination of 3rd party outage intel. Gisual’s data intelligence dramatically reduces complexity, costs, and resolution times. We enhance your network monitoring system, but instead of telling you what's happening inside your network we tell you everything happening outside of your network.

Explo

Seed Round in 2020
Explo, founded in 2019 and headquartered in San Francisco, California, operates a data exploration and analysis platform that facilitates the creation of customer-facing dashboards and reports. The platform connects directly to various data sources, allowing users to manipulate and visualize data through a user-friendly point-and-click interface. This functionality enables clients to make informed, data-driven decisions, ultimately supporting their business growth and scalability.

batch

Seed Round in 2020
Batch is a technology company that provides an observability and replay platform specifically designed for messaging systems. Founded in 2020 and based in Portland, Oregon, Batch's platform enables organizations to efficiently manage outage recovery and implement disaster recovery strategies. It facilitates the setup of independent data stores and supports load testing and data integrity testing to identify and resolve bugs. By allowing customers to observe and replay data, Batch helps businesses quickly diagnose issues and revert unwanted changes, effectively acting as a "Time Machine" for corporate data.

Flatfile

Seed Round in 2020
Flatfile Inc. offers a platform designed to streamline the data onboarding process for businesses by allowing developers to validate, map, and import data from various web applications. Founded in 2018 and based in Denver, Colorado, the company's key products include Portal, an import button that integrates seamlessly into software applications through a JavaScript snippet, and Concierge, which provides secure workspaces for collaboration on data ingestion challenges. Flatfile's platform supports multiple file formats, including CSV, XLS, and TSV, and features a JavaScript configurator that enables users to define target models for data validation. This automation helps organizations manage and structure imported data effectively, reducing the time spent on data cleaning and allowing teams to focus on utilizing their data for decision-making.

Flatfile

Seed Round in 2020
Flatfile Inc. offers a platform designed to streamline the data onboarding process for businesses by allowing developers to validate, map, and import data from various web applications. Founded in 2018 and based in Denver, Colorado, the company's key products include Portal, an import button that integrates seamlessly into software applications through a JavaScript snippet, and Concierge, which provides secure workspaces for collaboration on data ingestion challenges. Flatfile's platform supports multiple file formats, including CSV, XLS, and TSV, and features a JavaScript configurator that enables users to define target models for data validation. This automation helps organizations manage and structure imported data effectively, reducing the time spent on data cleaning and allowing teams to focus on utilizing their data for decision-making.

Explo

Seed Round in 2020
Explo, founded in 2019 and headquartered in San Francisco, California, operates a data exploration and analysis platform that facilitates the creation of customer-facing dashboards and reports. The platform connects directly to various data sources, allowing users to manipulate and visualize data through a user-friendly point-and-click interface. This functionality enables clients to make informed, data-driven decisions, ultimately supporting their business growth and scalability.

Datasaur

Seed Round in 2020
Datasaur, Inc. develops a data labeling solution for natural language processing related tasks. It offers Datasaur, a machine learning platform that provides ad hoc data labeling tools for clients’ data labeling needs. The company’s Datasaur is used in various applications, such as contract summarization and understanding, customer service call transcripts, receipt and invoice understanding, product review analysis, and fake news detection. The company was founded in 2019 and is based in Sunnyvale, California.

Gisual

Seed Round in 2019
Gisual provides outage intelligence for telecoms and service providers We automate the collection and dissemination of 3rd party outage intel. Gisual’s data intelligence dramatically reduces complexity, costs, and resolution times. We enhance your network monitoring system, but instead of telling you what's happening inside your network we tell you everything happening outside of your network.

Baotris

Seed Round in 2019
Baotris, Inc. is a technology company based in Lafayette, California, established in 2019. It specializes in developing a multi-platform data collection system that assists mobile developers in efficiently collecting and transmitting large-scale data. The company provides software development kits (SDKs) compatible with popular mobile platforms, including JavaScript, iOS, and Android applications. Additionally, Baotris offers an e-commerce marketing automation platform that features an aggregated data dashboard, persona development, website UX/UI optimization, search engine optimization, and influencer marketing services. This comprehensive approach enables businesses to gain valuable insights, improve customer retention, and ultimately increase revenue.

BlueCargo

Seed Round in 2018
BlueCargo is a predictive algorithm platform that brings visibility to optimize operations in seaport terminals. Their proprietary predictive algorithms streamline container management to increase terminal’s productivity while reducing operational costs and enhancing the quality of the service.

Pachyderm

Series A in 2018
Pachyderm, Inc. is a software company that specializes in developing a data engineering platform for managing complex, multi-stage data pipelines. Founded in 2014 and based in San Francisco, California, Pachyderm's platform facilitates the automation of data processes while ensuring reproducibility and data lineage throughout the machine learning development lifecycle. It offers a unique architecture that supports version control for data, allowing users to build scalable and language-agnostic machine learning and ETL workflows. The company's solutions include a free open-source version, an enterprise version, and a hub version, providing flexibility for users to choose the best fit for their needs. By enabling automatic scaling and parallel processing, Pachyderm empowers data science teams to work with any language, framework, or tool, making it an effective choice for sophisticated data transformations.

Biobot Analytics

Seed Round in 2018
Biobot Analytics, Inc. is a wastewater epidemiology company based in Cambridge, Massachusetts, founded in 2017. The company specializes in transforming sewer systems into public health observatories through innovative technology that analyzes wastewater samples. By deploying proprietary sampling devices and spatial analytics, Biobot measures various health threats, including infectious disease outbreaks, antibiotic resistance, and fluctuations in drug consumption. One of its key initiatives is the Opioid Consumption Monitoring Program, which assesses opioid usage by analyzing sewage to provide estimates of drug consumption in urban areas. This approach creates a health database that operates independently of traditional hospital reporting systems, enabling rapid adaptation to emerging public health challenges while minimizing societal biases in healthcare access.

Nanonets

Seed Round in 2017
NanoNets is machine learning API for developers which requires 1/10th of data and no machine learning expertise to train a model. Upload the data, wait for a few minutes and get a model you can query over their easy to use cloud API. Often companies do not have enough data to train a machine learning model on their own using state of the art algorithms as well as don't have enough data scientists to work on those problems. NanoNets solves both these problems for companies.

Skymind

Seed Round in 2016
Skymind Inc. is a business intelligence and enterprise software company based in San Francisco, California, founded in 2014. The company specializes in developing solutions that classify, cluster, and predict outcomes across various data types, including text, images, video, time series, and sound, to identify patterns that can influence business decisions. Skymind is known for its open-source deep-learning framework, Deeplearning4j, which is designed for use in production environments, and ND4J, a scientific computing library optimized for performance and minimal memory usage. In addition to its core offerings, Skymind's technologies enable businesses to leverage advanced analytics and machine learning to enhance their operational efficiency and decision-making processes.

Pachyderm

Seed Round in 2015
Pachyderm, Inc. is a software company that specializes in developing a data engineering platform for managing complex, multi-stage data pipelines. Founded in 2014 and based in San Francisco, California, Pachyderm's platform facilitates the automation of data processes while ensuring reproducibility and data lineage throughout the machine learning development lifecycle. It offers a unique architecture that supports version control for data, allowing users to build scalable and language-agnostic machine learning and ETL workflows. The company's solutions include a free open-source version, an enterprise version, and a hub version, providing flexibility for users to choose the best fit for their needs. By enabling automatic scaling and parallel processing, Pachyderm empowers data science teams to work with any language, framework, or tool, making it an effective choice for sophisticated data transformations.

Yhat

Seed Round in 2015
Yhat (pronounced Y-hat) provides an end-to-end data science platform for developing, deploying, and managing real-time decision APIs. Yhat eliminates painful IT obstacles involved in cloud-based data science like server setup and config. With Yhat, data scientists can transform static insights into production-ready decision making APIs that integrate seamlessly with any customer- or employee-facing app. Yhat also created Rodeo, an open source integrated development environment (IDE) for Python. Yhat was founded in 2013 and is based in New York. The team is composed of entrepreneurs, data scientists, and engineers formerly at OnDeck, AppNexus, Guidespark, Shareablee and the Washington Post.

Sensai

Venture Round in 2015
Sensai is a Content Analytics platform for Sales, Operations, and Finance teams. Analysts use Sensai to find and trend emerging concepts within enterprise repositories and from public and private feeds. Sensai features novel methods of clustering and tuning results so that users can quickly test theories, quantify and iterate on patterns in the data.
Spot something off? Help us improve by flagging any incorrect or outdated information. Just email us at support@teaserclub.com. Your feedback is most welcome.