BoxGroup

Founded in 2009 and based in New York City, BoxGroup is an early-stage investment fund focusing on pre-seed to Series A rounds. It invests globally, with a primary focus on New York City, Silicon Valley, and Los Angeles. The firm typically allocates between $50,000 and $250,000 per investment across various sectors including consumer, enterprise, fintech, healthcare, life science, marketplace, synthetic biology, and climate.

Brian Aledort

CFO

Adina Davis

Investor

Disha Karale

Investor

Nimi Katragadda

Partner

Greg Rosen

Partner

Past deals in Data Mining

Reducto

Series A in 2025
Reducto develops a retrieval augmented generation platform that converts complex, unstructured documents into optimized chunks compatible with vector databases and LLM retrieval pipelines. Its software uses AI and computer vision to parse and extract information from various data formats, enhancing RAG performance.

Clay

Series B in 2025
Clay offers advanced tools for growth teams to conduct comprehensive customer research. By integrating over 100 data sources, including first-party, intent, and third-party data, Clay enables deep insights into ideal customers.

David AI

Seed Round in 2025
David AI is a technology company that specializes in transforming audio data into valuable insights. It develops an AI-powered platform designed to help businesses manage and analyze multimodal data, enabling them to create high-quality, proprietary datasets for AI model training. The platform automates data labeling, qualifies and manages data quality, and generates metadata to track data origin and changes. This allows clients to enhance their AI models, detect and remove irrelevant or duplicated data, and create new revenue streams.

Tuva Health

Seed Round in 2024
Tuva Health develops open source software that cleans, normalizes, and transforms unstructured healthcare data. Based in Salt Lake City, Utah, the company provides data management tools and services that fix data quality issues in raw claims and medical records. Its software enables providers to unify disparate data, enrich records, and quality-test data to support analytics, artificial intelligence development, and real-world evidence programs, helping organizations assemble reliable data blocks for advanced reporting and decision making.

Reducto

Seed Round in 2024
Reducto develops a retrieval augmented generation platform that converts complex, unstructured documents into optimized chunks compatible with vector databases and LLM retrieval pipelines. Its software uses AI and computer vision to parse and extract information from various data formats, enhancing RAG performance.

Clay

Series B in 2024
Clay offers advanced tools for growth teams to conduct comprehensive customer research. By integrating over 100 data sources, including first-party, intent, and third-party data, Clay enables deep insights into ideal customers.

Zephyr AI

Seed Round in 2022
Zephyr AI is a healthcare technology company focused on transforming drug discovery and precision medicine. By leveraging large complex datasets and proprietary algorithms, Zephyr AI aims to redefine drug development and streamline clinical trials. The company collaborates with leading health systems, health insurance plans, and biotechnology innovators to enhance healthcare quality, improve patient outcomes, and reduce costs. Through its innovative approach, Zephyr AI seeks to address challenges in disease treatment and clinical decision support, ultimately contributing to advancements in the healthcare sector.

Rose AI

Seed Round in 2021
Rose AI is a data management platform that helps organizations find, clean, organize, and interact with large volumes of data to support data-driven decisions. It provides an integrated data workspace, analytics engine, and marketplace that transform, share, and monetize vetted, quality-controlled data in a single platform. The platform includes a data marketplace exposing datasets from reputable providers, with transparency into how datasets are constructed and the ability to pull underlying data into Excel or Jupyter Notebooks. By offering a common set of tools for manipulating and collaborating on data, Rose AI aims to streamline data preparation, cross-source integration, and insight generation for decision makers.

Heron Data

Seed Round in 2020
Heron Data develops AI-powered software that automates document-heavy workflows. Its platform extracts structured data from documents, syncing it directly with CRM systems or other platforms.

Clay

Series A in 2019
Clay offers advanced tools for growth teams to conduct comprehensive customer research. By integrating over 100 data sources, including first-party, intent, and third-party data, Clay enables deep insights into ideal customers.

Etleap

Seed Round in 2018
Etleap provides an ETL platform that lets data analytics teams build and maintain data pipelines from day one without extensive coding or engineering. Founded in 2012 and based in San Francisco, it automates most ETL setup and maintenance, reducing manual work and enabling analysts to own quick, ten-minute tasks. The tool collects data from diverse sources, cleans, structures, and manages it to simplify access to big data and improve data integration across warehouses, enhancing business value.

Clay

Seed Round in 2017
Clay offers advanced tools for growth teams to conduct comprehensive customer research. By integrating over 100 data sources, including first-party, intent, and third-party data, Clay enables deep insights into ideal customers.

Clay

Pre Seed Round in 2015
Clay offers advanced tools for growth teams to conduct comprehensive customer research. By integrating over 100 data sources, including first-party, intent, and third-party data, Clay enables deep insights into ideal customers.

Thanx

Seed Round in 2012
Thanx is a guest engagement platform for offline businesses, including brick-and-mortar retailers, restaurants, and malls, that links purchases from card networks to a CRM and marketing automation suite. The platform enables data capture from card transactions, personalized customer interactions, and loyalty programs through messages, email marketing, and real-time feedback tools, along with revenue reporting and online ordering integrations. Founded in 2011 and headquartered in San Francisco, Thanx helps merchants improve retention and revenue by turning purchase data into personalized engagement across channels.

Dataminr

Series A in 2011
Dataminr is a technology company that leverages artificial intelligence to monitor public information sources. It specializes in detecting and alerting clients to high-impact events and emerging risks in real-time, enabling them to make informed decisions across various sectors including finance, security, crisis management, and news.

David AI

David AI is a technology company that specializes in transforming audio data into valuable insights. It develops an AI-powered platform designed to help businesses manage and analyze multimodal data, enabling them to create high-quality, proprietary datasets for AI model training. The platform automates data labeling, qualifies and manages data quality, and generates metadata to track data origin and changes. This allows clients to enhance their AI models, detect and remove irrelevant or duplicated data, and create new revenue streams.
Spot something off? Help us improve by flagging any incorrect or outdated information. Just email us at support@teaserclub.com. Your feedback is most welcome.