The following completed and ongoing projects have been funded as part of the OSS Development Program (thanks to funds from the Alfred P. Sloan Foundation).
Completed Projects
User Interface platform that helps pull data from the NIH HEAL Data Platform and push it to the Qualitative Data Repository (QDR).
Project advised by Sebastian Karcher (QDR) and James Myers (Dataverse).
Ongoing Projects


Tariff Digitization
An automated solution for digitizing complex tariff documents from the early/mid 20th century. The system extracts structured data from scanned PDFs containing hierarchical commodity descriptions and associated tariff rates, transforming them into usable digital formats.
Project advised by Collin Capano and Will Gearty (SU OSPO) and Kristy Buzard (Maxwell School).

Digitization and Analysis of the Montesinos Tapes
Digitization and natural language analysis of the Montesinos tape transcripts from En la sala de la corrupción (In the Corruption Room). The recordings document meetings between Vladimiro Montesinos and various politicians and criminals during the Peruvian presidency of Alberto Fujimori.
Project advised by Jessie Trudeau (Maxwell School).



Preprint Alert Bot
Preprint servers like the arXiv represent the bleeding edge of science, with up to hundreds of papers uploaded every single day. This natural-language based bot monitors a preprint server and notifies it’s owner if any relevant papers have been uploaded based on their research interests.
