Skip to content
Change the repository type filter

All

    Repositories list

    • SQL scripts for dumping FOIArchive data to CSV
      0000Updated Sep 27, 2024Sep 27, 2024
    • Scripts for updating corpus-specific topic models in the FOIArchive database.
      Shell
      MIT License
      0000Updated Sep 25, 2024Sep 25, 2024
    • Scripts, configuration and examples for the PostgREST proof of concept
      PLpgSQL
      0100Updated Sep 18, 2024Sep 18, 2024
    • REST API for Freedom of Information Archive (FOIArchive)
      Python
      03102Updated Sep 4, 2024Sep 4, 2024
    • Course materials for the Summer '23 Archiving Digital Records workshop
      1300Updated Jun 21, 2024Jun 21, 2024
    • Describes access to History Lab data for Mosaic LLM project
      Shell
      0000Updated May 22, 2024May 22, 2024
    • Jupyter Notebook
      1000Updated May 20, 2024May 20, 2024
    • Example of querying the FOIArchive REST API via a Python program
      Jupyter Notebook
      MIT License
      0000Updated Feb 27, 2024Feb 27, 2024
    • Research project investigating OCR evaluation mechanisms at Columbia's History Lab.
      Python
      1000Updated Feb 13, 2024Feb 13, 2024
    • Scripts for preprocessing and loading of metadata and text for the History Lab-Muckrock COVID-19 Collection
      Python
      MIT License
      0000Updated Oct 16, 2023Oct 16, 2023
    • piir-eval

      Public
      Framework for PII redaction evaluation
      PLpgSQL
      0100Updated Apr 28, 2023Apr 28, 2023
    • Database schema objects for UN Archives metadata and text
      0000Updated Nov 5, 2022Nov 5, 2022
    • Java
      1015Updated Jun 21, 2022Jun 21, 2022
    • xmpdf

      Public
      A Python module for extracting emails from a PDF.
      Python
      MIT License
      0200Updated Mar 22, 2022Mar 22, 2022
    • pdf2mbox

      Public
      a command-line utility and Python package for converting PDF emails to MBOX format
      Python
      MIT License
      0500Updated Mar 22, 2022Mar 22, 2022
    • Jupyter Notebook
      1000Updated Aug 22, 2019Aug 22, 2019
    • cabinet

      Public
      1100Updated Apr 3, 2018Apr 3, 2018
    • 0000Updated Jul 21, 2017Jul 21, 2017
    • Scraper for state department consular names and positions
      Python
      0000Updated Apr 28, 2014Apr 28, 2014