Data science, research software, and open source work
CIB Mango Tree: Detecting Coordinated Campains In Social Media
Data Science • Civic Tech • 2024-present
website • demo dashboard • blog post • code
Collaborating on a open-source project to develop software that tests for coordinated inauthentic behavior in social media datasets.
Executable Science: Research Software Engineering for Reproducible Neuroscience
Research Software • Reproducibility • 2025
paper (in press) • code • interactive report • slides
Reproduced and replicated results from a published neuroimaging study to understand barriers to computational reproducibility. While sharing code and data helps, poor software engineering practices still prevent reliable reproduction of scientific results.
Transformer In-Context Retrieval Across Time And Scale
AI Research • 2022-2024
paper • code
Developed a pipeline to test how transformer language models retrieve information from context at different scales (14M to 12B parameters) and across training.
Python Data Reader for Invasive Brain Recordings
Open Source • 2023-2024
tool
Developed a new data reading plug-in and a test dataset for Neuralynx systems in the MNE-Python ecosystem of neuroscience analysis tools (open source)
A Contextual Encoding Model for Human Brain Responses to a Spoken Narrative
NeuroAI • Open Source • 2020-2023
conference paper • code
Used word embeddings from language models to model electrical activity (ECoG) in neurosurgical datasets provided by our our cross-institutional clinical partners.
Short-term memory in Neural Language Models
AI Research • PyTorch • 2021-2022
paper • video • code
Tested how well language models, recurrent neural networks and transformers, remember exact text they’ve seen before. Applied research methods from cognitive science to understand AI model behavior.
Large-Scale NeuroAI Dataset
Data • Open Science • 2020-2022
data • paper
Curated and published a 10-hour dataset of brain activity recorded while people listened to stories. Designed the data structure, documentation and validation analyses for reuse by other researchers.
Brain Activity and Language Prediction
Research • Statistical Modeling • 2017-2019
paper • code
Analyzed how brain oscillations relate to language prediction during story listening. Built statistical models connecting neural signals to computational language measures.
The Embodied Mind (Book Translation)
Translation • 2016-2017
book link
Translated the cognitive science classic from English to Slovenian. First translation of this work into Slovenian, making it accessible to local researchers and students.