Publications
A list of my publications sorted from most to least recent. This list contains peer-reviewed papers and preprints.
2025
- Explaining the Reasoning of Large Language Models Using Attribution GraphsarXiv preprint arXiv:2512.15663, 2025
- Metric-Driven Attributions for Vision TransformersIn The Thirteenth International Conference on Learning Representations, 2025
- Verifiable Natural Language to Linear Temporal Logic Translation: A Benchmark Dataset and Evaluation SuitearXiv preprint arXiv:2507.00877, 2025
- Explaining ViTs Using Information FlowIn International Conference on Artificial Intelligence and Statistics, 2025
- GinSign: Grounding Natural Language Into System Signatures for Temporal Logic TranslationarXiv preprint arXiv:2512.16770, 2025
- Detecting and Removing Adversarial Patches using Frequency SignaturesIn 2025 International Joint Conference on Neural Networks (IJCNN), 2025
2024
- Attribution quality metrics with magnitude alignmentIn Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI-24, 2024
- Integrated decision gradients: Compute your attributions where the model makes its decisionIn Proceedings of the AAAI Conference on Artificial Intelligence, 2024
- Out-of-Distribution Detection for Contrastive Models Using Angular Distance MeasuresIn 2024 International Conference on Machine Learning and Applications (ICMLA), 2024
2023
- Adversarial pixel and patch detection using attribution analysisIn MILCOM 2023-2023 IEEE Military Communications Conference (MILCOM), 2023