. Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data. arXiv, 2026.

PDF

. Position: agentic AI orchestration should be Bayes-consistent. ICML, 2026.

PDF

. REPO: Detoxifying LLMs via Representation Erasure-based Preference Optimization. CATS@ICML, 2026.

PDF OpenReview

. Less is More: Undertraining Experts Improves Model Upcycling. arXiv, 2025.

PDF

. Continual Learning in Vision-Language Models via Aligned Model Merging. arXiv, 2025.

PDF

. From Dormant to Deleted: Tamper-Resistant Unlearning Through Weight-Space Regularization. NeurIPS, 2025.

PDF OpenReview

. Leveraging Per-Instance Privacy for Machine Unlearning. ICML, 2025.

PDF PMLR OpenReview

. On Traceability in $\ell_p$ Stochastic Convex Optimization. NeurIPS, 2025.

PDF OpenReview

. The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws. ICLR, 2025.

PDF OpenReview

. Soup to go: mitigating forgetting during continual learning with model averaging. arXiv, 2025.

PDF

. Torque-Aware Momentum. arXiv, 2024.

PDF

. Improved Localized Machine Unlearning Through the Lens of Memorization. TMLR, 2024.

PDF OpenReview

. Unlearning in- vs. out-of-distribution data in LLMs under gradient-based methods. SafeGenAI@NeurIPS, 2024.

PDF OpenReview

. Mechanistic Unlearning: Robust Knowledge Unlearning and Editing via Mechanistic Localization. ICML, 2024.

PDF PMLR OpenReview

. The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse. arXiv, 2024.

PDF

. Mixture of Experts in a Mixture of RL settings. RLC, 2024.

PDF OpenReview

. Are we making progress in unlearning? Findings from the first NeurIPS unlearning competition. arXiv, 2024.

PDF

. Data Selection for Transfer Unlearning. arXiv, 2024.

PDF

. SSFL: Discovering Sparse Unified Subnetworks at Initialization for Efficient Federated Learning. TMLR, 2024.

PDF OpenReview

. Simultaneous linear connectivity of neural networks modulo permutation. ECML PKDD, 2024.

PDF DOI

. Evaluating Interventional Reasoning Capabilities of Large Language Models. CaLM@NeurIPS, 2024.

PDF OpenReview

. Mixtures of Experts Unlock Parameter Scaling for Deep RL. ICML, 2024.

PDF PMLR OpenReview

. Dataset Difficulty and the Role of Inductive Bias. arXiv, 2024.

PDF

. Leveraging Function Space Aggregation for Federated Learning at Scale. TMLR, 2023.

PDF OpenReview

. The Cost of Scaling Down Large Language Models: Reducing Model Size Affects Memory before In-context Learning. ICLR, 2023.

PDF OpenReview

. Identifying Spurious Biases Early in Training through the Lens of Simplicity Bias. AISTATS, 2023.

PDF PMLR OpenReview

. JaxPruner: A concise library for sparsity research. CPAL, 2023.

PDF Code OpenReview

. Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?. ICLR, 2023.

PDF

. Limitations of Information-Theoretic Generalization Bounds for Gradient Descent Methods in Stochastic Convex Optimization. ALT, 2023.

PDF

Blog

A research overview on when memorization helps, when it hurts, and how it can be controlled in deep learning.

CONTINUE READING

Contact