publications

2026

  1. jailbreak.jpg
    Learning Adversarial Manifold for Jailbreaking at Scale
    Shivank Rajput*Himanshu Singh*, and A V Subramanyam
    2026
    *Equal Contribution Under Review
  2. toxicity.jpg
    Do Prompts Guarantee Safety? Mitigating Toxicity from LLM Generations through Subspace Intervention
    Himanshu SinghZiwei XuA V Subramanyam, and Mohan Kankanhalli
    2026
    Preprint Under Review
  3. nnprat.jpg
    Nearest Neighbor Projection Removal Adversarial Training
    IEEE Transactions on Artificial Intelligence, Apr 2026

2024

  1. LGAP.jpg
    Language Guided Adversarial Purification
    Himanshu Singh, and A V Subramanyam
    In ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024