Hardik Srivastava

Seeking Summer 2026 Internships

Graduate Research Assistant, University of Washington

hardiksrwork@gmail.com

Connect via Topmate

LeetCode

About

I am a Masters in Data Science student at the University of Washington and a Graduate Research Assistant at the Paul G. Allen School of Computer Science & Engineering, advised by Prof. Zaid Harchaoui. My current research sits at the intersection of Generative AI, privacy, and controllability. I'm working on the mechanics of style-aware text rewriting and stylistic perturbation for authorship obfuscation. I develop controllable Transformer-based systems using attribution-guided modeling to isolate specific linguistic features in text. This involves training lightweight, parameter-efficient adapters (LoRA) to capture distinct stylistic dimensions, and applying constrained decoding algorithms at inference time to steer LLM generation, under strict privacy-fidelity constraints.

Prior to UW, I spent over two years as an Applied Scientist at JPMorgan Chase, where I led applied research and deployment of Gradient Boosting and Semi-Supervised Graph ML models within the Payments Fraud Prevention team. By capturing complex anomalies in transactional networks, I engineered solutions that prevented over $220M in fraud annually.

My work spanned architecting Named Entity Recognition systems for Address verification, Entity Resolution (Record Linkage) models for Sanctions Screening, and Learning-to-Rank models for resource-efficient KYC systems—leveraging tools like Spark, MLflow/MLServing, Kubernetes and AWS to productionize these pipelines at massive scale. Furthermore, I designed and productionized multi-modal Fraud pipelines using Contrastively Pre-Trained Transformers (ViT Encoders & BERT-based decoders) for OCR-based Check Fraud prevention.

My foundational research experience includes shipping word-sense disambiguation models in produciton for Samsung Bixby assistant at Samsung Research, and conducting research in Human-Centered AI and multimodal NLP at the Accessible Computing Lab (ACT) at McGill University.

I am passionate about building AI systems that are rigorous, reproducible, and grounded in theory, while being explicitly engineered for deployment at enterprise scale.

Let's connect if your team is building something at the intersection of learning, reasoning, and trustworthy AI.

Research Interests

My core research interests lie at the the intersection of Natural Language Processing, Generative AI, Graph Representation Learning, Human-AI Interaction, Causality, and Trustworthy ML. I focus on designing language and multimodal systems that are interactive, explainable, and adaptable to diverse user needs, and capable of operating reliably in high-stakes environments.

Skills

Machine Learning & AI

Probabilistic Reasoning
Graph Neural Networks (GNNs)
Zero & Few-shot Learning
Generative AI
Contrastive Learning
XAI
Self-Supervised Learning
Ranking & Recommendation

Natural Language Processing

Computational Linguistics
Large Language Modeling
Named Entity Recognition
Entity Resolution
Transformer Models (BERT, GPT)
Vision-Language Modeling

Research Tooling & MLOps

PyTorch
TensorFlow
HuggingFace
MLflow
Scikit-Learn
Weights & Biases
LaTeX

Big Data

Apache Spark
Kafka
Hadoop
ZooKeeper
Splunk

Cloud Platforms

AWS
Docker
Kubernetes
Jenkins
CI/CD Pipelines

Backend & APIs

FastAPI
Flask
Spring Boot
Microservices

Core Programming & CS

Python
Java
SQL
Data Structures & Algorithms
Distributed Systems

News

Publications

Multi-Modal Sentiment Analysis Using Text and Audio for Customer Support Centers

Hardik Srivastava, Sneha Sunil, K. Shantha Kumari, P. Kanmani

ICACTCE'23: International Conference on Advances in Communication Technology and Computer Engineering

Decision Support Complaint Prioritization System using a Statistical Multi-Method Algorithmic approach

Hardik Srivastava, Mayank Jha, T. Karthick

B.Tech Thesis, SRM 2023

Neural Text Style Transfer with Custom Language Styles for Personalized Communication Systems

Hardik Srivastava, Sneha Sunil, K. Shantha Kumari

ICKECS'22: International Conference on Knowledge Engineering and Communication Systems

Automatic Screening and Staging of Multi-Stage Diabetic Retinopathy using Deep Learning techniques

Hardik Srivastava, T. Rajalakshmi

Preprint

Using NLP Techniques for Enhancing Augmentative and Alternative Communication Applications

Hardik Srivastava

IJREAM'21: International Journal for Research in Engineering Application & Management

Multi-Modal Sentiment Analysis Using Text and Audio for Customer Support Centers

Hardik Srivastava, Sneha Sunil, K. Shantha Kumari, P. Kanmani

ICACTCE'23: International Conference on Advances in Communication Technology and Computer Engineering

Decision Support Complaint Prioritization System using a Statistical Multi-Method Algorithmic approach

Hardik Srivastava, Mayank Jha, T. Karthick

B.Tech Thesis, SRM 2023

Neural Text Style Transfer with Custom Language Styles for Personalized Communication Systems

Hardik Srivastava, Sneha Sunil, K. Shantha Kumari

ICKECS'22: International Conference on Knowledge Engineering and Communication Systems

Automatic Screening and Staging of Multi-Stage Diabetic Retinopathy using Deep Learning techniques

Hardik Srivastava, T. Rajalakshmi

Preprint

Using NLP Techniques for Enhancing Augmentative and Alternative Communication Applications

Hardik Srivastava

IJREAM'21: International Journal for Research in Engineering Application & Management

Projects

KeyCognition - Predicting Cognitive Load using Keystrokes
An AI system that predicts cognitive load purely from keystroke dynamics by using fine-grained keystroke features like dwell time, flight time, pauses, and error patterns in real-time.
Auto-Explainability and Evaluation of LLMs
Enabled interpretability and auto-evaluation of LLMs by developing a framework for Transformer Explainability that visualizes prediction attributions using Integrated Gradients
AI-Powered Child-Labor Prevention Platform
Grievance redressal platform that leverages multi-method statistical models to re-rank grievances for efficient prioritization using weighted-ranking
No-Code Algo-Trading Portfolio Manager
Platform to create/execute trades using a No-Code Drag-and-Drop UI. Powered by a Domain-specifc language framework that internally builds a trading algorithm powered by hybrid recommendation models to minimize the risk and shortfall for a specifc trade
Meta Kaggle Forum Posts Clustering
Clustering Meta Kaggle forum posts for insigts using Heirarchical Clustering + Fine-tuned Word2Vec to include Kaggle-specifc OOV words

Articles

MLOps 101 — Machine Learning Workflow Orchestration using TensorFlow Extended
Deploying Machine Learning workloads efficiently in Production using TFX
A/B Testing in Data Science
Learning how to let the customer do the talking, without actually talking.

Vitæ

Here is my Resume.

  • University of Washington September 2025 - Present
    Graduate Research Assistant
    Privacy-Preserving Text Generation in LLMs
  • University of Washington September 2025 - Present
    MS Student
    Masters in Data Science
  • JPMorgan Chase May - Aug 2025
    Applied Scientist II
    Fraud Prevention, Graph ML and Contrastive Pre-Training
  • JPMorgan Chase June 2023 - May 2025
    Applied Scientist
    Fraud Intelligence and Probalistic Reasoning
  • JPMorgan Chase. Feb - June 2023
    Software Engineer Intern - AI/ML
    Search Software and Semi-Supervised Ranking
  • McGill University June - Sept 2022
    Research Intern
    NLP/HCI at Accessible Computing Lab
  • Samsung Global Research Jan - June 2022
    Machine Learning Engineer - Intern
    Samsung Bixby's search optimization using Quantization-Aware Training
  • Inventuers Sept - Nov 2023
    Founder & Tech Lead
    Co-founded and released Curae, a platform to strealine healtcare services
  • DRDO, Govt. of India Aug - Nov 2021
    Research Intern
    Video Stabalization and Denoising for Long-Range Camera-feeds in Military Drones
  • SoftSensor.ai Apr - Oct 2021
    Data Scientist Intern
    Pattern Recognition in Whole Slide Imaging (Lymphoma)
  • SRM University Jan - July 2021
    Undergraduate Research Assistant
    Predictive Modeling for Diabetic Retinopathy detection
  • SRM University May 2019 - June 2023
    B.Tech Student
    Computer Science Engineering with a specialization in Big Data Analytics