Wikipedia → LLaMA Fine-Tuning (RAG + LoRA)
Complete RAG pipeline with chunking, embeddings, FAISS retrieval and LoRA fine-tuning of LLaMA for factual QA. ROUGE-1: 0.033 • ROUGE-L: 0.024.
Transformer — English ↔ Hindi (From Scratch)
Full Transformer encoder–decoder with custom attention, masking and beam search. BLEU 49.76.
Amazon Reviews NLP Analysis (3M+ Reviews)
Scalable pipeline including sentiment, aspect-mining and BART/T5 summarization for millions of reviews.
SMS Spam Classification — Naive Bayes, GDA, Logistic Regression
TF-IDF + classical models identifying high-impact spam tokens with strong recall on noisy text.
Plant Disease Detection — ResNet18
ResNet18 transfer-learning pipeline with Grad-CAM explainability; high accuracy across 38 classes.
Heart Stroke Prediction System
End-to-end ML pipeline for stroke risk prediction with feature engineering and class imbalance handling.
Credit Card Fraud Detection — Anomaly Models
Hybrid anomaly-detection workflow (PCA, UMAP, Isolation Forest). Isolation Forest delivered best precision-recall.
Student Performance & Ranking System
Stacked regression with PCA, SVM, and Gradient Boosting. R² = 0.90.