Projects
A two-stage training framework that first compresses a large "teacher" vision-language model into a smaller "student" using standard knowledge distillation, then further improves the student using reinforcement learning rewards.
This project presents VesselNet, a multi-pathway deep CNN for 3D hepatic vessel segmentation that classifies each voxel using three orthogonal 2D patches (sagittal/coronal/transverse) to better capture vessel structure in 3D.
A Gradio-based app for RAG banking chatbot using LangChain + Llama 2 + FAISS. This project is a banking-focused data science that explores fraud detection and risk analytics across multiple datasets and modeling approaches for banking chatbot.
A GraphRAG system designed for Vision-Language Models (VLMs) to answer employee questions from policy/legal documents and visually rich PDFs. It combines vector retrieval over multimodal chunks with a knowledge graph to improve scope correctness, reduce wrong-but-similar retrieval, and produce explainable answers with citations and reasoning paths.