Open-source work spanning generative AI, retrieval systems, and LLM alignment.
A reasoning-first image editor powered by Vision–Language Models
An advanced AI image editing pipeline that understands complex natural language instructions and applies precise, context-aware edits. Unlike traditional tools, V-EditR first reasons about scene context, spatial relationships, and object identities before making any modification — handling requests like "remove the chair behind the table" or "make the person holding the phone wear a black jacket".
Tech Stack
Key Features
Trustworthy Retrieval-Augmented Generation for the medical domain
A medical-domain QA system built on a hybrid dense + sparse retrieval pipeline with hallucination prevention. Uses ~230 K Wikipedia medical passages and refuses to return answers it cannot verify — hallucinated content is replaced with explicit, grounded refusal messages.
Tech Stack
Key Features
Localized Image Editing via Masked Velocity Blending
A novel image editing method built on top of Stable Diffusion 3 that enables precise, localized edits using only text prompts — no manual masks required. Extends the FlowEdit technique by automatically identifying which regions to modify via velocity field analysis, then applying masked velocity blending to confine edits to the relevant area.
Tech Stack
Key Features
DPO paper implementation for LLM alignment
A clean implementation of the Direct Preference Optimization algorithm for aligning language models with human preferences. Fine-tunes TinyLlama-1.1B on a sentiment classification task using preference pairs (chosen vs. rejected outputs) generated by gemma3:4b via Ollama.
Tech Stack
Key Features
FlowEdit paper implementation — text-guided image editing with flow-matching diffusion
A faithful implementation of the FlowEdit paper, enabling precise text-guided image transformations using Stable Diffusion 3 via source and target text prompts — without re-training or fine-tuning any model. Used as the baseline in the FocusFlow project above.
Tech Stack
Key Features
Multi-agent RAG system powered by LLMs
A multi-agent document assistant that answers questions from internal documents (PDFs, HTML, emails) with guaranteed citations and built-in PII/secrets safeguards. Exposes a FastAPI REST interface for ingestion and querying.
Tech Stack
Key Features