CSU Bakersfield · 2025–2026

Research.

Undergraduate research on multi-model generative pipelines and HIPAA-compliant medical NLP — under advisors Dr. Chengwei Lei and Dr. Kanwal Kaur at CSU Bakersfield.

in prep
Manuscript submission for CSCSU 2026
Video generation pipeline · co-authored with Dr. Chengwei Lei

Video Generation Pipeline

manuscript in prep
advisor
Dr. Chengwei Lei
period
Jan 2025 – May 2026
venue
CSCSU 2026 (manuscript in preparation)

Multi-model video-generation pipeline combining YOLO11 detection, BLIP2 captioning, and AnimateDiff diffusion. Uses confidence-weighted keyword fusion and a temporal-chaining scoring function for cross-frame visual consistency.

methods
Python PyTorch YOLO11 BLIP2 AnimateDiff ControlNet

Medical NLP Translation

proposal co-authored
advisor
Dr. Kanwal Kaur
period
Jan 2025 – May 2026
compliance
HIPAA

HIPAA-compliant bidirectional medical translation system for real-time doctor–patient communication. Whisper speech-to-text feeds MarianMT and NLLB-200 translation, then a TTS layer renders the response. Designed for clinical deployment with audit logging and on-device inference where feasible.

methods
Python PyTorch Whisper MarianMT NLLB-200 Hugging Face

Crop Disease Detection

completed
period
May 2026
dataset
PlantVillage + PlantDoc + PlantSeg (38 merged classes)
results
mAP@50 = 0.995 · 99.14% top-1 accuracy

Two-stage detection pipeline for crop-disease severity — YOLOv11x for lesion localization, HSV-space analysis for severity grading. Training used cosine learning-rate scheduling with warmup and standard augmentation. Built a per-class evaluation suite (ROC curves, confusion matrices) and an automated overfitting-diagnostic dashboard. Planned ablation study addresses full-image bounding-box limitations identified in mentor review.

methods
Python PyTorch YOLOv11x OpenCV Google Colab