MSc AI Student & Research Assistant passionate about LLM fine-tuning, computer vision, and developing innovative AI solutions. Experience in model compression, 3D mesh processing, and cutting-edge deep learning applications.
My professional journey in AI & Computer Vision
Working on the AI Native Software Engineering (ANSE) Project, exploring AI-driven approaches to software engineering standards.
Founding member working on LLM fine-tuning and compression for efficient large model deployment.
PyTorch, Transformers, PEFT, bitsandbytes
Llama, Qwen, Mistral, Gemma, Phi, Deepseek
Led multiple computer vision projects including model compression, virtual tours, and facial recognition systems.
Freelance work through Upwork involving data annotations to train Generative AI models.
Supported students in electronics and communications labs, graded assignments, and conducted tutorials.
Supported students and patrons at the university library, trained new employees.
Academic background
Advanced studies in AI with focus on deep learning, computer vision, NLP, and agent technology.
Graduated with Grade 1.73, Minor in Intelligent Mobile Systems.
Non-Linearity in Wireless Communications and Deep Learning
Tools and technologies I work with
Some of my recent work and contributions
Vision transformers for Nepali text detection with RoBERTa-based tokenizer.
Explored VAE, DCGAN, and WGAN for generating synthetic facial images.
TFLite model for ASL gestures. Ranked 371/1315 teams.
U-Net and LSTM for hand movement prediction from EMG data. Ranked 12/31.
Deep learning for modeling non-linearities in wireless communications.
Evaluating compression techniques on LLM task-specific performance.
Specializations and courses completed
Research papers and academic contributions
Find me across the web