DSN Sathvik

Contact

Email: sathvikdantu45@gmail.com
LinkedIn: https://linkedin.com/in/sathvik-dantu
GitHub: https://github.com/sathvik-web
Resume: Download Resume

About Me

I am an AI & Machine Learning engineering student with hands-on experience in building end-to-end machine learning and NLP systems. I specialize in designing data pipelines, fine-tuning deep learning models, and applying modern NLP and large language models (LLMs) to real-world problems.

My interests include Natural Language Processing, Large Language Models, federated learning, and applied AI systems. I enjoy experimenting with models, evaluating performance, and building scalable and practical solutions.

---

Experience

Research Assistant Intern – Woxsen Agentic Lab

Dec 2025 – Present

Fine-tuned open-source large language models for agent-based AI tasks using LoRA and QLoRA
Conducted multiple controlled experiments and analyzed model performance to support optimization decisions
Collaborated with a research team to improve LLM training and evaluation workflows

---

Projects

🔹 Multimodal Image Captioning

GitHub: https://github.com/sathvik-web/multimodal_image_captioning

Developed a multimodal deep learning model that generates descriptive captions for images by combining computer vision and natural language processing techniques.
Extracted visual features using a pre-trained CNN encoder and generated captions with an LSTM-based sequence decoder.
Built an end-to-end pipeline including image preprocessing, feature extraction, tokenization, and sequence generation.
Evaluated caption quality using BLEU scores and qualitative analysis to assess language fluency and relevance.

---

🔹 Handwritten OCR System

GitHub: https://github.com/sathvik-web/Handwritten-OCR-System

Built a handwritten Optical Character Recognition (OCR) system to recognize handwritten text using deep learning techniques.
Implemented image preprocessing steps such as noise reduction, normalization, and segmentation to improve recognition accuracy.
Trained a convolutional neural network (CNN) model to classify handwritten characters from image inputs.
Developed an inference pipeline that converts handwritten images into machine-readable text for automated document processing.

---

🔹 Voice Authentication Anti-Spoofing System

GitHub: https://github.com/sathvik-web/voice-authentication-anti-spoofing-system

Developed a voice authentication system capable of detecting spoofed or synthesized audio used in voice-based security systems.
Extracted audio features such as MFCCs and spectral representations for training machine learning models.
Built a classification model to distinguish between genuine and spoofed voice samples.
Designed the system to enhance security in voice biometric authentication applications.

---

🔹 RAG-Based PDF Question Answering Bot

GitHub: https://github.com/sathvik-web/rag-pdf-qa-bot

Built an intelligent Retrieval-Augmented Generation (RAG) system to answer user questions from PDF documents
Implemented document embedding, vector search, and context retrieval for accurate response generation
Designed a modular inference pipeline suitable for real-world document analysis and research workflows

Key skills: NLP, embeddings, vector search, LLM inference, pipeline design

---

🔹 Abstractive Text Summarization using T5 Transformer

GitHub: https://github.com/sathvik-web/text-summarizer-app

Fine-tuned a pre-trained T5 Transformer model to generate human-like abstractive summaries
Built a complete NLP pipeline including text cleaning, tokenization, attention masks, and sequence padding
Evaluated summarization quality using ROUGE metrics and manual readability checks
Developed a simple Python interface for model inference

Key skills: Transformers, NLP pipelines, evaluation, deep learning

---

🔹 Federated Learning using CNN

GitHub: https://github.com/sathvik-web/federated-learning-chestxray

Built a federated learning system in which multiple clients collaboratively trained a CNN model without sharing raw data
Designed client-side training workflows and simulated multiple federated learning rounds
Implemented FedAvg aggregation and analyzed convergence behavior across training rounds

Key skills: Distributed learning, CNNs, privacy-preserving ML, PyTorch, TensorFlow

---

Skills

Programming: Python, Java, JavaScript

Machine Learning & AI: Scikit-learn, TensorFlow, Keras, Feature engineering, Pytorch, Pandas

Cloud & Databases: AWS, Docker, POSTGRESQL,

Tools: Git, GitHub, Jupyter Notebook

Other: Problem solving

---

Patent

Autonomous Flood Monitoring System (Design Patent – India)

Co-invented and registered an autonomous flood monitoring system
Designed a solution integrating environmental sensors, solar power, and wireless communication
Developed for real-time flood monitoring and early-warning support for disaster management

---

Education

B.Tech – Artificial Intelligence & Machine Learning
Woxsen University, Hyderabad
GPA: 8.61 / 10
Duration: 2023 – 2027 (Pursuing)