Hey there I'm

Harsha Vardhan Reddy Emani

I build 

AI Research & Systems Engineer specializing in HPC and LLM optimization. Speaker @ SCI 2025. Ex-Intern @ C-DAC. Building production systems used by 5,000+ users.

Who I Am

I'm Harsha Vardhan Reddy Emani, an AI Research & Systems Engineering student specializing in High-Performance Computing and Large Language Model optimization. Based in Guntur, Andhra Pradesh, India.

Hands-on experience fine-tuning and building models from scratch across AMD MI300X, Intel Gaudi3, and NVIDIA L40S GPU platforms. Speaker at SCI 2025 delivering tutorials on LLM mathematical foundations.

Core Member & Platform Manager at AIHUB VVIT — deployed 5+ production projects, organized hackathons, and mentoring peers in applied AI.

5K+ Users Served
5+ Deployments
8.7 GPA

What I Do

  • LLM Development & Optimization

    Building LLMs from scratch, fine-tuning with LoRA, multi-GPU training

  • High-Performance Computing

    CUDA, ROCm, OpenMP, GPU-accelerated AI pipelines

  • RAG & Multi-Agent Systems

    LangGraph, LangChain, Vector DBs, FastAPI deployment

  • DevOps & Production Deployment

    Docker, Linux servers, scalable backend architecture

All Projects

Live data from GitHub

Fetching projects from GitHub…

Experience

  1. December 2025

    Technical Speaker — SCI 2025 Conference

    International Conference

    Selected to deliver a 3-hour tutorial titled "Mathematical Foundations for Large and Small Language Models" to AI researchers and engineers. Conducted live deployment and benchmarking across Intel Gaudi3, AMD MI300X (ROCm), and NVIDIA GPUs, comparing throughput, latency, and memory efficiency. Designed curriculum spanning Transformer internals, attention, positional encodings, and scaling laws.

    Public SpeakingLLMsGPU Benchmarking
  2. April 2025 – June 2025

    HPC Intern

    Centre for Development of Advanced Computing (C-DAC), Pune

    Contributed to the HPC sub-project "High Performance Data Management Using Berkeley DB for Telecom Traffic Data". Designed and optimized graph-based C/C++ data structures for large telecom networks with millions of nodes and edges. Leveraged Berkeley DB and OpenMP for parallelized large-scale data access, with hands-on exposure to HPC clusters, profiling tools, and scalable visualization workflows.

    HPCBerkeley DBOpenMPC/C++
  3. 2023 – Ongoing

    Core Member & Platform Manager — AIHUB VVIT

    AI/ML Community Organization

    Lead AI/ML project development, mentor 100+ peers in applied machine learning, and organize 3 to 4 student hackathons per year. Architected, deployed, and maintain the official AIHUB VVIT website and backend infrastructure to ensure high availability for the campus AI community.

    LeadershipDevOpsMentorshipCommunity
  4. Aug 2023 – Present

    B.Tech — Artificial Intelligence & Machine Learning

    Vasireddy Venkatadri University (VVITU), Guntur

    GPA: 8.7/10. Specializing in HPC and LLM optimization. Hands-on experience fine-tuning and building models from scratch across AMD MI300X, Intel Gaudi3, and NVIDIA L40S GPU platforms. Member of ACM student chapter.

    AI/MLHPCLLM OptimizationACM Member

Certifications & Achievements

Achievements

01

Technical Speaker — SCI 2025 Conference

3-hour tutorial: "Mathematical Foundations for Large and Small Language Models"

View Proof
02

Finalist - 5G Innovation Hackathon 2025

Recognized for innovative solution in 5G technology

View Proof
03

Ex-Intern @ C-DAC Pune (HPC Group)

Berkeley DB · OpenMP · Performance Profiling

View Proof
04

Core Member & Platform Manager — AIHUB VVIT

Deployed 5+ production projects · Organized 3-4 hackathons

05

ACM Student Chapter Member

Seminars, coding events, and tech talks participation

Oracle Cloud Certifications

OCI 2025 Certified Generative AI Professional

Oracle · Issued Oct 2025 · Expires Oct 2027

OCI GenAI RAG LLMs
View Proof

OCI 2025 Certified AI Foundations Associate

Oracle · Issued Aug 2025 · Expires Aug 2027

AI OCI ML Foundations
View Proof

Development & AI Certifications

Complete Flutter Development Bootcamp with Dart

Udemy · Issued Oct 2025

Flutter Dart Mobile Dev
View Proof

Programming in Java

NPTEL Online Certification · 2024

Java OOP Multithreading
View Proof

Generative AI by Google Cloud

Google Cloud Skill Boost (L4G) · 2024

LLMs Prompt Engineering Foundation Models
View Proof

Skills & Technologies

Languages

  • Python
  • C / C++
  • Java
  • JavaScript
  • Dart
  • SQL
  • HTML / CSS

AI / ML Frameworks

  • PyTorch
  • Hugging Face
  • LangChain / LangGraph
  • Scikit-learn
  • DeepSpeed
  • LoRA / DDP

Web & Mobile

  • React
  • FastAPI
  • Node.js / Express
  • Flutter
  • Scrapy
  • Tailwind CSS

Databases

  • MongoDB
  • MySQL
  • Oracle SQL
  • Berkeley DB
  • Vector Databases

GPU & HPC

  • CUDA
  • AMD ROCm
  • Intel Gaudi3
  • OpenMP
  • AMD MI300X
  • NVIDIA L40S

DevOps & Tools

  • Docker
  • Git / GitHub
  • Linux (Ubuntu)
  • Server Deployment
  • VS Code
  • Jupyter

Get In Touch

Open to opportunities, collaborations, and interesting conversations.

Email

GitHub

@harsha4261

LeetCode

Eharsha4261