Available for opportunities

๐Ÿ‘‹ Hi, I'm Aaditya Punekar

I build Data|
that drives decisions.

Data Scientist & AI Engineer with expertise in Machine Learning, NLP, and GenAI. Building predictive models and intelligent automation that deliver real business value.

model.py
# AI-powered prediction engine
import torch, transformers
from sklearn.ensemble import XGBClassifier

class RetailBrain:
    def predict(self, data):
        forecast = self.model(data)
        return {
          "stockout_risk": forecast.prob,
          "confidence": 0.94
        }
0+ AI Projects Built
0+ Years Experience
0+ Technologies
0% Avg Accuracy Boost

About Me

Empowering businesses with intelligent data solutions.

I am a Data Scientist & AI Engineer with over 2 years of experience applying machine learning, NLP, and GenAI tools to deliver actionable insights and automation across retail, pharma, and insurance sectors.

With a proven track record in building predictive models, streamlining data workflows, and driving data-informed business decisions, my goal is to bridge complex algorithms with tangible value. I specialize in deploying LLMs, constructing dashboards, and scaling data architectures using Python, SQL, and modern cloud deployment strategies.

Currently based in the United Kingdom, I hold an MSc in Computer Science from Queen Mary University of London.

Machine Learning
GenAI & LLMs
Data Analytics
Cloud & APIs
2+ Years of
Experience
United Kingdom

Skills & Technologies

Languages & Frameworks

Python SQL JavaScript FastAPI Flask PyTorch TensorFlow Scikit-learn

AI / ML

LLMs (GPT-4, Gemini) HuggingFace BERT / RoBERTa XGBoost Reinforcement Learning NLP / RAG Computer Vision

Data & Cloud

PostgreSQL MongoDB Azure Docker Airflow Power BI Tableau Pandas / NumPy

Experience & Education

May 2023 โ€“ Jan 2026

Data Analyst

JustinClicks  ยท  Remote, Mumbai

Fine-tuned LLMs such as BERT and RoBERTa using Python and Hugging Face to enhance sentiment tagging accuracy by 30% across ORM datasets. Designed dynamic SEO and brand performance dashboards in Power BI and Looker Studio. Automated ORM data ingestion pipelines with SQL and Airflow, decreasing manual processing by 60%. Performed competitor intelligence driving a 15% traffic growth.

PythonBERTPower BIAirflow
Oct 2020 โ€“ Jun 2021

Data Analyst

Deep Punch  ยท  Remote, Mumbai

Improved inventory forecasting using regression models, resulting in a 25% increase in accuracy. Created supplier performance dashboards using Power BI to reduce procurement delays by 20%. Automated reporting workflows driving a 12% reduction in sourcing costs.

SQLPythonPower BIRegression
2021 โ€“ 2023

MSc Computer Science

Queen Mary University of London

Specialised in advanced computing algorithms and data science paradigms. Graduated with a focus on machine learning systems and intelligent data architectures.

2017 โ€“ 2020

BE Computer Engineering

University of Mumbai

Built a strong engineering foundation in software architecture, computational logic, and data structures.

Featured Projects

Machine Learning

Retail Brain โ€” Sainsbury's

Full-stack AI demand forecasting system for Sainsbury's. Features JWT-authenticated REST API, real-time WebSocket alerts, POS connectors, and ML-powered stockout prediction for 1,000+ SKUs with 94% accuracy.

PythonFastAPIXGBoost PostgreSQLDocker
Reinforcement Learning

Retail Ops Copilot

End-to-end decision intelligence system using Reinforcement Learning to automate staff task assignments. Features MILP optimization baseline, digital twin simulation, and an interactive dashboard for retail operations.

PythonRL (PPO)MILP DashPlotly
GenAI / Agents

AgentSentinel

Multi-agent AI monitoring system providing real-time oversight of autonomous AI pipelines. Implements structured guardrail layers, audit logging, and enterprise-grade observability for agentic AI workflows.

PythonLLM AgentsFastAPI WebSockets
Automation

YouTube Automation Pipeline

Fully automated AI-driven content pipeline that generates scripts, voiceovers, thumbnails, and video compilations using GPT-4 and ElevenLabs. Targets fitness & health niche with engaging AI personas.

PythonGPT-4ElevenLabs FFmpegn8n
GenAI Agent

Logistics Reconciliation Engine

Zero-hallucination AI agent for retail logistics that eliminates "Ghost Inventory" by autonomously ingesting unstructured supplier data (PDFs, emails) and reconciling against master inventory with structured discrepancy reports.

PythonLLM AgentsRAG PostgreSQL
NLP / GenAI

GenAI Mention Intelligence

LLM-powered brand and entity mention analysis system. Fine-tuned Longformer models for multi-label entity tagging in pharma and insurance datasets, improving classification accuracy by 30โ€“40% with automated JSON pipelines.

PythonLongformerHuggingFace NLPAirflow
Automation

Excel to PDF Summarizer

AI-powered document intelligence tool that ingests Excel and CSV reports, applies LLM-generated natural language summaries, and exports polished PDF reports. Reduces manual reporting time significantly.

PythonOpenAI APIPandas ReportLab
Machine Learning

Hybrid Malware Detection

High-precision malware classification system using PE header analysis and ensemble models. Engineered feature extraction pipelines combining CNN binary visualisation, XGBoost, and SVM architectures.

PythonCNNXGBoost SVMPE Analysis

Let's Collaborate

Let's build something
intelligent together.

I'm open to new opportunities in data science, ML engineering, and AI product development. Whether it's a full-time role, contract work, or a collaboration โ€” let's talk.

Call me
+44 7776 663911
Location
United Kingdom