● System Synced: Visitor #0

🎯 New: Oracle GenAI Professional Certification Added!

Engineering Insights

"Containerization isn't just deployment—it's a contract between dev and ops for predictable behavior at scale."

MLOps Best Practice

"RAG systems fail when retrieval accuracy < 85%. Always benchmark your embeddings before production."

GenAI Architecture

"Explainability isn't optional—SHAP values turn black boxes into trusted production systems."

Responsible AI

● AI/ML Engineer · GenAI Systems · FastAPI · Cloud-Native

Designing, Shipping, and Owning Production GenAI Systems — RAG, Agentic Workflows, and Async FastAPI at Scale

Driving core platform work at Sylox on a multi-tenant DPSM SaaS — 44 routers, 400+ async REST endpoints, sub-500ms p95 across AKS EKS GKE

400+

Async REST Endpoints

<500ms

p95 Latency @ 80c

14+

2025–26 Certifications

View Portfolio Get in Touch

Technical Rigor

Production-grade expertise across the AI systems lifecycle

APIs & Async

FastAPI

Pydantic v2

SQLAlchemy 2 (async)

Celery · RQ

asyncpg · uvloop

GenAI

LangGraph · LangChain

LlamaIndex · OpenAI SDK

Agentic AI · RAG · Hybrid

Claude · OpenAI · Bedrock

Azure AI Search · Doc Intelligence

Cloud & K8s

Kubernetes (AKS · EKS · GKE)

Helm · Docker · Compose

Terraform · Multi-Cloud

Azure DevOps · Cloud Build

Azure · AWS · GCP · OCI

Pinecone · pgvector · Oracle 23ai

Security & Identity

OAuth2 · OIDC

Okta · Auth0 · Azure AD

SCIM 2.0 · RBAC

Argon2id · JWT

RFC 3161 Timestamping

Role Fit Matrix

JD pillars mapped to production evidence — what every line of the role looks like in real shipped systems

API Development

Python + FastAPI APIs

460+ Endpoints in Prod 44 Routers

Independently designed, shipped, and maintain 44 routers and 400+ async REST endpoints at Sylox; 40+ in the AI Privacy Verifier; 20+ in the matter-isolated RAG. FastAPI + Pydantic v2 + async SQLAlchemy 2.0 across every project, with Flask comfort as a fallback.

FastAPI Pydantic v2 SQLAlchemy 2 (async) Flask

Async & Concurrency

Concurrent & Scalable Systems

<500ms p95 @ 80c 1,200 RPS · p99 <180ms

asyncpg + uvloop sustaining sub-500ms p95 at 80-concurrency autoscale on a multi-tenant 11-schema Postgres. Celery + Redis + RQ workers with circuit breakers, per-tenant rate budgets, and partitioned tables for bursty load.

asyncio · uvloop asyncpg Celery · RQ Redis

Generative AI

LangChain · LlamaIndex · OpenAI SDK

7-Stage LangGraph 7-Vendor LLM Layer

LangGraph 7-stage classification pipeline at Sylox with a Claude verifier; LangChain + LlamaIndex RAG over policy documents; provider-agnostic LLM abstraction across 7 vendors (OpenAI, Anthropic, Gemini, Mistral, Cohere, AWS Bedrock, and one cloud-native provider) so a vendor switch is a config flag.

LangGraph LangChain LlamaIndex OpenAI Python SDK Anthropic SDK

Containers & Orchestration

Docker · Kubernetes · Helm · Compose

<180MB Distroless Images AKS · EKS · GKE

Multi-stage Dockerfiles (distroless runtime, non-root UID, <180MB images); Helm chart with values-{dev,stage,prod}.yaml, HPAs, PodDisruptionBudgets, NetworkPolicies, pre-rendered ServiceMonitors. Docker Compose for local parity with make demo bootstrapping the full stack in <90s.

Kubernetes Helm Docker · Compose HPA · PDB · NetworkPolicy

Testing

Unit · Integration · Performance Tests

2,195 Tests · 80% Coverage mypy --strict

Strict TDD with red-tests-first commits — pytest + pytest-asyncio, testcontainers (real Postgres + Redis, no mocks), Hypothesis property tests at 500+ examples, Playwright + Vitest E2E, schemathesis contract tests, k6 + Locust performance harnesses; gated by diff-cover ≥ 80% on every PR.

pytest · pytest-asyncio testcontainers Hypothesis k6 · Locust schemathesis

DevOps · CI/CD

Multi-Stage CI/CD Pipelines

45min → 9min PR-to-Deploy 30-Gate Quality System

Owned multi-stage pipelines (build → test → security-scan → publish → cluster deploy via Helm) with environment approvals, blue/green rollout, and automatic rollback on failed health gates. Cut PR-to-deploy from ~45min → ~9min via dependency caching, test sharding, and BuildKit cache mounts. Same patterns shipped on Azure DevOps and GCP Cloud Build.

Azure DevOps GCP Cloud Build GitHub Actions Helm Deploys Trivy · CodeQL

Security & Identity

OAuth · Okta · Auth0 · RBAC in AKS

Zero Cross-Tenant Leakage Workload Identity · No Long-Lived Secrets

OAuth2 + OIDC with Auth0 (pluggable to Okta or Azure AD via the same provider interface), JWT (python-jose), Argon2id hashing, refresh-token rotation, per-endpoint RBAC via require_permission(). AKS workload identity, NetworkPolicies default-deny, Pod Security Standards: restricted, Key Vault CSI mount. SCIM 2.0 provisioning at Sylox.

OAuth2 · OIDC Okta · Auth0 RBAC in AKS SCIM 2.0 Argon2id · JWT

Scripting & Automation

Bash · Shell · PowerShell

<90s Stack Bootstrap One-Command Demo

Bash + PowerShell automation for stack bootstrap, evidence-pack export, and on-call diagnostics; make demo and make preflight orchestrate end-to-end environment up + canary→PDF pipeline assertion in under 90 seconds.

Bash PowerShell Make Shell

Professional Experience

Production GenAI work across SaaS platforms, contracts, and independent research

Feb 2026 — Present · Onsite

AI Engineer · Sylox Technologies, Pune

Driving core platform engineering on IRIS — a multi-tenant Data Privacy & Security Management SaaS for DPDP Act 2025, GDPR, HIPAA, RBI, and SEBI compliance. 44 routers, 400+ async REST endpoints, 11-schema Postgres, sub-500ms p95 at 80-concurrency. Architected a 7-stage LangGraph classification pipeline (regex → Presidio + spaCy → Claude verifier → analyst override) lifting sensitive-data detection accuracy 18% across 10K+ records. Productionized cloud-portable Terraform deploys across GCP Cloud Run, AKS, and EKS. Operationalized GCP Cloud Build CI/CD with a 30-gate verification system and 161 idempotent Alembic migrations.

FastAPI LangGraph Claude Postgres GCP · AKS · EKS docling · dlt SCIM 2.0

Nov 2025 — Jan 2026 · Remote (Contract)

AI Engineer · LOPhils Inc.

Architected a multi-tenant Pinecone × Claude RAG over 5K+ legal-matter documents, eliminating cross-tenant context leakage via strict namespace isolation. Async ingestion pipeline processed 1GB+ batches across direct upload, n8n webhooks, S3 async, and ZIP batch — cut redundant embeddings 35% and embedding cost 30% via SHA-256 dedup and 50-per-request batching. Lifted retrieval relevance 28% with hybrid semantic + BM25 + cross-encoder reranking; query latency < 2s p95. Shipped 20+ FastAPI endpoints on AWS (EC2, S3, RDS, Redis) behind NGINX.

Pinecone Claude Voyage law-2 FastAPI Redis · RQ AWS n8n

Aug 2025 — Oct 2025

Independent AI Engineer & Researcher · Pune, India

Used the post-graduation runway to deepen production GenAI specialization. Earned nine industry certifications in eight weeks across GenAI, Vector Search, MLOps, Kubernetes, and Apache Airflow — Oracle OCI 2025 (5 certs), Databricks (2), Astronomer Apache Airflow 3 (2), KodeKloud (RAG + K8s). Took freelance fine-tuning and RAG-prototyping engagements; built open-source side projects including Cerevra (ContextOS) and early prototypes of Sentinel.

Oracle OCI Databricks Astronomer LangGraph Llama 3 · Mistral

Project Showcase

Production-grade AI systems deployed at scale

Multi-Tenant SaaS

AI Privacy Verifier — Continuous AI Vendor Governance

1,200 RPS · p99 < 180ms 38% LLM Cost Cut

9-service event-driven backend that continuously probes AI provider APIs and produces RFC 3161 trusted-timestamped evidence packets — canary traps, membership inference probes, model fingerprinting, policy surveillance via Azure Document Intelligence. 40+ FastAPI endpoints, provider-agnostic LLM layer over 7 vendors, Helm-deployed on AKS with Azure DevOps blue/green pipelines.

FastAPI LangChain · LlamaIndex Azure OpenAI AKS · Helm Auth0 · OIDC RFC 3161

Production RAG

Matter-Isolated RAG — Pinecone × AWS × n8n

Zero Cross-Matter Leakage +28% Retrieval Relevance

Multi-tenant retrieval over 5K+ legal-matter documents. Pushed isolation into the vector DB via per-matter Pinecone namespaces — enforced before retrieval, not after. Async ingestion across upload, n8n, S3, and ZIP batch (1GB+); SHA-256 dedup cut embedding cost 30%; hybrid BM25 + reranking lifted relevance 28% at < 2s p95.

Pinecone Claude Voyage law-2 FastAPI Redis · RQ AWS

Open Source · v0.1.0

Cerevra (ContextOS) — Zero-Data-Loss Deterministic RAG Engine

WAL · 1.2–2.1s Recovery 2,195 Tests · 80% Coverage

Production-grade RAG engine that treats retrieval like a database — write-ahead logging with fsync, deterministic crash recovery, hardware-grade tenant isolation, no vector store dependency. BM25-first (no GPU), 4-dim Context Quality Scoring, citations anchored to source offsets. Docker, K8s manifests, Python + TypeScript SDKs. 219 commits.

Python 3.11+ FastAPI rank-bm25 Prometheus · OTel Kubernetes

View Code

Agentic AI

Sentinel — Agentic Code-Review & Security-Triage

LangGraph Multi-Agent Tool-Level RBAC

LangGraph-orchestrated multi-agent system: a router dispatches to specialized Claude and Codex agents (diff analysis, secret scan, dependency audit, playbook retrieval), each with tool-level RBAC. Async FastAPI webhooks consume GitHub PR events; long reviews run on Kubernetes worker pods deployed via a custom Helm chart, with full audit trail of every tool invocation.

LangGraph Claude · Codex FastAPI pgvector + BM25 Kubernetes · Helm Okta · Auth0

Research · Peer-Reviewed

SHAR — Scalable Human Activity Recognition

92% Detection Accuracy 100ms / Frame

Final-year capstone (Team Lead) — fine-tuned Vision Transformer for real-time suspicious-activity detection from user-controlled camera APIs. Three-stage pipeline with WebSocket / e-mail / SMS alerting. Trained on UCF-Crime + KTH Action; 95% manual-audit alert precision. Published in TIJER (May 2025).

PyTorch · ViT WebSockets UCF-Crime Computer Vision

Read the Paper

Publication & Writing

Peer-reviewed research and long-form technical writing

TIJER · May 2025

Team Lead ISSN 2349-9249

SHAR: Scalable Human Activity Recognition

Vision Transformer–based real-time suspicious-activity recognition from user-controlled camera APIs. 92% detection accuracy on UCF-Crime, 100ms / frame, 95% manual-audit alert precision. Authors: Ashwin Upadhyay, Sarrah Burmawala, Amit Bhosale (TCOER, SPPU).

ViT PyTorch UCF-Crime

Read the Paper

Medium · Mar 2024

March 2024 Career

From Code to Insights: Web Development to Data Science

A career-narrative piece on building cross-disciplinary depth — from HTML/CSS/JS and full-stack web foundations into SQL, R, Python, ML, and neural networks.

Career Data Science

Read on Medium

Medium · Mar 2024

March 2024 ML Primer

Harnessing Future Insights: Predictive Analytics

A primer on the five foundational ML models — Linear Regression, Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting — framed for business audiences.

ML Predictive Analytics

Read on Medium

Licenses & Certifications

Industry-recognized credentials validating production-scale expertise

Active

Academy Accreditation - AI Agent Fundamentals

Databricks

Issued Oct 2025 • Expires Oct 2026

ID: 163890456

Generative AI Agentic AI

Active

Academy Accreditation - Generative AI Fundamentals

Databricks

Issued Oct 2025 • Expires Oct 2026

ID: 163827419

Generative AI LLM

🚀

Certified

DAG Authoring for Apache Airflow 3

Astronomer

Issued Oct 2025

ID: 863162ef

Apache Airflow ETL DAGs

🚀

Certified

Apache Airflow 3 Fundamentals

Astronomer

Issued Oct 2025

ID: ef9a1b33

Apache Airflow Kafka Pipelines

OCI

Active

Oracle AI Vector Search Certified Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027

ID: 102977056DB23AIOCP

Vector Search RAG Oracle DB 23ai

OCI

Active

OCI 2025 Certified GenAI Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2026

ID: 102964238OCI25GAIOCP

LLM RAG LangChain Vector DB

OCI

Active

OCI 2025 Certified Data Science Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027

ID: 102964238OCI25DSOCP

ML Lifecycle MLOps ADS SDK

OCI

Active

OCI 2025 Certified Architect Associate

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027

ID: 103043421OCI25CAA

Cloud Architecture Networking

OCI

Active

OCI 2025 Certified Developer Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027

ID: 103043421OCID25CP

Microservices Kubernetes

Top 5%

AI Fluent Tech Professional

Calyptus

Issued Nov 2025 • Expires Nov 2035

ID: 14

Agentic AI Explainable AI GenAI

🔐

Active

AI Security & Governance

Securiti

Issued Feb 2026

AI Security Governance Compliance

Active

Identity Security for the AI Age

Saviynt

Issued Dec 2025

Identity SCIM RBAC

K8s

Certified

Kubernetes

KodeKloud

Issued Oct 2025

Kubernetes Helm Container Orchestration

RAG

Certified

RAG Crash Course

KodeKloud

Issued Oct 2025

RAG LLM Vector DB

OCI

Active

OCI 2025 — AI Foundations Associate

Oracle Cloud Infrastructure

Issued Dec 2025

AI Fundamentals ML OCI

Verified

SQL (Advanced)

HackerRank

Issued Jul 2023

ID: 77f5f2dd7935

SQL Database

0

Professional Certifications

0

Active Credentials

0

2025–26 Certifications

Technical Articles

In-depth explorations of MLOps, GenAI, and Cloud-Native architectures

MLOps

Oct 15, 2025 8 min read

Production-Grade MLOps: From Jupyter to Kubernetes

A comprehensive guide to deploying ML models at scale using containerization, orchestration, and CI/CD pipelines.

Kubernetes Docker MLflow

GenAI

Oct 10, 2025 12 min read

Building Production RAG Systems: Beyond the Basics

Deep dive into advanced RAG architectures with hybrid search, reranking, and vector database optimization.

RAG LangChain Vector DB

Cloud

Oct 5, 2025 10 min read

Multi-Cloud Strategy: Avoiding Vendor Lock-in

Architecting resilient AI systems across AWS, Azure, and OCI with Terraform and cloud-agnostic patterns.

Multi-Cloud Terraform Architecture

XAI

Sep 28, 2025 15 min read

Explainable AI in Production: SHAP, LIME, and Beyond

Implementing model interpretability frameworks for regulatory compliance and stakeholder trust.

SHAP XGBoost Compliance

Selected Achievements

Production impact, recognition, and competitive milestones

2024 — 2026

500+ LeetCode · 500-day Streak

500+ problems solved with a 500-day submission streak. 100 Days Badge ’24 / ’25 / ’26 · 200 Days Badge ’25 · Annual Badge 2025 · monthly badges April–November 2025 plus February 2026.

Algorithms DSA Python

Nov 2025

Top 5% Globally — Calyptus AI Fluent Tech Professional

Earned the AI Fluent Tech Professional credential in the top 5% of the global Calyptus cohort, validated across Agentic AI, Explainable AI, and GenAI competencies.

Agentic AI Explainable AI GenAI

Apr 2026

Cerevra (ContextOS) v0.1.0 Released

Open-source maintainer of Cerevra — 219 commits, 2,195 tests, 80% diff-coverage. GitHub badges: Pair Extraordinaire · Pull Shark ×2 · YOLO · Quickdraw across 36 public repositories.

Open Source RAG Engine WAL · Recovery

2025 — 2026

Production Impact Across AI Systems

Lifted production retrieval accuracy to 90% across AI-driven systems; reduced end-to-end ML pipeline latency 40% through async I/O, caching, and batching. Eliminated cross-tenant leakage in two multi-tenant platforms via vector-DB-level namespace isolation and OIDC-bound RBAC.

Async I/O Caching Multi-tenant

May 2025

Lead-Author Publication · TIJER

Team Lead on the SHAR (Scalable Human Activity Recognition) capstone — peer-reviewed publication in TIJER (ISSN 2349-9249, Volume 12, Issue 5).

ViT PyTorch Research

GitHub Activity

Continuous contributions and open-source engagement

0

Commits This Year

0

Pull Requests

0

Total Stars

0

Repositories

Contribution Activity

Less More

Portfolio Analytics

Real-time insights and visitor metrics

Visitor Analytics

Live

Today 127

This Week 892

Total 12,847

Performance Metrics

Load Time

0.8s

First Paint

0.3s

Lighthouse Score

98/100

Geographic Distribution

🇺🇸 United States

42%

🇮🇳 India

28%

🇬🇧 United Kingdom

15%

🇩🇪 Germany

10%

🇨🇦 Canada

Popular Sections

Projects 34%

Certifications 28%

Technical Rigor 22%

Blog 16%

Get in Touch

Open to AI/ML Engineer · GenAI Engineer · Backend AI Platform roles — Hybrid / Remote / Onsite (Pune, Bengaluru, Hyderabad)

Email

ashwinupadhyay09@gmail.com

Phone

+91 8329396282

Location

Pune, Maharashtra, India 411046

Education

B.E. Information Technology · TCOER, SPPU · CGPA 8.53 / 10 · 2021–2025

Request sent successfully!