System Synced: Visitor #0
🎯 New: Oracle GenAI Professional Certification Added!

Engineering Insights

"Containerization isn't just deployment—it's a contract between dev and ops for predictable behavior at scale."

MLOps Best Practice

"RAG systems fail when retrieval accuracy < 85%. Always benchmark your embeddings before production."

GenAI Architecture

"Explainability isn't optional—SHAP values turn black boxes into trusted production systems."

Responsible AI
AI/ML Engineer · GenAI Systems · FastAPI · Cloud-Native

Designing, Shipping, and Owning Production GenAI Systems — RAG, Agentic Workflows, and Async FastAPI at Scale

Driving core platform work at Sylox on a multi-tenant DPSM SaaS — 44 routers, 400+ async REST endpoints, sub-500ms p95 across AKS EKS GKE

400+
Async REST Endpoints
<500ms
p95 Latency @ 80c
14+
2025–26 Certifications

Technical Rigor

Production-grade expertise across the AI systems lifecycle

APIs & Async

FastAPI
Pydantic v2
SQLAlchemy 2 (async)
Celery · RQ
asyncpg · uvloop

GenAI

LangGraph · LangChain
LlamaIndex · OpenAI SDK
Agentic AI · RAG · Hybrid
Claude · OpenAI · Bedrock
Azure AI Search · Doc Intelligence

Cloud & K8s

Kubernetes (AKS · EKS · GKE)
Helm · Docker · Compose
Terraform · Multi-Cloud
Azure DevOps · Cloud Build
Azure · AWS · GCP · OCI
Pinecone · pgvector · Oracle 23ai

Security & Identity

OAuth2 · OIDC
Okta · Auth0 · Azure AD
SCIM 2.0 · RBAC
Argon2id · JWT
RFC 3161 Timestamping

Role Fit Matrix

JD pillars mapped to production evidence — what every line of the role looks like in real shipped systems

API Development

Python + FastAPI APIs

460+ Endpoints in Prod 44 Routers

Independently designed, shipped, and maintain 44 routers and 400+ async REST endpoints at Sylox; 40+ in the AI Privacy Verifier; 20+ in the matter-isolated RAG. FastAPI + Pydantic v2 + async SQLAlchemy 2.0 across every project, with Flask comfort as a fallback.

FastAPI Pydantic v2 SQLAlchemy 2 (async) Flask
Async & Concurrency

Concurrent & Scalable Systems

<500ms p95 @ 80c 1,200 RPS · p99 <180ms

asyncpg + uvloop sustaining sub-500ms p95 at 80-concurrency autoscale on a multi-tenant 11-schema Postgres. Celery + Redis + RQ workers with circuit breakers, per-tenant rate budgets, and partitioned tables for bursty load.

asyncio · uvloop asyncpg Celery · RQ Redis
Generative AI

LangChain · LlamaIndex · OpenAI SDK

7-Stage LangGraph 7-Vendor LLM Layer

LangGraph 7-stage classification pipeline at Sylox with a Claude verifier; LangChain + LlamaIndex RAG over policy documents; provider-agnostic LLM abstraction across 7 vendors (OpenAI, Anthropic, Gemini, Mistral, Cohere, AWS Bedrock, and one cloud-native provider) so a vendor switch is a config flag.

LangGraph LangChain LlamaIndex OpenAI Python SDK Anthropic SDK
Containers & Orchestration

Docker · Kubernetes · Helm · Compose

<180MB Distroless Images AKS · EKS · GKE

Multi-stage Dockerfiles (distroless runtime, non-root UID, <180MB images); Helm chart with values-{dev,stage,prod}.yaml, HPAs, PodDisruptionBudgets, NetworkPolicies, pre-rendered ServiceMonitors. Docker Compose for local parity with make demo bootstrapping the full stack in <90s.

Kubernetes Helm Docker · Compose HPA · PDB · NetworkPolicy
Testing

Unit · Integration · Performance Tests

2,195 Tests · 80% Coverage mypy --strict

Strict TDD with red-tests-first commits — pytest + pytest-asyncio, testcontainers (real Postgres + Redis, no mocks), Hypothesis property tests at 500+ examples, Playwright + Vitest E2E, schemathesis contract tests, k6 + Locust performance harnesses; gated by diff-cover ≥ 80% on every PR.

pytest · pytest-asyncio testcontainers Hypothesis k6 · Locust schemathesis
DevOps · CI/CD

Multi-Stage CI/CD Pipelines

45min → 9min PR-to-Deploy 30-Gate Quality System

Owned multi-stage pipelines (build → test → security-scan → publish → cluster deploy via Helm) with environment approvals, blue/green rollout, and automatic rollback on failed health gates. Cut PR-to-deploy from ~45min → ~9min via dependency caching, test sharding, and BuildKit cache mounts. Same patterns shipped on Azure DevOps and GCP Cloud Build.

Azure DevOps GCP Cloud Build GitHub Actions Helm Deploys Trivy · CodeQL
Security & Identity

OAuth · Okta · Auth0 · RBAC in AKS

Zero Cross-Tenant Leakage Workload Identity · No Long-Lived Secrets

OAuth2 + OIDC with Auth0 (pluggable to Okta or Azure AD via the same provider interface), JWT (python-jose), Argon2id hashing, refresh-token rotation, per-endpoint RBAC via require_permission(). AKS workload identity, NetworkPolicies default-deny, Pod Security Standards: restricted, Key Vault CSI mount. SCIM 2.0 provisioning at Sylox.

OAuth2 · OIDC Okta · Auth0 RBAC in AKS SCIM 2.0 Argon2id · JWT
Scripting & Automation

Bash · Shell · PowerShell

<90s Stack Bootstrap One-Command Demo

Bash + PowerShell automation for stack bootstrap, evidence-pack export, and on-call diagnostics; make demo and make preflight orchestrate end-to-end environment up + canary→PDF pipeline assertion in under 90 seconds.

Bash PowerShell Make Shell

Professional Experience

Production GenAI work across SaaS platforms, contracts, and independent research

Feb 2026 — Present · Onsite

AI Engineer · Sylox Technologies, Pune

Driving core platform engineering on IRIS — a multi-tenant Data Privacy & Security Management SaaS for DPDP Act 2025, GDPR, HIPAA, RBI, and SEBI compliance. 44 routers, 400+ async REST endpoints, 11-schema Postgres, sub-500ms p95 at 80-concurrency. Architected a 7-stage LangGraph classification pipeline (regex → Presidio + spaCy → Claude verifier → analyst override) lifting sensitive-data detection accuracy 18% across 10K+ records. Productionized cloud-portable Terraform deploys across GCP Cloud Run, AKS, and EKS. Operationalized GCP Cloud Build CI/CD with a 30-gate verification system and 161 idempotent Alembic migrations.

FastAPI LangGraph Claude Postgres GCP · AKS · EKS docling · dlt SCIM 2.0
Nov 2025 — Jan 2026 · Remote (Contract)

AI Engineer · LOPhils Inc.

Architected a multi-tenant Pinecone × Claude RAG over 5K+ legal-matter documents, eliminating cross-tenant context leakage via strict namespace isolation. Async ingestion pipeline processed 1GB+ batches across direct upload, n8n webhooks, S3 async, and ZIP batch — cut redundant embeddings 35% and embedding cost 30% via SHA-256 dedup and 50-per-request batching. Lifted retrieval relevance 28% with hybrid semantic + BM25 + cross-encoder reranking; query latency < 2s p95. Shipped 20+ FastAPI endpoints on AWS (EC2, S3, RDS, Redis) behind NGINX.

Pinecone Claude Voyage law-2 FastAPI Redis · RQ AWS n8n
Aug 2025 — Oct 2025

Independent AI Engineer & Researcher · Pune, India

Used the post-graduation runway to deepen production GenAI specialization. Earned nine industry certifications in eight weeks across GenAI, Vector Search, MLOps, Kubernetes, and Apache Airflow — Oracle OCI 2025 (5 certs), Databricks (2), Astronomer Apache Airflow 3 (2), KodeKloud (RAG + K8s). Took freelance fine-tuning and RAG-prototyping engagements; built open-source side projects including Cerevra (ContextOS) and early prototypes of Sentinel.

Oracle OCI Databricks Astronomer LangGraph Llama 3 · Mistral

Project Showcase

Production-grade AI systems deployed at scale

Multi-Tenant SaaS

AI Privacy Verifier — Continuous AI Vendor Governance

1,200 RPS · p99 < 180ms 38% LLM Cost Cut

9-service event-driven backend that continuously probes AI provider APIs and produces RFC 3161 trusted-timestamped evidence packets — canary traps, membership inference probes, model fingerprinting, policy surveillance via Azure Document Intelligence. 40+ FastAPI endpoints, provider-agnostic LLM layer over 7 vendors, Helm-deployed on AKS with Azure DevOps blue/green pipelines.

FastAPI LangChain · LlamaIndex Azure OpenAI AKS · Helm Auth0 · OIDC RFC 3161
Production RAG

Matter-Isolated RAG — Pinecone × AWS × n8n

Zero Cross-Matter Leakage +28% Retrieval Relevance

Multi-tenant retrieval over 5K+ legal-matter documents. Pushed isolation into the vector DB via per-matter Pinecone namespaces — enforced before retrieval, not after. Async ingestion across upload, n8n, S3, and ZIP batch (1GB+); SHA-256 dedup cut embedding cost 30%; hybrid BM25 + reranking lifted relevance 28% at < 2s p95.

Pinecone Claude Voyage law-2 FastAPI Redis · RQ AWS
Open Source · v0.1.0

Cerevra (ContextOS) — Zero-Data-Loss Deterministic RAG Engine

WAL · 1.2–2.1s Recovery 2,195 Tests · 80% Coverage

Production-grade RAG engine that treats retrieval like a database — write-ahead logging with fsync, deterministic crash recovery, hardware-grade tenant isolation, no vector store dependency. BM25-first (no GPU), 4-dim Context Quality Scoring, citations anchored to source offsets. Docker, K8s manifests, Python + TypeScript SDKs. 219 commits.

Python 3.11+ FastAPI rank-bm25 Prometheus · OTel Kubernetes
View Code
Agentic AI

Sentinel — Agentic Code-Review & Security-Triage

LangGraph Multi-Agent Tool-Level RBAC

LangGraph-orchestrated multi-agent system: a router dispatches to specialized Claude and Codex agents (diff analysis, secret scan, dependency audit, playbook retrieval), each with tool-level RBAC. Async FastAPI webhooks consume GitHub PR events; long reviews run on Kubernetes worker pods deployed via a custom Helm chart, with full audit trail of every tool invocation.

LangGraph Claude · Codex FastAPI pgvector + BM25 Kubernetes · Helm Okta · Auth0
Research · Peer-Reviewed

SHAR — Scalable Human Activity Recognition

92% Detection Accuracy 100ms / Frame

Final-year capstone (Team Lead) — fine-tuned Vision Transformer for real-time suspicious-activity detection from user-controlled camera APIs. Three-stage pipeline with WebSocket / e-mail / SMS alerting. Trained on UCF-Crime + KTH Action; 95% manual-audit alert precision. Published in TIJER (May 2025).

PyTorch · ViT WebSockets UCF-Crime Computer Vision
Read the Paper

Publication & Writing

Peer-reviewed research and long-form technical writing

TIJER · May 2025
Team Lead ISSN 2349-9249

SHAR: Scalable Human Activity Recognition

Vision Transformer–based real-time suspicious-activity recognition from user-controlled camera APIs. 92% detection accuracy on UCF-Crime, 100ms / frame, 95% manual-audit alert precision. Authors: Ashwin Upadhyay, Sarrah Burmawala, Amit Bhosale (TCOER, SPPU).

ViT PyTorch UCF-Crime
Read the Paper
Medium · Mar 2024
March 2024 Career

From Code to Insights: Web Development to Data Science

A career-narrative piece on building cross-disciplinary depth — from HTML/CSS/JS and full-stack web foundations into SQL, R, Python, ML, and neural networks.

Career Data Science
Read on Medium
Medium · Mar 2024
March 2024 ML Primer

Harnessing Future Insights: Predictive Analytics

A primer on the five foundational ML models — Linear Regression, Logistic Regression, Decision Trees, Random Forests, and Gradient Boosting — framed for business audiences.

ML Predictive Analytics
Read on Medium

Licenses & Certifications

Industry-recognized credentials validating production-scale expertise

DB
Active

Academy Accreditation - AI Agent Fundamentals

Databricks

Issued Oct 2025 • Expires Oct 2026
ID: 163890456
Generative AI Agentic AI
DB
Active

Academy Accreditation - Generative AI Fundamentals

Databricks

Issued Oct 2025 • Expires Oct 2026
ID: 163827419
Generative AI LLM
🚀
Certified

DAG Authoring for Apache Airflow 3

Astronomer

Issued Oct 2025
ID: 863162ef
Apache Airflow ETL DAGs
🚀
Certified

Apache Airflow 3 Fundamentals

Astronomer

Issued Oct 2025
ID: ef9a1b33
Apache Airflow Kafka Pipelines
OCI
Active

Oracle AI Vector Search Certified Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027
ID: 102977056DB23AIOCP
Vector Search RAG Oracle DB 23ai
OCI
Active

OCI 2025 Certified GenAI Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2026
ID: 102964238OCI25GAIOCP
LLM RAG LangChain Vector DB
OCI
Active

OCI 2025 Certified Data Science Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027
ID: 102964238OCI25DSOCP
ML Lifecycle MLOps ADS SDK
OCI
Active

OCI 2025 Certified Architect Associate

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027
ID: 103043421OCI25CAA
Cloud Architecture Networking
OCI
Active

OCI 2025 Certified Developer Professional

Oracle Cloud Infrastructure

Issued Oct 2025 • Expires Oct 2027
ID: 103043421OCID25CP
Microservices Kubernetes
AI
Top 5%

AI Fluent Tech Professional

Calyptus

Issued Nov 2025 • Expires Nov 2035
ID: 14
Agentic AI Explainable AI GenAI
🔐
Active

AI Security & Governance

Securiti

Issued Feb 2026
AI Security Governance Compliance
SV
Active

Identity Security for the AI Age

Saviynt

Issued Dec 2025
Identity SCIM RBAC
K8s
Certified

Kubernetes

KodeKloud

Issued Oct 2025
Kubernetes Helm Container Orchestration
RAG
Certified

RAG Crash Course

KodeKloud

Issued Oct 2025
RAG LLM Vector DB
OCI
Active

OCI 2025 — AI Foundations Associate

Oracle Cloud Infrastructure

Issued Dec 2025
AI Fundamentals ML OCI
HR
Verified

SQL (Advanced)

HackerRank

Issued Jul 2023
ID: 77f5f2dd7935
SQL Database

0

Professional Certifications

0

Active Credentials

0

2025–26 Certifications

Selected Achievements

Production impact, recognition, and competitive milestones

2024 — 2026

500+ LeetCode · 500-day Streak

500+ problems solved with a 500-day submission streak. 100 Days Badge ’24 / ’25 / ’26 · 200 Days Badge ’25 · Annual Badge 2025 · monthly badges April–November 2025 plus February 2026.

Algorithms DSA Python
Nov 2025

Top 5% Globally — Calyptus AI Fluent Tech Professional

Earned the AI Fluent Tech Professional credential in the top 5% of the global Calyptus cohort, validated across Agentic AI, Explainable AI, and GenAI competencies.

Agentic AI Explainable AI GenAI
Apr 2026

Cerevra (ContextOS) v0.1.0 Released

Open-source maintainer of Cerevra — 219 commits, 2,195 tests, 80% diff-coverage. GitHub badges: Pair Extraordinaire · Pull Shark ×2 · YOLO · Quickdraw across 36 public repositories.

Open Source RAG Engine WAL · Recovery
2025 — 2026

Production Impact Across AI Systems

Lifted production retrieval accuracy to 90% across AI-driven systems; reduced end-to-end ML pipeline latency 40% through async I/O, caching, and batching. Eliminated cross-tenant leakage in two multi-tenant platforms via vector-DB-level namespace isolation and OIDC-bound RBAC.

Async I/O Caching Multi-tenant
May 2025

Lead-Author Publication · TIJER

Team Lead on the SHAR (Scalable Human Activity Recognition) capstone — peer-reviewed publication in TIJER (ISSN 2349-9249, Volume 12, Issue 5).

ViT PyTorch Research

GitHub Activity

Continuous contributions and open-source engagement

0

Commits This Year

0

Pull Requests

0

Total Stars

0

Repositories

Contribution Activity

Less More

Portfolio Analytics

Real-time insights and visitor metrics

Visitor Analytics

Live
Today 127
This Week 892
Total 12,847

Performance Metrics

Load Time

0.8s

First Paint

0.3s

Lighthouse Score

98/100

Geographic Distribution

🇺🇸 United States
42%
🇮🇳 India
28%
🇬🇧 United Kingdom
15%
🇩🇪 Germany
10%
🇨🇦 Canada
5%

Popular Sections

Projects 34%
Certifications 28%
Technical Rigor 22%
Blog 16%

Get in Touch

Open to AI/ML Engineer · GenAI Engineer · Backend AI Platform roles — Hybrid / Remote / Onsite (Pune, Bengaluru, Hyderabad)

Email

ashwinupadhyay09@gmail.com

Phone

+91 8329396282

Location

Pune, Maharashtra, India 411046

Education

B.E. Information Technology · TCOER, SPPU · CGPA 8.53 / 10 · 2021–2025

Request sent successfully!