Kaushik Kumar.

Building Production Compound AI Systems.

I am an MS in Computer Science student at NJIT and an AI Engineer. I specialize in Agentic AI, scalable MLOps, and deterministic multi-agent workflows that bridge the gap between advanced research and business outcomes.

Kaushik Kumar

The Production Reality

Building a slick LangChain prototype takes a weekend. Deploying an autonomous multi-agent system that executives actually trust requires a totally different paradigm. My focus is entirely on the broken middle-tier: reliability, safety, and Total Cost of Ownership (TCO).

Deterministic Safety Rails

LLMs are probabilistic; enterprises need deterministic outcomes. I architect strict schema validation boundaries, PII filtering, and multi-layered jailbreak prevention to ensure agents never hallucinate catastrophic actions.

Deep Observability

If you can't trace it, you can't trust it. I build telemetry-first systems utilizing Arize Phoenix and Langfuse to monitor token usage, latency, and agent reasoning traces down to the individual span level.

Business ROI & TCO

I optimize models and vector queries not just for accuracy, but for cost. Transitioning from generic heavy APIs to specialized, self-hosted LLMs and intelligent routing drastically reduces inference costs while scaling.

Experience

AI Engineer @ FIELDWORKER.AI

Feb 2026 - May 2026

Reduced manual data entry time from 2 hours to under 3 minutes per document. Architected an end-to-end agentic SDR parsing system using a self-hosted LLM with strict check-then-update logic, driving an estimated 95% reduction in processing costs and accelerating client onboarding.

Agentic AICost ReductionHITL ValidationsPostgreSQL

Software Engineer Intern @ CISCO SYSTEMS, INC.

Feb 2024 - Jun 2024

Streamlined operational processes across 500+ enterprise accounts, resulting in a 20% increase in operational efficiency. Designed scalable Spring Boot microservices with optimized caching that reduced database load by 45% and improved response times by 30% during peak traffic.

Spring BootMicroservicesEfficiency ScalingMockito

ML Engineer @ VERZEO EDUTECH PVT. LTD.

Feb 2023 - Mar 2023

Reduced model latency by 25% and scaled resource capacity by 50% for peak workloads. Translated core business requirements into real-time CNN/RNN pipelines on GCP, presenting actionable model insights directly to non-technical stakeholders.

GCPLatency OptimizationMachine LearningDocker

Featured Architecture

google_workspace_mcp

google_workspace_mcp

The Problem: Fragmented Workspace data caused immense cross-functional friction.

Technical Leap: Engineered a unified Model Context Protocol (MCP) server architecture.

Outcome: Enabled a 3x speedup in information retrieval for operational teams.

PythonMCPData Unification
Antigravit

Antigravit

The Problem: Unreliable, hallucinating analytics agents eroding executive trust.

Technical Leap: Built a 6-node deterministic LangGraph architecture with self-correcting SQL generation.

Outcome: Achieved 100% routing accuracy and zero-data-exfiltration security.

LangGraphDeterministic AISecurity
RAG Foundry

RAG Foundry

The Problem: Enterprise context-collapse across disconnected knowledge bases.

Technical Leap: Integrated Hybrid Retrieval, Cross-Encoder reranking, and 6 custom guardrails.

Outcome: Delivered 0.98 Relevancy on Ragas eval and mitigated hallucination risk entirely.

PythonRAGASAI Guardrails

Project Archive

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model.

PythonMultimodalLLMs

langchain-postgres

LangChain abstractions backed by Postgres Backend.

PythonLangChainPostgres

AgenticRAG_-RAGAS

Advanced RAG implementation with automated evaluation via RAGAS.

PythonRAGASAgents

AI_BI_Copilot

An AI Copilot for Business Intelligence and data query generation.

PythonCopilotAnalytics

Strategic Perspectives

Thoughts on architecture, AI safety, and product strategy.

May 20265 min read

Why Enterprise AI Projects Fail at the Deployment Phase

The gap between a Jupyter Notebook prototype and a production-ready agentic system is where 80% of corporate AI budgets go to die. Here is how focusing on deterministic guardrails and TCO changes the game.

Read Article
April 20264 min read

The Broken Middle-Tier: Moving Beyond Basic RAG

Vector databases alone aren't enough. By implementing hybrid retrieval, cross-encoder reranking, and self-correcting agentic orchestration, we can solve the context-collapse problem in massive enterprise datasets.

Read Article

Skills & Arsenal

Agentic AI & Prompting

LangChain & LangGraphDeepagentsModel Context Protocol (MCP)CrewAIReAct & ReflectionA2A Orchestration

Machine Learning & RAG

PyTorch & TransformersHybrid RetrievalCross-Encoder RerankingXGBoostOptuna & SHAPCNNs & LSTMs

DevOps, Cloud & DBs

Microsoft AzureGoogle Cloud Platform (GCP)Qdrant & ChromaDBpgvectorDocker & KubernetesFastAPI & PostgreSQLCI/CD

AI Safety & Observability

Arize PhoenixLangfuseLLM-as-a-JudgeHallucination DetectionGuardrailsPII Filtering

Education

Master of Science in Computer Science

New Jersey Institute of Technology (NJIT)

Sep 2024 - May 2026

Focusing on Advanced Machine Learning, AI Systems, and Scalable Architectures.

Bachelor of Engineering in Information Science and Engineering

BMS College of Engineering

Sep 2020 - Jun 2024

Core focus on Algorithms, Data Structures, and Software Engineering.