Chinmai S D
000/ 100 neural route linked
chinmai s d / ai systems

I build AI systems that show their work.

~/focus

I am Chinmai, an Information Science undergraduate from DSCE. I like building AI that is not just impressive in a demo, but traceable, measurable, and useful when someone actually depends on it.

Bengaluru, India
about

The way I like to build

I keep coming back to one idea: AI should make decisions easier to inspect, not harder to trust.

I am studying Information Science at Dayananda Sagar College of Engineering, and most of my work sits in the space between AI models and real product behavior. I have built hiring evaluation infrastructure, astronaut health monitoring, an AI habit coach, and now TraceroAI for RAG observability and LLMOps.

What feels most like me: taking an AI idea, giving it a backend, a data contract, an interface, failure states, and enough evaluation that someone can reason about the output.

01 / Start from the user decisionI first ask what decision the system should make easier, faster, or safer.
02 / Define the data contractThen I lock down inputs, outputs, failure states, and what the UI should reveal.
03 / Ground the model in evidenceFor LLM systems, I prefer retrieved sources, citations, confidence, and review paths.
04 / Make latency and trust visibleI treat speed, observability, and uncertainty as product features, not backend details.
05 / Measure, review, and improveThe last step is closing the loop with metrics, feedback, and cleaner next iterations.
skills

The stack I actually use

This is filtered from my resume and repos, so it reflects what shows up repeatedly in my projects instead of every tool I have ever touched.

Trustworthy AILLM + ML systemsRAG + Citationsevidence pathsModel ToolingPyTorch + HFFastAPI Servingmodel APIsData LayerSQL, vectors, cacheProduct UXReact, ExpoEvaluationmetrics + failure
What matters most Grounded answers, measurable APIs, reliable data paths, and product-ready AI behavior.

AI / LLM Systems

LLM app developmentRAGPrompt engineeringContext-aware pipelinesStructured outputsModel evaluationHugging FaceTransformersAgentic workflowsLoRA/QLoRA basics

Backend / APIs

PythonFastAPIFlaskREST API designJWT authAsync processingModel serving APIsPostman

Data / ML

SQLPandasNumPyFeature engineeringTime-series analysisAnomaly detectionscikit-learnPyTorchTensorFlow/KerasXGBoostProphet

Frontend / Product

JavaScriptReactReact NativeExpoNext.jsTailwind CSSRechartsProduct dashboards

Storage / Deployment

PostgreSQLChromaDBInfluxDBMongoDBRedisDockerPipeline optimizationGit
projects

Four builds that explain me

If someone wants to understand how I think as an AI engineer, TraceroAI is the current build log and these are the proof-of-work systems around it.

TraceroAI

Current build: RAG observability and LLMOps for tracing retrieval, inspecting context, evaluating groundedness, and debugging hallucination failures.

Trace retrieval pathsInspect context windows
RAG ObservabilityLLMOpsTracingGroundedness EvalsContext InspectionDebug UX
Live build
metrics and signals

TraceroAI

Traceretrieval paths
Inspectcontext windows
Evaluategroundedness
Debughallucination failures
Open TraceroAI

SignalStack

I built SignalStack because resumes alone do not prove engineering ability. It reads real repositories, extracts evidence, and turns proof-of-work into reviewable hiring signals.

26 backend test filesRBAC admin/recruiter isolation
FastAPIReactOpenAIPostgreSQLRedisGitHub API
View GitHub
metrics and signals

SignalStack

26backend test files
RBACadmin/recruiter isolation
Redisbackground evaluation queue
Prometheuslatency and LLM cost metrics
Open repository

Astronaut Space Health

This is my IEEE-linked health AI work expanded into a full-stack monitoring system: streaming vitals, time-series storage, model-serving APIs, and dashboard-ready risk outputs.

<50ms prediction latency target144 step health window
FastAPINode.jsInfluxDBMongoDBXGBoostProphet
View GitHub
metrics and signals

Astronaut Space Health

<50msprediction latency target
144step health window
25%false-positive alert reduction
IEEEpublished health-monitoring work
Open repository

AI Coach

AI Coach is where I explored personal AI as a product: goals, habits, recent activity, and a coaching layer that responds with context instead of generic motivation.

<200ms context lookup target40%+ API latency reduction under load
FastAPIExpoReact NativeGeminiSQLAlchemyJWT
View GitHub
metrics and signals

AI Coach

<200mscontext lookup target
40%+API latency reduction under load
100+concurrent-user design target
128LRU response cache size
Open repository
portfolio lab

Challenge my build log

A focused interface for asking about Chinmai's projects, metrics, skills, and engineering decisions.

AI engineering game

Reliability Quest

0 / 100 XP
0 XP

Click each quest to unlock how I think about reliable AI systems. Every step maps to a real build.

TraceroAIGround the answer

Trace retrieval, inspect context, and evaluate groundedness so RAG answers can be debugged instead of blindly trusted.

chinmai-build-log

Ask me about TraceroAI, SignalStack, Astronaut Space Health, AI Coach, or Chinmai's AI/ML stack.

command layer

Type to navigate

Click anywhere outside the input, type a destination, and the portfolio routes there instantly.

traceroaigithublinkedinemailprojectscasesassistantcontact
case studies

What each build taught me

These are the stories I would use in an interview: the problem, the engineering choice, the result, and what I learned.

SignalStack

Evidence-first AI evaluation

problem

Hiring systems often over-trust keywords, resumes, and ungrounded AI summaries.

solution

Parse repositories, select source evidence, verify authorship signals, score capability separately from confidence, and show reviewers the audit trail.

result

The workflow surfaces fit score, code evidence, confidence, verification status, production-readiness signals, and recruiter decisions.

lesson

A reliable AI evaluator needs retrieval, deterministic checks, scoring design, and review UX, not only an LLM prompt.

Astronaut Space Health

Real-time ML needs clear contracts

problem

Streaming health predictions are fragile when feature order, time windows, model artifacts, and warm-up behavior are not explicit.

solution

Define a 144-step telemetry window, store raw/derived/context measurements, serve XGBoost and IsolationForest outputs, and expose documented API contracts.

result

The system returns risk labels, probabilities, anomaly scores, alert states, dominant drivers, and forecast outputs through product-facing endpoints.

lesson

The model is only useful when the data cadence, inference path, and dashboard semantics are engineered together.

AI Coach

Personalization without prompt chaos

problem

A generic chatbot cannot coach well if it ignores goals, habits, recent logs, intent, and safety boundaries.

solution

Inject only relevant user context, route by intent, block obvious prompt-injection patterns, cache stable responses, and keep mobile workflows simple.

result

Users can sign up, manage goals, log habits, track streaks, and receive coaching grounded in their actual behavior.

lesson

Useful AI products start with clean product state and careful context selection.

TraceroAI

RAG observability needs a debug surface

problem

RAG apps can fail silently when developers cannot see retrieved chunks, context assembly, groundedness, or hallucination points.

solution

Build an LLMOps workflow for tracing retrieval, inspecting context, evaluating groundedness, and debugging hallucination failures.

result

The live build frames TraceroAI as a developer-facing observability layer for retrieval quality and LLM behavior.

lesson

Reliable RAG needs instrumentation and evaluation surfaces as much as better prompts.

failure wall

Things I had to learn by building

I want this portfolio to show judgment, not perfection. These are the lessons that changed how I build the next version.

grounding

Do not let fluent output outrun evidence

This is probably the biggest rule I keep coming back to. If evidence is thin, the system should lower confidence instead of sounding more polished.

systems

Model demos need warm-up and failure states

The astronaut platform taught me that a model is not always ready just because an endpoint exists. Readiness, warm-up, and missing data need UI states.

prompt design

Context should be selected, not dumped

In AI Coach, better answers came from sending less context, but the right context. That lesson changed how I think about prompt design.

github live stats

Public build signal

Fetched from @chinmai-sd-123. Status: fallback.

18public repositories
3repos updated in 180 days
12recent public push events
recentlatest public repo update
currently building

Work in motion

Right now I am focused on TraceroAI: RAG observability, retrieval traces, groundedness checks, and developer-facing LLM debugging.

TraceroAI

My current work-in-motion: an end-to-end RAG observability and LLMOps project for retrieval traces, context inspection, groundedness evaluation, and hallucination debugging.

Next.jsRAGLLMOpsTracingEvalsDebugging
Live build
live build
AI Coach

I am hardening the coach beyond MVP: memory, personalization, rate limits, deployment readiness, and better evaluation around response quality.

FastAPIExpoGeminiJWTSQLAlchemyCaching
active
writing

Ideas I keep circling

These are the topics I naturally come back to while building: evidence, latency, model contracts, and human review.

LLM evals

Evidence-backed AI evaluation

How SignalStack separates capability, evidence confidence, verification, and production readiness.

Discuss this
ML systems

Health telemetry as an ML product

What real-time prediction systems need beyond a trained model: cadence, artifacts, APIs, and monitoring.

Discuss this
LLMOps

RAG observability is product work

TraceroAI's core idea: make retrieval, context, groundedness, and hallucination failures visible to developers.

Discuss this
contact

Let us talk about useful AI

The best way to evaluate me is simple: read the repos, inspect the systems, then ask me how I would make the next version stronger.

Best way to reach me

I am most interested in AI systems where reliability, evaluation, and user trust matter. Send me the problem, the constraints, and what a good answer should prove.

Bengaluru, India