|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
Fine Tuning Open Source Llms For Proprietary Use C
|
Automated Evaluation And Monitoring Of Llms In Pro
|
Real Time Ai Inference Optimization
|
Production Scale Retrieval Augmented Generation R
|
💬 Commented on [Feature] MiniMax-M3 in sgl-project/sglang (2026-06-08)
💬 Commented on [APEX研究] ⚙️ 技术架构 零一万物 Yi 跨模型技术架构深度对比分析 in 01-ai/Yi (2026-06-08)
💬 Commented on feat: Add retrieve_online_documents_v2 support to Redis onli in feast-dev/feast (2026-06-08)
💬 Commented on [feat]Add 5 built-in URL validators with enhanced domain ver in guardrails-ai/guardrails (2026-06-08)
💬 Commented on [Bug] Image upload triggers full page reload on Android Go ( in QwenLM/Qwen (2026-06-08)
⭐ Starred zengxiao-he/tessera (2026-06-08)
⭐ Starred MemMachine/MemMachine (2026-06-08)
⭐ Starred ferrumox/fox (2026-06-08)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 Automated Evaluation and Monitoring of LLMs in Production
🔬 Fine-Tuning Open-Source LLMs for Proprietary Use Cases
🔬 Real-Time AI Inference Optimization
🔬 Agent-Based Workflow Automation in Production
🔬 Fine-Tuning Large Language Models with Parameter-Efficient Techniques
🔬 Production-Scale Retrieval-Augmented Generation (RAG) in Enterprise Search
📌 Structured Output Validator for JSON Schema — Production Pattern (Python) (2026-06-08)
📌 Vector Similarity Search with FAISS — Production Pattern (Python) (2026-06-07)
📌 Agent Tool Registry with Dynamic Discovery — Production Pattern (Python) (2026-06-07)
🤖 Profile auto-updated on 2026-06-08 16:11 UTC


