Skip to content
View esolnguyen's full-sized avatar

Block or report esolnguyen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
esolnguyen/README.md

Hi, I'm Thang Nguyen πŸ‘‹

AI/ML Engineer Β· Computer Vision Β· LLM Applications Β· MLOps

Machine Learning Engineer with 5+ years of experience and a robust foundation in Software Engineering and Full-Stack Development. Specialized in decomposing complex problems into scalable, production-grade AI solutions. Proven expertise in architecture end-to-end ML pipelines, deploying Large Language Models (LLMs) and RAG systems, and orchestrating serverless workflows on AWS. Adept at bridging the gap between research models and reliable business applications.


About Me

  • 🧠 AI/ML Engineer with 5+ years spanning Software Engineering, Full-Stack Development, and applied AI
  • πŸŽ“ MSc in Artificial Intelligence @ University of Science, VNU (2023–2026)
    • Thesis: Extracting Information from Tires Using CNNs and Large Language Models
  • πŸŽ“ BSc in Software Engineering @ University of Science, VNU (2016–2021)
  • 🏒 Currently Machine Learning Engineer at TecAlliance (promoted from SWE in 2022)
  • 🌍 Based in Ho Chi Minh City, Vietnam
  • πŸ—£οΈ Vietnamese (Native) Β· English (Professional Working Proficiency)
  • πŸ“« thangnguyen31081998@gmail.com

What I Build

πŸ”¬ Machine Learning Track

Computer Vision      β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  Object detection (YOLO), image retrieval, OCR
Deep Learning        β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    Model training, fine-tuning, CNN pipelines, PyTorch
MLOps & Infra        β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ       AWS Step Functions, CDK, Docker, CI/CD

πŸ€– AI Engineering Track

LLM Applications     β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ  Multi-agent orchestration, RAG, tool-using agents
Prompt Engineering   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ    Structured outputs, few-shot retrieval, eval loops
Backend / APIs       β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ       FastAPI, microservices, REST API design

Tech Stack

Languages Python TypeScript JavaScript SQL C#

AI & Machine Learning PyTorch scikit-learn YOLO OpenCV MMOCR LangChain OpenAI Anthropic

Web & APIs FastAPI React Angular

Cloud, DevOps & MLOps AWS Azure Docker Azure DevOps

Databases & Vector Stores PostgreSQL MSSQL MongoDB DynamoDB ChromaDB Pinecone


Featured Work

πŸ”¬ Machine Learning

πŸ›ž Tire Sidewall Extraction Β Β·Β  MSc Thesis End-to-end CV pipeline extracting structured data from tire sidewall images.

  • Boosted detection accuracy from 93.55% β†’ 95.05% (@ IoU 0.5) by enhancing YOLOv11 with multi-branch features
  • Geometric unwrapping + CLAHE preprocessing for low-light garage images
  • Multi-LLM validation loop for robust JSON standardization

πŸ”Ž Visual Search Engine Β Β·Β  TecAlliance CNN-based car part image retrieval with scalable vector indexing.

  • 88% top-k accuracy in similarity search; evaluated multiple CNN backbones
  • Reduced manual part identification from hours to seconds
  • Production vector index on Pinecone

πŸš— Vehicle Matching Pipeline Β Β·Β  TecAlliance Serverless AWS pipeline reconciling New Zealand government schemas with internal datasets.

  • 98% matching accuracy using decision trees + rule-based logic
  • High-precision reconciliation across disparate schema sources

πŸ€– AI Engineering

πŸ“„ Multi-Agent Extraction Platform Β Β·Β  TecAlliance LLM-powered platform replacing manual internal document workflows.

  • Significant reduction in operational workload across high-volume document pipelines
  • FastAPI + React with Okta auth, multi-model orchestration, human-in-the-loop feedback
  • Designed for external client expansion via REST API

🌐 Multi-Agent Domain Translation  ·  TecAlliance Domain-adaptive translation pipeline with autonomous quality control.

  • Reduced LLM hallucination rates via LangChain multi-agent orchestration
  • Knowledge-Graph-based n-gram index for terminological consistency
  • Cycle Consistency validation loop for self-verifying outputs

πŸ’¬ RAG SQL Assistant Β Β·Β  TecAlliance Natural language interface over databases and Azure Application Insights.

  • Few-shot retrieval via ChromaDB + schema-aware tool usage
  • Self-correcting execution loops eliminate ad-hoc engineering support for PO queries

βš™οΈ Platform & MLOps

🧰 ML Developer Experience (DevEx)  ·  TecAlliance Internal Python platform library standardizing ML development.

  • Unified Okta auth + OCR/LLM integration across internal ML projects
  • Embedded CloudWatch telemetry wrapper - traces performance and meters OCR/LLM cost per user and per project

☁️ MLOps & Orchestration  ·  TecAlliance End-to-end ML infrastructure on AWS.

  • Multi-agent batch workflows in AWS Step Functions with automated retries
  • Standardized Infrastructure-as-Code via AWS CDK
  • CI/CD automation through Azure DevOps

Career

  • TecAlliance - Machine Learning Engineer (Oct 2022 – Present), Software Engineer (Nov 2021 – Oct 2022)
  • Netwealth - Full-Stack Developer (Mar 2020 – Nov 2021).
  • KΓ€rcher - Software Engineer Intern/Junior (Aug 2019 – Mar 2020).

Certifications

  • Professional Scrum Master I (PSM I)
  • Professional Scrum Master II (PSM II)

Connect

πŸ“§ thangnguyen31081998@gmail.com

Popular repositories Loading

  1. nl-to-sql nl-to-sql Public

    Turn plain-English questions into executable SQL. LangChain agents over a schema-aware FastAPI service, with example-SQL retrieval (RAG), per-DB instructions, OpenAI fine-tuning, and streaming gene…

    Python 2

  2. tire-sidewall-text-extraction tire-sidewall-text-extraction Public

    Extract structured info (manufacturer, model, size, DOT) from tire sidewall images via a YOLO + TRBA OCR + Gemini LLM pipeline, served over FastAPI.

    Jupyter Notebook 1

  3. booking-chatbot booking-chatbot Public

    AI corporate travel booking assistant that recommends flight and hotel combos with policy validation, confidence scoring, and human-in-the-loop approvals.

    Python 1

  4. bedrock-pr-review bedrock-pr-review Public

    AI-powered PR review agent on AWS Bedrock (Claude). Reviews GitHub and Azure DevOps pull requests for security, code quality, and requirements coverage, with Jira/Azure Boards work item linking. De…

    Python 1

  5. vehicle-ds-project vehicle-ds-project Public

    Data science & ML pipeline on the NZ Motor Vehicle Register cleaning, EDA, and EV adoption forecasting

    Jupyter Notebook 1

  6. kiro-ui-tool kiro-ui-tool Public

    Visual dashboard for managing Kiro AI agents, commands, skills, and pipelines a full GUI layer over the ~/.kiro directory.

    TypeScript 1