Tung Nguyen
Lead Data Engineer | Solution Architect | Project Manager
Professional Philosophy
Can be very down to earth to write code, draw design diagrams, write docs, collect & structure requirements, working side by side with engineers to get things done.
But can also be 10,000 feet high working with senior stakeholders/executives to provide strategy, tactics, define project plans, inspiring but also realistic.
Key Expertise
- Databricks: Delta Live Table, Materialized View, Change Data Capture (Batch & Streaming)
- MLOps: Designed & delivered MLOps for Prudential following Databricks best practices
- Architecture: Large-scale system design and delivery for production environments
- Leadership: Cross-functional team coordination and stakeholder management
Portfolio Projects
Deep Analysis Open Source Integration
Input: Messy bunch of xlsx, sqlite, markdown, pdf, powerpoint, prompt
Current Output: PDF report with text and charts
Future Output: DLT pipeline, DBT repo, PBI model, docs
Technology: vLLM, NodeJS, MCP, Agents
AWM Fund Reporting Tool
Input: User prompt query about stock market, investment advice
Output: Chart from real-time data
Technology: Python crawler, LLM, matplotlib, NextJS
Marketing Agency Automation Tool
Input: Original YouTube video
Output: New video with celebrity lip sync
Technology: F5-TTS, Latent Sync ByteDance, OpenAI Whisper, FastAPI
Insurance Voice & Text Chatbot
Features: RAG from insurance contracts, support text & voice chatbot
Technology: Qwen, Deepseek, VectorDB, vLLM
Major Client Projects
Top Tier Insurance Company – Prudential (Solution Architect)
- Defined Regional MLOps design
- Researched and compared Azure ML Studio vs Databricks across multiple criteria
- Proposed detailed MLOps design following Databricks best practices
- Implemented design for 2 models in 2 countries
- Setup infrastructure and DevOps using Databricks Asset Bundle
- Implemented Lakehouse monitoring and Unity Catalog Lineage
- Presented results and received approval from CDO
Top Tier Coffee Chain (Solution Architect)
- Performed gap analysis of current vs best practice design
- Defined data flow design: ADF copy, Delta Live Table (DLT), Bronze Silver Gold
- Defined Unity Catalog design for 6 countries, 4 data domains
- Built ADF data pipeline, DBR workflow, DLT following best practices
- Conducted workshops to collect business requirements and build Gold layer
Top Tier Furniture Retailer (Data Modelling Architect)
Led data modeling initiatives for large-scale retail operations.
Multiple Generative LLM AI Projects (Solution Architect & Delivery Manager)
Architected and delivered various AI solutions across different industries.
Technical Skills
Programming
SQL, Python
Data Platforms
Databricks, Synapse, ADF
Cloud Infrastructure
Azure Infra, VNET, Private Endpoint
DevOps
Azure DevOps CI/CD
Business Intelligence
Power BI
Certifications
Azure Data Engineer Associate