Dongin Kim’s Data Science Portfolio
Welcome to my project showcase. Here you’ll find a collection of my key data science projects, demonstrating my expertise in solving complex business problems through advanced analytics and machine learning. These projects highlight the skills I’ve developed over the years and showcase my ability to deliver impactful solutions across various domains.
For a comprehensive overview of my technical skills and the tools I use:
🚀 Explore My Interactive Skills Dashboard 🚀
Featured Projects
- Navigating Uncertainty: ETA Prediction at Scale
- Built a large-scale ETA prediction system (2M+ daily deliveries) for food logistics, addressing high uncertainty from weather, traffic, and operational variability.
- Applied hybrid models (Neural Net + GBM), multi-task learning, and causal weather adjustment to improve robustness and accuracy.
- Key technologies: Python, Deep Learning, Statistical Modeling, Causal Analysis, MLOps
- Forecasting & Monitoring: Anomaly Detection in Time Series
- Developed a real-time anomaly detection system for transactional time series, combining forecasting models with statistical tests and stream-based alerts.
- Built end-to-end MLOps pipelines with automated retraining, validation, and deployment for continuous monitoring.
- Key technologies: Python, Time Series Forecasting, Stream Processing, Statistical Modeling, MLOps
- FoodieNet: AI-powered Food Allergy Detection
- Developed an end-to-end system that combines computer vision and LLMs to detect ingredients and provide comprehensive allergen information from food images, improving ingredient detection accuracy especially for allergenic substances
- Key technologies: CLIP, Vision Transformers, Large Language Models, MLOps
- Comic Book Recommendation System for Mobile Webtoon Users
- Built a hybrid recommendation system for a newly launched mobile webtoon service, addressing the cold start problem with content-based and collaborative filtering.
- Implemented real-time adaptation, A/B testing frameworks, and scalable architecture for growing user bases.
- Key technologies: Python, LightFM, Collaborative Filtering, Content-Based Filtering, A/B Testing
- RAG와 LLM을 활용한 기업 지식 관리 시스템 PoC
- AWS 기반 RAG 아키텍처를 설계하여 기업 문서 검색 및 LLM 기반 답변 생성 시스템 구축
- Hybrid Search, Re-ranking, Query Expansion 등 검색 품질 최적화 기법 적용
- Key technologies: AWS Bedrock, OpenSearch, RAG, LLM, Prompt Engineering
Technical Proficiency
Programming Languages
ML & DL
MLOps & Model Serving
Data Infrastructure & Query Engines
Visualization Tools
Cloud Platforms