Ryan Besto Saragih.
Bridging the gap between Data Science and Software Engineering.
I specialize in
Building End-to-End ML Systems.
About Me
Background
As a Computer Science student, I bridge the gap between software engineering principles and modern Data Science.
My expertise goes beyond just analyzing data; I focus on building end-to-end systems—from architecting scalable ETL pipelines to deploying machine learning models into production.
I am proficient in Python,SQL, andCloud Platforms. Currently exploring the intersection of MLOps and Generative AI to solve complex real-world problems.
Technical Arsenal
Tools I use to engineer robust data solutions.
Experience
Junior Data Scientist

Led an end-to-end Customer Churn Analysis for a Telecommunications company using the CRISP-DM framework.
- Analyzed 7,000+ customer records using Python (Pandas) to detect churn drivers (42.7% churn rate found).
- Discovered $139k+ in lost Monthly Recurring Revenue (MRR) & formulated retention strategy.
- Designed interactive Looker Studio Dashboard to track high-risk segments.
- Proposed 'Contract Migration' strategy to increase customer Lifetime Value (LTV).
Data Engineer

Collaborated to architect a centralized Banking Data Warehouse, resolving fragmented data issues.
- Designed Star Schema Data Warehouse on SQL Server (FactTransaction & DimTables).
- Engineered modular ETL jobs in Talend Open Studio from 8 disparate sources.
- Implemented advanced transformations (tMap, tUnite) for data deduplication.
- Automated daily transaction summaries using T-SQL Stored Procedures.
Featured Projects

DATAION | Data Contract & AutoML Platform
Dec 2025 - Present | Full-Stack MLOps Engineer
Engineered a full-stack MLOps platform designed to solve production failures caused by schema drift. DATAION enforces strict data contracts between data producers and ML pipelines, ensuring only high-quality data is processed. The platform features automated EDA, advanced model selection (XGBoost, LightGBM), and a real-time prediction simulator.

SmartConvert | AI-Powered CRM & XAI
Oct 2025 - Jan 2026 | AI Engineer (VINIX7)
An Intelligent CRM system designed to optimize banking sales operations by predicting customer conversion probability (Lead Scoring) and providing transparent AI reasoning (Explainable AI). Integrated SHAP for model interpretability, allowing stakeholders to explore 'Next Best Conversation' insights.

Automated ETL & Banking Data Warehouse
Jun 2025 - Jun 2025 | Data Engineer (id/x partners)
An end-to-end data warehousing solution that transforms raw, multi-source banking data into a centralized, analytics-ready asset. Implemented automated ETL pipelines in Talend to ingest data from 8 disparate sources and architected a Star Schema on SQL Server.
Licenses & Certifications

Generative AI with Large Language Models
Credential ID
WYMCHFVTJB1Q

Intermediate Data Science
Credential ID
19510809840-944

Back End Development and APIs
Credential ID
-

Data Classification with IBM Granite
Credential ID
-

Science & Technology Track
Credential ID
-

Generative AI: Alibaba Cloud & Qwen
Credential ID
20010199840-991

LLM-Based Tools & Gemini API
Credential ID
03972/H8/CSR/MBA/I/2026
Get In Touch
Let's Connect
I am currently seeking Internship opportunities in Data Science, Machine Learning, or Web Development. Whether you have a question or just want to say hi, my inbox is always open!