RyanBestoSaragih.
Data Scientist/Machine Learning Engineer
About Me
Background
As a Computer Science student, I bridge the gap between software engineering principles and modern Data Science.
My expertise goes beyond just analyzing data; I focus on building end-to-end systems—from architecting scalable ETL pipelines to deploying machine learning models into production.
Currently exploring the intersection of MLOps and Generative AI to solve complex real-world problems.
Universitas Negeri Surabaya
Aug 2023 - Jul 2027 (Expected)
Bachelor of Computer Science (S1)
Technical Arsenal
Engineering data solutions.
Relevant Coursework
Specializing in data-heavy computing and algorithms.
Professional Experience
Data Management & Analysis Intern

Unit Placement: Directorate General of Sea Transportation (Ditjen Hubla) | Division: Organization & Governance (Ortala). Supporting organizational efficiency and digital governance through rigorous data management and workflow analysis.
- Collection & Digitization: Assisting in the collection, validation, and digitization of critical organizational data assets.
- Workflow Analysis: Analyzing operational data and internal workflows to identify key opportunities for process optimization.
- Data Integrity: Collaborating with the governance team to maintain high data integrity and support strategic decision-making processes.
Junior Data Scientist

Led an end-to-end Customer Churn Analysis for a Telecommunications company using the CRISP-DM framework.
- Analyzed 7,000+ customer records using Python (Pandas) to detect churn drivers (42.7% churn rate found).
- Discovered $139k+ in lost Monthly Recurring Revenue (MRR) & formulated retention strategy.
- Designed interactive Looker Studio Dashboard to track high-risk segments.
- Proposed 'Contract Migration' strategy to increase customer Lifetime Value (LTV).
Data Engineer

Collaborated to architect a centralized Banking Data Warehouse, resolving fragmented data issues.
- Designed Star Schema Data Warehouse on SQL Server (FactTransaction & DimTables).
- Engineered modular ETL jobs in Talend Open Studio from 8 disparate sources.
- Implemented advanced transformations (tMap, tUnite) for data deduplication.
- Automated daily transaction summaries using T-SQL Stored Procedures.
Featured Projects

PlayLens AI | Neural Market Intelligence
Feb 2026 - Present | AI & Full-Stack Engineer
An enterprise-grade Market Intelligence platform that decodes global user feedback using advanced RAG (Retrieval-Augmented Generation) and LLMs. Features include automated Play Store scraping, multilingual sentiment analysis via XLM-RoBERTa, and an autonomous AI agent that provides context-aware customer support responses and market trend synthesis.

DATAION | Data Contract & AutoML Platform
Dec 2025 - Present | Full-Stack MLOps Engineer
A Full-stack MLOps platform designed to solve production failures caused by schema drift. DATAION enforces strict data contracts between data producers and ML pipelines, ensuring only high-quality data is processed. The platform features automated EDA, advanced model selection (XGBoost, LightGBM), and a real-time prediction simulator.

SmartConvert | AI-Powered CRM & XAI
Oct 2025 - Jan 2026 | AI Engineer (VINIX7)
An Intelligent CRM system designed to optimize banking sales operations by predicting customer conversion probability (Lead Scoring) and providing transparent AI reasoning (Explainable AI). Integrated SHAP for model interpretability, allowing stakeholders to explore 'Next Best Conversation' insights.
Organization Experience

Event Logistic Committee
Google Developer Group on Campus
Coordinated end-to-end logistics for 'Tech Talk Series #4: AI x Business', ensuring equipment readiness for 100+ attendees and reducing procurement costs by 15-20%.

Information & Communication Staff
Naposo HKBP Surabaya
Managing the organization's digital ecosystem, including end-to-end Instagram content creation and designing 10+ digital promotional materials for community events.
Licenses & Certifications

Generative AI with Large Language Models
Credential ID
WYMCHFVTJB1Q

Intermediate Data Science
Credential ID
19510809840-944

Back End Development and APIs
Credential ID
-

Data Classification with IBM Granite
Credential ID
-

Science & Technology Track
Credential ID
-

Generative AI: Alibaba Cloud & Qwen
Credential ID
20010199840-991

LLM-Based Tools & Gemini API
Credential ID
03972/H8/CSR/MBA/I/2026
Get In Touch
Let's Connect
I am currently seeking Internship opportunities in Data Science, Machine Learning, or Web Development. Whether you have a question or just want to say hi, my inbox is always open!