Rob Kras

DevOps & Data Engineer

Building the data platform foundation at Rabobank — CI/CD, infrastructure-as-code, reliable data pipelines, and observability.

Rob Kras

About Me

I'm a DevOps and Data Engineer at Rabobank, building and operating the data platform foundation that supports analytics and machine learning workloads across the bank.

My work bridges platform engineering and data engineering — from CI/CD and infrastructure-as-code to reliable data pipelines and observability — making complex systems reproducible, scalable, and trustworthy.

I hold an MSc in Computer Science (Data Science & AI) from Leiden University and bring strong foundations in Python, SQL, and production ML workflows, with hands-on research experience in modern AI architectures.

Education

MSc Computer Science Universiteit Leiden Data Science & AI · Feb 2024 – Jul 2025
BSc Computer Science Vrije Universiteit Amsterdam Minor: Data Science · Sep 2020 – Dec 2023

Certifications

Databricks Fundamentals Databricks · Mar 2026
Generative AI Fundamentals Databricks · Mar 2026
C1 Advanced Cambridge English · May 2018

Technical Skills

DevOps & Cloud

Microsoft Azure Databricks CI/CD Infrastructure-as-Code Git Linux Containerisation Monitoring & Observability

Data Engineering

ETL/ELT Pipelines Async Python MySQL Pandas NumPy Data Quality & Lineage

ML & Deep Learning

XGBoost CatBoost Random Forest TensorFlow Keras PyTorch CNNs Ensemble Methods

AI & NLP

HuggingFace Transformers Vision-Language Models LLaMA SAM2 OpenAI API spaCy Multimodal AI

Data Tooling

Scikit-Learn Optuna SHAP SMOTE Matplotlib Seaborn Jupyter

Languages

Python SQL Scala Bash C/C++ JavaScript

Experience

Oct 2025 – Present

DevOps Engineer / Data Engineer

Rabobank — Data Management & Data Products, W&R Markets and Treasury Tribe

  • Build and operate cloud and DevOps infrastructure on Microsoft Azure and Databricks, supporting analytics and ML workloads across the bank — CI/CD, infrastructure-as-code, and monitoring.
  • Design and maintain reliable, reproducible data pipelines on Databricks for the Reporting & Value Chain Services domain, with a focus on observability, lineage, and recovery.
  • Automate deployments, environment provisioning, and operational workflows on Azure so data engineers and analysts can ship faster with fewer incidents.
  • Lead ML engineering on an approved departmental innovation pilot to automate operational tasks using machine learning, owning model design, data integration, and productionisation end-to-end.
  • Own non-functional concerns — reliability, security, cost, and disaster recovery — through guardrails and automation rather than manual intervention.
Azure Databricks CI/CD Infrastructure-as-Code MLOps Observability

Featured Projects

A selection of projects showcasing my expertise in data engineering, machine learning, and cloud infrastructure.

Rainfall Prediction

Top 0.2%

Feature engineering, K-Folds, and ensemble methods. Discovered that simpler algorithms (KNN) can outperform complex ensembles when properly optimized. Rank 5 / 2,529.

KNN Ensemble K-Folds Feature Engineering

House Prices Prediction

Top 1%

Regression techniques with domain knowledge and SHAP for feature importance. Achieved rank 37 / 3,935. Discovered and exploited data leakage for near-perfect score.

Regression SHAP XGBoost Data Leakage

Loan Payback Prediction

Top 4%

Binary classification for financial risk assessment. Feature engineering combining credit metrics, debt-to-income ratios, and payment history. Ensemble of XGBoost, LightGBM, and CatBoost. Rank 172 / 3,724.

Classification Ensemble SHAP XGBoost LightGBM

Music BPM Prediction

Top 5%

Predicting song tempo from audio features. Combined signal processing with machine learning, leveraging music theory for feature engineering. Rank 131 / 2,581.

Audio ML Regression Signal Processing Gradient Boosting

Road Accident Risk

Top 8%

Ensemble methods predicting accident risk. Created temporal and weather interaction features. Containerized training pipeline using Docker. Rank 313 / 4,082.

Ensemble Optuna Docker Feature Engineering

Bank Marketing

Top 17%

Customer response prediction in 7 days. YAML configuration management for rapid prototyping. Rank 576 / 3,367.

Classification Optuna YAML Config