Skip to content
View IlliaSator's full-sized avatar

Block or report IlliaSator

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
IlliaSator/README.md

Illia Sator — Software Engineer (ML)

Software Engineering graduate (BSUIR, Minsk) transitioning into ML/DS.
Building production-ready machine learning services with deployment, testing, and CI/CD.
Passionate about applying ML to real business problems — from credit risk to predictive maintenance and LLM-powered systems.


Stack

ML & Data Science
Python scikit-learn CatBoost XGBoost LightGBM PyTorch pandas numpy MLflow LangChain NLTK

Statistics & Analysis
A/B testing hypothesis testing feature engineering EDA matplotlib seaborn

Deploy & Backend
FastAPI Docker docker-compose GitHub Actions uvicorn REST API

Languages & Tools
Python SQL C# C++ Git pytest Jupyter Linux


Projects

Production-like ML monitoring service for credit scoring: FastAPI, PostgreSQL, Evidently drift reports, prediction logging, alerts, baseline management, model performance tracking, retraining triggers, Docker, CI/CD.

Production-style Computer Vision project for industrial safety monitoring: YOLO detection, PPE compliance checks, worker tracking, danger-zone alerts, FastAPI inference, Docker, training/evaluation pipeline, benchmarking, tests and CI.

Production-style NLP system for contract document analysis.
It classifies contract clauses, detects heuristic risk indicators, retrieves semantically similar clauses, and generates structured risk reports through a FastAPI service. Built with a real LEDGAR data pipeline, TF-IDF + Logistic Regression baseline, optional transformer training pipeline, evaluation reports, semantic search, Docker, tests and CI. The project connects my legal background with practical ML engineering and focuses on reproducibility, honest limitations, and human-in-the-loop document review.

Credit default prediction service on real banking data · 150k records

  • Model: GradientBoosting · ROC-AUC 0.868 · PR-AUC 0.400
  • MLflow experiment tracking · optimal threshold selection by F1 (+60% vs default)
  • FastAPI + web UI + Docker + CI/CD GitHub Actions

Built an end-to-end ML automation project for predicting whether a news article will trend. The project combines a scikit-learn training pipeline with time-based validation, threshold tuning, local model deployment with rollback, Telegram and Google Sheets integrations, Dockerized execution, CI via GitHub Actions, and visual workflow orchestration in n8n.

Aircraft engine failure prediction based on NASA CMAPSS telemetry

  • Model: CatBoost · ROC-AUC 0.991 · early stopping at iteration 115/1000
  • FastAPI + interactive dashboard + pytest + Docker + CI/CD

Multi-agent market trend analysis system powered by LLM

  • 5 specialized agents: Router · Researcher · Analyst · Extractor · Editor
  • LangChain + FastAPI + Docker + async pytest + CI/CD

InsightData Analyst — a local RAG-powered SQL agent that translates natural language into SQL queries. Built with LangGraph, Ollama, and Qdrant: the agent retrieves real table schemas from a vector store, generates accurate SQL, and self-corrects on errors — all without sending data to the cloud.

Chest X-Ray Pneumonia Classifier - Detection of pneumonia in chest X-ray images using ResNet18 transfer learning.


📍 Location & Availability

📍 Minsk, Belarus
💼 Open to: remote DS/ML/DA positions · Russian companies · Relocation EU

📫 Contact

Telegram LinkedIn

@IlliaSator's activity is private