Pinned Loading
-
Building-A-Scalable-Data-Architecture-With-Microservices
Building-A-Scalable-Data-Architecture-With-Microservices PublicExplores the design and implementation of a modern, adaptable data infrastructure using microservices.
-
fraud-detection-ml
fraud-detection-ml PublicEnd-to-end fraud detection — 500k SA transactions · PySpark · XGBoost vs RF vs LR vs MLP · SHAP · MLflow · PR-AUC 0.709
Jupyter Notebook 1
-
idm-debt-pipeline
idm-debt-pipeline PublicEnd-to-end Medallion data pipeline for SA consumer debt portfolio — Azure Databricks, ADF, PySpark, ADLS Gen2
Python 1
-
ieee-fraud-gru
ieee-fraud-gru PublicSequential Financial Fraud Detection using a 2-Layer GRU with Bahdanau Attention. Features PySpark sequence engineering for 590k+ transactions and ZAR business impact optimization (R1.69M+ net bene…
Jupyter Notebook 1
-
ecommerce-sentiment-analysis
ecommerce-sentiment-analysis PublicEnd-to-end NLP sentiment analysis pipeline for Amazon Appliance reviews. Compares LSTM, Conv1D, and BiLSTM architectures using PySpark and GloVe embeddings, featuring automated CI/CD validation.
Jupyter Notebook 1
-
voice-of-customer-pipeline
voice-of-customer-pipeline PublicEnterprise-grade VoC data pipeline using a Medallion Architecture (PySpark & Delta Lake) to process 1M+ Amazon reviews. Features distributed text normalization, salted joins for skew mitigation, an…
Python 1
If the problem persists, check the GitHub status page or contact support.