Skip to content
View selengetu's full-sized avatar

Organizations

@DataExpert-io

Block or report selengetu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
selengetu/README.md

👋 Well Hello There! ✨

I’m Selenge Tulga , I’m a Data Engineer with 7 years (4.5 DE + 2.5 SWE) of experience and a background in Computer Science (B.S.) and Data Science (M.S.). I focus on building reliable, production-grade data platforms that replace manual reporting and support finance, operations, and business decision-making.

I enjoy working at the intersection of data systems, analytics, and real-world constraints, with a strong emphasis on data correctness, observability, and long-term maintainability. I care less about flashy tools and more about systems that are easy to reason about and trust.

📍 Based in Austin, TX 🇺🇸

LinkedIn Email


🛠️ Core Stack

Data Engineering & Analytics

  • Warehousing & Modeling: Snowflake, dbt, PostgreSQL, Redshift
  • Pipelines & Orchestration: Airflow, Prefect, AWS Glue, ETL/ELT
  • Streaming & Processing: Kafka, Flink, Spark, Databricks (when scale or latency requires it)
  • Cloud: AWS (primary), GCP
  • Visualization: Tableau, Power BI, Amazon QuickSight

I choose tools based on reliability, scale, and operational needs — not trends.


🎓 Certifications

  • AWS Certified Data Engineer – Associate
  • AWS Certified Machine Learning Engineer – Associate
  • AWS Certified Machine Learning – Specialty
  • Databricks Certified Data Engineer – Associate
  • Databricks Certified Data Engineer – Professional

🚀 What I’m Focused On Now

  • Preparing for Big Tech Data Engineer roles
  • Deepening expertise in SQL, data modeling, and system design
  • Building small, focused projects that demonstrate production-ready data pipelines
  • Practicing failure handling, monitoring, and data quality patterns

Pinned Loading

  1. realtime_reddit_trend realtime_reddit_trend Public

    End-to-end streaming data pipeline that ingests Reddit posts via the PRAW API, processes them with PySpark Structured Streaming, stores results in a serverless lakehouse (S3 + Glue + Athena), and v…

    Python

  2. ai_quality_check ai_quality_check Public

    An end-to-end data quality pipeline that uses Claude AI to automatically diagnose root causes when data quality checks fail — and delivers plain-English incident reports to Slack and a live Streaml…

    Python 1

  3. order_tracking_power_bi order_tracking_power_bi Public

    This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order status, sales trends, shipping details, and warehouse assignments.

    10

  4. HR-Analysis HR-Analysis Public

    A Power BI dashboard analyzing HR data, visualizing employee distribution, satisfaction, training costs, and department-wise insights.

    4 1

  5. neetcode-submissions-gzmz9xhb neetcode-submissions-gzmz9xhb Public

    My NeetCode.io problem submissions

    Python

  6. DE_AI_Interviewer DE_AI_Interviewer Public

    An AI-powered mock interview tool that reads a real job description and generates 12 tailored interview questions across 6 data engineering pillars. Answer everything at once, then get a full score…

    JavaScript 9 3