Skip to content
View SamiraSiavash's full-sized avatar

Block or report SamiraSiavash

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SamiraSiavash/README.md

πŸ‘‹ Hi, I'm Samira Siavash

Python Developer | Data Engineering Enthusiast | BI & Analytics

I have a background in financial software implementation and support, where I gained hands-on experience working with business processes, databases, and real-world data.

My interests lie in data engineering, analytics, and automation. I enjoy building data pipelines, integrating data from different sources, transforming raw data into meaningful insights, and creating solutions that support data-driven decision-making.

Currently, I am expanding my skills through practical projects involving Apache Kafka, Apache Airflow, Apache Spark, PostgreSQL, MongoDB, ClickHouse, Grafana, and Power BI, while continuing to strengthen my expertise in Python and SQL.


πŸ”§ Tech Stack

Programming & Data Processing

  • Python
  • SQL
  • Scala (Apache Spark Fundamentals)
  • Linux

Data Engineering

  • Apache Kafka
  • Apache Airflow
  • Apache Spark
  • ETL Development
  • Data Cleaning & Transformation

Databases

  • PostgreSQL
  • SQL Server
  • MongoDB
  • ClickHouse

Analytics & Visualization

  • Power BI
  • Grafana

Tools

  • Git & GitHub
  • Docker
  • REST APIs
  • Web Scraping

πŸš€ Featured Projects

πŸ“‘ Kafka Spam Detection Pipeline

Real-time spam classification pipeline using Kafka, Machine Learning, and PostgreSQL.

Tech: Kafka, Python, PostgreSQL, Scikit-Learn


πŸ”„ SMS ETL Pipeline

End-to-end ETL workflow for extracting, transforming, and loading SMS datasets.

Tech: Python, PostgreSQL, ETL


🌐 Divar API to PostgreSQL

Automated data ingestion pipeline that extracts data from the Divar API and stores it in PostgreSQL.

Tech: Python, REST API, PostgreSQL


βš™οΈ Divar Airflow Pipeline

Workflow orchestration and scheduling using Apache Airflow.

Tech: Apache Airflow, Python, PostgreSQL


🧹 Data Cleaning Project

Data preprocessing, cleaning, and transformation workflow for analytics-ready datasets.

Tech: Python, Pandas


πŸ”₯ Spark Data Processing Project

Large-scale dataset processing, cleaning, and transformation using Apache Spark.

Tech: Spark, Python, Scala, Parquet


πŸ›οΈ ClickHouse Analytics Project

Analytical processing and reporting using ClickHouse, PostgreSQL, and Grafana.

Tech: ClickHouse, PostgreSQL, Grafana


πŸ“Š Grafana Analytics Dashboard

Interactive dashboards and KPI monitoring for business analytics.

Tech: Grafana, PostgreSQL


πŸ“ˆ Students Dashboard

Interactive Power BI dashboard for analyzing student performance metrics.

Tech: Power BI, DAX, Data Modeling


🌸 Perfume Scraper

Web scraping project for collecting and storing perfume product data.

Tech: Python, Requests, BeautifulSoup, SQLite


πŸ“š Currently Learning

  • Apache Spark
  • ClickHouse
  • Data Warehousing
  • Data Modeling
  • Distributed Data Processing
  • Real-Time Data Pipelines

🎯 Career Focus

  • Data Engineering
  • ETL & ELT Pipelines
  • Data Platforms
  • Analytics Engineering
  • Big Data Processing
  • Workflow Automation

πŸ“« Connect With Me

LinkedIn: https://linkedin.com/in/samira-siavash

GitHub: https://github.com/SamiraSiavash

Email: s.siavash.m@gmail.com

Let’s connect and talk about data, BI, and engineering solutions!

Pinned Loading

  1. kafka-spam-detection-pipeline kafka-spam-detection-pipeline Public

    Real-time spam detection pipeline using Apache Kafka, Python, Scikit-learn, and PostgreSQL. Incoming SMS messages are streamed through Kafka, classified as Spam/Ham using a machine learning model, …

    Python 1

  2. divar-airflow-pipeline divar-airflow-pipeline Public

    A sample Airflow DAG that fetches city data from Divar API and saves it as CSV.

    Python 1

  3. sms-etl-pipeline sms-etl-pipeline Public

    ETL pipeline that processes SMSSpamCollection dataset using Python, Pandas, PostgreSQL, and MongoDB with structured and unstructured data storage.

    Python 1

  4. divar-api-to-postgres divar-api-to-postgres Public

    A simple end-to-end ETL pipeline that extracts real-world data from Divar API, transforms it using Pandas, and loads it into a PostgreSQL database.

    Python 1

  5. Grafana-Northwind-Dashboard Grafana-Northwind-Dashboard Public

    Grafana dashboard built using the Northwind dataset (JSON export included)

  6. Students-dashboard Students-dashboard Public

    Interactive Power BI Dashboard for Student Performance