Data Analyst & Applied Data Scientist | Econometrics • Statistical Modelling • Enterprise Statistics
Welcome to my GitHub!
I am a Data Analyst and Applied Data Scientist with a strong foundation in econometrics, statistical modelling, data analysis, forecasting, and reproducible analytical workflows.
I combine PhD-level research experience with practical skills in Python, R, SQL, and statistical software to build transparent, interpretable, and well-structured analytical solutions that support evidence-based decision-making.
My work spans Data Science, Econometrics, Enterprise Statistics, Data Integration, and Business Analytics, with a particular interest in transforming complex data into meaningful statistical and analytical insights.
I work extensively with Python, R, SQL, Power BI, and modern analytical tools across a range of methodological and applied projects.
- Data Science
- Econometrics
- Enterprise Statistics
- Statistical Modelling
- Data Integration
- Business Analytics
- Reproducible Analytical Workflows
Languages & Analytics:
Python • R • SQL • Econometrics • Statistical Modelling • Time Series • Forecasting
Python Stack:
pandas • numpy • scikit-learn • matplotlib • OOP • ETL pipelines • APIs
R Stack:
dplyr • tidyr • purrr • ggplot2 • lubridate • renv • panel data workflows • economic indicator construction
Data Ops & Workflow:
Git • GitHub • Reproducible Workflows • Data Cleaning • Data Preparation
BI & Visualization:
Power BI • Tableau
Statistical Tools:
Stata • GLM • Statistical Inference • Model Evaluation • EDA
A reproducible methodological workflow demonstrating statistical editing, harmonization, multisource integration, and structural indicator production using synthetic enterprise data.
Tech: R, dplyr, statistical validation, harmonization, data integration, reproducible workflows, renv
A modern finance analytics solution integrating data engineering, profitability modeling, and customer churn prediction — designed for banking and financial institutions.
Tech: Python, SQL, data engineering, PowerBI
Supervised learning project building a transparent ML workflow to classify ESG indicators using feature engineering, model evaluation, and reproducible pipelines.
Tech: Python, pandas, scikit-learn, feature engineering, classification models
Modular, object-oriented pipeline for extracting ESG-related data using APIs, structured processing steps, and reusable components.
Tech: Python, OOP, APIs, ETL, data cleaning
End-to-end actuarial risk modeling using GLMs to predict frequency, severity, and pure premium.
Tech: Python, GLM, Negative Binomial, model diagnostics, visualization
Time series forecasting using SARIMA and feature engineering to evaluate tourism dynamics in European hotel markets.
Tech: Python, SARIMA, time series modeling, feature engineering
I am currently developing a Retail Data Intelligence project, combining:
- Python OOP pipelines
- DuckDB SQL modeling
- API-based data extraction
- Forecasting & clustering
- Power BI dashboarding
- Business-driven retail KPIs
Repository will be published soon.
🔗 LinkedIn: https://www.linkedin.com/in/golib-sanaev
📧 Email: gsanaev80@gmail.com
Feel free to reach out — I am open to opportunities in Data Science, Data Analytics, Enterprise Statistics, Applied Econometrics, and Business Analytics roles.