Theme: Pakistan's Digital Payments Landscape
This pipeline extracts data from World Bank Group's Data360 portal, historical popularity data of related search keywords on Google (using the Google Trends API), and related news articles from NewsAPI.
- Install dependencies for a conda environment using the
requirements.txtfile. - Set API keys in
.envfile (your NewsAPI key) - Run
python run_pipeline.pyto start the EL pipeline - Analysis: Run the
python src/exploratory_analysis.ipynbnotebook for some analysis.
The pipeline will extract, transform, and load data into the specified output format for further analysis.
Note: Answers to the given questions are given in the docs/part_1_questions.md and docs/part_2_questions.md
Note: Due to size constraints, data/raw's data files have been deleted.
