You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
O objetivo deste projeto é demonstrar como processar eficientemente um arquivo de dados massivo contendo 1 bilhão de linhas (~15GB), especificamente para calcular estatísticas (Incluindo agregação e ordenação que são operações pesadas) utilizando Python.
A complete guide to ensuring seamless compatibility between pandas and cuDF in GPU-accelerated data processing. Covers data type handling, missing values, and merge operations step-by-step. Perfect for developers transitioning from pandas to FireDucks. 🚀
This project analyzes an online retail dataset using data cleaning, feature engineering, exploratory data analysis (EDA), and customer segmentation. It also optimizes performance using the Fireducks library for faster computations. Key insights include sales trends, customer behavior, and RFM-based segmentation to enhance business decisions.