This project focused on transforming large-scale retail transaction data into actionable insights through advanced analytics and interactive visualization. Working with a dataset of over 200,000 transactions, I used Python and SQL to perform comprehensive data cleaning, transformation, and validation to ensure consistency across multiple regions and product categories. This rigorous preprocessing established a reliable foundation for conducting detailed revenue and profitability analysis, enabling accurate business performance comparisons and data-driven decision-making.
I conducted extensive exploratory data analysis (EDA) using statistical and visualization techniques to uncover the key drivers of sales trends and category-level performance. By applying correlation analysis, distribution mapping, and outlier detection, I identified patterns in regional performance and seasonal fluctuations that directly influenced revenue outcomes. These insights revealed growth opportunities and supported data-backed strategies for optimizing sales operations.
To make these insights accessible and interactive, I designed a Power BI dashboard featuring DAX-powered dynamic filters and calculated KPIs that allowed stakeholders to explore revenue, profit, and margin trends by time period and location. In parallel, I replicated the same analytical structure and visual logic in Streamlit, developing a fully interactive Python-based dashboard that mirrored the Power BI interface. This dual deployment showcased the flexibility of my data engineering approach, combining enterprise-grade reporting with open-source accessibility for rapid iteration and sharing.
Overall, this project demonstrated how structured data pipelines, advanced analytics, and cross-platform visualization can transform raw transaction records into strategic business intelligence. The combined Power BI and Streamlit ecosystem not only ensured analytical consistency but also provided scalable, automated tools for continuous performance tracking and optimization.