TransactIQ

TransactIQ: Forecasting India’s Journey Beyond Cash with UPI

banner transaction

TransactIQ is a data-driven project analyzing and forecasting India’s transition to a cashless economy through UPI adoption. Using real data sets (NPCI, RBI, World Bank), exploring trends, patterns, and future predictions of digital payments in India.

Unified Payments Interface (UPI) is a system that enables multiple bank accounts to be linked to a single mobile app, allowing for easy fund transfers, merchant payments, and peer-to-peer transactions. It was launched as a pilot by NPCI with 21 member banks on 11th April 2016 in Mumbai, inaugurated by Dr. Raghuram G. Rajan, then RBI Governor. Soon after, banks started releasing UPI-enabled apps on the Google Play Store.

Table of Contents

  1. Project Overview
  2. Data Sources
  3. Data Collection & Merging
  4. Data Cleaning & Preprocessing
  5. Exploratory Data Analysis (EDA)
  6. Visualization & Dashboards
  7. Forecasting using Time Series Model
  8. Conclusion

1. Project Overview

2. Data Sources

The dataset for this project was obtained from the official NPCI (National Payments Corporation of India) UPI Statistics Portal Website. It is an umbrella organization that operates retail and settlement payment systems in India, including UPI, RuPay, FASTag, and other payment systems. For this project, data from April 2016 to April 2025 has been compiled by consolidating the annual Excel files available on the NPCI website into a single, structured dataset. This ensures accuracy, transparency, and completeness of information for analysis.

3. Data Collection & Merging

The monthly UPI statistics from April 2016 to April 2025 were collected from the official NPCI website and merged into a single dataset. The merging and processing were carried out in Python using libraries such as pandas (for data manipulation and merging), openpyxl (for handling Excel files), and glob (for retrieving file paths and iterating through multiple Excel files). The complete code implementation is provided in the attached Jupyter Notebook (.ipynb) file, and the final consolidated dataset is exported and attached as an Excel (.xlsx) file for reference.

Jupyter File

Merged Excel File (File)

4. Data Processing and Cleaning (Processed File)

The dataset was cleaned and prepared using Power Query, as it provided an efficient and user-friendly approach for managing the available data. Several preprocessing steps were carried out to ensure data quality and consistency:

Raw Data (Before Cleaning)

banner

Cleaned Data (After Processing)

banner


banner

5. Exploratory Data Analysis (EDA)

Exploratory Data Analysis (EDA) was carried out to understand the structure, patterns, and key insights from the dataset. The objective of this stage was to examine trends, detect anomalies, and establish relationships between variables before proceeding to a deeper interpretation.

Dataset Overview

EDA on UPI Adoption and Comparative Analysis with Cash and Card Payments



Key Insights from the EDA on UPI Transactions

The dataset for this section has been extracted from the Reserve Bank of India’s (RBI) Database on Indian Economy (DBIE) under the Payment and Settlement Systems section Website.It provides monthly and yearly statistics on transaction volumes and values across various payment modes, including UPI, Card, and Cash, covering the period from November 2019 to June 2025. Data cleaning and preparation were carried out using Power Query, ensuring both efficiency and simplicity in managing the dataset.

Dataset Overview

Unprocessed Data

Processed Data

Jupyter File



Key Insights from the comparison

6. Interactive Visualizations & Dashboard


7. Forecasting using Time Series Models (ARIMA)

Jupyter File

Assumptions

To ensure the validity of the forecasting analysis, the following assumptions are made:

Risk Assessment

The following risks may influence the forecasting results:

8. Conclusion

Github Link