Automating Excel Data Comparison with Streamlit and Python

satya bollojusatya bolloju
2 min read

Working with Excel data across multiple sheets and comparing rows against reference blocks is tedious β€” especially when done manually.

I built a lightweight web tool using Streamlit that automates this for analysts inside our company.


πŸ” What the Tool Does

  • Accepts two uploaded Excel files:

    • A base file (rows to compare)

    • A source file (multiple sheets of reference blocks)

  • For each row in the base file:

    • Fuzzy matches the correct sheet from the source file

    • Finds the "weight group" position

    • Extracts a 12x8 block of data

    • Inserts it into the base file

    • Applies Excel formulas and formatting to highlight differences

  • Outputs a downloadable Excel file instantly


🧠 Why I Built This

This was designed to eliminate repetitive manual work:

  • Cross-checking 100+ rows manually against 6–10 sheets

  • Copying specific blocks

  • Writing formulas per row

  • Formatting results for stakeholder visibility

Now it’s a few clicks and one download.


πŸ›  Tech Stack

ToolPurpose
PythonBackend logic
StreamlitWeb UI
openpyxlExcel manipulation
fuzzywuzzySheet name matching
BytesIOIn-memory file handling

βš™οΈ Architecture Overview

User Uploads Excel Files
        ↓
Base file read row by row
        ↓
Match each row’s sheet using fuzzywuzzy
        ↓
Find weight group in matched sheet
        ↓
Extract 12x8 block + paste into base sheet
        ↓
Apply formulas + styling
        ↓
Generate and download updated file

πŸ“‚ Expected Input Format

ColumnPurpose
BShape
FWeight Group
MSheet Name
N β†’ WOutput block insert

πŸ”— GitHub Repo

πŸ‘‰ streamlit-excel-comparator


🧩 Real Use Case

This tool is now actively used by analysts at our company to:

  • Compare price matrices

  • Evaluate diamond inventory vs vendor pricing

  • Reduce error-prone manual work

Result: faster decisions and cleaner Excel files.


πŸ§‘β€πŸ’» About Me

Data Analyst
Currently working @ Sheetal Manufacturing Co.
MSc Data Science 2026

πŸ”— LinkedIn πŸ”— GitHub

0
Subscribe to my newsletter

Read articles from satya bolloju directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

satya bolloju
satya bolloju