Knowledge Mining with Azure Search

Meet vedMeet ved
4 min read

β˜• Building a Search-Driven Customer Insight Solution with Azure AI Search

πŸ–ŠοΈ by Meet Ved | πŸ“… 13 June 2025


πŸ” Overview

As a Data Analyst passionate about unlocking value from raw data, I recently explored Azure AI Search by completing a hands-on knowledge mining lab for a fictional company called Fourth Coffee.

The goal? To extract actionable insights from customer reviews using Azure’s powerful AI tools β€” a process that mirrors real-world needs in industries like retail, e-commerce, and customer service.


πŸ”§ What I Built

In this project, I created a knowledge mining solution that enables smart search over unstructured customer feedback.

πŸ”Ή The pipeline included:

  • βœ… Creating Azure AI Search and Azure AI Services

  • βœ… Setting up Azure Blob Storage

  • βœ… Uploading customer review documents

  • βœ… Using Cognitive Skills:

    • OCR

    • Sentiment detection

    • Key phrase extraction

    • Image analysis

  • βœ… Building a Search Index and Knowledge Store

  • βœ… Querying enriched data for deep insights


🧠 Key Learnings

βœ… 1. Connecting Data with AI

I enriched plain-text reviews with AI to extract:

  • πŸ”Έ Sentiment (positive/negative)

  • πŸ”Έ Key Phrases (e.g., "slow service", "great ambiance")

  • πŸ”Έ Locations mentioned in the text

  • πŸ”Έ Image tags and captions

➑️ These skills turned raw data into searchable insights.


βœ… 2. Creating a Knowledge Store

The Knowledge Store acted like a structured database for AI-enriched fields:

  • πŸ”— Linked tables using entity keys

  • πŸ“Š Enabled visual analysis and dashboarding

  • 🧠 Persisted all AI outputs for deeper exploration


βœ… 3. Search Indexing with AI

Using Azure’s Import Data Wizard, I created a pipeline that:

  • Indexed review documents from blob storage

  • Ran AI skillsets automatically

  • Enabled filtering by location, sentiment, and more

πŸ” Example JSON Query:

{
  "search": "sentiment:'negative'",
  "count": true
}

βœ… 4.πŸ“¦ Customer Review Data Overview

The input data consisted of 9 Microsoft Word (.docx) files, each containing real or simulated customer reviews for a fictional coffee chain, Fourth Coffee. These documents varied in structure but typically included:

πŸ“ Input Data (Raw Customer Reviews)

  1. Each Word document contained:

    • Multiple paragraphs of free-text reviews

    • Informal language, including typos or casual expressions

    • Mentions of locations (e.g., "Chicago", "Toronto")

    • Implicit sentiment (positive, neutral, negative)

    • Occasional embedded images (coffee, cafΓ© scenes)


πŸ”  Sample Raw Review (from input)


🧠 AI-Enriched Output (Sample)

After processing with Azure AI Search + Cognitive Skills, the documents were enriched with structured metadata like this:

βœ… Sample JSON Output from Azure Search Index:

    jsonCopyEdit{
      "metadata_storage_path": "https://<storage>.blob.core.windows.net/coffee-reviews/review3.docx",
      "merged_content": "I absolutely loved the coffee in Toronto... Not recommended at all.",
      "locations": ["Toronto", "Chicago"],
      "sentiment": "mixed",
      "keyphrases": ["slow service", "great mocha", "terrible experience"],
      "imageCaption": "a coffee cup on a table",
      "imageTags": ["coffee", "cup", "table", "cafe"]
    }

πŸ—ƒοΈ Stored in Knowledge Store as:

  • Tables for:

    • KeyPhrases

    • Locations

    • Entities

    • Image Projections

  • Blob Containers for:

    • Original and enriched documents

    • Captions and tags from images


This structure allows you to:

  • Search by sentiment, location, or key phrase

  • Build dashboards to visualize customer satisfaction

  • Query for trends across locations or time

🌍 Real-World Applications

This isn’t just an academic exercise β€” here’s how businesses can use it:

πŸ’¬ Customer Service Teams

  • Detect common complaints (via key phrases)

  • Identify patterns by location/sentiment

  • Prioritize areas for quality improvement

πŸ›’ Retail & E-Commerce

  • Find product-level feedback

  • Segment reviews by keywords or geography

  • Automate review summaries for decision-making

  • Extract entities (names, dates, clauses)

  • Index thousands of legal contracts

  • Make legal archives searchable and reportable


πŸ› οΈ Skills & Tools Used

  • Azure AI Search

  • Azure Cognitive Services

  • Azure Blob Storage

  • Search Explorer (JSON Queries)

  • Knowledge Store (Tables + Image Projections)


πŸ”š Final Thoughts

This lab showed me how to bridge unstructured feedback with structured insight using the power of Microsoft Azure.

πŸ”§ Whether it’s enhancing search, supporting sentiment dashboards, or building smarter applications β€” Azure AI Search is a valuable tool for data-driven teams.

2
Subscribe to my newsletter

Read articles from Meet ved directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Meet ved
Meet ved