How to Search For Specific Words In All Fabric Notebooks In A Workspace

Sandeep PawarSandeep Pawar
2 min read

My personal Fabric trial tenant is a mess. I create items, test them and don’t name them properly. For example, I have a workspace with 226 notebooks named Notebook 1 , Notebook 2 etc 🙄 I am sure I am not alone of this crime. I needed to find a notebook I created a few months ago that contained a specific code logic. Fabric global search only searches for item names and not item content. So here is what I came up with:

import sempy.fabric as fabric 
import json

def extract_notebook_cells(notebook_name, workspace):

    return [{'cell_type': cell.get('cell_type'), 'source': ''.join(cell.get('source', '')).replace('\n', ' ')} 
            for cell in json.loads(notebookutils.notebook.getDefinition(notebook_name, workspaceId=workspace)).get('cells', [])]

def search_notebooks(word, workspace=None):
    if workspace is None: #workspace id of the notebook if not given
        workspace = fabric.get_notebook_workspace_id() 
    else:
        workspace = fabric.resolve_workspace_id(workspace) #get workspace id

    notebooks = [nb['displayName'] for nb in notebookutils.notebook.list(workspace)]
    matching_notebooks = []
    print(f"Workspace: {fabric.resolve_workspace_name(workspace)}")
    for notebook_name in notebooks:
        try:
            cells = extract_notebook_cells(notebook_name=notebook_name, workspace=workspace)
            if any(word.lower() in str(cell['source']).lower() for cell in cells):
                matching_notebooks.append(notebook_name)
        except Exception as e:
            print(f"Error {notebook_name}: {str(e)}")

    return matching_notebooks

search_notebooks("Cache")

Logic:

  • Get a list of notebooks in a workspace

  • For each notebook, get the definition, strip everything and get only cell/markdown blocks

  • Search for the keyword

  • Depending on the number of notebooks and notebook content, this can take a while. You can certainly speed it up with some optimization but this is sufficient for me. You can also use your own regex pattern for more advanced search.

💡
One use case for this - as you might now, mssparkutils is being deprecated and instead notebookutils should be used. With above, you can search for all the notebooks that use mssparkutils and replace it.
2
Subscribe to my newsletter

Read articles from Sandeep Pawar directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Sandeep Pawar
Sandeep Pawar

Microsoft MVP with expertise in data analytics, data science and generative AI using Microsoft data platform.