Unfolding the Mysteries of Protein Structure: ColabFold on Lilypad

Emma CoochinEmma Coochin
4 min read

TLDR: Lilypad is making ColabFold, a powerful AI tool for predicting protein structures, accessible to everyone. Researchers will soon be able to easily and affordably run complex protein folding simulations on a vast, decentralized network, accelerating drug discovery, enzyme engineering, and fundamental biological research.


The world of protein folding is a bit like origami on a molecular scale. Imagine trying to predict how a long, complex chain of amino acids will twist and turn to form a specific 3D structure. It's a puzzle that has baffled scientists for decades, but thanks to advancements in AI, we're finally starting to crack the code.

One of the most exciting tools in this field is ColabFold, a powerful software suite that leverages deep learning to predict protein structures with remarkable accuracy. However, while ColabFold has made significant strides in accessibility, running complex predictions still requires substantial computational resources and can be a hassle to set up. [1]

That's where Lilypad comes in. We're not just providing the compute power needed to run ColabFold; we're making it easier than ever for researchers to unlock the secrets of protein structure.

What is ColabFold?

Think of ColabFold as a super-powered protein origami predictor. You feed it a sequence of amino acids (the building blocks of proteins), and it uses a combination of clever algorithms and AI to predict the final 3D shape. It's like giving someone a string of letters and having them tell you exactly how that string would fold into a specific, intricate knot. [2]

ColabFold combines two key components:

  1. MMseqs2: A lightning-fast search engine that sifts through millions of known protein sequences to find similar ones.

  2. AlphaFold2 or RoseTTAFold: A powerful AI system that uses these similar proteins as a guide to predict the final 3D structure.

A 3D model of a protein structure with various coiled and looping strands, colored in a gradient from green to purple, set against a light blue background.

Image 1: A large protein complex known as a gap junction

Why Does ColabFold Matter?

Understanding protein structure is crucial for a wide range of applications:

  • Drug Discovery: Designing drugs that effectively target specific proteins to treat diseases.

  • Enzyme Engineering: Creating enzymes with enhanced properties for industrial or therapeutic purposes.

  • Biomaterial Development: Developing new materials with unique properties inspired by protein structures.

  • Fundamental Biological Research: Gaining a deeper understanding of how proteins function and interact within living organisms.

A complex 3D molecular structure forming a multicolored star shape with intertwining ribbons in shades of purple and blue against a black background.

Image 2: A large protein complex known as a hemichannel

Lilypad is Democratizing Access to Protein Structure Prediction

Lilypad takes ColabFold to the next level by making it accessible to everyone, regardless of their computational resources. We've packaged the entire ColabFold system into a Docker container, a self-contained unit that will be deployed and run on our decentralized network.

This means researchers will soon be able to:

  • Access vast compute power: Run complex protein structure predictions on thousands of GPUs distributed across the globe.

  • Simplify setup and deployment: Eliminate the hassle of configuring software and managing dependencies.

  • Reduce costs: Access affordable compute resources without the need for expensive hardware investments.

With Lilypad, protein structure prediction will become as simple as submitting a job to our network. We will empower researchers to accelerate their discoveries and unlock the mysteries of the protein world.

Video 1: AI generated Boltzmann distribution samples for an enzyme

Join the Lilypad Revolution

Ready to explore the fascinating world of protein folding with ColabFold? Join the Lilypad community and experience the power of decentralized computing for yourself.

Together, let's push the boundaries of scientific discovery and build a more open and accessible future for researchers.

About Lilypad
Lilypad is a serverless, distributed compute network thatl enables internet-scale data processing for AI, ML, and other arbitrary computation. Unleashing idle processing power by leveraging decentralized infrastructure networks, Lilypad unlocks a new marketplace for compute, making AI more accessible, efficient, and transparent for developers and users.

0
Subscribe to my newsletter

Read articles from Emma Coochin directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Emma Coochin
Emma Coochin