Introduction to Generative AI (Part 2)
This is collection of notes from the Generative AI course on Google Cloud taught by Dr. Gwendolyn Striping. Some images are taken from the course itself.
It is a detailed compilation and annotated excerpts will be available on my LinkedIn profile.
The following is a continuation of the Introduction to Generative AI (Part 1) article. Make sure to go over it before continuing below.
Course Overview
The course is divided into four parts. We will go through the last 2 ones in this article.
Defining Generative AI
Working of Generative AI
Generative AI Model Types
Generative Applications.
Model Types
Generative models can be divided into 4 types as follows:
text-to-text
These models learn mapping between pair of texts.text-to-image
These models are trained on images with short text description.text-to-video and text-to-3D
These models generate video representation from input text which can be a sentence or full script. We can also generate 3D models based on text description.text-to-task
These models are trained to perform specific tasks such as Question Answering, Search or Prediction based on text input.
Applications
The course gives example application of Code Generation using Bard. We can give a prompt to Bard, get results and also export them to Google Colab.
Bard can also provide number of other functionalities such as Debugging Code, Code Explanation, Code Translation and Documentation.
Foundation Models
Foundation Models are Large AI models that are trained on vast quantities of data. These are adapted and fine-tuned to perform downstream tasks such as Sentiment Analysis, Image Captioning and Object Recognition.
Google Cloud's Vertex AI provides model garden that consists of various foundation models for variety of use cases. Examples of such foundational models include PaLM (Pathways Language Model) for Chat and Text, Stable Diffusion and CLIP.
Google's Offerings
Generative AI studio
Generative AI studio lets you quickly explore and customize Gen AI models that you can leverage in Google Cloud. Other functionalities provided by GenAI Studio are as follows:
Fine tune models
Deploy Models to Production.
Create Chatbots
Image Generation.
Community Forum.
Vertex AI Search and Conversation
Build Generative AI Applications with little-to-no coding or ML experience.
Utilize Drag-and-drop interface.
Visual Editor that can create and edit application content.
Built in Conversational AI engine to help users interact with the app using Natural Language.
Create Digital Assistants, chatbots, custom search engines, knowledge bases.
PaLM API & MakerSuite
PaLM API can be utilized to test, experiment and prototype generative applications using Google's LLMs and GenAI tools. PaLM API is integrated into MakerSuite.
MakerSuite can be used to access APIs in graphical user interface, and contains various tools such as follows:
Tools for Model training
We can utilize different types of algorithms and check which suits our dataset or use case better.Tools for Model Deployment
We can deploy our model with variety of different options.Tools for Model Monitoring
We can track our model's performance using a dedicated dashboard. We can also use different metrics to evaluate our model performance.
Ending Note
The course provides an extensive list of documents that can help enhance our understanding. You can follow the link here to access these documents.
We will cover Introduction to Large Language Models course next in the learning path. More details about Transformer Models, Stable Diffusion and Attention Mechanism will be discussed in subsequent posts. Stay tuned for lots of Generative AI content!
Subscribe to my newsletter
Read articles from Akshit Keoliya directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by