Personal tools

Generative AI, Large Language Models, and Foundation Models

The University of Chicago_061422A
[The University of Chicago]

 

- Overview

Artificial intelligence (AI) is a very broad field that encompasses the study of many different types of problems, from ad targeting to weather forecasting, self-driving cars to photo tagging, chess playing to speech recognition. While the field of AI research as a whole has always included parallel work on many different topics, the focus involving the most exciting advances seems to have changed over the years.

Loosely speaking, we can say that in the early 2010s, significant progress was made in image classification and speech recognition; in the mid-2010s, the focus turned to reinforcement learning (especially games such as Go and StarCraft); in the late 2010s and early 2020s, There was a boom in language and image generation.

This chronological breakdown is very approximate, and any researcher will tell you that work in all of these fields (and more) has been going on throughout this period and long before.

The point is not to draw clear boundaries, but to explain the emergence of terms like generative AI, large language models, and foundational models, in an attempt to point out a range of research directions and AI systems that have become particularly noteworthy in recent years.

Please refer to the following for more information:

 

- Generative AI

Generative artificial intelligence (GenAI) is a type of machine learning (ML) algorithm that can produce a variety of novel content, such as: Images, video, music, voice, text, software code, product design, and simulations. 

Some examples of GenAI tools include: GPT-4, ChatGPT, AlphaCode, GitHub Copilot, Bard. It works by learning patterns and relationships in a dataset of human-created content, and then using those patterns to generate new content. 

The results can be indistinguishable from human-generated content, or they can seem uncanny, depending on the quality of the model and how well it matches the use case. 

GenAI has been around since the 1960s, when it was introduced in chatbots. Recent interest in GenAI has been driven by the simplicity of new user interfaces that allow users to create high-quality content in seconds. It has a variety of applications, including: Improving customer interactions, Exploring unstructured data, and Assisting with repetitive tasks. 

GenAI uses various ML techniques, such as Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), and Large Language Models (LLMs), to generate new content. One such technique is diffusion models, which slowly add random noise to training data during the forward diffusion process, and then reverse the noise to reconstruct the data samples during the reverse diffusion process. 

Novel data can then be generated by running the reverse denoising process starting from entirely random noise. However, one study warns that the influx of AI-generated content could rapidly devalue content, impacting eCommerce and marketing.

Some potential issues with generative AI include:

  • Difficulty adapting to new circumstances
  • Harmful bias
  • Intellectual property rights
  • Lack of transparency
  • New cybersecurity dangers
  • Potential problems with accuracy and appropriatenes

 

- Large Language Models (LLMs)

Generative AI (GenAI) is a broad category of AI that can create new content, such as text, images, music, and code. Large language models (LLMs) are a specific type of GenAI that focus on producing text that sounds like human writing. LLMs are trained on large amounts of text to understand existing content and generate original content.

Generative AI (GenAI) applications are built on top of large language models (LLMs) and foundation models. 

LLMs are a type of AI program that can recognize and generate text. They are trained on huge sets of data, such as trillions of words, across many natural-language tasks. LLMs are built on ML, specifically a type of neural network called a transformer model. 

LLMs are proficient in generating text, producing fluent, succinct, and precise linguistic expressions. They are also proficient in language comprehension tasks, such as sentiment analysis, text categorization, and processing factual input. 

GenAI is a broader concept of AI systems capable of generating various types of content. It is a pinnacle achievement, particularly in the intricate domain of Natural Language Processing (NLP). 

LLMs provide context and memory capabilities, while GenAI enables the production of engaging responses. This results in more natural, humanlike, interactive conversations.

 

- Foundation Models (FMs)

AI Foundation models (FMs) are a type of machine learning (ML) model that can perform a variety of tasks and applications. They are large AI models that can generate a wide range of outputs, including text, images, or audio. Foundation models can be standalone systems or used as a base for other applications. 

FMs are a form of GenAI that can generate output from one or more inputs in the form of human language instructions. They are based on complex neural networks, including generative adversarial networks (GANs), transformers, and variational encoders. 

FMs are trained on large, unlabeled datasets and fine-tuned for an array of applications. They are pre-trained with extremely large data sets scraped from the public internet. 

The term "foundation model" was coined in August 2021 by the Stanford Institute for Human-Centered Artificial Intelligence's (HAI) Center for Research on Foundation Models (CRFM).

Trained on massive datasets, FMs are large deep learning neural networks that have changed the way data scientists approach machine learning (ML). 

Rather than develop AI from scratch, data scientists use a foundation model as a starting point to develop ML models that power new applications more quickly and cost-effectively.

FMs can be very large, up to trillions of parameters in size, so adapting the entirety of a foundation model can be computationally expensive. Therefore, developers sometimes adapt only the last neural layer or only the bias vectors to save time and space. 

For particularly niche applications, specific data may also not be available to adapt the foundation model sufficiently. In such circumstances, data must be manually labeled, which is costly and can demand expert knowledge.

 

- Applications of GenAI and LLMs

Generative AI (GenAI) and large language models (LLM) are revolutionizing personal and professional lives. From powerful digital assistants that manage email to seemingly omniscient chatbots that can communicate with corporate data across industries, languages, and professions, these technologies are driving a new era of convenience, productivity, and connectivity. 

GenAI applications are built on top of large language models (LLMs) and foundation models. LLMs are specialized AI models created to comprehend and produce text-based content. LLMs can decipher the nuances of language, while GenAI can create accurate translations and localized versions of the content. LLMs can be utilized alongside GenAI models to improve content translation and localization.

Here are some ways generative AI and LLMs can be used:

  • Personalized recommendations: LLMs can analyze shopper preferences and generate personalized recommendations.
  • Chatbots and virtual assistants: LLMs can provide context and memory capabilities, while GenAI can enable the production of engaging responses, resulting in more natural, humanlike, interactive conversations.
  • Summarizing data: LLMs can provide a textual overview of key players, case highlights, and suggested next steps.
  • Video creation: Generative AI-powered video creation tools can be used to send video walkthroughs.

Some examples of GenAI include: Midjourney, DALL-E, Stable Diffusion, Replit's Ghostwriter, and GitHub Copilot. Some examples of LLMs include: OpenAI's ``GPT'' series and ChatGPT.

 

San Francisco_CA_032721A
[San Francisco, CA - Civil Engineering Discoveries]

- Generative AI vs. Predictive AI

Generative AI (GenAI) and predictive AI are two different approaches to AI. GenAI focuses on creating new content, while predictive AI focuses on making accurate predictions.

Predictive AI analyzes existing data to make predictions, recommendations, and decisions. It uses various AI and ML techniques. Predictive AI is commonly used in industries such as finance, healthcare, and marketing.

Predictive AI enables informed decision-making, cost reduction, and risk mitigation. It can also predict a company's future needs or events, such as foreseeing upcoming trends or predicting risks and their solutions.

GenAI and predictive AI are both types of AI that use ML algorithms, but they have different purposes and applications:

  • Generative AI: Creates new content, such as images, videos, music, text, virtual worlds, artwork, or design concepts. It uses neural networks and deep learning to generate novel and creative outputs that mimic human-like patterns. Generative AI requires some initial creative input, such as a prompt, seed, or example, to start the creative process.
  • Predictive AI: Analyzes patterns in existing data to make predictions about future outcomes, such as stock market trends, customer behavior, disease progression, financial forecasting, or demand forecasting. It uses statistical and machine learning models to forecast future scenarios and trends mathematically.

GenAI can build on predictive AI capabilities by harnessing data for insights and creating content. For example, predictive AI can help companies optimize inventory management and supply chain operations by anticipating future demand for products or services

 

- Generative Integration

Generative integration is an advanced approach to data and application integration that leverages generative AI (GenAI) and large language models (LLMs). 

This innovative approach securely automates the creation of integration pipelines, simplifying the process of connecting disparate systems and data sources. 

By leveraging AI and machine learning (ML) capabilities, generative integration can understand, interpret, and generate code, significantly reducing manual workload and increasing the efficiency and accuracy of data integration tasks.

In technology, generative integration is a combination of the terms generative AI (GenAI) (i.e., ChatGPT) and integration (i.e., technology that connects data, systems, and applications). 

Generative integration is an advanced approach to data and application integration that leverages GenAI and large language models (LLM) to securely automate integration pipelines and simplify the process of connecting disparate systems and data sources. 

By leveraging AI and ML capabilities, generative integration understands, interprets and generates code, significantly reducing IT teams' manual workload and increasing the efficiency and accuracy of data integration tasks. 

This revolutionary combination of technologies provides an opportunity to make data and application integration and business process automation easy to implement, accessible to non-technical staff, and more effective.

 

[More to come ...]



Document Actions