You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Product type Paperback

Published in Dec 2024

Publisher Packt

ISBN-13 9781835889329

Length 306 pages

Edition 1st Edition

Concepts

GPT/LLMs

Author (1):

Leonid Kuligin

View More author details

Table of Contents (22) Chapters

Preface

1. Part 1: Intro to LangChain and Generative AI on Google Cloud

2. Chapter 1: Using LangChain with Google Cloud FREE CHAPTER

3. Chapter 2: Foundational Models on Google Cloud

4. Part 2: Hallucinations and Grounding Responses

5. Chapter 3: Grounding Responses

6. Chapter 4: Vector Search on Google Cloud

7. Chapter 5: Ingesting Documents

8. Chapter 6: Multimodality

9. Part 3: Common Generative AI Architectures

10. Chapter 7: Working with Long Context

11. Chapter 8: Building Chatbots

12. Chapter 9: Tools and Function Calling

13. Chapter 10: Agents

14. Chapter 11: Agentic Workflows

15. Part 4: Designing Generative AI Applications

16. Chapter 12: Evaluating GenAI Applications

17. Chapter 13: Generative AI System Design

18. Index

Why subscribe?

19. Other Books You May Enjoy

Appendix 1: Overview of Generative AI

1. Appendix 2: Google Cloud Foundations

Multimodality

Information can be presented in various modalities – for example, text, images, videos, audio, and so on. Typically, machine learning (ML) models deal with a single modality. For example, they might take a video input and provide a text description of this video as an output. Imagine how great it would be if you could ask a large language model (LLM) a question about a specific image. In that case, your input would become both a text and an image (or maybe only a text or an image). Multimodality is the capability of ML models to take as input or produce as output data of various modalities at the same time (e.g., text, image, video, audio, etc.), and multimodal models can deal with different modalities at the same time (either on the input or output side, or both). [1]

Our documents often contain multimodal input (a very simple example – images or charts in a document that have an important meaning). Imagine how better our retrieval augmented generation...

The rest of the chapter is locked

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Multimodality

Authors (3)

Personalised recommendations for you

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Multimodality

Unlock this book and the full library FREE for 7 days

Authors (3)

Personalised recommendations for you