You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Product type Paperback

Published in Dec 2024

Publisher Packt

ISBN-13 9781835889329

Length 306 pages

Edition 1st Edition

Concepts

GPT/LLMs

Author (1):

Leonid Kuligin

View More author details

Table of Contents (22) Chapters

Preface

1. Part 1: Intro to LangChain and Generative AI on Google Cloud

2. Chapter 1: Using LangChain with Google Cloud FREE CHAPTER

3. Chapter 2: Foundational Models on Google Cloud

4. Part 2: Hallucinations and Grounding Responses

5. Chapter 3: Grounding Responses

6. Chapter 4: Vector Search on Google Cloud

7. Chapter 5: Ingesting Documents

8. Chapter 6: Multimodality

9. Part 3: Common Generative AI Architectures

10. Chapter 7: Working with Long Context

11. Chapter 8: Building Chatbots

12. Chapter 9: Tools and Function Calling

13. Chapter 10: Agents

14. Chapter 11: Agentic Workflows

15. Part 4: Designing Generative AI Applications

16. Chapter 12: Evaluating GenAI Applications

17. Chapter 13: Generative AI System Design

18. Index

Why subscribe?

19. Other Books You May Enjoy

Appendix 1: Overview of Generative AI

1. Appendix 2: Google Cloud Foundations

Developing multimodal RAGs

In the previous chapter, we discussed a classic example of RAG: a Q&A application on enterprise data. Often, the source of the data is PDFs that contain images that incorporate important – pie charts, graphs, and other types of visualizations.

We have two problems in front of us – first, we have to determine how to extract images from underlying objects. Second, if we have text and images from the underlying document, we need to know how to prepare the context for the LLM. Let’s look at these problems one by one. Of course, you can expand this approach to other types of content.

Extracting images from PDF documents

Ideally, we should have images as a separate source of data for our RAG applications. However, in practice, images are often a part of PDF files and other unstructured data sources, and we’d like our Q&A application to take them into account. That means we need to extract them during the pre-processing...

The rest of the chapter is locked

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Developing multimodal RAGs

Extracting images from PDF documents

Authors (3)

Personalised recommendations for you

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Developing multimodal RAGs

Extracting images from PDF documents

Unlock this book and the full library FREE for 7 days

Authors (3)

Personalised recommendations for you