You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Product type Paperback

Published in Dec 2024

Publisher Packt

ISBN-13 9781835889329

Length 306 pages

Edition 1st Edition

Concepts

GPT/LLMs

Author (1):

Leonid Kuligin

View More author details

Table of Contents (22) Chapters

Preface

1. Part 1: Intro to LangChain and Generative AI on Google Cloud

2. Chapter 1: Using LangChain with Google Cloud FREE CHAPTER

3. Chapter 2: Foundational Models on Google Cloud

4. Part 2: Hallucinations and Grounding Responses

5. Chapter 3: Grounding Responses

6. Chapter 4: Vector Search on Google Cloud

7. Chapter 5: Ingesting Documents

8. Chapter 6: Multimodality

9. Part 3: Common Generative AI Architectures

10. Chapter 7: Working with Long Context

11. Chapter 8: Building Chatbots

12. Chapter 9: Tools and Function Calling

13. Chapter 10: Agents

14. Chapter 11: Agentic Workflows

15. Part 4: Designing Generative AI Applications

16. Chapter 12: Evaluating GenAI Applications

17. Chapter 13: Generative AI System Design

18. Index

Why subscribe?

19. Other Books You May Enjoy

Appendix 1: Overview of Generative AI

1. Appendix 2: Google Cloud Foundations

Evaluating GenAI Applications

Large Language Models (LLMs) have proved their performance on a variety of Natural Language Processing (NLP) tasks and even their abilities of common reasoning. When new LLMs are released, they are typically tested on various generalized datasets, and performance benchmarks and leaderboards are publicly available. Still, when building Generative AI (GenAI) applications, we need to evaluate their performance on the underlying task (or tasks) we’re working on. We need this for two reasons – to ensure we meet product requirements and quality expectations, and to compare various architectures or prompting techniques to pick the best setup for our specific use case.

In this chapter, we’re going to discuss how you can evaluate a GenAI application, briefly touch on using LangSmith for tracing and debugging your application’s performance, and explore using Vertex AI evaluation capabilities with LangChain.

Here, we’ll cover...

The rest of the chapter is locked

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Evaluating GenAI Applications

Authors (3)

Personalised recommendations for you

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Evaluating GenAI Applications

Unlock this book and the full library FREE for 7 days

Authors (3)

Personalised recommendations for you