You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Product type Paperback

Published in Dec 2024

Publisher Packt

ISBN-13 9781835889329

Length 306 pages

Edition 1st Edition

Concepts

GPT/LLMs

Author (1):

Leonid Kuligin

View More author details

Table of Contents (22) Chapters

Preface

1. Part 1: Intro to LangChain and Generative AI on Google Cloud

2. Chapter 1: Using LangChain with Google Cloud FREE CHAPTER

3. Chapter 2: Foundational Models on Google Cloud

4. Part 2: Hallucinations and Grounding Responses

5. Chapter 3: Grounding Responses

6. Chapter 4: Vector Search on Google Cloud

7. Chapter 5: Ingesting Documents

8. Chapter 6: Multimodality

9. Part 3: Common Generative AI Architectures

10. Chapter 7: Working with Long Context

11. Chapter 8: Building Chatbots

12. Chapter 9: Tools and Function Calling

13. Chapter 10: Agents

14. Chapter 11: Agentic Workflows

15. Part 4: Designing Generative AI Applications

16. Chapter 12: Evaluating GenAI Applications

17. Chapter 13: Generative AI System Design

18. Index

Why subscribe?

19. Other Books You May Enjoy

Appendix 1: Overview of Generative AI

1. Appendix 2: Google Cloud Foundations

Pairwise evaluations

As we’ve already mentioned, pairwise evaluators use LLM to compare two different outputs from two versions of your application configured differently. A configuration change might be anything – a different prompt, a different foundational model, a new ingestion or chunking mechanism, or just a change in the temperature argument. You don’t get a score on a specific scale, but you get preferences, and you can compute a share of cases when output from version A is a preferred one over output from version B.

LangChain offers you a few out-of-the-box evaluators:

The pairwise_string and labeled_pairwise_string evaluators predict the preferred prediction between the two models, but the second uses the golden answer as an additional input. As usual, an evaluator with an expected output provided would most probably perform more reliably and get preferences with a higher correlation to human ones:
```
from langchain.evaluation import load_evaluator...
```

The rest of the chapter is locked

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Pairwise evaluations

Authors (3)

Personalised recommendations for you

You're reading from Generative AI on Google Cloud with LangChain Design scalable generative AI solutions with Python, LangChain, and Vertex AI on Google Cloud

Table of Contents (22) Chapters

Pairwise evaluations

Unlock this book and the full library FREE for 7 days

Authors (3)

Personalised recommendations for you