You're reading from Practical Automated Machine Learning Using H2O.ai Discover the power of automated machine learning, from experimentation through to deployment to production

Product type Paperback

Published in Sep 2022

Publisher Packt

ISBN-13 9781801074520

Length 396 pages

Edition 1st Edition

Tools

H2O

Concepts

Machine Learning

Author (1):

Salil Ajgaonkar

View More author details

Table of Contents (19) Chapters

Preface

1. Part 1 H2O AutoML Basics

2. Chapter 1: Understanding H2O AutoML Basics FREE CHAPTER

3. Chapter 2: Working with H2O Flow (H2O’s Web UI)

4. Part 2 H2O AutoML Deep Dive

5. Chapter 3: Understanding Data Processing

6. Chapter 4: Understanding H2O AutoML Architecture and Training

7. Chapter 5: Understanding AutoML Algorithms

8. Chapter 6: Understanding H2O AutoML Leaderboard and Other Performance Metrics

9. Chapter 7: Working with Model Explainability

10. Part 3 H2O AutoML Advanced Implementation and Productization

11. Chapter 8: Exploring Optional Parameters for H2O AutoML

12. Chapter 9: Exploring Miscellaneous Features in H2O AutoML

13. Chapter 10: Working with Plain Old Java Objects (POJOs)

14. Chapter 11: Working with Model Object, Optimized (MOJO)

15. Chapter 12: Working with H2O AutoML and Apache Spark

16. Chapter 13: Using H2O AutoML with Other Technologies

17. Index

Why subscribe?

18. Other Books You May Enjoy

Experimenting with parameters that support imbalanced classes

One common problem you will often face in the field of ML is classifying rare events. Consider the case of large earthquakes. Large earthquakes of magnitude 7 and higher occur about once every year. If you had a dataset containing the Earth’s tectonic activity of each day since the last decade with the response column containing whether or not an earthquake occurred, then you would have approximately 3,650 rows of data; that is, one row for each day in the decade, with around 8-12 rows showing large earthquakes. That is less than a 0.3% chance that this event will occur. 99.7% of the time, there will be no large earthquakes. This dataset, where the number of large earthquake events is so small, is called an imbalanced dataset.

The problem with the imbalanced dataset is that even if you write a simple if-else function that marks all tectonic events as not earthquakes and call this a model, it will still show the...

The rest of the chapter is locked

You're reading from Practical Automated Machine Learning Using H2O.ai Discover the power of automated machine learning, from experimentation through to deployment to production

Table of Contents (19) Chapters

Experimenting with parameters that support imbalanced classes

Authors (1)

Personalised recommendations for you

You're reading from Practical Automated Machine Learning Using H2O.ai Discover the power of automated machine learning, from experimentation through to deployment to production

Table of Contents (19) Chapters

Experimenting with parameters that support imbalanced classes

Unlock this book and the full library FREE for 7 days

Authors (1)

Personalised recommendations for you