Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Healthcare Analytics Made Simple

You're reading from   Healthcare Analytics Made Simple Techniques in healthcare computing using machine learning and Python

Arrow left icon
Product type Paperback
Published in Jul 2018
Publisher Packt
ISBN-13 9781787286702
Length 268 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (2):
Arrow left icon
Vikas (Vik) Kumar Vikas (Vik) Kumar
Author Profile Icon Vikas (Vik) Kumar
Vikas (Vik) Kumar
Shameer Khader Shameer Khader
Author Profile Icon Shameer Khader
Shameer Khader
Arrow right icon
View More author details
Toc

Table of Contents (11) Chapters Close

Preface 1. Introduction to Healthcare Analytics FREE CHAPTER 2. Healthcare Foundations 3. Machine Learning Foundations 4. Computing Foundations – Databases 5. Computing Foundations – Introduction to Python 6. Measuring Healthcare Quality 7. Making Predictive Models in Healthcare 8. Healthcare Predictive Models – A Review 9. The Future – Healthcare and Emerging Technologies 10. Other Books You May Enjoy

Splitting the data into train and test sets

Now that we have our response variable, the next step is to split the dataset into train and test sets. In data science, the training set is the data that is used to determine the model coefficients. In the training phase, the model takes into account the predictor variable values together with the response value to "discover" the rules and the weights that will guide the prediction of new data. The testing set is then used to measure our model performance, as we discussed in Chapter 3, Machine Learning Foundations. Typical splits use 70-80% for the training data and 20-30% for the testing data (unless the dataset is very large, in which case a smaller percentage can be allotted toward the testing set).

Some practitioners also have a validation set that is used to train model parameters, such as the tree size in the random...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image