Explore Products

Best Sellers

New Releases

Books

Videos

Audiobooks

Learning Hub

Free Learning

You're reading from Hands-On Ensemble Learning with Python Build highly optimized ensemble machine learning models using scikit-learn and Keras

Product type Paperback

Published in Jul 2019

Publisher Packt

ISBN-13 9781789612851

Length 298 pages

Edition 1st Edition

Languages

Python

Tools

Keras

Concepts

Machine Learning

Authors (2):

Konstantinos G. Margaritis

George Kyriakides

View More author details

Table of Contents (20) Chapters

Preface

1. Section 1: Introduction and Required Software Tools FREE CHAPTER

2. A Machine Learning Refresher

3. Getting Started with Ensemble Learning

4. Section 2: Non-Generative Methods

5. Voting

6. Stacking

7. Section 3: Generative Methods

8. Bagging

9. Boosting

10. Random Forests

11. Section 4: Clustering

12. Clustering

13. Section 5: Real World Applications

14. Classifying Fraudulent Transactions

15. Predicting Bitcoin Prices

16. Evaluating Sentiment on Twitter

17. Recommending Movies with Keras

18. Clustering World Happiness

19. Another Book You May Enjoy

Leave a review - let other readers know what you think

Getting Twitter data

There are a number of ways to gather Twitter data. From web scraping to using custom libraries, each one has different advantages and disadvantages. For our implementation, as we also need sentiment labeling, we will utilize the Sentiment140 dataset (http://cs.stanford.edu/people/alecmgo/trainingandtestdata.zip). The reason that we do not collect our own data is mostly due to the time we would need to label it. In the last section of this chapter, we will see how we can collect our own data and analyze it in real time. The dataset consists of 1.6 million tweets, containing the following 6 fields:

The tweet's polarity
A numeric ID
The date it was tweeted
The query used to record the tweet
The user's name
The tweet's text content

For our models, we will only need the tweet's text and polarity. As can be seen in the following graph, there...

The rest of the chapter is locked

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $19.99/month. Cancel anytime

Authors (2)

Kyriakides

George Kyriakides is a Ph.D. researcher, studying distributed neural architecture search. His interests and experience include automated generation and optimization of predictive models for a wide array of applications, such as image recognition, time series analysis, and financial applications. He holds an M.Sc. in computational methods and applications, and a B.Sc. in applied informatics, both from the University of Macedonia, Thessaloniki, Greece.

See other products by Kyriakides

Margaritis

Konstantinos G. Margaritis has been a teacher and researcher in computer science for more than 30 years. His research interests include parallel and distributed computing as well as computational intelligence and machine learning. He holds an M.Eng. in electrical engineering (Aristotle University of Thessaloniki, Greece), as well as an M.Sc. and a Ph.D. in computer science (Loughborough University, UK). He is a professor at the Department of Applied Informatics, University of Macedonia, Thessaloniki, Greece.

See other products by Margaritis