Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Biostatistics with Python

You're reading from   Biostatistics with Python Apply Python for biostatistics with hands-on biomedical and biotechnology projects

Arrow left icon
Product type Paperback
Published in Nov 2024
Publisher Packt
ISBN-13 9781837630967
Length 374 pages
Edition 1st Edition
Languages
Concepts
Arrow right icon
Author (1):
Arrow left icon
Darko Medin Darko Medin
Author Profile Icon Darko Medin
Darko Medin
Arrow right icon
View More author details
Toc

Table of Contents (24) Chapters Close

Preface 1. Part 1:Introduction to Biostatistics and Getting Started with Python
2. Chapter 1: Introduction to Biostatistics FREE CHAPTER 3. Chapter 2: Getting Started with Python for Biostatistics 4. Chapter 3: Exercise 1 – Cleaning and Describing Data Using Python 5. Chapter 4: Part 1 Exemplar Project – Load, Clean, and Describe Diabetes Data in Python 6. Part 2:Introduction to Python for Biostatistics – Methodology and Examples
7. Chapter 5: Introduction to Python for Biostatistics 8. Chapter 6: Biostatistical Inference Using Hypothesis Tests and Effect Sizes 9. Chapter 7: Predictive Biostatistics Using Python 10. Chapter 8: Part 2 Exercise – T-Test, ANOVA, and Linear and Logistic Regression 11. Chapter 9: Biostatistical Inference and Predictive Analytics Using Cardiovascular Study Data 12. Part 3:Clinical Study Design, Analysis, and Synthesizing Evidence
13. Chapter 10: Clinical Study Design 14. Chapter 11: Survival Analysis in Biomedical Research 15. Chapter 12: Meta-Analysis – Synthesizing Evidence from Multiple Studies 16. Chapter 13: Survival Predictive Analysis and Meta-Analysis Practice 17. Chapter 14: Part 3 Exemplar Project – Meta-Analysis of Survival Data in Clinical Research 18. Part 4:Biological and Statistical Variables and Frameworks, and a Final Practical Project from the Field of Biology
19. Chapter 15: Understanding Biological Variables 20. Chapter 16: Data Analysis Frameworks and Performance for Life Sciences Research 21. Chapter 17: Part 4 Exercise – Performing Statistics for Biology Studies in Python 22. Index 23. Other Books You May Enjoy

How computation can help answer different questions in life sciences

It is generally believed that biostatistics is mostly about numbers and graphs. The reality is quite different. Biostatistics is also about understanding life science problems and finding ways to resolve those using statistical methods. There are six main problem-solving skills in biostatistics:

  • Helping life science professionals resolve research problems in these domains through the use of data
  • Helping life science professionals interpret the results of their research
  • Making sure the published research is both statistically and biologically valid
  • Helping R&D professionals make decisions in the projects
  • Revealing objective truths about different phenomena through the use of data
  • Explaining the abstract features of mathematics and biology in an intuitive and easy-to-understand way

One of the most important impacts of biostatistics is transitioning from statistical knowledge to actual problem solutions in life sciences. This will be discussed in more detail in the rest of this chapter.

Biostatistics is needed to derive insights from life science experiments and convert measurements and observations to life science solutions.

Professionals in life science and biostatisticians, working together, design different types of experiments, measurements, and observations. All these can be written or stored as data. Data is a source of information from those experiments, measurements, and observations.

Data can originate from observations, too. One example of observation is the diagnosis by a dermatologist or the identification of species by biologists.

Biostatisticians are there to help make sure this data is valid and make it meaningful. Further, data should be organized and structured, often presented in the form of tables to be prepared for further analysis and interpretation.

To make the data useful, we must understand all the details about the data and how these are related to domains where biostatistics is applied. One of the most important aspects of biostatistics is the context around the data. This context can significantly affect the results and is one of the reasons why biostatisticians are more specialized in life science domains than general statisticians.

One of the main goals of biostatistics is to take all available inputs in the form of data and process them in such a way as to produce meaningful insights, answers, and conclusions and provide information to make decisions in life science.

Here is the biostatistics workflow:

Figure 1.3 – Biostatistics workflow

Figure 1.3 – Biostatistics workflow

There are two main types of data: numerical (for example, the measurement of the hemoglobin level in blood in which we are using numerical values such as grams per liter or g/L) and categorical, such as a doctor’s diagnoses of their patients in a form; “Yes” for a positive diagnosis or “No” for a negative diagnosis. These types of data can be further divided into subcategories, which will be discussed in detail in the next chapters.

Understanding data sources is essential for biostatistics. Biostatistics is focused on statistical models but also on domain knowledge and, as such, has evolved as a separate branch of both statistics and life sciences.

This book will provide many different examples that will show you how to use biostatistics specifically for different domains, such as diabetes research, cardiology, and biostatistical studies. Further, in this chapter, we will discuss how the Python programming language can facilitate the implementation of biostatistical methods.

Biostatistics and Python

Most biostatistical analyses today are implemented in some form of software or a programming language. I chose Python as a programming language for this book for several reasons. Python is one of the most advanced languages for data science and biostatistics. As programmers today are moving toward using Python, keep in mind that it is one of the most wanted skills in most areas that have to do with analytics. Libraries such as Biopython and SciPy are among the more than 100,000 libraries that make Python so versatile, meaning that almost any biostatistical analysis can be performed using this programming language. It is open source, meaning it is transparent and free for anyone to use.

The following figure is an example of using Python for biostatistics:

Figure 1.4 – Biostatistics and Python

Figure 1.4 – Biostatistics and Python

Its integration with advanced machine learning and bioinformatics algorithms gives a biostatistician a whole new spectrum of approaches and provides the most advanced frameworks for using biostatistical algorithms at this time.

Finally, the most important part – learning Python through a portfolio of practical projects provides you, as a reader, with two important qualities: being able to use one of the most wanted programming languages out there can be beneficial for your career, and having a portfolio of more than 10 practical projects using biostatistics and Python provides significant resources for your portfolio as someone who plans to use or advance your career by using biostatistics.

You have been reading a chapter from
Biostatistics with Python
Published in: Nov 2024
Publisher: Packt
ISBN-13: 9781837630967
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image