Formulating the scientific questions in life sciences and research
To be able to perform statistical analyses in life science and research, you will first need to learn how to address scientific questions in these areas. Scientific questions are a way to define what it is that we are trying to understand or what goal to achieve. In this chapter, you will learn by example how to formulate scientific questions related to various fields related to biostatistics, such as biomedical research, before any relevant statistical analysis is made. One of the first questions to answer is, “What is the goal of a statistical analysis?” This goal is closely related to different life science aspects, therapies, biological processes, or genetic characteristics, and in this section, those will be covered in more detail.
Once scientific questions are made, they are then used to formulate different scientific hypotheses. The main characteristic of any hypothesis is that it can be tested and there is an alternative (opposite) hypothesis to the main one. So, the baseline scenario assumption can be that there is no statistically significant result, and we can test the alternate scenario: that there is a significant result against the baseline or null scenario. We can call the null hypothesis H0 and the alternate hypothesis Ha.
How to formulate scientific questions related to diabetes
The effect of different lifestyles on the outcomes of type 2 diabetes mellitus has been debated for decades.
Let’s pose a couple of scientific questions about diabetes. We will use the letter Q for scientific questions:
- Q1. Is body weight related to type 2 diabetes mellitus?
- Q2. Are there other risk factors for type 2 diabetes mellitus among those studies?
- Q3. Which of the lifestyle factors is the most important risk factor in type 2 diabetes mellitus?
Now, let’s formulate these questions even better. We will mark formulations using the letter F:
- F1. Null hypothesis (H0): Body weight is not related to type 2 diabetes mellitus.
Alternate hypothesis (Ha): Body weight is related to type 2 diabetes mellitus.
- F2. Null hypothesis (H0): There are no other risk factors for type 2 diabetes mellitus among those studied.
Alternate hypothesis (Ha): There are other risk factors for type 2 diabetes mellitus among those studied.
- F3. This question will not have a null hypothesis as it is already assumed there are risk factors in the questions. So, the goal of answering this question is to compare the risk factors and identify the most important one. This would be an observational scientific question.
So, why do we usually formulate the null hypothesis as a negation of what’s being tested? Well, we want to know the following: Can I show evidence that contradicts that baseline negative assumption? If I can, then I can reject the null hypothesis. If there isn’t enough evidence to negate the null hypothesis, I can say that I cannot reject the null hypothesis (avoid the mistake of saying that no evidence is evidence of a null hypothesis).
How to formulate scientific questions related to cardiovascular disease
Is ST (the last wave on the electrocardiogram of the heartbeat) elevation closely related to heart disease? With this, we move to the following questions:
- Q4. Do cigarettes increase the risk of cardiovascular diseases?
- Q5. Is an ECG closely related to cardiovascular disease?
- Q6. Are there any other risk factors for cardiovascular disease among the studied parameters?
Let us make a more structured formulation as follows:
- F4. Null hypothesis (H0): Cigarettes do not increase the risk of cardiovascular diseases.
- Alternate hypothesis (Ha): Cigarettes increase the risk of cardiovascular diseases.
- F5. Null hypothesis (H0): ECG is not closely related to cardiovascular disease.
- Alternate hypothesis (Ha): ECG is not closely related to cardiovascular disease.
- F6. Practice yourself!
How to formulate scientific questions in biology
Here are a few examples for formulating questions in biology:
- Q7. Learn to explore which genes are highly suppressed in lung cancer.
- Q8. How similar are the genomes of mice and humans?
- Q9. What are the differences in plants and minerals collected from localities A and B (Ca, Mg, K)?
- Q10. Does water temperature affect plankton?
Practice formulating these questions as hypotheses or concrete study questions!
You may find the answers at the end of Chapter 1.