Remember! Everybody can and should read Part I. Part II today requires the Class background.
Video
Links: YouTube * Twitter – X * Rumble * Bitchute * Class Page * Jaynes Book * Uncertainty
HOMEWORK: You must look up and discover a Research Shows paper and see if it conforms to the conditions given below. Have you done it yet?
Lecture
Go back and re-read Class 32, so the notation makes sense. Recall we ended, as most epidemiological studies ended, by computing Pr(E|YX) and Pr(E|No YX), but that neither of these are Pr(Y|DEX), and that claiming they are is the epidemiologist fallacy. We want Pr(Y|DEX), the probability of the disease given exposure and a “sufficient” dose, and given whatever background X.
We can get to Pr(Y | DEX) easily enough using the correct rules of probability. First we see:
This simply applies Bayes theorem properly, and gives us the probability of having the disease given exposure (not yet dose), and given our background knowledge. Notice very carefully it does have the two elements in the epidemiologist fallacy, which are Pr(E | YX) and Pr(E | No YX). Which means epidemiologists started on the right path, but never completed their journey.
For Bonvicini (go back and review!) we can plug numbers in:
Also note that this applies only to the 82+41 = 123 people that were measured. If we want to use this as an estimate to people not yet measured, we must take more care (and we recall the 0.00027 is suspect). But these are niceties we can ignore today.
We can, and should, also calculate this:
Now Pr(Y | EX) > Pr(Y | X) and Pr(Y | EX) > Pr(Y | No EX), which is intriguing, and should be whispered to those who want to dig deeper. It is in no way proof that pesticides cause ALS. Because if that were true, then all exposed people would have got ALS. Which all did not. And some who were not exposed also got ALS. And these two facts at least mean other things besides pesticide are causing ALS. Which means those other things, which we have proved with certainty must exist, might account for all causes of ALS, and pesticide account for none. As long as there is no error in the exposure as claimed, that is.
We still have not got to dose. We can, using probability as before:
We now know Pr(Y | DEX), which means we also know Pr(No Y | DEX) (which is 1 – Pr(Y | DEX)). Yet we do not know what Pr(D | YEX) or Pr(D | No YEX) are. But we do know that if pesticides have anything to do with causing ALS then it must be that Pr(D | YEX) > Pr(D |No YEX).
If, for example, Pr(D | YEX) = Pr(D |No YEX) (for any value of either) then Pr(Y | DEX) = Pr(Y | EX), which says that knowing D is irrelevant after knowing E and X, a very curious thing to claim. It would mean that (always assuming X), after exposure, both diseased and non-disease patients have the same chance of receiving a dose. But recall the causes and causal pathways in any two people need not be the same.
Only if Pr(D | YEX) > Pr(D |No YEX) is Pr(Y | DEX) > Pr(Y | EX). Indeed, if Pr(D | YEX) < Pr(D |No YEX) then Pr(Y | DEX) < Pr(Y | EX), and the dose is in the direction of protective against the disease. Which might be the case if the dose has a hormesis effect.
Let’s pay careful attention to the equation Pr(D | YEX) > Pr(D |No YEX). In both cases (always accepting X), we assume exposure. Both the diseased and non-diseased have been exposed. We then want to know about dose. If, after being equally exposed the chance of getting a dose is also equal for both the diseased and non-diseased, then dose is irrelevant, as mentioned, and we only need Pr(Y|EX). We don’t need to know dose. But that means, since exposure is not dose, that if Pr(Y|EX) > Pr(Y|No EX), it must be something besides dose but related to exposure that accounts for the increased probability of disease.
There could be any number of things that are exposure but that are not dose-related that could be causally important in the disease. This is why any method that only gives, or gives functions of, mere exposure can be very badly misleading. Which is why most epidemiological studies that rely on exposure are in fact misleading.
Problem is, of course, that we typically don’t know what Pr(D | YEX) or Pr(D |No YEX) is. We certainly don’t in Bonvicini. Merely for the sake of argument, let us suppose that indeed Pr(D | YEX) > Pr(D |No YEX). Not knowing reasonable numbers, we might guess—and recall this assumes all people involved were exposed—Pr(D | YEX) = 0.1 and Pr(D |No YEX) = 0.08. Which is, some 10% of the exposed diseased people were dosed, and only 8% of the exposed non-disease people were. To stress: I made these numbers up. I have no idea of their accuracy.
With that we get Pr(Y | DEX) = 0.0008. Recall Pr(Y | EX) = 0.00063. That difference, 0.0008 – 0.00063 = 0.00017 accounts for adding dose to our evidence. It is a very small number (of course assuming our guesses are correct). But it is the difference, whatever it might be, that is important, because we already know that mere exposure in the absence of dose gives evidence of disease. Exposure itself can’t be a cause, so it means that things associated with exposure are our culprits.
All this analysis is before considering the importance to people outside those already measured. For that, we’d need a more sophisticated analysis. But we can do a quick preview of this. Suppose Pr(Y | DEX) = 0.0008, as above. We assumed Pr(D | YEX) = 0.1. And from above we had Pr(E | YX) = 0.32. For a group of new people, say a region of a state, or for whatever number of new unmeasured people, we want the probability that somebody has the disease and is exposed and is dosed. This is
Knowing nothing about new people except that they “fit” whatever information we have in X, then for every million new people, we’d expect about 26 (the Pr(YDE|X) times a million) people with the disease who were both exposed and dosed.
For those just exposed, we have Pr(YE|X) = \Pr(Y | EX)\Pr(E |X) = 0.00063 × 0.32 = 0.0002. And so in that same new million people, we’d expect about 202 people with the disease who have been exposed. That means about 202 – 26 = 176 come from “exposure”, or whatever it is that is causally related to exposure (because again it cannot be exposure itself).
These numbers are only one side of the story, and all must be balanced by whatever good the pesticide does. Suppose, to make up numbers, that we determine the pesticide saves the lives of 100 people out of every million (to low ball it). Then, in balance, the pesticide is still ahead on average. But this kind of thing is an analysis for another time.
Meanwhile, we have one more trick epidemiologists play. And that is not only to skip dose, but to the skip exposure. A model of exposure is substituted for exposure, and it is forgotten the model is a model, that a guess is not the real thing.
We do this the next, and final time, in our initial Epidemiology series.
Subscribe or donate to support this site and its wholly independent host using credit card click here. Or use the paid subscription at Substack. Cash App: $WilliamMBriggs. For Zelle, use my email: matt@wmbriggs.com, and please include yours so I know who to thank. BUY ME A COFFEE.
While you are sleeping among your numbers, equations, variables and stuff...
What if we asked one simple question instead of all this huge effort in the incomprehensible landscape:
Do we know that this (intervention, drug, chemical, measure, whatever) is 100% safe?
You don’t need any clinical studies or years-long super-paid research to answer it.
And then...
If we don’t know this, what are you doing with it? What for? At whose command? At what cost? Who will pay?
College education (even incomplete) is sufficient to handle such challenges.
Common sense works even better.
"Meanwhile, we have one more trick epidemiologists play. And that is not only to skip dose, but to the skip exposure. A model of exposure is substituted for exposure, and it is forgotten the model is a model, that a guess is not the real thing..."
It's a scientific study! It all looks so real! :-)
Models and statistics are great tools, but in the hands of some they're certainly terrible tools of propaganda. Of course you'd have to have some understanding of models and statistics to know that, if only more people did.