The Thyroid Madness: Two Apparently Contradictory Studies. Proof?

Recap: (See also: http://lesswrong.com/r/discussion/lw/nef/the_thyroid_madness_core_argument_evidence/ and previous posts)

Chronic Fatigue Syndrome and Fibromyalgia all look far too much like the classical presentation of hypothyroidism for comfort, but thyroid hormone blood tests are normal.

Many alternative medicine practitioners, most prominently John Lowe, and several conventional medical doctors, most prominently Kenneth Blanchard, a practising endocrinologist with a longstanding practice completely free of lawsuits, have tried diagnosing hypothyroidism 'by clinical symptoms', and treating it with various combinations of thyroid hormones, and they all report success, but the practice is dismissed as ignorant and dangerous quackery by conventional medicine.

I suspect that there are acquired 'hormone resistance' or 'type II' versions of all the various endocrine disorders. These would produce the symptoms without reducing the levels of the hormones in the blood. However hormone treatments should still work, simply by overwhelming the resistance.

We know that diabetes comes in two forms, (type I) gland failure and (type II), 'insulin resistance', and that the resistance version is usually acquired rather than inborn. The mechanism for the resistance version of diabetes is mysterious.

There are known to be corresponding 'gland failure' and 'resistance' versions of diseases associated with all the other endocrine hormones, but for some reason the resistance versions are thought to be very rare, and only to be inherited, never acquired.

Should such acquired resistance mechanisms exist and be common, then on evolutionary grounds they would have to be caused by the direct action of pathogens, be a side effect of immune defense against such pathogens, or have an environmental cause. Nothing else would be stable.

Chronic Fatigue Syndrome often seems to start with an infection.


 


I thought until recently that the problem must be rather complex, and depend on subtle balances of hormones in a complicated system. The idea is so simple and obvious that if it were straightforwardly true, it isn't credible that it would have been missed.

But it turns out that there have been two formal studies of the simplest possible version of idea (treat the symptoms of hypothyroidism with thyroxine) in the medical literature. And they're all I've managed to find. Further examples would be most welcome.

The two studies are apparently contradictory, but there's no real contradiction, in fact the second supports the first.

The first:

Clinical Response to Thyroxine Sodium in Clinically Hypothyroid but Biochemically Euthyroid Patients
G. R. B. SKINNER MD DSc FRCPath FRCOG, D. HOLMES, A. AHMAD PhD, J. A. DAVIES BSc and J. BENITEZ MSc

was an open trial done in 2000, by Gordon Skinner in Birmingham.

Dr Skinner took 139 patients, all of whom had symptoms consistent with a clinical diagnosis of hypothyroidism.

Of these the majority had been diagnosed with CFS or ME or Post-Viral Fatigue Syndrome, but thirty had been diagnosed with Major Depression, which also has all the right symptoms.

Dr Skinner started off with small doses of thyroxine, and slowly increased the doses, to quite high levels, until the patients got better. He reported that they all got considerably better. In fact his results are phenomenally good.

He mentioned the possibility of placebo effect, and the necessity of ruling it by placebo-controlled blinded randomised trial in the paper, but thought it unlikely. Many of these patients had been seriously ill for many years, and had usually tried a lot of things already.

[ From the study ]  In the absence of a control group, a placebo effect cannot be excluded in this or any study. However, the average duration of illness was 7.5 years in patients who had usually undergone an alarming array of traditional and alternative medications without significant improvement as evidenced by their wish to seek further medical advice. Secondly, certain clinical features allowed objective assessment, namely change in appearance, hair or skin texture, reduction in size of tongue and thyroid gland and increase in pulse rate.

If these patients hadn't had a hormone resistance, he would have done them very serious harm! He kept increasing the dose until it worked, and the highest dose he used was 300mg of thyroxine. That's more than the amount you'd usually use to completely replace the output of a removed thyroid gland. Given that all these people had normal hormone levels to start with, if the patient was not resisting the hormone, this should have caused a range of extremely unpleasant symptoms, including death.

He mentions no adverse effects whatsoever.

Dr Skinner wrote to the British Medical Journal suggesting that thryoxine should be tried in cases where the clinical symptoms of hypothyroidism were present but the blood tests were normal.

This prompted a small trial:

Thyroxine treatment in patients with symptoms of hypothyroidism but thyroid function tests within the reference range: randomised double blind placebo controlled crossover trial  

M Anne Pollock, Alison Sturrock, Karen Marshall, Kate M Davidson, Christopher J G Kelly, Alex D McMahon, E Hamish McLaren


This trial looks very well designed and established that:

(a) There was a huge placebo effect in the patients

(b) Thyroxine is very strongly disliked by the healthy controls (they could tell it from placebo and hated it)

(c) The patient group couldn't tell the difference between thyroxine and placebo (on average).

This result is very interesting of itself, and I make no criticism of the brave GPs who organised it in response to Skinner's letter, but unfortunately it has been taken as a refutation of Skinner's methods. Which it is not. In fact it supports him.

In fact there are two obvious relevant differences between what they did and what Skinner did:

(i) They used a fixed dose for everyone (100mg thyroxine / day) and made no attempt to tailor the dose to the patient.

I suspect that this would have made Skinner's treatment less effective, but it should still have worked.

(ii) They used very different criteria for selecting their patients.

Skinner had carefully done a 'clinical diagnosis' of hypothyroidism, using 16 symptoms, most of which were present in the majority of his patients.

The criteria for the formal trial were:

At least three of the following symptoms for six months: tiredness, lethargy, weight gain or inability to lose weight, intolerance to cold, hair loss, or dry skin or hair.

So a fat person with dry hair who didn't get enough sleep would have qualified as a patient.

This is utterly inadequate as a diagnosis of hypothyroidism! It is a famously difficult disease to diagnose!

Their patient group would have consisted mainly of people who didn't have the clinical symptoms of hypothyroidism. (EDIT: Obviously these people would have had symptoms of *something*, and thus probably been ill, but they are equally valid as symptoms of mild anaemia, or mild diabetes, which also seem to go undiagnosed a lot. The whole trick with hypothyroidism was to tell the difference between it and other similar diseases.)

If the type II version is rare or non-existent, then it would have included no real patients at all.

If the type II version is very common, then at least some of the patient group should have had the disease Skinner said he could cure.

What I think must have happened here is that the treatment produced great improvements in a few patients, and caused unpleasant symptoms in all the rest. This averaged out to 'can't tell the difference between placebo and treatment'. Remember that healthy people can!

I deduce that Skinner's treatment works pretty much as well as he thought it did, and that the disease he was curing is very common indeed.

Can anyone explain these two studies in any other way?




Conclusion

When combined with Sarah Myhill's paper showing that the principal cause of chronic fatigue is 'mitochondrial dysfunction', and that the action of the thyroid hormone is to stimulate the mitochondria, I think the case for a 'thyroid hormone resistance' disease manifesting as Chronic Fatigue Syndrome is unanswerable.

At the very least, this should be investigated.

I now believe my own argument, which until I saw Skinner's paper appeared even to me to be a wild idea made up from shreds of mathematical intuition and questionable evidence from biased sources. I think that Skinner's treatment is unlikely to be optimal, and research into what is actually going on needs to be done.

The problem, if it does exist, is likely to be extremely widespread, and explain far more than the mystery of Chronic Fatigue Syndrome and Fibromyalgia. I immediately claim Major Depressive Disorder and Irritable Bowel Syndrome as alternative labels for: 'type II hypothyroidism'. There is a large cluster of these diseases, all mysterious, all with very similar symptoms, known as the 'central sensitivity syndromes'.

And I should like to add that 'blood cholesterol' was once a test for hypothyroidism, so there are probably implications for heart disease as well. Anyone interested in the wider implications might want to take a look at Broda Barnes' work. I started off thinking he was a lunatic. I'm now fairly sure he must have been right all along.

I think it's now urgent to bring this to the attention of the medical profession and the sufferers' groups. Has anyone got any ideas how to do that?

 


 

Edit:

Two excellent arguments made on reddit's r/CFS group by EmergencyLies (I paraphrase/steelman him):

  • If there's a widespread hormone resistance version of hypothyroidism, where are the most severe cases?

(i) The mild version may be polymorphic, but the severe 'myxoedema' described in Victorian literature was the sort of thing that could be diagnosed on sight (or by hearing the voice) by anyone who'd seen a few severe cases.

(ii) One hears anecdotes of people who can tolerate insane levels of T3. If the hormone resistance can get that severe, why isn't the same problem killing people, or at least making them obviously hypothyroid?

I can't answer this one. Where are they? This is the best objection to this idea that I have seen in three months. Does anyone know of people with really obvious hypothyroidism and normal TSH values?

EDIT: Actually there are such people! They get diagnosed with 'central hypothyroidism', which is thought to be very rare.  John Lowe thought that about 1/4 of fibromyalgia cases were undiagnosed 'primary hypothyroidism', 1/2 were 'central hypothyroidism', and 1/4 were the 'hormone resistance version'. He thought that the hormone resistance version was very rare and genetic. I think it's more likely acquired in some way. Or it's possible that 'mild central hypothyroidism' is much more common than generally believed. It makes sense that the mild version should be more common than the severe version. It would be very difficult to tell the difference between 'central hypothyroidism' and 'acquired hormone resistance hypothyroidism'.


and:

  • CFS should look like hypothyroidism, but doesn't

(i) Skinner and Pollock together strongly suggest that there's a widespread form of hypothyroidism, undetected by usual blood tests, but treatable with thyroxine

(ii) Anyone with hypothyroidism but normal blood tests is going to get diagnosed with something like CFS/FMS/IBS/MDD etc...

(iii) Some of those people are going to end up diagnosed with CFS. Probably lots, if it's widespread.

(iv) Hypothyroidism causes lowered heart rate

(v) But CFS patients have raised heart rates, (on average?).

Those five things together look like a proof of contradiction, so one of them must be wrong.

 

I think it's (iv). Billewicz's clinical hypothyroidism test doesn't think heart rate has diagnostic value. Thus there were both low and high heart rates in hypothyroidism. I suspect that there's a low basal heart rate because of low metabolism, but that it goes high and stays high after even mild exercise because of the need to clear fatigue poison. Also, of course, hypothyroidism weakens the heart like any other muscle, so heart rate would actually need to be higher to pump the same amount of blood.

Comments

sorted by
magical algorithm
Highlighting new comments since Today at 2:05 AM
Select new highlight date
Rendering 50/79 comments  show more

I think it's now urgent to bring this to the attention of the medical profession and the sufferers' groups. Has anyone got any ideas how to do that?

At least posting this stuff here is a start, and thank you for doing this. I do not have this problem myself, but I saw the pattern often enough in others that I have independently (of any medical knowledge or opinions of other people) developed this reaction:

someone has chronically low energy / mood -> check if they have obvious problems with diet/sleep/exercise -> if not, suspect thyroid-related deregulation

This is in no way accurate science... but it makes my prior for your hypothesis pretty high, and I am inclined to accept it with high confidence, despite the fact that it could look unfounded or even dangerous to some.

I am going to mention this to the people I know, and judging by how common this type of condition is I expect I'm going to reach at least a few of those that could be helped by the knowledge, even if by pure chance.

Squirrel, it's bloody dangerous. I'm arguing that it should be investigated, not that everyone who feels tired should start glugging hormones.

But it's not unfounded. I've found loads of anecdotal and circumstantial evidence, and very little against (hot daytime fibro-turks is the most surprising thing, and the alternative people seem to think there's often also adrenal issues, but usually secondary). And now two studies from the 'proper' literature.

And I'm fairly sure that endocrinology has never bothered its pretty little head about type II versions, or the accuracy of its wretched TSH test, despite the example of diabetes staring it in the face.

I'm as sure as I can be without formal trial (so maybe about 25% confident?). But I'm now certain that the lack of investigation and of formal trials is inexcusable negligence. J' totally accuse.

The TSH test is actually very accurate. Third generation TSH assays are able to detect 0.02 mIU/L or less.

The problem is the way TSH testing levels are used without regard for the actual thyroid hormone levels. The relationship between TSH, T4, and T3 is much more complicated than it seems.

A good explanation of the latest research into it is here: https://www.researchgate.net/publication/263321383. The title of the paper is "Homeostatic equilibria between free thyroid hormones and pituitary thyrotropin are modulated by various influences including age, body mass index and treatment."

Hi FriendlyBuffalo, welcome to Less Wrong!

I've got no problem with the TSH test as a test for TSH. It's really good for that, and I seriously admire its cleverness and accuracy.

In fact it probably is a good test for primary gland failure. I can't see how the gland itself could go seriously wrong without driving TSH into the stratosphere.

What I hate is the idea that TSH normal <=> 'Thyroid Symptoms, improve when treated with thyroid hormones'. I think there are other dysfunctions going on. The very idea of assessing the state of a system that complicated by measuring one variable (or even three) is ridiculous.

I'm pretty much 'clear clinical picture => therapeutic trial, and sod the blood tests' at the moment, as I think Gordon Skinner was. Of course the problem with that is you end up endorsing leeches and aromatherapy that way. I do have a lot of sympathy for basal metabolic rate, and for waking temperature as a proxy for that.

I think we both agree that some CFS/FMS is just thyroid dysfunction, and will improve with various combinations of thyroid hormones.

The only remaining question for me now is 'Is all of CFS/FMS thyroid related, or just a significant portion of it'?

Lowe reckoned that it was 1/4 primary that had been missed, 1/2 central that there's no test for, and 1/4 the mysterious resistance that he had to overwhelm with high doses of TSH. I see no reason currently to doubt his word, and I'm pretty sure that his work has saved my life (I wouldn't have put up with CFS for much longer. It was awful, and there's no way I'd have found out it was thyroid without Lowe.) So I want to dig into his ideas until I can convince myself that they're either true or false.

If they're false that's really strange. There are now two different diseases, which came into being in the 1970s, which look exactly the same as hypothyroidism, only one of them is, and one of them isn't. I have real trouble with that on Occam's razor grounds.

And that's assuming FMS/CFS/MDD are the same thing. If not then there are three new diseases and some misdiagnosed/mistreated thyroid stuff all pretending to be each other.

I'm really really busy at the moment. I'm so sorry. I've been looking for someone who knows more about this than me for months, and now you show up willing to talk and I've got urgent other things to do. But let us resume discussion at a later date. I've written quite a lot about it here as the idea developed, feel free to debunk it all in absentia mea. I'll come back to all this soon, I'm absolutely sure. I'm obsessed.

I've printed off that paper, it looks really interesting. I'm going to try very very hard not to read it instead of doing the thing I'm supposed to be doing. Thank you very much, and hope to resume discussion soon.

Squirrel, it's bloody dangerous. I'm arguing that it should be investigated, not that everyone who feels tired should start glugging hormones.

Yes. And I'm not planning to, you know, convince anyone to do this; however I myself would have liked to know that this is a possibility, if I had this condition. What to do about this is a different matter entirely. I guess you can't easily obtain the relevant medications without consulting a doctor in any case.

Wim Hof demostrates that humans are able to regulate their body temperature when they train to do so. If hypothyroidism is basically about a downregulated metabolism, what happens if the techniques of Wim Hof are used?

Anyone who's been following this idea as it metastasised might be happy to know that I'm now a retired amateur endocrinologist. This is as good an argument as I'm ever going to make, and it should be turned over to adults now. Thank you all for your helpful comments and suggestions. Everyone's been wonderful.

Eric Drexler pointed out to me the other day that there probably is a path to superintelligent help without bringing on the apocalypse, so maybe we can get to paradise after all. His argument is very good. That looks like it might be more interesting than endocrinology, which seems to be more of a social problem than anything I can help with.

You wouldn't need to invoke the idea of 'hormone resistance' because TSH and T4 tests normally used to diagnose hypothyroidism don't measure the active hormone - T3. T4 is just a prohormone with very little direct activity on metabolic rate.

In primates, metabolism is regulated primarily in the liver by T4->T3 conversion, so if this is inhibited for any reason it will suppress metabolism without showing up on those tests. Low calorie intake, and poor nutrition are known to cause this (e.g. Euthyroid sick syndrome). In cases of poor liver conversion, supplementing T4 can actually make symptoms worse, as it will further suppress metabolism by lowering the small amount of T3 production from the thyroid (via the TSH feedback loop).

I assume you have heard of Ray Peat? I personally had good luck applying his ideas to increase my energy levels, and my pulse, body temperature, and cold tolerance raised as well - without supplementing thyroid. His general idea is pretty simple- just look at what conditions and nutrients maximize T4->T3 conversion, and provide them (low stress, high nutrient diet).

Broda Barnes work is very interesting. It blows my mind that he published a paper in The Lancet showing that desiccated thyroid lowered cholesterol levels and seemed to prevent cardiovascular disease in his patients, and that it remains virtually un-discussed and uncited (http://www.ncbi.nlm.nih.gov/pubmed/13796871).

The reason why Barnes' paper showing that desiccated thyroid lowering cholesterol levels and seeming to prevent cardiovascular disease isn't cited is because he was basically making his patients hyperthyroid. Lower cholesterol levels occur in hyperthyroidism.

There is a doctor I know of in California who gives his patients supra-physiological levels of T3 hormone (cytomel) to increase their metabolism, to help them lose weight, and to lower their cholesterol levels. It basically suppresses the thyroid's own production of hormone. In the short term, it works. It's brilliant. But it's crazy. We have no idea what the long-term consequences are. And since I'm pretty sure he's not running a study on it, we won't.

he was basically making his patients hyperthyroid

Why is this a reason not to reject it? He is essentially arguing that the major cause of cardiovascular disease is population-wide high rates of hypothyroidism. It would be a circular argument to dismiss that because his treatment leads to a greater than average metabolic rate. One would also need evidence of a disadvantage that outweighs the advantages. His patients seemed to be doing well, or at least he doesn't report them exhibiting any classic signs of hyperthyroidism. He was primarily adjusting dose based on body temperature to the upper end of the normal non-hyperthyroid range.

to help them lose weight

I have seen studies on thyroid supplementation as a weight loss strategy, and it causes loss of lean tissue (muscle, etc.) more than fat.

Hi, there can be all sorts of things going wrong! Mysterious resistances, gland failures, conversion disorders, broken pituitary, broken hypothalamus, faulty deiodinase enzymes, etc. All potentially inherited or acquired. We really do seem to have no idea how this complicated system works or what it's all for, or what can cause it to go wrong.

But I would have thought that if there was widespread 'central hypothyroidism', someone would have twigged by now, since that form does show up if you do a full panel of hormone tests.

Or I would have thought that when I wrote this. By now I am in such despair about the pitiful state of medical research that I wouldn't be surprised if they'd never thought to look, so maybe it is all just perfectly obvious from blood tests and the fools have ignored it.

And the question of 'what is the optimal treatment' is bound to be tricky. I'm just trying to demonstrate that the problems exist and are widespread and thus worth looking at!

Although Skinner certainly thought 'clinical hypothyroidism' could usually be fixed by bunging enough T4 at the problem. He does mention in his book that he sometimes used T3 or NDT, but he doesn't go into details. Various other people say 'mostly T4 with a bit of extra T3', but no-one has particularly clear ideas on what works and what doesn't or why.

Thanks for the reference to Ray Peat, I hadn't heard of him before. Can you link to the best expression of his thoughts?

I think it's important to have a more global vision of the problem. Knowing what the situation is in different countries could be a start. Post it as replies to this comment, with the country you live in, and the situation there (the best would be to ask one or more doctors about it, in order to be sure we can trust it). Personally, I live in France. Here, desiccated thyroid isn't sold anymore to everybody. According to two doctors I know, it's because bad things used to happen back in the days where it was completely accessible (in the 60' I believe). I am still researching informations about the hypothesis itself in my country, I'll post it if I learn anything. Thank you for your replies by advance!

That's interesting. In England and America I think that it would be illegal to sell desiccated thyroid as a prescription drug without a prescription. But it's perfectly legal to sell it as a food supplement. It's just dried bacon, after all.

It's quite difficult to find (people with TSH-detectable hypothyroidism get treated by their doctors with thyroxine, and for most people (~90%??) that seems to work perfectly), but it can be found.

I expect some fascist bastards will get round to outlawing that sooner or later. Before they do, we ought to find out whether it will cure CFS etc. After all, once it's illegal people will have to buy it from criminals, and I don't trust their quality control. Also it might put the price up slightly.

I'm interested in what the bad things that happened in France were. Obviously this is quite a potent drug, and so if it's for sale to the general public it is certain to cause harm from people taking far too much and overdosing.

But at the moment I think it's fairly safe in small doses for trial periods. And I'd very much like to know if that's not true.

Just buy a whole pig and eat it snout to tail :-)

Why is the Pollock trial evidence supporting your hypothesis? What outcome from the trial would you have considered to be evidence against it?

Also, what part suggests that the healthy controls could distinguish the treatment from placebo? From Table 4, it seems that the reverse is true.

At first glance, the results from that study look like straightforward evidence that this treatment is actively harmful. I’d also point out that RCTs need to be standardized across patients. I can’t say whether the inclusion criteria should have been different, but choosing a single dose is normal procedure. There are always better options, but it’s a weak argument on its own, in part because it can be applied in almost any circumstances.

Everyone who's ever tried fixing the clinical diagnosis of hypothyroidism with any kind of thyroid therapy either seems to think it works, or hasn't written about it on the internet or in the medical literature.

I admit I’m not an endocrinologist, but from what I’m reading I don’t think there is any recognized clinical diagnosis of hypothyroidism. The TSH test is the gold standard. That would suggest those who talk about it are primarily cranks and such.

That's a big claim. I'm making it in bold on Less Wrong. I expect someone to turn up some evidence against it. I would love to see that evidence.

Less Wrong might not be the best place for this, since there aren’t many biologists here. You have the burden of proof (i.e., the prior for arbitrary hypotheses is very low), so you shouldn’t be asking other people to disprove it. Could you summarize your support for this claim? Are these the only two peer-reviewed articles?

Assuming he's not just making up his data it's hard to explain his results.

There are lots of ways that data can be wrong without being made up. 90% of medical research findings are false, etc.

Thank you so much, intelligent and careful criticism like this is exactly what I started posting on Less Wrong for!

Why is the Pollock trial evidence supporting your hypothesis?

Well, it's only fairly weak evidence, but it does seem that the healthy controls reacted differently to the patient group. What it really proves is that thyroxine isn't just a nice recreational drug that everyone likes. Healthy people dislike it. But it seems to have been less bad for the patients on average. So I imagine there were some people in the patient group who reacted well.

What I'm saying is that Skinner got strong evidence for the idea, and wanted it confirmed by PCRT (and I agree, that's necessary). So they did a PCRT, but not very well because they didn't find patients carefully. And yet they seem to have supported him anyway, but everyone thinks that they refuted him, because they didn't quite understand what he was saying.

What outcome from the trial would you have considered to be evidence against it?

If none of the patients had had any sort of thyroid problem, I'd have expected it to be equally bad for everyone. If that had been the result, then I'd have had to think that 'type 2 hypothyroidism' is rare, or that 'fixed doses of thyroxine don't fix it'. For a long time that's exactly what I did think! I was assuming you might need T3 as well and you might need to adust the ratio carefully. Skinner and Pollock together make me think that it might be fairly common, and mostly fixable with T4 alone.

Also, what part suggests that the healthy controls could distinguish the treatment from placebo? From Table 4, it seems that the reverse is true.

That shows that when they were asked which was the active preparation, they couldn't tell. They appear to have had a 'nocebo' effect, where they interpreted everything they felt as an effect of the drug. That's as expected.

What makes me think that they felt bad on thyroxine is table 2, where all the 'self-reported' psychological scores have got worse from thyroxine. In particular p=0.007 for the decline in Vitality. Since, as you point out, they really didn't know which was which, it's hard to see how they could have faked that.

At first glance, the results from that study look like straightforward evidence that this treatment is actively harmful.

Absolutely this treatment is harmful to healthy people. It should cause 'hypermetabolism', which is unpleasant. And severe hypermetabolism is awful. Very like the manic phase of manic depression. You should be careful not to give drugs to people who don't need them. That's why in the old days, if they weren't sure, they'd give you a bit and watch to see what effect it had. That was pretty much their test, except in the obvious cases.

but choosing a single dose is normal procedure.

Yes, but that does mean that anything that needs careful dose control will get rejected. In this case I think it might have made the treatment less effective, but it shouldn't have ruined it. I'm not making any criticism of the people who did this trial, I think it was a brave try and they did it well. I just don't think it's enough to refute Skinner. In fact I think it was supportive.

From what I’m reading I don’t think there is any recognized clinical diagnosis of hypothyroidism. The TSH test is the gold standard.

There was once. The paper:

STATISTICAL METHODS APPLIED TO THE DIAGNOSIS OF HYPOTHYROIDISM by W. Z. BILLEWICZ, R. S. CHAPMAN, J. CROOKS, M. E. DAY, J. GOSSAGE, SIR EDWARD WAYNE, AND J. A. YOUNG

was the last word in 'clinical diagnosis'. It was very very difficult to do, and GPs tended to refer suspected cases to experts. In doubtful cases they just tried treating it with small amounts of thyroid and checked that people improved rather than being made anxious and hyper.

The TSH test replaced that around 1970. But they never seem to have checked that clinical and biochemical diagnoses detected the same things, and after that there was the slow emergence of all sorts of nasty diseases that look very like hypothyroidism in the clinical sense but have normal TSH.

The TSH test seems to have been accepted (and then ruthlessly enforced) on the basis of theoretical arguments that weren't checked experimentally.

I do think that the TSH test detects gland failure quite well, in fact I think that if your thyroid gland gets destroyed, your TSH value will become huge. My (excellent) GP tells me that he sees people with TSH 30 with no symptoms at all (yet! Their thyroids are obviously on the way out...).

In fact the original 'normal range' for TSH was very wide indeed. And I think that's probably right too. Over the years the 'normal range' has got narrowed to the point where it's now so narrow people with abnormal TSH usually don't have any symptoms, and the noise in the test can put you outside the range. That's kind of weird. See recent AACB study where they thought the upper limit of normal should be 2.5.

There was a recent attempt to define a new clinical score (Zulewski et al), but the authors of the paper who'd constructed it refused to endorse it because the symptoms didn't correlate with TSH. That says to me that the test isn't detecting the disease it's supposed to detect.

You have the burden of proof

Absolutely accept that! And if Skinner was right, it should be dead easy to prove. Just re-run the Scottish trial using Billewicz as the entry criterion. It would be better if you could adjust the dose, but it should work quite well with a fixed dose, if you accept you're going to under-treat some people and over-treat others. Actually I'd rather use titrated doses of desiccated thyroid, since that's what they used to do, or T4/T3 combinations, but if I believe Skinner then they should all work, and it's just a question of which works best.

Could you summarize your support for this claim? Are these the only two peer-reviewed articles?

These are the only ones I can find through google scholar / pubmed. That in itself is really surprising and one of the things I can't explain! Why has such an obvious thing not been ruled out? Real doctors seem to try it all the time, find it works, and then get persecuted for trying it.

All the rest of it is anecdotal, from alternative sources, but there's a mountain of it. Just google. If people have tried this and it didn't work, they're keeping very quiet. All I've heard against is 'it helps, but it doesn't fix it entirely'. And the alternative people say exactly that themselves, and reckon that there's usually something adrenal going on as well.

I'd point primarily to Broda Barnes, John Lowe, Kenneth Blanchard, Gordon Skinner, Sarah Myhill, Barry Durrant-Peatfield, the various thyroid activist groups, Kent Holtorf, and 'Wilson's syndrome', off the top of my head, but there's plenty more where that came from. And a lot of those guys are actual medical doctors. The big exception is John Lowe, who was a chiropractor. But I've read a lot of his stuff and he was a very careful, thoughtful man.

90% of medical research findings are false

Indeed. The whole thing is a disaster. John Ioannides said 'Evidence Based Medicine Has Been Hijacked'. But I think it's worse than that. By saying that you're going to ignore the experience of doctors, and only accept very expensive evidence that can only be provided by wealthy sources, and even then using methods so bad that they're practically guaranteed to produce false answers, you've completely cut yourselves off from the truth.

I'd go further and say 'Evidence Based Medicine Has Been A Catastrophe'. I'm not more than half-convinced this thryoid-craziness is true, but I think the fact that it's never been properly investigated is a complete scandal.

I'm not against "evidence based medicine" because it's based on evidence. I'm against "evidence based medicine" precisely because it's based on ignoring most of the evidence. -- GK Chesterton's Homeopath.

I was helping a consultant friend revise for an interview the other day, and one of the practice questions was 'describe the hierarchy of evidence'. He put 'expert opinion' bottom.

Really? Forty years of experience in treating patients is less valuable than a single anecdote published in a journal? Really?

And of course, it doesn't actually work that way. The TSH test ruling out hypothyroidism is expert opinion. Its reliability is unfounded dogma. I can't find any evidence for it as the sole measure of thyroid system function at all.

If none of the patients had had any sort of thyroid problem, I'd have expected it to be equally bad for everyone.

I’m talking about conservation of expected evidence. If X is positive evidence, then ~X is negative evidence. An experiment only supports a hypothesis if it was possible for it to come out another way that refutes it. And if an experiment that could have supported the hypothesis actually didn’t, then it’s evidence against.

What makes me think that they felt bad on thyroxine is table 2, where all the 'self-reported' psychological scores have got worse from thyroxine. In particular p=0.007 for the decline in Vitality. Since, as you point out, they really didn't know which was which, it's hard to see how they could have faked that.

Terminology then. When you said “Thyroxine is very strongly disliked by the healthy controls (they could tell it from placebo and hated it),” it suggests they could identify the active treatment.

Absolutely this treatment is harmful to healthy people.

The people in the study had symptoms. Even if you think their symptoms were mild or unrepresentative, you shouldn’t call them healthy. It’s fair to extend the conclusion to cover people without those symptoms, but I think that’s an important difference.

Yes, but that does mean that anything that needs careful dose control will get rejected.

It’s more that you need an easily followed protocol. Anything else, especially anything subjective, is unlikely to be practically feasible, and will probably not be reproducible.

The TSH test replaced that around 1970. But they never seem to have checked that clinical and biochemical diagnoses detected the same things, and after that there was the slow emergence of all sorts of nasty diseases that look very like hypothyroidism in the clinical sense but have normal TSH.

This is normal. Clinical presentations often have many causes, which makes it almost impossible to progress. Eventually we break them down based on their causal mechanisms so we can treat them individually. Each time we find a new cause, some of the cases will be left unexplained.

These are the only ones I can find through google scholar / pubmed. That in itself is really surprising and one of the things I can't explain! Why has such an obvious thing not been ruled out?

There are a lot of interesting hypotheses competing for resources, and we have to decide which ones are worth considering. I can’t say what the reason might be here, but there are a lot of possibilities. For example, it might not be possible to design a study like the one you want that could effectively answer the question.

Really? Forty years of experience in treating patients is less valuable than a single anecdote published in a journal? Really?

Yes. Expert opinion (i.e., the opinion of individual experts, not expert consensus) is the lowest level because you can find an expert to support pretty much any proposition that isn’t obviously ridiculous, and sometimes even if it is. In fact, this is true higher in the hierarchy as well, which is why we use syntheses of evidence so much. I can’t stress this enough: in biology, you can use peer-reviewed evidence to make plausible arguments for arbitrary hypotheses.

All the rest of it is anecdotal, from alternative sources, but there's a mountain of it.

The point of evidence-based medicine is that perceptions are unreliable. That includes the perceptions we call clinical experience (which once said that bloodletting was an important medical treatment). Keep in mind that doctors aren’t scientists and usually don’t even qualify as experts. EBM is unreliable too, but less so, just like science is unreliable but is still better than ancestral wisdom.

The TSH test ruling out hypothyroidism is expert opinion. Its reliability is unfounded dogma.

This sounds like you’re saying the TSH test doesn’t actually measure TSH, but I think you mean to say you disagree with the conclusions that it’s used for. But since hypothyroidism is defined as low thyroid hormone levels, some of this will be a dispute over definitions.

I can't find any evidence for it as the sole measure of thyroid system function at all.

I don’t think anyone who understands it would say it is. It measures TSH levels, and the question is what we do with that measurement. But we’re often limited by what we’re able to (easily) measure, and it might even be the only objective measurement we have.

I’m talking about conservation of expected evidence.

Sure, this experiment is evidence against 'all fat, tired people with dry hair get better with thryoxine'. No problem there.

Terminology then. When you said “Thyroxine is very strongly disliked by the healthy controls (they could tell it from placebo and hated it),” it suggests they could identify the active treatment.

Yes, it is kind of odd isn't it? One of the pills apparently made them a bit unwell, and yet they couldn't tell which one. I notice that I am confused.

The people in the study had symptoms.

You're right. I think I should have said "This treatment is harmful to most people".

This is normal...

But that's awful! Once, there was a diagnostic method, and a treatment that worked fine, that everyone thought was brilliant. Then they invented a test, which is very clever, and a good test for what it tests, and the result of that is that lots of people are ill and don't get the treatment any more and have to suffer horribly and die early.

If that's normal then there's something badly wrong with normal. A new way of measuring things should help!

we have to decide which ones are worth considering.

Sure, I'm trying to make a case that this one is worth considering.

it might not be possible to design a study ..

I think the Scottish study with stricter entry criteria for the patient group would do. If that failed, I would be quite surprised.

If someone did the same thing with stricter entry criteria and used desiccated thyroid and titrated doses and it failed I would be so surprised that I would give up.

Seriously, if 'start off with low doses and keep raising the dose until you get a response' is inaccessible to testing, then something is broken.

But in fact, just 'low basal metabolic rate in CFS' would be good evidence in favour, I think. We can work out optimal treatments later.

And if it turned out that there wasn't a subset of CFS patients with high Billewicz scores and low basal metabolic rates, I'd give up. No study needed.

I can’t stress this enough: in biology, you can use peer-reviewed evidence to make plausible arguments for arbitrary hypotheses.

At that point, we're all post-modernists aren't we? The truth is socially determined.

science is unreliable but is still better than ancestral wisdom

Science is not unreliable. If I can surprise a physicist or a chemist about something he is sure of, he will be very very interested, and science will quickly rearrange itself around the new fact. It took about five years to completely overturn classical physics and replace it with something we haven't managed to surprise yet, even though everyone knows that the new theories are broken and is actively trying to find things that happen that they don't predict. And classical physics is still damned good in the domains that it used to work in.

There's at least a possibility here that medical science is getting beaten hollow by chiropractors and quack doctors and internet loonies, none of whom have any resources or funding at all.

Even the possibility is enough to make me think that there's something appallingly badly wrong with the methods and structure of medical science.

in biology, you can use peer-reviewed evidence to make plausible arguments for arbitrary hypotheses.

Et tu, Brut? That is obviously true for humanities and for things like observational studies of nutrition, but do you think it extends to most / all of biology? "For any hypothesis there is a mouse strain which proves it true"? :-/

Hmmm

This piece claims that

...we face a replication crisis in the field of biomedicine, not unlike the one we’ve seen in psychology but with far more dire implications. Sloppy data analysis, contaminated lab materials, and poor experimental design all contribute to the problem.

...Freedman and his co-authors guessed that fully half of all results rest on shaky ground, and might not be replicable in other labs. These cancer studies don’t merely fail to find a cure; they might not offer any useful data whatsoever.

Oh God, where will this end? Is it really only physics and chemistry that aren't sloppy cargo-cults, or are they broken too?

A lot of this, I think is to do with taking tenure away from young academics. Once upon a time once you'd proved basic competence and cleverness, you could spend your whole career being careful about stuff. These days you've just got to turn out crap as fast as possible. And you spend most of your time applying for grants.

This open-access article discusses some of the issues in cancer research.

In most ways biology is intermediate between the hard and soft sciences, with all that implies. It’s usually impossible to identify all the confounders, most biologists are not trained in statistics, experiments are complex and you can get different results from slight variations in protocol, we're trying to generalize from imperfect models, many high-profile results don’t get tested by other labs, ... all these factors come together and we get something that people call a “replication crisis.”

tl;dr It's complicated.

Yes, I know. But it would be nice if people recognized that it is complicated and not pretend that we know more than we actually do.

By the way, what's you definition of hypothyroidism?

Since you are doubtful of the TSH test and point out that diagnosis by clinical symptoms is very hard (with the relevant implication that there will be a lot of mistakes), which exactly condition of the human body do you call hypothyroidism?

It's not "that which is made better by consuming dessicated thyroid", is it?

Actually I think it was only 'very hard' for GPs. It seems that most endocrinologists back in the day just ruled in or ruled out the obvious cases, and experimented with small amounts of thyroid on the rest. That doesn't look like a bad approach to me. Small amounts of thyroid with someone competent watching you won't hurt you.

just ruled in or ruled out the obvious cases, and experimented with small amounts of thyroid on the rest. That doesn't look like a bad approach to me.

I don't know about that. Let's try s/thyroid/bloodletting. Why, that looks just like what the doctors used to do a few centuries ago. I don't think this was a resounding success.

Well sure, and as an advocate of 'Traditional Western Medicine', I suppose I should be equally hurt by the disappearance of that once beloved (by doctor and patient alike!) treatment, so well supported by the humorous theories of Aristotle and Galen.

So I suggest that we appeal to Almighty God to show to us His Wille, by using the ritual of randomised controlled augury, as our fathers have shown us.

We should carefully select our patients using the strategems of Billewicz, and we should have three treatment arms, one with desiccated thyroid, one with leechwork and lancettry, and one with that flower of the modern schoole, graded exercise therapy.

I have a feeling that thryoid will come out well in the comparison. I am somewhat unclear as to GET vs leeches.

Let the best quackery win!

Something like 'inadequate thyroid-hormone-mediated regulation of metabolism'.

Certainly that would include primary, secondary, and tertiary hypothyroidism, as well as the various forms of 'peripheral resistance', and the conversion disorders. (TSH probably detects the primary form. The question is 'how widespread are the other things.')

A good clinical correlate for all of those would probably be the Billewicz score from '68. Which he pretty much did work out by 'that which is made better by consuming desiccated thyroid'.

We might want to call all that 'easily-treatable clinical hypothyroidism', or 'twimbbcdt'. Congratulations, trope-namer!

But actually there are consistent reports of people with all the usual symptoms who don't respond to the sane use of T4/T3 or NDT, but who do improve on insanely high levels of T3.

So I'd actually want to draw 'hypothyroidism' a bit wider than "that which is made better by consuming desiccated thyroid".

And I bet you can get all sorts of 'dysthyroidism' too, where there are some resistant tissues but some are fine, and you or your system raises levels of this or that to compensate, and makes some bits of you hyper and some bits hypo at the same time.

If there's that, we might actually need to understand how it works to treat it. Imagine...

And then, I bet if there are two acquired hormone resistances then there are others, and I bet they're all horribly intertwined, and lots of nasty pathogens either taking advantage of or causing them, so there's probably a whole swamp of horrors that get a bit better with thyroid but that no-one would call 'hypothyroidism'. We'd probably need to call those 'acquired generalised hormone resistance disorders', which gives the pleasantly onomatopoeic "aghrd". And we're not going to nail those without actually rolling up our sleeves and playing around.

At the current rate of progress it should all be sorted out well after we've destroyed the universe by careless use of computers.

Unless it is all being caused by nasty chemicals in the environment, in which case we'll probably make ourselves so stupid that we just blunder straight back into Malthus' trap in a few generations. And that should sort it all out 'the natural way'

Something like 'inadequate thyroid-hormone-mediated regulation of metabolism'.

That's wonderfully vague. I bet I can diagnose half the population with having "inadequate" regulation.

A definition should allow easy classification of observed phenomena into two classes: "fits the definition" and "doesn't fit the definition". This one... struggles.

we might actually need to understand how it works to treat

Yes, I have such a suspicion, too.

'acquired generalised hormone resistance disorders'

And this I can probably diagnose 90% of the population with? See above.

The meta issue is whether you want to medicalise deviations from the theoretical optimum. On the one hand, sure, it's nice to move closer to the optimum, on the other hand this means that no one is "healthy", everyone is "sick" and under care of doctors.

Well, 'hypothyroidism' was a very difficult and polymorphic badger in its day. But a thing that is difficult to detect can still be a thing. Consider neutrinos and gravity waves and unicornes, which no man nowadays doubts of.

And as for 'medicalise deviations from the theoretical optimum', most chronic fatigue people are already bothering their poor doctors incessantly, and being given (with the best will in the world) a selection of nasty things that mildly alleviate some of their symptoms. CFS is a horrible thing. As Hitler says in the film Downfall:

"'Chronic Fatigue Syndrome' ? they might as well call Leprosy : 'Chronic Dandruff Syndrome'".

Well, 'hypothyroidism' was a very difficult and polymorphic badger in its day.

Isn't it still "its day"?

Think of it this way. There is a set of people with some clinical symptoms which look maybe-possibly like hypothyroidism. There is a another set of people with abnormal TSH. These sets partially intersect and form three subsets. Subset one is the intersection: people with both clinical symptoms and abnormal TSH. They are a clear case and there are no problems here. Subset two is abnormal TSH and absence of clinical symptoms. We interpret that as thyroid gland falling apart and expect clinical symptoms to appear in the near future. We are not concerned with people either.

Subset three is the one you are interested in: people with normal TSH and clinical symptoms. What about them? Well, as you mention diagnosing hypothyroidism solely on the basis of clinical symptoms is difficult. So in this subset some but not all people will have a thyroid malfunction, and some will have other problems, maybe instead or maybe in addition to thyroid issues.

By the way, the people who you insist on calling "fat, tired, and with dry skin" are in subset three. They exhibit clinical symptoms of hypothyroidism.

Your suggestion is that we give some dessicated thyroid to subset three and see if it helps. Well, it's pretty clear that it will help some people and will not help other people (for example, those fat and tired ones). However that is true of many medical interventions.

For example, there are probably males in subset three with low testosterone. So giving testosterone to subset three males will also help some people and not help others. There also probably people with low-grade systemic infections in there. Giving broad-spectrum antibiotics to subset three might well help some people and not help others. There are likely people with autoimmune disorders there...

Basically, if you have little idea about what's wrong, trying a variety of drugs hoping for a lucky hit is not necessarily a horrible strategy (depends on the side-effects of the drugs and the consequences of doing nothing), but it's not much advancement from the good old times.

Let me quote from West Hunter:

Back in the good old days, Charles II, age 53, had a fit one Sunday evening, while fondling two of his mistresses.

Monday they bled him (cupping and scarifying) of eight ounces of blood. Followed by an antimony emetic, vitriol in peony water, purgative pills, and a clyster. Followed by another clyster after two hours. Then syrup of blackthorn, more antimony, and rock salt. Next, more laxatives, white hellebore root up the nostrils. Powdered cowslip flowers. More purgatives. Then Spanish Fly. They shaved his head and stuck blistering plasters all over it, plastered the soles of his feet with tar and pigeon-dung, then said good-night.

Tuesday. ten more ounces of blood, a gargle of elm in syrup of mallow, and a julep of black cherry, peony, crushed pearls, and white sugar candy.

Wednesday. Things looked good:: only senna pods infused in spring water, along with white wine and nutmeg.

Thursday. More fits. They gave him a spirituous draft made from the skull of a man who had died a violent death. Peruvian bark, repeatedly, interspersed with more human skull. Didn’t work.

Friday. The king was worse. He tells them not to let poor Nelly starve. They try the Oriental Bezoar Stone, and more bleeding. Dies at noon.

P.S. Note the awe-inspiring lack of smugness with which I present:

IMPAIRED ACTION OF THYROID HORMONE ASSOCIATED WITH SMOKING IN WOMEN WITH HYPOTHYROIDISM

BEAT MÜLLER , M.D., HENRYK ZULEWSKI , M.D., PETER HUBER , P H .D., JOHN G. RATCLIFFE , M.D., AND JEAN -JACQUES STAUB , M.D.

I bloody said it would turn out to be the reason smoking's bad for you, didn't I? And at the same time it's evidence that acquired hormone resistance exists, and this one fingers an environmental cause.

Isn't it still "its day"?

Opinions are divided. There's me and some dead guys, and everyone else. Everyone else thinks it's a solved problem.

By the way, the people who you insist on calling "fat, tired, and with dry skin" are in subset three. They exhibit clinical symptoms of hypothyroidism.

They absolutely do! Back in the day, they would have been referred to endocrinologists on suspicion of hypothyroidism, who would have (if they were very sophisticated and modern endocrinologists) used Billewicz' test to sort them into definite, definitely not, and 'therapeutic trial' groups. His test didn't rate these three symptoms, or lethargy or stupidity, because most everyone he saw had them, so he would look at all their other symptoms to make the diagnosis, looking for things like slow reflexes that are characteristic of hypothyroidism, and weight them to get a score. It really is a very careful piece of work, that test.

He would treat the 'definites' without further ado, send the 'definitely nots' off to people who were into diabetes etc, and be careful with the rest. Including all sorts of unreliable lab tests and therapeutic trials.

Luckily the therapeutic trials are not difficult to do, because with desiccated thyroid/T3 you seem to get either get a fairly rapid improvement, or you get hyper symptoms. (you might get both of course, in which case dose probably too high)

Other popular ways of trying to work it out involved cholesterol and basal metabolic rate.

Broda Barnes thought waking armpit temperature beat all this and just handed it out to anyone who woke up cold.

And the fact that it has been sprayed around at random for a hundred years without anyone having a word to say against it implies that it's pretty damned safe. If you give yourself a massive overdose, then sure, you can probably give yourself a heart attack, but you'd need to be way way more criminally careless than I can imagine any (modern) doctor being.

Osteoporosis and atrial fibrillation (both ghastly things) are associated with low TSH, so it's doubtless not a good idea to induce hyperthyroidism in people. And I think we should be careful not do that.

Barnes might have been deluded. I certainly started off thinking that he was, but one thing he was into was records and statistics. He thought his patients healthier than the general population. Including low rates of heart trouble. Which is just bizarre if what he was seeing was today's CFS etc population, who seem to be really ill and then go on to be even more ill. Unless his treatments actually helped.

Hell, let's do all four! If there's a subset of fat tired stupid lethargic CFS patients with dry skin and high Billewicz scores, low basal metabolic rates, high cholesterol, and low waking temperatures all at the same time, then let's run the Scottish trial on them and see what happens. That should be enough to break the TSH test, at which point, I imagine there will be an absolute explosion of research.

I couldn't agree more that it's really really important to understand mechanism. I'm into 'explanations' and 'causes'. I think you are too. I get the impression that they're a bit out of fashion in medicine.

Well, it's pretty clear that it will help some people

Ooh, is it me and you and some dead guys now? Welcome! Sorry some of us aren't that talkative. Damnit, that means I need another opponent. Devil's advocate isn't good enough. It needs to be someone who hates the idea.

Dies at noon.

Oh dear, poor Charles. The English crown was a bit of a poisoned chalice for the Stuarts wasn't it? Still, he made it to 53 and they did call him the Merry Monarch. Anyone who dies in office of excessive mistress-related-activity hasn't had a totally wasted life.