Alice: "I just flipped a coin [large number] times. Here's the sequence I got:
(Alice presents her sequence.)
Bob: No, you didn't. The probability of having gotten that particular sequence is 1/2^[large number]. Which is basically impossible. I don't believe you.
Alice: But I had to get some sequence or other. You'd make the same claim regardless of what sequence I showed you.
Bob: True. But am I really supposed to believe you that a 1/2^[large number] event happened, just because you tell me it did, or because you showed me a video of it happening, or even if I watched it happen with my own eyes? My observations are always fallible, and if you make an event improbable enough, why shouldn't I be skeptical even if I think I observed it?
Alice: Someone usually wins the lottery. Should the person who finds out that their ticket had the winning numbers believe the opposite, because winning is so improbable?
Bob: What's the difference between finding out you've won the lottery and finding out that your neighbor is a 500 year old vampire, or that your house is haunted by real ghosts? All of these events are extremely improbable given what we know of the world.
Alice: There's improbable, and then there's impossible. 500 year old vampires and ghosts don't exist.
Bob: As far as you know. And I bet more people claim to have seen ghosts than have won more than 100 million dollars in the lottery.
Alice: I still think there's something wrong with your reasoning here.
Substantial? No - it adds up to normality. Interesting? Yes.
Imagine two situations of equal improbability:
In one, Alice flips a coin N times in front of a crowd, and achieves some specific sequence M.
In the other, Alice flips a coin N / 2 times in front of a crowd, and achieves some specific sequence Q; she then opens an envelope, and reveals a prediction of exactly the sequence that she just flipped.
These two end results are equally improbable (both end results encode N bits of information - to see this, imagine that the envelope contained a different sequence than she flipped), but we attach significance to one result (appropriately) and not the other. What's the difference between the two situations?
I don't understand what you mean by this.
It is important to note that to capture this problem entirely we must make it explicit that the person observing the coin flips has not only a distribution over sequences of coin flips, but a distribution over world-models that produce the sequences. It is often implicit, and sometimes explicitly assumed, in coin flipping examples, that a normal human flipping a fair coin is something like our null hypothesis about the world. Most coins seem fair in our everyday experience. Alice correctly predicting the sequence that she achieves is evidence that causes a substantial update on our distribution over world-models, even if the two sequences are assigned equal probability in our distribution over sequences given that the null hypothesis is true.
You can also imagine it as the problem of finding an efficient encoding for sequences of coin flips. If you know that certain subsequences are more likely than others, then you should find a way to encode more probable subsequences with less bits. Actually doing this is equivalent to forming beliefs about the world. (Like 'The coin is biased in this particular way', or 'Alice is clairvoyant.')