The correlations achieved by separated measurements on entangled particles measurably exceed what is classically achievable. That's what's meant by "Bell inequality violations".
We can't transmit information instantaneously, but there are nevertheless certain distributed tasks we can do better at when we have a source of entangled states.
To paraphrase, Alice, Bob and Carol play a game where they can't communicate (after deciding on a strategy) and the referee shows each of them a bit. After being shown the bit they must reply with 0 or 1. The four possible combinations the referee chooses uniformly from is:
If the first combination was shown, the answers must have an even sum, otherwise the answers must have an odd sum.
First, without using probability, if A0 is the answer Alice gives when shown 0 and A1 when shown 1 (and similarly for Bob and Carol), you get the following set of equations needed to win always:
However, each term occurs twice on the left hand side, so when you add all equations up (mod 2) you would find the left hand side is even. However the right hand side sums to an odd number, thus all four equations can't hold simultaneously.
Now, probability doesn't help here, because any mixed strategy can be shown to be equivalent to a combination of pure strategies, none of which can guarantee a win.
Yet with a shared entangled state, it can be won 100% of the time. This isn't a probability thing - you can actually guarantee a win using the quantum strategy, even over arbitrarily large distances.