If I have two series of numbers, series A contains either 1s or 0s, depending on if a patient took a pill or not. Series B contains random numbers. All of the series B numbers that coincide with the patient taking a pill have an average of 100, whereas those that coincide with NOT taking a pill average to 101. There is a HUGE amount of data, so I am trying to find the formula that will show that there is a strong correlation between the two - that if the patient takes the pill, the most likely result is that their B measurement will go up by 1 point. A standard correlative coefficient shows a low correlation... around .15. Any help would be greatly appreciated.

The most commonly used correlation measure(Pearson's correlation) is not well-suited to this problem. I will suggest that you measure two things:

1. Magnitude: The difference between the mean of variable B for variable A 0s and for variable A 1s.

and...

2. Significance: Try a t-test or bootstrap to establish that the difference between the two means is unlikely to be zero.

