Figuring out an ad’s real effect is tricky. Clicks don’t tell the whole story. Even if the user clicked on the ad and made a purchase, we can’t be sure if the ad drove this sale, or if it would’ve happened anyway, especially if they saw multiple ads on different platforms. And sometimes, a user doesn’t click on the ad but makes a purchase because of it later. Attribution models can’t capture the true effect of the ad, so they give a biased image.
The solution lies in running tests to compare different scenarios: one where you advertise and one where you don’t. You want to create two different worlds which are similar in every parameter except for this specific ad you’re trying to investigate. Other campaigns can be going on at the same time too, as long as they are going on in both worlds. You don’t even care if the user interacted with the ad, because if the ad had any effect on the bottom line, a comparison between these two worlds will show it. You can also measure an entire platform, or any combination of ads, you just need to compare a scenario with it and a scenario without it. But how do you create and compare these worlds?
A/B testing is probably the first thing that comes to mind when mentioning testing. You randomly split your audience into two: some see the ad, the others don’t. Then, you track the people in both groups and compare KPIs to see the real impact of your ad. Any difference between these groups can be attributed to the ads. But there are three big limitations:
So, we need a smarter approach.
Geo Tests is a smart workaround. Instead of focusing on individuals, you identify whole geographic areas, usually DMAs or stats. Some areas see the ads, and some don’t, and you compare KPIs in these areas. This could mean billboards in certain states but not in others. This approach dodges privacy issues since you’re comparing areas, not individual people, and can be easily tracked even for offline ads. But there’s a catch: you need to make sure the areas you’re comparing have similar enough trends in your KPIs prior to the test. Otherwise, you can’t really say the difference in the KPI during the test was because of the ads.
For example, if you place ads for cars in Iowa but not in New York, we can’t say the difference in sales in these states is only attributed to the ads. New York and Iowa are very different places, with New York having a denser population and greater reliance on public transportation, while Iowa has more rural areas and a higher dependency on personal vehicles. So you need to be smart when dividing areas into test and control, and even use weights to make sure these two pseudo-populations made from different areas have similar enough trends to be compared. If we do that, we can estimate the true causal effect of the ad, and calculate the lift, incrementality and all the usual metrics.
To get accurate results from Geo Tests, keep these in mind: