The Merits of Externally Invalid Survey Experiments

Gustavo Diaz
McMaster University
gustavodiaz.org
diazg2@mcmaster.ca

Slides: gustavodiaz.org/talk

Limitations

“Future research should confirm if our findings generalize…”

…with a representative sample
…in other countries
…beyond the survey setting
…when using behavioral outcomes

Usual workflow

Research idea
Realize resource/ethical/practical limitations
Conduct experiment with limitations
Wave hands about external validity

Recent trends

Goal

Should we ever implement an externally invalid survey experiment on purpose?
Identify what makes external invalidity desirable
Challenge: Different kinds of external (in)validity

External validity concerns

Type	Concern
Samples	Does this apply to a different population?
Contexts	Does this apply in a different setting?
Treatments	Do they resemble real-world phenomena?
Outcomes	Do they reflect actual behaviors?

External validity concerns

Type	Concern
Samples	Does this apply to a different population?
Contexts	Does this apply in a different setting?
Treatments	Do they resemble real-world phenomena?
Outcomes	Do they reflect actual behaviors?

External validity concerns

Type	Concern
Samples	Does this apply to a different population?
Contexts	Does this apply in a different setting?
Treatments	Do they resemble real-world phenomena?
Outcomes	Do they reflect actual behaviors?

External validity concerns

Type	Concern
Samples	Does this apply to a different population?
Contexts	Does this apply in a different setting?
Treatments	Do they resemble real-world phenomena?
Outcomes	Do they reflect actual behaviors?

External validity concerns

Type	Concern
Samples	Does this apply to a different population?
Contexts	Does this apply in a different setting?
Treatments	Do they resemble real-world phenomena?
Outcomes	Do they reflect actual behaviors?

Examples

Samples

Contexts

Saudi Arabia and Kuwait were selected for their theoretical case value;

Contexts

Saudi Arabia and Kuwait were selected for their theoretical case value; both are high in gender inegalitarianism, and they offer tough tests.

Contexts

Saudi Arabia and Kuwait were selected for their theoretical case value; both are high in gender inegalitarianism, and they offer tough tests. In addition, while these neighboring countries have much in common, both resource-rich and highly conservative, they also differ in important ways. Thus, if similar results are found, the case for generalizability across different interaction types and varying national circumstances will be strengthened.

Treatments

Outcomes

Conclusion

Invalid	Benefit
Samples	Contour generalizations
Contexts	Contour generalizations
Treatments	Statistical properties
Outcomes	Hypothetical/rare scenarios

Endline: Consider merits before implementation
What would persuade you to embrace external invalidity?