November 2007

You are currently browsing the monthly archive for November 2007.

King et al. (1994: 75, note 1): ‘[a]t its core, real explanation is always based on causal inferences’.

Limiting Discussion to positivist Qualitative Methods

Qualitative Methods can be roughly divided into positivist (case-studies etc.) and interpretive. I will limit my comments to positivist Qualitative Methods. The differences between positivist qualitative and quantitative methods ‘are only stylistic and are methodologically and substantively unimportant’ (King et al., 1994:4). Both methods share ‘an epistemological logic of inference: they all agree on the importance of testing theories empirically, generating an inclusive list of alternative explanations and their observable implications, and specifying what evidence might infirm or affirm a theory’ (King et al. 1994: 3).

Causal Inference in Empirical Data

To impute causality, science relies either on evidence based process knowledge, or clever experiment design that obviates (perhaps more correctly, mitigates) the need to know the process – though researchers often are encouraged to have a story to explain the process, and test variables implicated in the ‘story’.

Experimentation provides one of the best ways to reliably impute causality. However for experiments to have value outside (the labs), the ‘treatment’ must be ‘ecological’ as in reflect the typical values that the variables takes in the world –for example, effect of news is best measured with either real life news clips or something similar, and the findings must hold up in the field (through surveys/field experiments). The problem is that most problems in Social Science cannot be studied experimentally. Brady et al. (2001:8) write: ‘A central reason why both qualitative and quantitative research are hard to do well is that any study based on observational (i.e., non-experimental) data faces the fundamental inferential challenge of eliminating rival explanations’. While most Social Science papers (and certainly practitioners) talk about ‘eliminating’ rival explanations, one doesn’t quiet have to do that. Social Scientists often times include variables reflecting ‘rival explanations’ as ‘straw variables’ (to refer to the straw man they will blow at the end) in a regression equation, to show how much variance is ‘explained’ by their variable of choice as compared to the ‘straw variables’.

In Quantitative Methods, to impute causality, one makes a variety of assumptions including: a ‘ceteris paribus’ (all other things being equal – which may mean assigning away everything ‘else’ to randomization) clause, error term (non-systematic) part is not correlated with other independent variables, to infer the correlation between an explanatory variable x and dependent variable y can only be explained as x’s effect on y. In analyzing survey data, one ‘controls’ for variables using regression, or some other similar way. There are a variety of assumptions in regression models and penalty for violation of each of these assumptions. In Qualitative Methods, one can either analytically (or where possible empirically) control for variables, or trace the process.

Qualitative Methods: some handles on generalizability

Traditional probability sampling theories are built on the highly conservative assumption that we that we know nothing about the world. And the only systematic way to go about knowing it is through ‘random sampling’, a process that delivers ‘representative data’ on average. Newer sampling theories, however, acknowledge what we know about the world by selectively over-sampling things (or people) we are truly clueless about, and under-sampling where we have a good idea. For example, polling organizations under-sample self described partisans, and over-sample non-partisans. This provides a window for positivist qualitative methods to make generalizable claims. Qualitative methods can overcome their limitations and make legitimate generalizable claims if their sampling reflects the extent of prior knowledge about the world.

Other than sampling, there are other analytical ways of getting a handle on the variables that ‘moderate’ the effect of a particular variable that we may be interested in studying. For example, we can analytically think through how education will affect (or not affect) racist attitudes. Analytical claims are based on deductive logic and a priori assumptions or knowledge. Hence the success of analytical claims is contingent upon the accuracy of the knowledge, and the correctness of the logic.

One of the problems that have been repeatedly pointed out about Qualitative research is its propensity to select on the dependent variable. Selection on dependent variable deviously leaves out cases where for example the dependent variable doesn’t take extreme values. Selection bias can not only lead to misleading conclusions about causal effects but also about causal processes. It is important hence not to use a truncated dependent variable to do one’s analysis. One of the ways one can systematically drill down to causal processes in qualitative research is by starting off with the broadest palette, either in prior research or elsewhere, to grasp the macro-processes, and other variables that may affect the case. Then cognizant of the particularistic aspects of a particular case, analyze the microfoundations or microprocesses present in the system.

Positivism and Empiricism

Scientific method roughly refers to systematic analysis of empirical data. The quality and the strength of the ‘system’ – forever open to challenge – determine any claims of epistemic superiority that the ‘scientific method’ may make over other competing claims of gleaning ‘knowledge’ from data. The extent to which claims are solely arbitrated on scientific merit is limited by a variety of factors, as outlined by Lakatos, Kuhn, and Feyerabend, resulting in at best an inefficient process and at worst something far more pernicious. I however ignore such issues and focus narrowly on methodological questions around causality and generalizability in qualitative methods.

In science, inquiry into generalizable causal processes is greatly privileged, and for good reason – causality and generalizability taken together can provide basis for policy action, for mounting intervention, among other things. One can also think of knowledge of causal processes as providing predictive power. However, not all kinds of data make themselves readily accessible to imputing causality, or even to making generalizable descriptive statements. For example, causal inference in most historical research remains out of bounds. AddKeeping this in mind, I analyze how qualitative methods within Social Sciences (can) interrogate causality and generalizability.

Causality:

Hume felt that there was no place for causality within empiricism. He argued that the most we can find is that “the one [event] does actually, in fact, follow the other”. More broadly, causality is nothing but an illusion occasioned when events follow each other with regularity. That formulation however didn’t prevent Hume from believing in scientific theories for he felt that regularly occurring constant conjunctions were sufficient basis for scientific laws. Theoretical advances in the 200 or so years since Hume have been able to provide a deeper understanding of causality, including a process based understanding and an experimental understanding.

Donald Rubin, a Professor of Statistics at Harvard, defines causal effect as, “Intuitively, the causal effect of one treatment, E, over another, C, for a particular unit and an interval of time from t1 to t2 is the difference between what would have happened at time t2 if the unit had been exposed to E initiated at t1 and what would have happened at t2 if the unit had been exposed to C initiated at t1: ‘If an hour ago I had taken two aspirins instead of just a glass of water, my headache would now be gone,’ or because an hour ago I took two aspirins instead of just a glass of water, my headache is now gone.’ Our definition of the causal effect of the E versus C treatment will reflect this intuitive meaning.”

It is important to note that RCM as presented above depicts an elementary causal connection between two Boolean variables: one level of explanatory variable (two aspirins) with single effect (headache is gone). Often times, the variables take multiple values and to study that, we need to mount a series of experiments. Similarly one may want to analyze the effect of a variable in different subgroups of populations, for example, effect of aspirin on women as compared to men. All the above two scenarios do is highlight the problems in coming up with a robust understanding of causal processes between (here) two variables. Fundamentally though, our RCM understanding of causal effect remains unchanged.

Rubin Causal Model provides a counterfactual deterministic understanding of causality that is firmly based in the logic of experiment design. RCM formulation can be expanded to include a probabilistic understanding of causal effect. Just as a note: A probabilistic understanding of causality implicitly accepts that certain parts of the explanation are still missing, and hence is absent of a necessary and sufficient condition though attempts have been made to include necessary and sufficient clauses in probabilistic statements. David Papineau (Probabilities and Causes, 1985, Journal of Philosophy) writes, “Factor A is a cause of some B just in case it is one of a set of conditions that are jointly and minimally sufficient for B. In such a case we can write A&X ->B. In general there will also be other sets of conditions minimally sufficient for B. Suppose we write their disjunction as Y. If now we suppose further that B is always determined when it occurs, that it never occurs unless one of these sufficients sets (let’s call them B’s full causes) occurs first, then we have, A and X condition conjugated with Y is equivalent with B. Given this equivalence, it is not difficult to see why A’s causing B should be related to A’s being correlated with B. If A is indeed a cause of B, then there is a natural inference to Prob(B/A) > Prob(B/-A): for, given A, one will have B if either X or Y occurs, whereas without A one will get B only with Y. And conversely it seems that if we do find that Prob(B/A) > Prob(B/-A), then we can conclude that A is a cause of B: for if A didn’t appear in the disjunction of full causes which are necessary and sufficient for B, then it wouldn’t affect the chance of B occurring.”

Papineau’s definition is a bit archaic, and doesn’t quite cover the set of cases we define as probabilistically causal. John Gerring (Social Science Methodology: A Criterial Framework, 2001: 127,138; emphasis in original), provides a definition of probabilistic causality: ‘[c]auses are factors that raise the (prior) probabilities of an event occurring. (…) [Hence] a sensible and minimal definition: X may be considered a cause of Y if (and only if) it raises the probability of Y occurring.’

A still more ‘sensible’ and still ‘minimal’ definition of causality, can be found in Gary King et al. (Designing Social Inquiry: Scientific Inference in Qualitative Research, 1994: 81-82), ‘the causal effect is the difference between the systematic component of observations made when the explanatory variable takes one value and the systematic component of comparable observations when the explanatory variable takes on another value.’

Brief discussion on Causal Inference in Qualitative and Quantitative Methods

While the above formulations of causality – Rubin Causal Model, Gerring, and King – seem more quantitative, they can be applied equally to qualitative methods. A parallel understanding of causality, used much more often in qualitative social science, is a process based understanding of causality wherein you trace the causal process to construct a theory. Simplistically speaking, in Quantitative methods in Social Sciences, one often times deduces the causal process, while in Qualitative methods the understanding of the causal process is induced from deep and close interaction with ‘data’.

Both deduction and induction processes, however, are rife with problems. Deduction privileges formal rules – statistics – that straightjacket the systematic deductive process so that the deductions are systematic and cognizant of explicit assumptions (like normal distribution of data, linearity of the effect, lack of measurement error, etc.). The formal deductive process bestows a host of appealing qualities like generalizability (when an adequate random sample of population is taken) or even systematic handle on causal inference. In quantitative methods, the methodological assumptions for deduction are cleanly separated from data. The same separation – between the formal deductive process with a rather arbitrarily chosen statistical model and data – however makes the discovery process less than optimal, and sometimes deeply problematic. Recent research by Ho and King (Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference, Political Analysis, 2007), and methods like Bayesian Model Averaging by Volinsky, have gone some ways in providing ways to mitigate problems with model selection.

No such sharp delineation between method and data exists in qualitative research, where data is collected iteratively – [in studies using iterative abstraction (Sayer 1981, 1992; Lawson 1989, 1995) or grounded theory (Glaser 1978; Strauss 1987; Strauss and Corbin 1990)] – till it explains the phenomenon singled out for explanation. Grounded data driven Qualitative methods often run the risk of modeling in particularistic aspects of data, which impedes the reliability with which they can come up with a generalizable causal model. This is indeed only one kind of qualitative research for there are others who do qualitative analysis in the vein of experiments – for example with a 2×2 model, and yet others who will test apriori assumptions by analytically controlling for variables in a verbal regression equation to get at the systematic effect of explanatory variable on the explanandum. Perhaps more than grounded theory method, the pseudo-quantitative style qualitative analysis runs the risk of coming to deeply problematic conclusions based on the cases used.

On a slow news day, Western journalists, took to telling the world about how ‘Pakistan’s Musharraf’ – it was nice of them to put the possessive noun as it is not always clear – had declared emergency. In doing so, they paid as much attention to a third-world dictator enacting Martial law, as they have in some time. The calamitous event was reported with the usual combination of scant detail, high impact graphics, half-baked – sometimes skipping the oven all together – analysis, all topped with alarmist rhetorical flourishes (they will give you a heart attack if you don’t watch out).

The news day actually started a day earlier with reporting about speculation, and government warnings against speculated action. US, proudly and staunchly backing Musharraf at least since a day after that calamitous day in 2001, warned their pet dictator ‘against martial law’. (San Jose Mercury News) When the threat of declaring martial law grew, the ‘world grew concerned’ (Guardian) simultaneously. And when a ‘desperate Musharraf’ (Trend Information, Azerbaijan!) declared emergency, it left US in a ‘tizzy’ (Calcutta Telegraph), as it ‘reel[ed]‘ (AFP) under the blow. US immediately took umbrage and issued ‘condemnation’. The ‘world’, not to be left behind, ’roundly condemned’ (Duetche Welle, Germany), ‘slammed’ (Earthtimes, UK), and ‘rapped’ (The Province, Canada) General Sahib. Rice went further, for she wanted ‘Pakistan to evolve as a democracy’, and appealed for a return to the ‘constitutional course’ (Bloomberg) even as the phrase cruel and unusual punishments”, in US’s own constitution was being debated to see if ‘coercively inducing a drowning sensation’ met that criteria. All this condemnation must have left Musharraf chagrined.

It was a ‘Sad day for Pakistan’ (The Daily Star, Bangladesh) when an ‘out of control’ (Nation Multimedia, Thailand) Musharraf chose to launch a ‘coup within a coup’, and put Pakistan under his ‘iron fist’ (The Standard, Hong Kong – an area that reels under the iron fist of its own big brother). The wordsmiths at Hindustan Times online division, which doubles as a soft porn website, found time to craft the smart aleck headline “Under General Anesthesia” to describe the events of ‘Black Saturday’. (Malaysia Sun)

On that ‘black’ day, Western journalist’s thoughts didn’t stay long with people in Pakistan, as reporting on martial law gave way to more pertinent matters like ‘threat of nuclear weapons falling into wrong hands’, and ‘War on Terror’.

After all, the concerns of the media are solely determined by what (and how) they can best pander conditional on what is available. The pavlovian reactions to international crisis, the cues media uses to determine when to cry fire and when to cheer, are all rather simplistic – Democracy is good, autocrats are bad. Forget then that sometimes ‘enlightened moderation’ is the best alternative. Ignore too that nothing has changed substantively in Pakistani politics – control still wrests with the general for after all it was only a ‘coup within a coup’. Banish any thoughts that Benazir, the incomparable, ran a regime true to her name – only if in levels of corruption.

I misstate my point, as I often do, for people who know little don’t need to deliberately ignore. They simply write. Till a new story appears and fuels a new news cycle, provides more cause for alarm, and more time to run ads.

It appears that Pakistan has had its day in the sun. New York Times is not waiting with baited breath; there is a cell phone jammer in the market that can stop the person sitting next to you from yammering while you try to read about Paris Hilton. (Number 1 story on NY Times website) Yeah, she is back.