University of South Florida University of South Florida
Digital Commons @ University of Digital Commons @ University of
South Florida South Florida
Textbooks Collection Tampa Library
2012
Social Science Research: Principles, Methods, and Practices Social Science Research: Principles, Methods, and Practices
Anol Bhattacherjee
University of South Florida
Follow this and additional works at: https://digitalcommons.usf.edu/oa_textbooks
Part of the American Studies Commons, Education Commons, Public Health Commons, and the
Social and Behavioral Sciences Commons
Recommended Citation Recommended Citation
Bhattacherjee, Anol, "Social Science Research: Principles, Methods, and Practices" (2012).
Textbooks
Collection
. 3.
https://digitalcommons.usf.edu/oa_textbooks/3
This Book is brought to you for free and open access by the Tampa Library at Digital Commons @ University of
South Florida. It has been accepted for inclusion in Textbooks Collection by an authorized administrator of Digital
Commons @ University of South Florida. For more information, please contact [email protected].
SOCIAL SCIENCE RESEARCH:
PRINCIPLES, METHODS, AND PRACTICES
ANOL BHATTACHERJEE
SOCIAL SCIENCE RESEARCH:
PRINCIPLES, METHODS, AND PRACTICES
Anol Bhattacherjee, Ph.D.
University of South Florida
Tampa, Florida, USA
Second Edition
Copyright © 2012 by Anol Bhattacherjee
Published under the
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License
Social Science Research: Principles, Methods, and Practices, 2
nd
edition
By Anol Bhattacherjee
First published 2012
ISBN-13: 978-1475146127
ISBN-10: 1475146124
Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License:
Users are free to use, copy, share, distribute, display, and reference this book under the following
conditions:
ATTRIBUTION: Whole or partial use of this book should be attributed (referenced or cited)
according to standard academic practices.
NON-COMMERCIAL: This book may not be used for commercial purposes.
SHARE ALIKE: Users may alter, transform, or build upon this book, but must distribute the
resulting work under the same or similar license as this one.
For any reuse or distribution, the license terms of this work must be clearly specified. Your fair use
and other rights are in no way affected by the above.
Copyright © 2012 by Anol Bhattacherjee
i
Preface
This book is designed to introduce doctoral and graduate students to the process of
scientific research in the social sciences, business, education, public health, and related
disciplines. This book is based on my lecture materials developed over a decade of teaching the
doctoral-level class on Research Methods at the University of South Florida. The target
audience for this book includes Ph.D. and graduate students, junior researchers, and professors
teaching courses on research methods, although senior researchers can also use this book as a
handy and compact reference.
The first and most important question potential readers should have about this book is
how is it different from other text books on the market? Well, there are four key differences.
First, unlike other text books, this book is not just about “research methods” (empirical data
collection and analysis) but about the entire “research process” from start to end. Research
method is only one phase in that research process, and possibly the easiest and most structured
one. Most text books cover research methods in depth, but leave out the more challenging, less
structured, and probably more important issues such as theorizing and thinking like a
researcher, which are often prerequisites of empirical research. In my experience, most
doctoral students become fairly competent at research methods during their Ph.D. years, but
struggle to generate interesting or useful research questions or build scientific theories. To
address this deficit, I have devoted entire chapters to topics such as “Thinking Like a
Researcher” and “Theories in Scientific Research”, which are essential skills for a junior
researcher.
Second, the book is succinct and compact by design. While writing the book, I decided
to focus only on essential concepts, and not fill pages with clutter that can divert the students’
attention to less relevant or tangential issues. Most doctoral seminars include a fair
complement of readings drawn from the respective discipline. This book is designed to
complement those readings by summarizing all important concepts in one compact volume,
rather than burden students with a voluminous text on top of their assigned readings.
Third, this book is free in its download version. Not just the current edition but all
future editions in perpetuity. The book will also be available in Kindle e-Book, Apple iBook, and
on-demand paperback versions at a nominal cost. Many people have asked why I’m giving
away something for free when I can make money selling it? Well, not just to stop my students
from constantly complaining about the high price of text books, but also because I believe that
scientific knowledge should not be constrained by access barriers such as price and availability.
Scientific progress can occur only if students and academics around the world have affordable
access to the best that science can offer, and this free book is my humble effort to that cause.
However, free should not imply “lower quality.” Some of the best things in life such as air,
water, and sunlight are free. Many of Google’s resources are free too, and one can well imagine
where we would be in today’s Internet age without Google. Some of the most sophisticated
software programs available today, like Linux and Apache, are also free, and so is this book.
Fourth, I plan to make local-language versions of this book available in due course of
time, and those translated versions will also be free. So far, I have had commitments to
ii
translate thus book into Chinese, French, Indonesian, Korean, Portuguese, Spanish versions
(which will hopefully be available in 2012), and I’m looking for qualified researchers or
professors to translate it into Arabic, German, and other languages where there is sufficient
demand for a research text. If you are a prospective translator, please note that there will be no
financial gains or royalty for your translation services, because the book must remain free, but
I’ll gladly include you as a coauthor on the local-language version.
The book is structured into 16 chapters for a 16-week semester. However, professors
or instructors can add, drop, stretch, or condense topics to customize the book to the specific
needs of their curriculum. For instance, I don’t cover Chapters 14 and 15 in my own class,
because we have dedicated classes on statistics to cover those materials and more. Instead, I
spend two weeks on theories (Chapter 3), one week to discussing and conducting reviews for
academic journals (not in the book), and one week for a finals exam. Nevertheless, I felt it
necessary to include Chapters 14 and 15 for academic programs that may not have a dedicated
class on statistical analysis for research. A sample syllabus that I use for my own class in the
business Ph.D. program is provided in the appendix.
Lastly, I plan to continually update this book based on emerging trends in scientific
research. If there are any new or interesting content that you wish to see in future editions,
please drop me a note, and I will try my best to accommodate them. Comments, criticisms, or
corrections to any of the existing content will also be gratefully appreciated.
Anol Bhattacherjee
E-mail: abhatt@usf.edu
iii
Table of Contents
Introduction to Research
1. Science and Scientific Research.................................................................................................... 1
2. Thinking Like a Researcher ........................................................................................................... 9
3. The Research Process ................................................................................................................. 17
4. Theories in Scientific Research ................................................................................................... 25
Basics of Empirical Research
5. Research Design ......................................................................................................................... 35
6. Measurement of Constructs....................................................................................................... 43
7. Scale Reliability and Validity ....................................................................................................... 55
8. Sampling ..................................................................................................................................... 65
Data Collection
9. Survey Research ......................................................................................................................... 73
10. Experimental Research .............................................................................................................. 83
11. Case Research ............................................................................................................................ 93
12. Interpretive Research ............................................................................................................... 103
Data Analysis
13. Qualitative Analysis .................................................................................................................. 113
14. Quantitative Analysis: Descriptive Statistics ............................................................................ 119
15. Quantitative Analysis: Inferential Statistics ............................................................................. 129
Epilogue
16. Research Ethics ........................................................................................................................ 137
Appendix ............................................................................................................................................. 143
1
Chapter 1
Science and Scientific Research
What is research? Depending on who you ask, you will likely get very different answers
to this seemingly innocuous question. Some people will say that they routinely research
different online websites to find the best place to buy goods or services they want. Television
news channels supposedly conduct research in the form of viewer polls on topics of public
interest such as forthcoming elections or government-funded projects. Undergraduate students
research the Internet to find the information they need to complete assigned projects or term
papers. Graduate students working on research projects for a professor may see research as
collecting or analyzing data related to their project. Businesses and consultants research
different potential solutions to remedy organizational problems such as a supply chain
bottleneck or to identify customer purchase patterns. However, none of the above can be
considered “scientific research” unless: (1) it contributes to a body of science, and (2) it follows
the scientific method. This chapter will examine what these terms mean.
Science
What is science? To some, science refers to difficult high school or college-level courses
such as physics, chemistry, and biology meant only for the brightest students. To others,
science is a craft practiced by scientists in white coats using specialized equipment in their
laboratories. Etymologically, the word “science” is derived from the Latin word scientia
meaning knowledge. Science refers to a systematic and organized body of knowledge in any
area of inquiry that is acquired using “the scientific method” (the scientific method is described
further below). Science can be grouped into two broad categories: natural science and social
science. Natural science is the science of naturally occurring objects or phenomena, such as
light, objects, matter, earth, celestial bodies, or the human body. Natural sciences can be further
classified into physical sciences, earth sciences, life sciences, and others. Physical sciences
consist of disciplines such as physics (the science of physical objects), chemistry (the science of
matter), and astronomy (the science of celestial objects). Earth sciences consist of disciplines
such as geology (the science of the earth). Life sciences include disciplines such as biology (the
science of human bodies) and botany (the science of plants). In contrast, social science is the
science of people or collections of people, such as groups, firms, societies, or economies, and
their individual or collective behaviors. Social sciences can be classified into disciplines such as
psychology (the science of human behaviors), sociology (the science of social groups), and
economics (the science of firms, markets, and economies).
The natural sciences are different from the social sciences in several respects. The
natural sciences are very precise, accurate, deterministic, and independent of the person
2 | S o c i a l S c i e n c e R e s e a r c h
making the scientific observations. For instance, a scientific experiment in physics, such as
measuring the speed of sound through a certain media or the refractive index of water, should
always yield the exact same results, irrespective of the time or place of the experiment, or the
person conducting the experiment. If two students conducting the same physics experiment
obtain two different values of these physical properties, then it generally means that one or
both of those students must be in error. However, the same cannot be said for the social
sciences, which tend to be less accurate, deterministic, or unambiguous. For instance, if you
measure a person’s happiness using a hypothetical instrument, you may find that the same
person is more happy or less happy (or sad) on different days and sometimes, at different times
on the same day. One’s happiness may vary depending on the news that person received that
day or on the events that transpired earlier during that day. Furthermore, there is not a single
instrument or metric that can accurately measure a person’s happiness. Hence, one instrument
may calibrate a person as being “more happy” while a second instrument may find that the
same person is “less happy” at the same instant in time. In other words, there is a high degree
of measurement error in the social sciences and there is considerable uncertainty and little
agreement on social science policy decisions. For instance, you will not find many
disagreements among natural scientists on the speed of light or the speed of the earth around
the sun, but you will find numerous disagreements among social scientists on how to solve a
social problem such as reduce global terrorism or rescue an economy from a recession. Any
student studying the social sciences must be cognizant of and comfortable with handling higher
levels of ambiguity, uncertainty, and error that come with such sciences, which merely reflects
the high variability of social objects.
Sciences can also be classified based on their purpose. Basic sciences, also called pure
sciences, are those that explain the most basic objects and forces, relationships between them,
and laws governing them. Examples include physics, mathematics, and biology. Applied
sciences, also called practical sciences, are sciences that apply scientific knowledge from basic
sciences in a physical environment. For instance, engineering is an applied science that applies
the laws of physics and chemistry for practical applications such as building stronger bridges or
fuel efficient combustion engines, while medicine is an applied science that applies the laws of
biology for solving human ailments. Both basic and applied sciences are required for human
development. However, applied sciences cannot stand on their own right, but instead relies on
basic sciences for its progress. Of course, the industry and private enterprises tend to focus
more on applied sciences given their practical value, while universities study both basic and
applied sciences.
Scientific Knowledge
The purpose of science is to create scientific knowledge. Scientific knowledge refers to
a generalized body of laws and theories to explain a phenomenon or behavior of interest that
are acquired using the scientific method. Laws are observed patterns of phenomena or
behaviors, while theories are systematic explanations of the underlying phenomenon or
behavior. For instance, in physics, the Newtonian Laws of Motion describe what happens when
an object is in a state of rest or motion (Newton’s First Law), what force is needed to move a
stationary object or stop a moving object (Newton’s Second Law), and what happens when two
objects collide (Newton’s Third Law). Collectively, the three laws constitute the basis of
classical mechanics a theory of moving objects. Likewise, the theory of optics explains the
properties of light and how it behaves in different media, electromagnetic theory explains the
properties of electricity and how to generate it, quantum mechanics explains the properties of
subatomic particles, and thermodynamics explains the properties of energy and mechanical
S c i e n c e a n d S c i e n t i f i c R e s e a r c h | 3
work. An introductory college level text book in physics will likely contain separate chapters
devoted to each of these theories. Similar theories are also available in social sciences. For
instance, cognitive dissonance theory in psychology explains how people react when their
observations of an event is different from what they expected of that event, general deterrence
theory explains why some people engage in improper or criminal behaviors, such as illegally
download music or commit software piracy, and the theory of planned behavior explains how
people make conscious reasoned choices in their everyday lives.
The goal of scientific research is to discover laws and postulate theories that can explain
natural or social phenomena, or in other words, build scientific knowledge. It is important to
understand that this knowledge may be imperfect or even quite far from the truth. Sometimes,
there may not be a single universal truth, but rather an equilibrium of “multiple truths.” We
must understand that the theories, upon which scientific knowledge is based, are only
explanations of a particular phenomenon, as suggested by a scientist. As such, there may be
good or poor explanations, depending on the extent to which those explanations fit well with
reality, and consequently, there may be good or poor theories. The progress of science is
marked by our progression over time from poorer theories to better theories, through better
observations using more accurate instruments and more informed logical reasoning.
We arrive at scientific laws or theories through a process of logic and evidence. Logic
(theory) and evidence (observations) are the two, and only two, pillars upon which scientific
knowledge is based. In science, theories and observations are interrelated and cannot exist
without each other. Theories provide meaning and significance to what we observe, and
observations help validate or refine existing theory or construct new theory. Any other means
of knowledge acquisition, such as faith or authority cannot be considered science.
Scientific Research
Given that theories and observations are the two pillars of science, scientific research
operates at two levels: a theoretical level and an empirical level. The theoretical level is
concerned with developing abstract concepts about a natural or social phenomenon and
relationships between those concepts (i.e., build “theories”), while the empirical level is
concerned with testing the theoretical concepts and relationships to see how well they reflect
our observations of reality, with the goal of ultimately building better theories. Over time, a
theory becomes more and more refined (i.e., fits the observed reality better), and the science
gains maturity. Scientific research involves continually moving back and forth between theory
and observations. Both theory and observations are essential components of scientific
research. For instance, relying solely on observations for making inferences and ignoring
theory is not considered valid scientific research.
Depending on a researcher’s training and interest, scientific inquiry may take one of two
possible forms: inductive or deductive. In inductive research, the goal of a researcher is to
infer theoretical concepts and patterns from observed data. In deductive research, the goal of
the researcher is to test concepts and patterns known from theory using new empirical data.
Hence, inductive research is also called theory-building research, and deductive research is
theory-testing research. Note here that the goal of theory-testing is not just to test a theory, but
possibly to refine, improve, and extend it. Figure 1.1 depicts the complementary nature of
inductive and deductive research. Note that inductive and deductive research are two halves of
the research cycle that constantly iterates between theory and observations. You cannot do
inductive or deductive research if you are not familiar with both the theory and data
4 | S o c i a l S c i e n c e R e s e a r c h
components of research. Naturally, a complete researcher is one who can traverse the entire
research cycle and can handle both inductive and deductive research.
It is important to understand that theory-building (inductive research) and theory-
testing (deductive research) are both critical for the advancement of science. Elegant theories
are not valuable if they do not match with reality. Likewise, mountains of data are also useless
until they can contribute to the construction to meaningful theories. Rather than viewing these
two processes in a circular relationship, as shown in Figure 1.1, perhaps they can be better
viewed as a helix, with each iteration between theory and data contributing to better
explanations of the phenomenon of interest and better theories. Though both inductive and
deductive research are important for the advancement of science, it appears that inductive
(theory-building) research is more valuable when there are few prior theories or explanations,
while deductive (theory-testing) research is more productive when there are many competing
theories of the same phenomenon and researchers are interested in knowing which theory
works best and under what circumstances.
Figure 1.1. The Cycle of Research
Theory building and theory testing are particularly difficult in the social sciences, given
the imprecise nature of the theoretical concepts, inadequate tools to measure them, and the
presence of many unaccounted factors that can also influence the phenomenon of interest. It is
also very difficult to refute theories that do not work. For instance, Karl Marx’s theory of
communism as an effective means of economic production withstood for decades, before it was
finally discredited as being inferior to capitalism in promoting economic growth and social
welfare. Erstwhile communist economies like the Soviet Union and China eventually moved
toward more capitalistic economies characterized by profit-maximizing private enterprises.
However, the recent collapse of the mortgage and financial industries in the United States
demonstrates that capitalism also has its flaws and is not as effective in fostering economic
growth and social welfare as previously presumed. Unlike theories in the natural sciences,
social science theories are rarely perfect, which provides numerous opportunities for
researchers to improve those theories or build their own alternative theories.
Conducting scientific research, therefore, requires two sets of skills theoretical and
methodological needed to operate in the theoretical and empirical levels respectively.
Methodological skills ("know-how") are relatively standard, invariant across disciplines, and
easily acquired through doctoral programs. However, theoretical skills ("know-what") is
considerably harder to master, requires years of observation and reflection, and are tacit skills
that cannot be “taughtbut rather learned though experience. All of the greatest scientists in
the history of mankind, such as Galileo, Newton, Einstein, Neils Bohr, Adam Smith, Charles
S c i e n c e a n d S c i e n t i f i c R e s e a r c h | 5
Darwin, and Herbert Simon, were master theoreticians, and they are remembered for the
theories they postulated that transformed the course of science. Methodological skills are
needed to be an ordinary researcher, but theoretical skills are needed to be an extraordinary
researcher!
Scientific Method
In the preceding sections, we described science as knowledge acquired through a
scientific method. So what exactly is the “scientific method”? Scientific method refers to a
standardized set of techniques for building scientific knowledge, such as how to make valid
observations, how to interpret results, and how to generalize those results. The scientific
method allows researchers to independently and impartially test preexisting theories and prior
findings, and subject them to open debate, modifications, or enhancements. The scientific
method must satisfy four key characteristics:
Logical: Scientific inferences must be based on logical principles of reasoning.
Confirmable: Inferences derived must match with observed evidence.
Repeatable: Other scientists should be able to independently replicate or repeat a
scientific study and obtain similar, if not identical, results.
Scrutinizable: The procedures used and the inferences derived must withstand critical
scrutiny (peer review) by other scientists.
Any branch of inquiry that does not allow the scientific method to test its basic laws or
theories cannot be called “science.” For instance, theology (the study of religion) is not science
because theological ideas (such as the presence of God) cannot be tested by independent
observers using a logical, confirmable, repeatable, and scrutinizable. Similarly, arts, music,
literature, humanities, and law are also not considered science, even though they are creative
and worthwhile endeavors in their own right.
The scientific method, as applied to social sciences, includes a variety of research
approaches, tools, and techniques, for collecting and analyzing qualitative or quantitative data.
These methods include laboratory experiments, field surveys, case research, ethnographic
research, action research, and so forth. Much of this book is devoted to learning about these
different methods. However, recognize that the scientific method operates primarily at the
empirical level of research, i.e., how to make observations and analyze these observations. Very
little of this method is directly pertinent to the theoretical level, which is really the more
challenging part of scientific research.
Types of Scientific Research
Depending on the purpose of research, scientific research projects can be grouped into
three types: exploratory, descriptive, and explanatory. Exploratory research is often
conducted in new areas of inquiry, where the goals of the research are: (1) to scope out the
magnitude or extent of a particular phenomenon, problem, or behavior, (2) to generate some
initial ideas (or “hunches”) about that phenomenon, or (3) to test the feasibility of undertaking
a more extensive study regarding that phenomenon. For instance, if the citizens of a country
are generally dissatisfied with governmental policies regarding during an economic recession,
6 | S o c i a l S c i e n c e R e s e a r c h
exploratory research may be directed at measuring the extent of citizens’ dissatisfaction,
understanding how such dissatisfaction is manifested, such as the frequency of public protests,
and the presumed causes of such dissatisfaction, such as ineffective government policies in
dealing with inflation, interest rates, unemployment, or higher taxes. Such research may
include examination of publicly reported figures, such as estimates of economic indicators, such
as gross domestic product (GDP), unemployment, and consumer price index, as archived by
third-party sources, obtained through interviews of experts, eminent economists, or key
government officials, and/or derived from studying historical examples of dealing with similar
problems. This research may not lead to a very accurate understanding of the target problem,
but may be worthwhile in scoping out the nature and extent of the problem and serve as a
useful precursor to more in-depth research.
Descriptive research is directed at making careful observations and detailed
documentation of a phenomenon of interest. These observations must be based on the
scientific method (i.e., must be replicable, precise, etc.), and therefore, are more reliable than
casual observations by untrained people. Examples of descriptive research are tabulation of
demographic statistics by the United States Census Bureau or employment statistics by the
Bureau of Labor, who use the same or similar instruments for estimating employment by sector
or population growth by ethnicity over multiple employment surveys or censuses. If any
changes are made to the measuring instruments, estimates are provided with and without the
changed instrumentation to allow the readers to make a fair before-and-after comparison
regarding population or employment trends. Other descriptive research may include
chronicling ethnographic reports of gang activities among adolescent youth in urban
populations, the persistence or evolution of religious, cultural, or ethnic practices in select
communities, and the role of technologies such as Twitter and instant messaging in the spread
of democracy movements in Middle Eastern countries.
Explanatory research seeks explanations of observed phenomena, problems, or
behaviors. While descriptive research examines the what, where, and when of a phenomenon,
explanatory research seeks answers to why and how types of questions. It attempts to “connect
the dots” in research, by identifying causal factors and outcomes of the target phenomenon.
Examples include understanding the reasons behind adolescent crime or gang violence, with
the goal of prescribing strategies to overcome such societal ailments. Most academic or
doctoral research belongs to the explanation category, though some amount of exploratory
and/or descriptive research may also be needed during initial phases of academic research.
Seeking explanations for observed events requires strong theoretical and interpretation skills,
along with intuition, insights, and personal experience. Those who can do it well are also the
most prized scientists in their disciplines.
History of Scientific Thought
Before closing this chapter, it may be interesting to go back in history and see how
science has evolved over time and identify the key scientific minds in this evolution. Although
instances of scientific progress have been documented over many centuries, the terms
“science,” “scientists,” and the “scientific method” were coined only in the 19
th
century. Prior to
this time, science was viewed as a part of philosophy, and coexisted with other branches of
philosophy such as logic, metaphysics, ethics, and aesthetics, although the boundaries between
some of these branches were blurred.
S c i e n c e a n d S c i e n t i f i c R e s e a r c h | 7
In the earliest days of human inquiry, knowledge was usually recognized in terms of
theological precepts based on faith. This was challenged by Greek philosophers such as Plato,
Aristotle, and Socrates during the 3
rd
century BC, who suggested that the fundamental nature of
being and the world can be understood more accurately through a process of systematic logical
reasoning called rationalism. In particular, Aristotle’s classic work Metaphysics (literally
meaning “beyond physical [existence]”) separated theology (the study of Gods) from ontology
(the study of being and existence) and universal science (the study of first principles, upon
which logic is based). Rationalism (not to be confused with rationality”) views reason as the
source of knowledge or justification, and suggests that the criterion of truth is not sensory but
rather intellectual and deductive, often derived from a set of first principles or axioms (such as
Aristotle’s “law of non-contradiction”).
The next major shift in scientific thought occurred during the 16
th
century, when British
philosopher Francis Bacon (1561-1626) suggested that knowledge can only be derived from
observations in the real world. Based on this premise, Bacon emphasized knowledge
acquisition as an empirical activity (rather than as a reasoning activity), and developed
empiricism as an influential branch of philosophy. Bacon’s works led to the popularization of
inductive methods of scientific inquiry, the development of the “scientific method” (originally
called the “Baconian method”), consisting of systematic observation, measurement, and
experimentation, and may have even sowed the seeds of atheism or the rejection of theological
precepts as “unobservable.”
Empiricism continued to clash with rationalism throughout the Middle Ages, as
philosophers sought the most effective way of gaining valid knowledge. French philosopher
Rene Descartes sided with the rationalists, while British philosophers John Locke and David
Hume sided with the empiricists. Other scientists, such as Galileo Galilei and Sir Issac Newton,
attempted to fuse the two ideas into natural philosophy (the philosophy of nature), to focus
specifically on understanding nature and the physical universe, which is considered to be the
precursor of the natural sciences. Galileo (1564-1642) was perhaps the first to state that the
laws of nature are mathematical, and contributed to the field of astronomy through an
innovative combination of experimentation and mathematics.
In the 18
th
century, German philosopher Immanuel Kant sought to resolve the dispute
between empiricism and rationalism in his book Critique of Pure Reason, by arguing that
experience is purely subjective and processing them using pure reason without first delving
into the subjective nature of experiences will lead to theoretical illusions. Kant’s ideas led to the
development of German idealism, which inspired later development of interpretive techniques
such as phenomenology, hermeneutics, and critical social theory.
At about the same time, French philosopher Auguste Comte (17981857), founder of
the discipline of sociology, attempted to blend rationalism and empiricism in a new doctrine
called positivism. He suggested that theory and observations have circular dependence on
each other. While theories may be created via reasoning, they are only authentic if they can be
verified through observations. The emphasis on verification started the separation of modern
science from philosophy and metaphysics and further development of the “scientific method” as
the primary means of validating scientific claims. Comte’s ideas were expanded by Emile
Durkheim in his development of sociological positivism (positivism as a foundation for social
research) and Ludwig Wittgenstein in logical positivism.
8 | S o c i a l S c i e n c e R e s e a r c h
In the early 20
th
century, strong accounts of positivism were rejected by interpretive
sociologists (antipositivists) belonging to the German idealism school of thought. Positivism
was typically equated with quantitative research methods such as experiments and surveys and
without any explicit philosophical commitments, while antipositivism employed qualitative
methods such as unstructured interviews and participant observation. Even practitioners of
positivism, such as American sociologist Paul Lazarsfield who pioneered large-scale survey
research and statistical techniques for analyzing survey data, acknowledged potential problems
of observer bias and structural limitations in positivist inquiry. In response, antipositivists
emphasized that social actions must be studied though interpretive means based upon an
understanding the meaning and purpose that individuals attach to their personal actions, which
inspired Georg Simmel’s work on symbolic interactionism, Max Weber’s work on ideal types,
and Edmund Husserl’s work on phenomenology.
In the mid-to-late 20
th
century, both positivist and antipositivist schools of thought were
subjected to criticisms and modifications. British philosopher Sir Karl Popper suggested that
human knowledge is based not on unchallengeable, rock solid foundations, but rather on a set
of tentative conjectures that can never be proven conclusively, but only disproven. Empirical
evidence is the basis for disproving these conjectures or “theories.” This metatheoretical
stance, called postpositivism (or postempiricism), amends positivism by suggesting that it is
impossible to verify the truth although it is possible to reject false beliefs, though it retains the
positivist notion of an objective truth and its emphasis on the scientific method.
Likewise, antipositivists have also been criticized for trying only to understand society
but not critiquing and changing society for the better. The roots of this thought lie in Das
Capital, written by German philosophers Karl Marx and Friedrich Engels, which critiqued
capitalistic societies as being social inequitable and inefficient, and recommended resolving this
inequity through class conflict and proletarian revolutions. Marxism inspired social revolutions
in countries such as Germany, Italy, Russia, and China, but generally failed to accomplish the
social equality that it aspired. Critical research (also called critical theory) propounded by
Max Horkheimer and Jurgen Habermas in the 20
th
century, retains similar ideas of critiquing
and resolving social inequality, and adds that people can and should consciously act to change
their social and economic circumstances, although their ability to do so is constrained by
various forms of social, cultural and political domination. Critical research attempts to uncover
and critique the restrictive and alienating conditions of the status quo by analyzing the
oppositions, conflicts and contradictions in contemporary society, and seeks to eliminate the
causes of alienation and domination (i.e., emancipate the oppressed class). More on these
different research philosophies and approaches will be covered in future chapters of this book.
9
Chapter 2
Thinking Like a Researcher
Conducting good research requires first retraining your brain to think like a researcher.
This requires visualizing the abstract from actual observations, mentally “connecting the dots”
to identify hidden concepts and patterns, and synthesizing those patterns into generalizable
laws and theories that apply to other contexts beyond the domain of the initial observations.
Research involves constantly moving back and forth from an empirical plane where
observations are conducted to a theoretical plane where these observations are abstracted into
generalizable laws and theories. This is a skill that takes many years to develop, is not
something that is taught in graduate or doctoral programs or acquired in industry training, and
is by far the biggest deficit amongst Ph.D. students. Some of the mental abstractions needed to
think like a researcher include unit of analysis, constructs, hypotheses, operationalization,
theories, models, induction, deduction, and so forth, which we will examine in this chapter.
Unit of Analysis
One of the first decisions in any social science research is the unit of analysis of a
scientific study. The unit of analysis refers to the person, collective, or object that is the target
of the investigation. Typical unit of analysis include individuals, groups, organizations,
countries, technologies, objects, and such. For instance, if we are interested in studying people’s
shopping behavior, their learning outcomes, or their attitudes to new technologies, then the
unit of analysis is the individual. If we want to study characteristics of street gangs or teamwork
in organizations, then the unit of analysis is the group. If the goal of research is to understand
how firms can improve profitability or make good executive decisions, then the unit of analysis
is the firm. In this case, even though decisions are made by individuals in these firms, these
individuals are presumed to represent their firm’s decision rather than their personal decisions.
If research is directed at understanding differences in national cultures, then the unit of analysis
becomes a country. Even inanimate objects can serve as units of analysis. For instance, if a
researcher is interested in understanding how to make web pages more attractive to its users,
then the unit of analysis is a web page (and not users). If we wish to study how knowledge
transfer occurs between two firms, then our unit of analysis becomes the dyad (the combination
of firms that is sending and receiving knowledge).
Understanding the units of analysis can sometimes be fairly complex. For instance, if we
wish to study why certain neighborhoods have high crime rates, then our unit of analysis
becomes the neighborhood, and not crimes or criminals committing such crimes. This is
because the object of our inquiry is the neighborhood and not criminals. However, if we wish to
compare different types of crimes in different neighborhoods, such as homicide, robbery,
10 | S o c i a l S c i e n c e R e s e a r c h
assault, and so forth, our unit of analysis becomes the crime. If we wish to study why criminals
engage in illegal activities, then the unit of analysis becomes the individual (i.e., the criminal).
Like, if we want to study why some innovations are more successful than others, then our unit
of analysis is an innovation. However, if we wish to study how some organizations innovate
more consistently than others, then the unit of analysis is the organization. Hence, two related
research questions within the same research study may have two entirely different units of
analysis.
Understanding the unit of analysis is important because it shapes what type of data you
should collect for your study and who you collect it from. If your unit of analysis is a web page,
you should be collecting data about web pages from actual web pages, and not surveying people
about how they use web pages. If your unit of analysis is the organization, then you should be
measuring organizational-level variables such as organizational size, revenues, hierarchy, or
absorptive capacity. This data may come from a variety of sources such as financial records or
surveys of Chief Executive Officers (CEO), who are presumed to be representing their
organization (rather than themselves). Some variables such as CEO pay may seem like
individual level variables, but in fact, it can also be an organizational level variable because each
organization has only one CEO pay at any time. Sometimes, it is possible to collect data from a
lower level of analysis and aggregate that data to a higher level of analysis. For instance, in
order to study teamwork in organizations, you can survey individual team members in different
organizational teams, and average their individual scores to create a composite team-level
score for team-level variables like cohesion and conflict. We will examine the notion of
“variables” in greater depth in the next section.
Concepts, Constructs, and Variables
We discussed in Chapter 1 that although research can be exploratory, descriptive, or
explanatory, most scientific research tend to be of the explanatory type in that they search for
potential explanations of observed natural or social phenomena. Explanations require
development of concepts or generalizable properties or characteristics associated with objects,
events, or people. While objects such as a person, a firm, or a car are not concepts, their specific
characteristics or behavior such as a person’s attitude toward immigrants, a firm’s capacity for
innovation, and a car’s weight can be viewed as concepts.
Knowingly or unknowingly, we use different kinds of concepts in our everyday
conversations. Some of these concepts have been developed over time through our shared
language. Sometimes, we borrow concepts from other disciplines or languages to explain a
phenomenon of interest. For instance, the idea of gravitation borrowed from physics can be
used in business to describe why people tend to “gravitate” to their preferred shopping
destinations. Likewise, the concept of distance can be used to explain the degree of social
separation between two otherwise collocated individuals. Sometimes, we create our own
concepts to describe a unique characteristic not described in prior research. For instance,
technostress is a new concept referring to the mental stress one may face when asked to learn a
new technology.
Concepts may also have progressive levels of abstraction. Some concepts such as a
person’s weight are precise and objective, while other concepts such as a person’s personality
may be more abstract and difficult to visualize. A construct is an abstract concept that is
specifically chosen (or “created”) to explain a given phenomenon. A construct may be a simple
concept, such as a person’s weight, or a combination of a set of related concepts such as a
T h i n k i n g L i k e a R e s e a r c h e r | 11
person’s communication skill, which may consist of several underlying concepts such as the
person’s vocabulary, syntax, and spelling. The former instance (weight) is a unidimensional
construct, while the latter (communication skill) is a multi-dimensional construct (i.e., it
consists of multiple underlying concepts). The distinction between constructs and concepts are
clearer in multi-dimensional constructs, where the higher order abstraction is called a construct
and the lower order abstractions are called concepts. However, this distinction tends to blur in
the case of unidimensional constructs.
Constructs used for scientific research must have precise and clear definitions that
others can use to understand exactly what it means and what it does not mean. For instance, a
seemingly simple construct such as income may refer to monthly or annual income, before-tax
or after-tax income, and personal or family income, and is therefore neither precise nor clear.
There are two types of definitions: dictionary definitions and operational definitions. In the
more familiar dictionary definition, a construct is often defined in terms of a synonym. For
instance, attitude may be defined as a disposition, a feeling, or an affect, and affect in turn is
defined as an attitude. Such definitions of a circular nature are not particularly useful in
scientific research for elaborating the meaning and content of that construct. Scientific research
requires operational definitions that define constructs in terms of how they will be
empirically measured. For instance, the operational definition of a construct such as
temperature must specify whether we plan to measure temperature in Celsius, Fahrenheit, or
Kelvin scale. A construct such as income should be defined in terms of whether we are
interested in monthly or annual income, before-tax or after-tax income, and personal or family
income. One can imagine that constructs such as learning, personality, and intelligence can be
quite hard to define operationally.
Figure 2.1. The theoretical and empirical planes of research
A term frequently associated with, and sometimes used interchangeably with, a
construct is a variable. Etymologically speaking, a variable is a quantity that can vary (e.g., from
low to high, negative to positive, etc.), in contrast to constants that do not vary (i.e., remain
constant). However, in scientific research, a variable is a measurable representation of an
abstract construct. As abstract entities, constructs are not directly measurable, and hence, we
look for proxy measures called variables. For instance, a person’s intelligence is often measured
as his or her IQ (intelligence quotient) score, which is an index generated from an analytical and
pattern-matching test administered to people. In this case, intelligence is a construct, and IQ
score is a variable that measures the intelligence construct. Whether IQ scores truly measures
one’s intelligence is anyone’s guess (though many believe that they do), and depending on
12 | S o c i a l S c i e n c e R e s e a r c h
whether how well it measures intelligence, the IQ score may be a good or a poor measure of the
intelligence construct. As shown in Figure 2.1, scientific research proceeds along two planes: a
theoretical plane and an empirical plane. Constructs are conceptualized at the theoretical
(abstract) plane, while variables are operationalized and measured at the empirical
(observational) plane. Thinking like a researcher implies the ability to move back and forth
between these two planes.
Depending on their intended use, variables may be classified as independent,
dependent, moderating, mediating, or control variables. Variables that explain other variables
are called independent variables, those that are explained by other variables are dependent
variables, those that are explained by independent variables while also explaining dependent
variables are mediating variables (or intermediate variables), and those that influence the
relationship between independent and dependent variables are called moderating variables.
As an example, if we state that higher intelligence causes improved learning among students,
then intelligence is an independent variable and learning is a dependent variable. There may be
other extraneous variables that are not pertinent to explaining a given dependent variable, but
may have some impact on the dependent variable. These variables must be controlled for in a
scientific study, and are therefore called control variables.
Figure 2.2. A nomological network of constructs
To understand the differences between these different variable types, consider the
example shown in Figure 2.2. If we believe that intelligence influences (or explains) students’
academic achievement, then a measure of intelligence such as an IQ score is an independent
variable, while a measure of academic success such as grade point average is a dependent
variable. If we believe that the effect of intelligence on academic achievement also depends on
the effort invested by the student in the learning process (i.e., between two equally intelligent
students, the student who puts is more effort achieves higher academic achievement than one
who puts in less effort), then effort becomes a moderating variable. Incidentally, one may also
view effort as an independent variable and intelligence as a moderating variable. If academic
achievement is viewed as an intermediate step to higher earning potential, then earning
potential becomes the dependent variable for the independent variable academic achievement,
and academic achievement becomes the mediating variable in the relationship between
intelligence and earning potential. Hence, variable are defined as an independent, dependent,
moderating, or mediating variable based on their nature of association with each other. The
overall network of relationships between a set of related constructs is called a nomological
network (see Figure 2.2). Thinking like a researcher requires not only being able to abstract
constructs from observations, but also being able to mentally visualize a nomological network
linking these abstract constructs.
T h i n k i n g L i k e a R e s e a r c h e r | 13
Propositions and Hypotheses
Figure 2.2 shows how theoretical constructs such as intelligence, effort, academic
achievement, and earning potential are related to each other in a nomological network. Each of
these relationships is called a proposition. In seeking explanations to a given phenomenon or
behavior, it is not adequate just to identify key concepts and constructs underlying the target
phenomenon or behavior. We must also identify and state patterns of relationships between
these constructs. Such patterns of relationships are called propositions. A proposition is a
tentative and conjectural relationship between constructs that is stated in a declarative form.
An example of a proposition is: “An increase in student intelligence causes an increase in their
academic achievement. This declarative statement does not have to be true, but must be
empirically testable using data, so that we can judge whether it is true or false. Propositions are
generally derived based on logic (deduction) or empirical observations (induction).
Because propositions are associations between abstract constructs, they cannot be
tested directly. Instead, they are tested indirectly by examining the relationship between
corresponding measures (variables) of those constructs. The empirical formulation of
propositions, stated as relationships between variables, is called hypotheses (see Figure 2.1).
Since IQ scores and grade point average are operational measures of intelligence and academic
achievement respectively, the above proposition can be specified in form of the hypothesis: “An
increase in students’ IQ score causes an increase in their grade point average. Propositions are
specified in the theoretical plane, while hypotheses are specified in the empirical plane. Hence,
hypotheses are empirically testable using observed data, and may be rejected if not supported
by empirical observations. Of course, the goal of hypothesis testing is to infer whether the
corresponding proposition is valid.
Hypotheses can be strong or weak. “Students’ IQ scores are related to their academic
achievement” is an example of a weak hypothesis, since it indicates neither the directionality of
the hypothesis (i.e., whether the relationship is positive or negative), nor its causality (i.e.,
whether intelligence causes academic achievement or academic achievement causes
intelligence). A stronger hypothesis is “students’ IQ scores are positively related to their
academic achievement”, which indicates the directionality but not the causality. A still better
hypothesis is “students’ IQ scores have positive effects on their academic achievement”, which
specifies both the directionality and the causality (i.e., intelligence causes academic
achievement, and not the reverse). The signs in Figure 2.2 indicate the directionality of the
respective hypotheses.
Also note that scientific hypotheses should clearly specify independent and dependent
variables. In the hypothesis, “students’ IQ scores have positive effects on their academic
achievement,” it is clear that intelligence is the independent variable (the “cause”) and academic
achievement is the dependent variable (the “effect”). Further, it is also clear that this
hypothesis can be evaluated as either true (if higher intelligence leads to higher academic
achievement) or false (if higher intelligence has no effect on or leads to lower academic
achievement). Later on in this book, we will examine how to empirically test such cause-effect
relationships. Statements such as “students are generally intelligent or “all students can
achieve academic success” are not scientific hypotheses because they do not specify
independent and dependent variables, nor do they specify a directional relationship that can be
evaluated as true or false.
14 | S o c i a l S c i e n c e R e s e a r c h
Theories and Models
A theory is a set of systematically interrelated constructs and propositions intended to
explain and predict a phenomenon or behavior of interest, within certain boundary conditions
and assumptions. Essentially, a theory is a systemic collection of related theoretical
propositions. While propositions generally connect two or three constructs, theories represent
a system of multiple constructs and propositions. Hence, theories can be substantially more
complex and abstract and of a larger scope than propositions or hypotheses.
I must note here that people not familiar with scientific research often view a theory as
a speculation or the opposite of fact. For instance, people often say that teachers need to be less
theoretical and more practical or factual in their classroom teaching. However, practice or fact
are not opposites of theory, but in a scientific sense, are essential components needed to test
the validity of a theory. A good scientific theory should be well supported using observed facts
and should also have practical value, while a poorly defined theory tends to be lacking in these
dimensions. Famous organizational research Kurt Lewin once said, “Theory without practice is
sterile; practice without theory is blind.” Hence, both theory and facts (or practice) are
essential for scientific research.
Theories provide explanations of social or natural phenomenon. As emphasized in
Chapter 1, these explanations may be good or poor. Hence, there may be good or poor theories.
Chapter 3 describes some criteria that can be used to evaluate how good a theory really is.
Nevertheless, it is important for researchers to understand that theory is not “truth,” there is
nothing sacrosanct about any theory, and theories should not be accepted just because they
were proposed by someone. In the course of scientific progress, poorer theories are eventually
replaced by better theories with higher explanatory power. The essential challenge for
researchers is to build better and more comprehensive theories that can explain a target
phenomenon better than prior theories.
A term often used in conjunction with theory is a model. A model is a representation of
all or part of a system that is constructed to study that system (e.g., how the system works or
what triggers the system). While a theory tries to explain a phenomenon, a model tries to
represent a phenomenon. Models are often used by decision makers to make important
decisions based on a given set of inputs. For instance, marketing managers may use models to
decide how much money to spend on advertising for different product lines based on
parameters such as prior year’s advertising expenses, sales, market growth, and competing
products. Likewise, weather forecasters can use models to predict future weather patterns
based on parameters such as wind speeds, wind direction, temperature, and humidity. While
these models are useful, they may not necessarily explain advertising expenditure or weather
forecasts. Models may be of different kinds, such as mathematical models, network models, and
path models. Models can also be descriptive, predictive, or normative. Descriptive models are
frequently used for representing complex systems, for visualizing variables and relationships in
such systems. An advertising expenditure model may be a descriptive model. Predictive
models (e.g., a regression model) allow forecast of future events. Weather forecasting models
are predictive models. Normative models are used to guide our activities along commonly
accepted norms or practices. Models may also be static if it represents the state of a system at
one point in time, or dynamic, if it represents a system’s evolution over time.
The process of theory or model development may involve inductive and deductive
reasoning. Recall from Chapter 1 that deduction is the process of drawing conclusions about a
T h i n k i n g L i k e a R e s e a r c h e r | 15
phenomenon or behavior based on theoretical or logical reasons and an initial set of premises.
As an example, if a certain bank enforces a strict code of ethics for its employees (Premise 1)
and Jamie is an employee at that bank (Premise 2), then Jamie can be trusted to follow ethical
practices (Conclusion). In deduction, the conclusions must be true if the initial premises and
reasons are correct.
In contrast, induction is the process of drawing conclusions based on facts or observed
evidence. For instance, if a firm spent a lot of money on a promotional campaign (Observation
1), but the sales did not increase (Observation 2), then possibly the promotion campaign was
poorly executed (Conclusion). However, there may be rival explanations for poor sales, such as
economic recession or the emergence of a competing product or brand or perhaps a supply
chain problem. Inductive conclusions are therefore only a hypothesis, and may be disproven.
Deductive conclusions generally tend to be stronger than inductive conclusions, but a deductive
conclusion based on an incorrect premise is also incorrect.
As shown in Figure 2.3, inductive and deductive reasoning go hand in hand in theory
and model building. Induction occurs when we observe a fact and ask, “Why is this happening?
In answering this question, we advance one or more tentative explanations (hypotheses). We
then use deduction to narrow down the tentative explanations to the most plausible
explanation based on logic and reasonable premises (based on our understanding of the
phenomenon under study). Researchers must be able to move back and forth between
inductive and deductive reasoning if they are to post extensions or modifications to a given
model or theory, or built better ones, which are the essence of scientific research.
Figure 2.3. The model-building process
17
Chapter 3
The Research Process
In Chapter 1, we saw that scientific research is the process of acquiring scientific
knowledge using the scientific method. But how is such research conducted? This chapter
delves into the process of scientific research, and the assumptions and outcomes of the research
process.
Paradigms of Social Research
Our design and conduct of research is shaped by our mental models or frames of
references that we use to organize our reasoning and observations. These mental models or
frames (belief systems) are called paradigms. The word “paradigm” was popularized by
Thomas Kuhn (1962) in his book The Structure of Scientific Revolutions, where he examined the
history of the natural sciences to identify patterns of activities that shape the progress of
science. Similar ideas are applicable to social sciences as well, where a social reality can be
viewed by different people in different ways, which may constrain their thinking and reasoning
about the observed phenomenon. For instance, conservatives and liberals tend to have very
different perceptions of the role of government in people’s lives, and hence, have different
opinions on how to solve social problems. Conservatives may believe that lowering taxes is the
best way to stimulate a stagnant economy because it increases people’s disposable income and
spending, which in turn expands business output and employment. In contrast, liberals may
believe that governments should invest more directly in job creation programs such as public
works and infrastructure projects, which will increase employment and people’s ability to
consume and drive the economy. Likewise, Western societies place greater emphasis on
individual rights, such as one’s right to privacy, right of free speech, and right to bear arms. In
contrast, Asian societies tend to balance the rights of individuals against the rights of families,
organizations, and the government, and therefore tend to be more communal and less
individualistic in their policies. Such differences in perspective often lead Westerners to
criticize Asian governments for being autocratic, while Asians criticize Western societies for
being greedy, having high crime rates, and creating a “cult of the individual.” Our personal
paradigms are like “colored glasses” that govern how we view the world and how we structure
our thoughts about what we see in the world.
Paradigms are often hard to recognize, because they are implicit, assumed, and taken
for granted. However, recognizing these paradigms is key to making sense of and reconciling
differences in people’ perceptions of the same social phenomenon. For instance, why do
liberals believe that the best way to improve secondary education is to hire more teachers, but
conservatives believe that privatizing education (using such means as school vouchers) are
18 | S o c i a l S c i e n c e R e s e a r c h
more effective in achieving the same goal? Because conservatives place more faith in
competitive markets (i.e., in free competition between schools competing for education dollars),
while liberals believe more in labor (i.e., in having more teachers and schools). Likewise, in
social science research, if one were to understand why a certain technology was successfully
implemented in one organization but failed miserably in another, a researcher looking at the
world through a “rational lens” will look for rational explanations of the problem such as
inadequate technology or poor fit between technology and the task context where it is being
utilized, while another research looking at the same problem through a “social lens” may seek
out social deficiencies such as inadequate user training or lack of management support, while
those seeing it through a political lens” will look for instances of organizational politics that
may subvert the technology implementation process. Hence, subconscious paradigms often
constrain the concepts that researchers attempt to measure, their observations, and their
subsequent interpretations of a phenomenon. However, given the complex nature of social
phenomenon, it is possible that all of the above paradigms are partially correct, and that a fuller
understanding of the problem may require an understanding and application of multiple
paradigms.
Two popular paradigms today among social science researchers are positivism and
post-positivism. Positivism, based on the works of French philosopher Auguste Comte (1798-
1857), was the dominant scientific paradigm until the mid-20
th
century. It holds that science or
knowledge creation should be restricted to what can be observed and measured. Positivism
tends to rely exclusively on theories that can be directly tested. Though positivism was
originally an attempt to separate scientific inquiry from religion (where the precepts could not
be objectively observed), positivism led to empiricism or a blind faith in observed data and a
rejection of any attempt to extend or reason beyond observable facts. Since human thoughts
and emotions could not be directly measured, there were not considered to be legitimate topics
for scientific research. Frustrations with the strictly empirical nature of positivist philosophy
led to the development of post-positivism (or postmodernism) during the mid-late 20
th
century. Post-positivism argues that one can make reasonable inferences about a phenomenon
by combining empirical observations with logical reasoning. Post-positivists view science as
not certain but probabilistic (i.e., based on many contingencies), and often seek to explore these
contingencies to understand social reality better. The post-positivist camp has further
fragmented into subjectivists, who view the world as a subjective construction of our subjective
minds rather than as an objective reality, and critical realists, who believe that there is an
external reality that is independent of a person’s thinking but we can never know such reality
with any degree of certainty.
Burrell and Morgan (1979), in their seminal book Sociological Paradigms and
Organizational Analysis, suggested that the way social science researchers view and study social
phenomena is shaped by two fundamental sets of philosophical assumptions: ontology and
epistemology. Ontology refers to our assumptions about how we see the world, e.g., does the
world consist mostly of social order or constant change. Epistemology refers to our
assumptions about the best way to study the world, e.g., should we use an objective or
subjective approach to study social reality. Using these two sets of assumptions, we can
categorize social science research as belonging to one of four categories (see Figure 3.1).
If researchers view the world as consisting mostly of social order (ontology) and hence
seek to study patterns of ordered events or behaviors, and believe that the best way to study
such a world is using objective approach (epistemology) that is independent of the person
conducting the observation or interpretation, such as by using standardized data collection
T h e R e s e a r c h P r o c e s s | 19
tools like surveys, then they are adopting a paradigm of functionalism. However, if they
believe that the best way to study social order is though the subjective interpretation of
participants involved, such as by interviewing different participants and reconciling differences
among their responses using their own subjective perspectives, then they are employing an
interpretivism paradigm. If researchers believe that the world consists of radical change and
seek to understand or enact change using an objectivist approach, then they are employing a
radical structuralism paradigm. If they wish to understand social change using the subjective
perspectives of the participants involved, then they are following a radical humanism
paradigm.
Figure 3.1. Four paradigms of social science research
(Source: Burrell and Morgan, 1979)
To date, the majority of social science research has emulated the natural sciences, and
followed the functionalist paradigm. Functionalists believe that social order or patterns can be
understood in terms of their functional components, and therefore attempt to break down a
problem into small components and studying one or more components in detail using
objectivist techniques such as surveys and experimental research. However, with the
emergence of post-positivist thinking, a small but growing number of social science researchers
are attempting to understand social order using subjectivist techniques such as interviews and
ethnographic studies. Radical humanism and radical structuralism continues to represent a
negligible proportion of social science research, because scientists are primarily concerned with
understanding generalizable patterns of behavior, events, or phenomena, rather than
idiosyncratic or changing events. Nevertheless, if you wish to study social change, such as why
democratic movements are increasingly emerging in Middle Eastern countries, or why this
movement was successful in Tunisia, took a longer path to success in Libya, and is still not
successful in Syria, then perhaps radical humanism is the right approach for such a study.
Social and organizational phenomena generally consists elements of both order and change.
For instance, organizational success depends on formalized business processes, work
procedures, and job responsibilities, while being simultaneously constrained by a constantly
changing mix of competitors, competing products, suppliers, and customer base in the business
environment. Hence, a holistic and more complete understanding of social phenomena such as
why are some organizations more successful than others, require an appreciation and
application of a multi-paradigmatic approach to research.
20 | S o c i a l S c i e n c e R e s e a r c h
Overview of the Research Process
So how do our mental paradigms shape social science research? At its core, all scientific
research is an iterative process of observation, rationalization, and validation. In the
observation phase, we observe a natural or social phenomenon, event, or behavior that
interests us. In the rationalization phase, we try to make sense of or the observed
phenomenon, event, or behavior by logically connecting the different pieces of the puzzle that
we observe, which in some cases, may lead to the construction of a theory. Finally, in the
validation phase, we test our theories using a scientific method through a process of data
collection and analysis, and in doing so, possibly modify or extend our initial theory. However,
research designs vary based on whether the researcher starts at observation and attempts to
rationalize the observations (inductive research), or whether the researcher starts at an ex ante
rationalization or a theory and attempts to validate the theory (deductive research). Hence, the
observation-rationalization-validation cycle is very similar to the induction-deduction cycle of
research discussed in Chapter 1.
Most traditional research tends to be deductive and functionalistic in nature. Figure 3.2
provides a schematic view of such a research project. This figure depicts a series of activities to
be performed in functionalist research, categorized into three phases: exploration, research
design, and research execution. Note that this generalized design is not a roadmap or flowchart
for all research. It applies only to functionalistic research, and it can and should be modified to
fit the needs of a specific project.
Figure 3.2. Functionalistic research process
The first phase of research is exploration. This phase includes exploring and selecting
research questions for further investigation, examining the published literature in the area of
inquiry to understand the current state of knowledge in that area, and identifying theories that
may help answer the research questions of interest.
T h e R e s e a r c h P r o c e s s | 21
The first step in the exploration phase is identifying one or more research questions
dealing with a specific behavior, event, or phenomena of interest. Research questions are
specific questions about a behavior, event, or phenomena of interest that you wish to seek
answers for in your research. Examples include what factors motivate consumers to purchase
goods and services online without knowing the vendors of these goods or services, how can we
make high school students more creative, and why do some people commit terrorist acts.
Research questions can delve into issues of what, why, how, when, and so forth. More
interesting research questions are those that appeal to a broader population (e.g., “how can
firms innovate” is a more interesting research question than “how can Chinese firms innovate in
the service-sector”), address real and complex problems (in contrast to hypothetical or “toy
problems), and where the answers are not obvious. Narrowly focused research questions
(often with a binary yes/no answer) tend to be less useful and less interesting and less suited to
capturing the subtle nuances of social phenomena. Uninteresting research questions generally
lead to uninteresting and unpublishable research findings.
The next step is to conduct a literature review of the domain of interest. The purpose
of a literature review is three-fold: (1) to survey the current state of knowledge in the area of
inquiry, (2) to identify key authors, articles, theories, and findings in that area, and (3) to
identify gaps in knowledge in that research area. Literature review is commonly done today
using computerized keyword searches in online databases. Keywords can be combined using
“and” and “or” operations to narrow down or expand the search results. Once a shortlist of
relevant articles is generated from the keyword search, the researcher must then manually
browse through each article, or at least its abstract section, to determine the suitability of that
article for a detailed review. Literature reviews should be reasonably complete, and not
restricted to a few journals, a few years, or a specific methodology. Reviewed articles may be
summarized in the form of tables, and can be further structured using organizing frameworks
such as a concept matrix. A well-conducted literature review should indicate whether the initial
research questions have already been addressed in the literature (which would obviate the
need to study them again), whether there are newer or more interesting research questions
available, and whether the original research questions should be modified or changed in light of
findings of the literature review. The review can also provide some intuitions or potential
answers to the questions of interest and/or help identify theories that have previously been
used to address similar questions.
Since functionalist (deductive) research involves theory-testing, the third step is to
identify one or more theories can help address the desired research questions. While the
literature review may uncover a wide range of concepts or constructs potentially related to the
phenomenon of interest, a theory will help identify which of these constructs is logically
relevant to the target phenomenon and how. Forgoing theories may result in measuring a wide
range of less relevant, marginally relevant, or irrelevant constructs, while also minimizing the
chances of obtaining results that are meaningful and not by pure chance. In functionalist
research, theories can be used as the logical basis for postulating hypotheses for empirical
testing. Obviously, not all theories are well-suited for studying all social phenomena. Theories
must be carefully selected based on their fit with the target problem and the extent to which
their assumptions are consistent with that of the target problem. We will examine theories and
the process of theorizing in detail in the next chapter.
The next phase in the research process is research design. This process is concerned
with creating a blueprint of the activities to take in order to satisfactorily answer the research
22 | S o c i a l S c i e n c e R e s e a r c h
questions identified in the exploration phase. This includes selecting a research method,
operationalizing constructs of interest, and devising an appropriate sampling strategy.
Operationalization is the process of designing precise measures for abstract
theoretical constructs. This is a major problem in social science research, given that many of
the constructs, such as prejudice, alienation, and liberalism are hard to define, let alone
measure accurately. Operationalization starts with specifying an “operational definition” (or
“conceptualization”) of the constructs of interest. Next, the researcher can search the literature
to see if there are existing prevalidated measures matching their operational definition that can
be used directly or modified to measure their constructs of interest. If such measures are not
available or if existing measures are poor or reflect a different conceptualization than that
intended by the researcher, new instruments may have to be designed for measuring those
constructs. This means specifying exactly how exactly the desired construct will be measured
(e.g., how many items, what items, and so forth). This can easily be a long and laborious
process, with multiple rounds of pretests and modifications before the newly designed
instrument can be accepted as “scientifically valid.” We will discuss operationalization of
constructs in a future chapter on measurement.
Simultaneously with operationalization, the researcher must also decide what research
method they wish to employ for collecting data to address their research questions of interest.
Such methods may include quantitative methods such as experiments or survey research or
qualitative methods such as case research or action research, or possibly a combination of both.
If an experiment is desired, then what is the experimental design? If survey, do you plan a mail
survey, telephone survey, web survey, or a combination? For complex, uncertain, and multi-
faceted social phenomena, multi-method approaches may be more suitable, which may help
leverage the unique strengths of each research method and generate insights that may not be
obtained using a single method.
Researchers must also carefully choose the target population from which they wish to
collect data, and a sampling strategy to select a sample from that population. For instance,
should they survey individuals or firms or workgroups within firms? What types of individuals
or firms they wish to target? Sampling strategy is closely related to the unit of analysis in a
research problem. While selecting a sample, reasonable care should be taken to avoid a biased
sample (e.g., sample based on convenience) that may generate biased observations. Sampling is
covered in depth in a later chapter.
At this stage, it is often a good idea to write a research proposal detailing all of the
decisions made in the preceding stages of the research process and the rationale behind each
decision. This multi-part proposal should address what research questions you wish to study
and why, the prior state of knowledge in this area, theories you wish to employ along with
hypotheses to be tested, how to measure constructs, what research method to be employed and
why, and desired sampling strategy. Funding agencies typically require such a proposal in
order to select the best proposals for funding. Even if funding is not sought for a research
project, a proposal may serve as a useful vehicle for seeking feedback from other researchers
and identifying potential problems with the research project (e.g., whether some important
constructs were missing from the study) before starting data collection. This initial feedback is
invaluable because it is often too late to correct critical problems after data is collected in a
research study.
T h e R e s e a r c h P r o c e s s | 23
Having decided who to study (subjects), what to measure (concepts), and how to collect
data (research method), the researcher is now ready to proceed to the research execution
phase. This includes pilot testing the measurement instruments, data collection, and data
analysis.
Pilot testing is an often overlooked but extremely important part of the research
process. It helps detect potential problems in your research design and/or instrumentation
(e.g., whether the questions asked is intelligible to the targeted sample), and to ensure that the
measurement instruments used in the study are reliable and valid measures of the constructs of
interest. The pilot sample is usually a small subset of the target population. After a successful
pilot testing, the researcher may then proceed with data collection using the sampled
population. The data collected may be quantitative or qualitative, depending on the research
method employed.
Following data collection, the data is analyzed and interpreted for the purpose of
drawing conclusions regarding the research questions of interest. Depending on the type of
data collected (quantitative or qualitative), data analysis may be quantitative (e.g., employ
statistical techniques such as regression or structural equation modeling) or qualitative (e.g.,
coding or content analysis).
The final phase of research involves preparing the final research report documenting
the entire research process and its findings in the form of a research paper, dissertation, or
monograph. This report should outline in detail all the choices made during the research
process (e.g., theory used, constructs selected, measures used, research methods, sampling, etc.)
and why, as well as the outcomes of each phase of the research process. The research process
must be described in sufficient detail so as to allow other researchers to replicate your study,
test the findings, or assess whether the inferences derived are scientifically acceptable. Of
course, having a ready research proposal will greatly simplify and quicken the process of
writing the finished report. Note that research is of no value unless the research process and
outcomes are documented for future generations; such documentation is essential for the
incremental progress of science.
Common Mistakes in Research
The research process is fraught with problems and pitfalls, and novice researchers often
find, after investing substantial amounts of time and effort into a research project, that their
research questions were not sufficiently answered, or that the findings were not interesting
enough, or that the research was not of “acceptable” scientific quality. Such problems typically
result in research papers being rejected by journals. Some of the more frequent mistakes are
described below.
Insufficiently motivated research questions. Often times, we choose our “pet”
problems that are interesting to us but not to the scientific community at large, i.e., it does not
generate new knowledge or insight about the phenomenon being investigated. Because the
research process involves a significant investment of time and effort on the researcher’s part,
the researcher must be certain (and be able to convince others) that the research questions
they seek to answer in fact deal with real problems (and not hypothetical problems) that affect
a substantial portion of a population and has not been adequately addressed in prior research.
24 | S o c i a l S c i e n c e R e s e a r c h
Pursuing research fads. Another common mistake is pursuing “popular” topics with
limited shelf life. A typical example is studying technologies or practices that are popular today.
Because research takes several years to complete and publish, it is possible that popular
interest in these fads may die down by the time the research is completed and submitted for
publication. A better strategy may be to study “timeless” topics that have always persisted
through the years.
Unresearchable problems. Some research problems may not be answered adequately
based on observed evidence alone, or using currently accepted methods and procedures. Such
problems are best avoided. However, some unresearchable, ambiguously defined problems
may be modified or fine tuned into well-defined and useful researchable problems.
Favored research methods. Many researchers have a tendency to recast a research
problem so that it is amenable to their favorite research method (e.g., survey research). This is
an unfortunate trend. Research methods should be chosen to best fit a research problem, and
not the other way around.
Blind data mining. Some researchers have the tendency to collect data first (using
instruments that are already available), and then figure out what to do with it. Note that data
collection is only one step in a long and elaborate process of planning, designing, and executing
research. In fact, a series of other activities are needed in a research process prior to data
collection. If researchers jump into data collection without such elaborate planning, the data
collected will likely be irrelevant, imperfect, or useless, and their data collection efforts may be
entirely wasted. An abundance of data cannot make up for deficits in research planning and
design, and particularly, for the lack of interesting research questions.
25
Chapter 4
Theories in Scientific Research
As we know from previous chapters, science is knowledge represented as a collection of
“theories” derived using the scientific method. In this chapter, we will examine what is a
theory, why do we need theories in research, what are the building blocks of a theory, how to
evaluate theories, how can we apply theories in research, and also presents illustrative
examples of five theories frequently used in social science research.
Theories
Theories are explanations of a natural or social behavior, event, or phenomenon. More
formally, a scientific theory is a system of constructs (concepts) and propositions (relationships
between those constructs) that collectively presents a logical, systematic, and coherent
explanation of a phenomenon of interest within some assumptions and boundary conditions
(Bacharach 1989).
1
Theories should explain why things happen, rather than just describe or predict. Note
that it is possible to predict events or behaviors using a set of predictors, without necessarily
explaining why such events are taking place. For instance, market analysts predict fluctuations
in the stock market based on market announcements, earnings reports of major companies, and
new data from the Federal Reserve and other agencies, based on previously observed
correlations. Prediction requires only correlations. In contrast, explanations require causations,
or understanding of cause-effect relationships. Establishing causation requires three
conditions: (1) correlations between two constructs, (2) temporal precedence (the cause must
precede the effect in time), and (3) rejection of alternative hypotheses (through testing).
Scientific theories are different from theological, philosophical, or other explanations in that
scientific theories can be empirically tested using scientific methods.
Explanations can be idiographic or nomothetic. Idiographic explanations are those
that explain a single situation or event in idiosyncratic detail. For example, you did poorly on an
exam because: (1) you forgot that you had an exam on that day, (2) you arrived late to the exam
due to a traffic jam, (3) you panicked midway through the exam, (4) you had to work late the
previous evening and could not study for the exam, or even (5) your dog ate your text book.
The explanations may be detailed, accurate, and valid, but they may not apply to other similar
situations, even involving the same person, and are hence not generalizable. In contrast,
1
Bacharach, S. B. (1989). “Organizational Theories: Some Criteria for Evaluation,” Academy of
Management Review (14:4), 496-515.
26 | S o c i a l S c i e n c e R e s e a r c h
nomothetic explanations seek to explain a class of situations or events rather than a specific
situation or event. For example, students who do poorly in exams do so because they did not
spend adequate time preparing for exams or that they suffer from nervousness, attention-
deficit, or some other medical disorder. Because nomothetic explanations are designed to be
generalizable across situations, events, or people, they tend to be less precise, less complete,
and less detailed. However, they explain economically, using only a few explanatory variables.
Because theories are also intended to serve as generalized explanations for patterns of events,
behaviors, or phenomena, theoretical explanations are generally nomothetic in nature.
While understanding theories, it is also important to understand what theory is not.
Theory is not data, facts, typologies, taxonomies, or empirical findings. A collection of facts is
not a theory, just as a pile of stones is not a house. Likewise, a collection of constructs (e.g., a
typology of constructs) is not a theory, because theories must go well beyond constructs to
include propositions, explanations, and boundary conditions. Data, facts, and findings operate
at the empirical or observational level, while theories operate at a conceptual level and are
based on logic rather than observations.
There are many benefits to using theories in research. First, theories provide the
underlying logic of the occurrence of natural or social phenomenon by explaining what are the
key drivers and key outcomes of the target phenomenon and why, and what underlying
processes are responsible driving that phenomenon. Second, they aid in sense-making by
helping us synthesize prior empirical findings within a theoretical framework and reconcile
contradictory findings by discovering contingent factors influencing the relationship between
two constructs in different studies. Third, theories provide guidance for future research by
helping identify constructs and relationships that are worthy of further research. Fourth,
theories can contribute to cumulative knowledge building by bridging gaps between other
theories and by causing existing theories to be reevaluated in a new light.
However, theories can also have their own share of limitations. As simplified
explanations of reality, theories may not always provide adequate explanations of the
phenomenon of interest based on a limited set of constructs and relationships. Theories are
designed to be simple and parsimonious explanations, while reality may be significantly more
complex. Furthermore, theories may impose blinders or limit researchers’ “range of vision,
causing them to miss out on important concepts that are not defined by the theory.
Building Blocks of a Theory
David Whetten (1989) suggests that there are four building blocks of a theory:
constructs, propositions, logic, and boundary conditions/assumptions. Constructs capture the
“what” of theories (i.e., what concepts are important for explaining a phenomenon),
propositions capture the “how” (i.e., how are these concepts related to each other), logic
represents the “why” (i.e., why are these concepts related), and boundary
conditions/assumptions examines the “who, when, and where” (i.e., under what circumstances
will these concepts and relationships work). Though constructs and propositions were
previously discussed in Chapter 2, we describe them again here for the sake of completeness.
Constructs are abstract concepts specified at a high level of abstraction that are chosen
specifically to explain the phenomenon of interest. Recall from Chapter 2 that constructs may
be unidimensional (i.e., embody a single concept), such as weight or age, or multi-dimensional
(i.e., embody multiple underlying concepts), such as personality or culture. While some
T h e o r i e s i n S c i e n t i f i c R e s e a r c h | 27
constructs, such as age, education, and firm size, are easy to understand, others, such as
creativity, prejudice, and organizational agility, may be more complex and abstruse, and still
others such as trust, attitude, and learning, may represent temporal tendencies rather than
steady states. Nevertheless, all constructs must have clear and unambiguous operational
definition that should specify exactly how the construct will be measured and at what level of
analysis (individual, group, organizational, etc.). Measurable representations of abstract
constructs are called variables. For instance, intelligence quotient (IQ score) is a variable that
is purported to measure an abstract construct called intelligence. As noted earlier, scientific
research proceeds along two planes: a theoretical plane and an empirical plane. Constructs are
conceptualized at the theoretical plane, while variables are operationalized and measured at
the empirical (observational) plane. Furthermore, variables may be independent, dependent,
mediating, or moderating, as discussed in Chapter 2. The distinction between constructs
(conceptualized at the theoretical level) and variables (measured at the empirical level) is
shown in Figure 4.1.
Figure 4.1. Distinction between theoretical and empirical concepts
Propositions are associations postulated between constructs based on deductive logic.
Propositions are stated in declarative form and should ideally indicate a cause-effect
relationship (e.g., if X occurs, then Y will follow). Note that propositions may be conjectural but
MUST be testable, and should be rejected if they are not supported by empirical observations.
However, like constructs, propositions are stated at the theoretical level, and they can only be
tested by examining the corresponding relationship between measurable variables of those
constructs. The empirical formulation of propositions, stated as relationships between
variables, is called hypotheses. The distinction between propositions (formulated at the
theoretical level) and hypotheses (tested at the empirical level) is depicted in Figure 4.1.
The third building block of a theory is the logic that provides the basis for justifying the
propositions as postulated. Logic acts like a glue” that connects the theoretical constructs and
provides meaning and relevance to the relationships between these constructs. Logic also
represents the “explanation” that lies at the core of a theory. Without logic, propositions will be
ad hoc, arbitrary, and meaningless, and cannot be tied into a cohesive “system of propositions
that is the heart of any theory.
Finally, all theories are constrained by assumptions about values, time, and space, and
boundary conditions that govern where the theory can be applied and where it cannot be
applied. For example, many economic theories assume that human beings are rational (or
28 | S o c i a l S c i e n c e R e s e a r c h
boundedly rational) and employ utility maximization based on cost and benefit expectations as
a way of understand human behavior. In contrast, political science theories assume that people
are more political than rational, and try to position themselves in their professional or personal
environment in a way that maximizes their power and control over others. Given the nature of
their underlying assumptions, economic and political theories are not directly comparable, and
researchers should not use economic theories if their objective is to understand the power
structure or its evolution in a organization. Likewise, theories may have implicit cultural
assumptions (e.g., whether they apply to individualistic or collective cultures), temporal
assumptions (e.g., whether they apply to early stages or later stages of human behavior), and
spatial assumptions (e.g., whether they apply to certain localities but not to others). If a theory
is to be properly used or tested, all of its implicit assumptions that form the boundaries of that
theory must be properly understood. Unfortunately, theorists rarely state their implicit
assumptions clearly, which leads to frequent misapplications of theories to problem situations
in research.
Attributes of a Good Theory
Theories are simplified and often partial explanations of complex social reality. As such,
there can be good explanations or poor explanations, and consequently, there can be good
theories or poor theories. How can we evaluate the “goodness” of a given theory? Different
criteria have been proposed by different researchers, the more important of which are listed
below:
Logical consistency: Are the theoretical constructs, propositions, boundary conditions,
and assumptions logically consistent with each other? If some of these “building blocks”
of a theory are inconsistent with each other (e.g., a theory assumes rationality, but some
constructs represent non-rational concepts), then the theory is a poor theory.
Explanatory power: How much does a given theory explain (or predict) reality? Good
theories obviously explain the target phenomenon better than rival theories, as often
measured by variance explained (R-square) value in regression equations.
Falsifiability: British philosopher Karl Popper stated in the 1940’s that for theories to
be valid, they must be falsifiable. Falsifiability ensures that the theory is potentially
disprovable, if empirical data does not match with theoretical propositions, which
allows for their empirical testing by researchers. In other words, theories cannot be
theories unless they can be empirically testable. Tautological statements, such as “a day
with high temperatures is a hot day” are not empirically testable because a hot day is
defined (and measured) as a day with high temperatures, and hence, such statements
cannot be viewed as a theoretical proposition. Falsifiability requires presence of rival
explanations it ensures that the constructs are adequately measurable, and so forth.
However, note that saying that a theory is falsifiable is not the same as saying that a
theory should be falsified. If a theory is indeed falsified based on empirical evidence,
then it was probably a poor theory to begin with!
Parsimony: Parsimony examines how much of a phenomenon is explained with how
few variables. The concept is attributed to 14
th
century English logician Father William
of Ockham (and hence called “Ockham’s razor” or “Occam’s razor), which states that
among competing explanations that sufficiently explain the observed evidence, the
simplest theory (i.e., one that uses the smallest number of variables or makes the fewest
T h e o r i e s i n S c i e n t i f i c R e s e a r c h | 29
assumptions) is the best. Explanation of a complex social phenomenon can always be
increased by adding more and more constructs. However, such approach defeats the
purpose of having a theory, which are intended to be “simplified” and generalizable
explanations of reality. Parsimony relates to the degrees of freedom in a given theory.
Parsimonious theories have higher degrees of freedom, which allow them to be more
easily generalized to other contexts, settings, and populations.
Approaches to Theorizing
How do researchers build theories? Steinfeld and Fulk (1990)
2
recommend four such
approaches. The first approach is to build theories inductively based on observed patterns of
events or behaviors. Such approach is often called “grounded theory building, because the
theory is grounded in empirical observations. This technique is heavily dependent on the
observational and interpretive abilities of the researcher, and the resulting theory may be
subjective and non-confirmable. Furthermore, observing certain patterns of events will not
necessarily make a theory, unless the researcher is able to provide consistent explanations for
the observed patterns. We will discuss the grounded theory approach in a later chapter on
qualitative research.
The second approach to theory building is to conduct a bottom-up conceptual analysis
to identify different sets of predictors relevant to the phenomenon of interest using a
predefined framework. One such framework may be a simple input-process-output framework,
where the researcher may look for different categories of inputs, such as individual,
organizational, and/or technological factors potentially related to the phenomenon of interest
(the output), and describe the underlying processes that link these factors to the target
phenomenon. This is also an inductive approach that relies heavily on the inductive abilities of
the researcher, and interpretation may be biased by researcher’s prior knowledge of the
phenomenon being studied.
The third approach to theorizing is to extend or modify existing theories to explain a
new context, such as by extending theories of individual learning to explain organizational
learning. While making such an extension, certain concepts, propositions, and/or boundary
conditions of the old theory may be retained and others modified to fit the new context. This
deductive approach leverages the rich inventory of social science theories developed by prior
theoreticians, and is an efficient way of building new theories by building on existing ones.
The fourth approach is to apply existing theories in entirely new contexts by drawing
upon the structural similarities between the two contexts. This approach relies on reasoning by
analogy, and is probably the most creative way of theorizing using a deductive approach. For
instance, Markus (1987)
3
used analogic similarities between a nuclear explosion and
uncontrolled growth of networks or network-based businesses to propose a critical mass
theory of network growth. Just as a nuclear explosion requires a critical mass of radioactive
material to sustain a nuclear explosion, Markus suggested that a network requires a critical
mass of users to sustain its growth, and without such critical mass, users may leave the
network, causing an eventual demise of the network.
2
Steinfield, C.W. and Fulk, J. (1990). “The Theory Imperative," in Organizations and Communications
Technology, J. Fulk and C. W. Steinfield (eds.), Newbury Park, CA: Sage Publications.
3
Markus, M. L. (1987). “Toward a ‘Critical Mass’ Theory of Interactive Media: Universal Access,
Interdependence, and Diffusion,” Communication Research (14:5), 491-511.
30 | S o c i a l S c i e n c e R e s e a r c h
Examples of Social Science Theories
In this section, we present brief overviews of a few illustrative theories from different
social science disciplines. These theories explain different types of social behaviors, using a set
of constructs, propositions, boundary conditions, assumptions, and underlying logic. Note that
the following represents just a simplistic introduction to these theories; readers are advised to
consult the original sources of these theories for more details and insights on each theory.
Agency Theory. Agency theory (also called principal-agent theory), a classic theory in
the organizational economics literature, was originally proposed by Ross (1973)
4
to explain
two-party relationships (such as those between an employer and its employees, between
organizational executives and shareholders, and between buyers and sellers) whose goals are
not congruent with each other. The goal of agency theory is to specify optimal contracts and the
conditions under which such contracts may help minimize the effect of goal incongruence. The
core assumptions of this theory are that human beings are self-interested individuals,
boundedly rational, and risk-averse, and the theory can be applied at the individual or
organizational level.
The two parties in this theory are the principal and the agent; the principal employs the
agent to perform certain tasks on its behalf. While the principal’s goal is quick and effective
completion of the assigned task, the agent’s goal may be working at its own pace, avoiding risks,
and seeking self-interest (such as personal pay) over corporate interests. Hence, the goal
incongruence. Compounding the nature of the problem may be information asymmetry
problems caused by the principal’s inability to adequately observe the agent’s behavior or
accurately evaluate the agent’s skill sets. Such asymmetry may lead to agency problems where
the agent may not put forth the effort needed to get the task done (the moral hazard problem)
or may misrepresent its expertise or skills to get the job but not perform as expected (the
adverse selection problem). Typical contracts that are behavior-based, such as a monthly salary,
cannot overcome these problems. Hence, agency theory recommends using outcome-based
contracts, such as a commissions or a fee payable upon task completion, or mixed contracts that
combine behavior-based and outcome-based incentives. An employee stock option plans are is
an example of an outcome-based contract while employee pay is a behavior-based contract.
Agency theory also recommends tools that principals may employ to improve the efficacy of
behavior-based contracts, such as investing in monitoring mechanisms (such as hiring
supervisors) to counter the information asymmetry caused by moral hazard, designing
renewable contracts contingent on agent’s performance (performance assessment makes the
contract partially outcome-based), or by improving the structure of the assigned task to make it
more programmable and therefore more observable.
Theory of Planned Behavior. Postulated by Azjen (1991)
5
, the theory of planned
behavior (TPB) is a generalized theory of human behavior in the social psychology literature
that can be used to study a wide range of individual behaviors. It presumes that individual
behavior represents conscious reasoned choice, and is shaped by cognitive thinking and social
pressures. The theory postulates that behaviors are based on one’s intention regarding that
behavior, which in turn is a function of the person’s attitude toward the behavior, subjective
4
Ross, S. A. (1973). “The Economic Theory of Agency: The Principal’s Problem,” American Economic
Review (63:2), 134-139.
5
Ajzen, I. (1991). “The Theory of Planned Behavior,” Organizational Behavior and Human Decision
Processes (50), 179-211.
T h e o r i e s i n S c i e n t i f i c R e s e a r c h | 31
norm regarding that behavior, and perception of control over that behavior (see Figure 4.2).
Attitude is defined as the individual's overall positive or negative feelings about performing the
behavior in question, which may be assessed as a summation of one's beliefs regarding the
different consequences of that behavior, weighted by the desirability of those consequences.
Subjective norm refers to one’s perception of whether people important to that person expect
the person to perform the intended behavior, and represented as a weighted combination of the
expected norms of different referent groups such as friends, colleagues, or supervisors at work.
Behavioral control is one's perception of internal or external controls constraining the behavior
in question. Internal controls may include the person’s ability to perform the intended behavior
(self-efficacy), while external control refers to the availability of external resources needed to
perform that behavior (facilitating conditions). TPB also suggests that sometimes people may
intend to perform a given behavior but lack the resources needed to do so, and therefore
suggests that posits that behavioral control can have a direct effect on behavior, in addition to
the indirect effect mediated by intention.
TPB is an extension of an earlier theory called the theory of reasoned action, which
included attitude and subjective norm as key drivers of intention, but not behavioral control.
The latter construct was added by Ajzen in TPB to account for circumstances when people may
have incomplete control over their own behaviors (such as not having high-speed Internet
access for web surfing).
Figure 4.2. Theory of planned behavior
Innovation diffusion theory. Innovation diffusion theory (IDT) is a seminal theory in
the communications literature that explains how innovations are adopted within a population
of potential adopters. The concept was first studied by French sociologist Gabriel Tarde, but the
theory was developed by Everett Rogers in 1962 based on observations of 508 diffusion
studies. The four key elements in this theory are: innovation, communication channels, time,
and social system. Innovations may include new technologies, new practices, or new ideas, and
adopters may be individuals or organizations. At the macro (population) level, IDT views
innovation diffusion as a process of communication where people in a social system learn about
a new innovation and its potential benefits through communication channels (such as mass
media or prior adopters) and are persuaded to adopt it. Diffusion is a temporal process; the
diffusion process starts off slow among a few early adopters, then picks up speed as the
innovation is adopted by the mainstream population, and finally slows down as the adopter
population reaches saturation. The cumulative adoption pattern therefore an S-shaped curve,
as shown in Figure 4.3, and the adopter distribution represents a normal distribution. All
adopters are not identical, and adopters can be classified into innovators, early adopters, early
majority, late majority, and laggards based on their time of their adoption. The rate of diffusion
32 | S o c i a l S c i e n c e R e s e a r c h
also depends on characteristics of the social system such as the presence of opinion leaders
(experts whose opinions are valued by others) and change agents (people who influence others’
behaviors).
At the micro (adopter) level, Rogers (1995)
6
suggests that innovation adoption is a
process consisting of five stages: (1) knowledge: when adopters first learn about an innovation
from mass-media or interpersonal channels, (2) persuasion: when they are persuaded by prior
adopters to try the innovation, (3) decision: their decision to accept or reject the innovation, (4)
implementation: their initial utilization of the innovation, and (5) confirmation: their decision
to continue using it to its fullest potential (see Figure 4.4). Five innovation characteristics are
presumed to shape adopters’ innovation adoption decisions: (1) relative advantage: the
expected benefits of an innovation relative to prior innovations, (2) compatibility: the extent to
which the innovation fits with the adopter’s work habits, beliefs, and values, (3) complexity: the
extent to which the innovation is difficult to learn and use, (4) trialability: the extent to which
the innovation can be tested on a trial basis, and (5) observability: the extent to which the
results of using the innovation can be clearly observed. The last two characteristics have since
been dropped from many innovation studies. Complexity is negatively correlated to innovation
adoption, while the other four factors are positively correlated. Innovation adoption also
depends on personal factors such as the adopter’s risk-taking propensity, education level,
cosmopolitanism, and communication influence. Early adopters are venturesome, well
educated, and rely more on mass media for information about the innovation, while later
adopters rely more on interpersonal sources (such as friends and family) as their primary
source of information. IDT has been criticized for having a “pro-innovation bias,” that is for
presuming that all innovations are beneficial and will be eventually diffused across the entire
population, and because it does not allow for inefficient innovations such as fads or fashions to
die off quickly without being adopted by the entire population or being replaced by better
innovations.
Figure 4.3. S-shaped diffusion curve
6
Rogers, E. (1962). Diffusion of Innovations. New York: The Free Press. Other editions 1983, 1996, 2005.
T h e o r i e s i n S c i e n t i f i c R e s e a r c h | 33
Figure 4.4. Innovation adoption process
Elaboration Likelihood Model. Developed by Petty and Cacioppo (1986)
7
, the
elaboration likelihood model (ELM) is a dual-process theory of attitude formation or change in
the psychology literature. It explains how individuals can be influenced to change their attitude
toward a certain object, events, or behavior and the relative efficacy of such change strategies.
The ELM posits that one’s attitude may be shaped by two “routes” of influence, the central route
and the peripheral route, which differ in the amount of thoughtful information processing or
“elaboration” required of people (see Figure 4.5). The central route requires a person to think
about issue-related arguments in an informational message and carefully scrutinize the merits
and relevance of those arguments, before forming an informed judgment about the target
object. In the peripheral route, subjects rely on external “cues” such as number of prior users,
endorsements from experts, or likeability of the endorser, rather than on the quality of
arguments, in framing their attitude towards the target object. The latter route is less
cognitively demanding, and the routes of attitude change are typically operationalized in the
ELM using the argument quality and peripheral cues constructs respectively.
Figure 4.5. Elaboration likelihood model
Whether people will be influenced by the central or peripheral routes depends upon
their ability and motivation to elaborate the central merits of an argument. This ability and
motivation to elaborate is called elaboration likelihood. People in a state of high elaboration
likelihood (high ability and high motivation) are more likely to thoughtfully process the
information presented and are therefore more influenced by argument quality, while those in
the low elaboration likelihood state are more motivated by peripheral cues. Elaboration
likelihood is a situational characteristic and not a personal trait. For instance, a doctor may
employ the central route for diagnosing and treating a medical ailment (by virtue of his or her
expertise of the subject), but may rely on peripheral cues from auto mechanics to understand
7
Petty, R. E., and Cacioppo, J. T. (1986). Communication and Persuasion: Central and Peripheral Routes to
Attitude Change. New York: Springer-Verlag.
Knowledge Persuasion Decision
Imple-
mentation
Confirmation
34 | S o c i a l S c i e n c e R e s e a r c h
the problems with his car. As such, the theory has widespread implications about how to enact
attitude change toward new products or ideas and even social change.
General Deterrence Theory. Two utilitarian philosophers of the eighteenth century, Cesare
Beccaria and Jeremy Bentham, formulated General Deterrence Theory (GDT) as both an explanation
of crime and a method for reducing it. GDT examines why certain individuals engage in deviant,
anti-social, or criminal behaviors. This theory holds that people are fundamentally rational (for
both conforming and deviant behaviors), and that they freely choose deviant behaviors based
on a rational cost-benefit calculation. Because people naturally choose utility-maximizing
behaviors, deviant choices that engender personal gain or pleasure can be controlled by
increasing the costs of such behaviors in the form of punishments (countermeasures) as well as
increasing the probability of apprehension. Swiftness, severity, and certainty of punishments
are the key constructs in GDT.
While classical positivist research in criminology seeks generalized causes of criminal
behaviors, such as poverty, lack of education, psychological conditions, and recommends
strategies to rehabilitate criminals, such as by providing them job training and medical
treatment, GDT focuses on the criminal decision making process and situational factors that
influence that process. Hence, a criminal’s personal situation (such as his personal values, his
affluence, and his need for money) and the environmental context (such as how protected is the
target, how efficient is the local police, how likely are criminals to be apprehended) play key
roles in this decision making process. The focus of GDT is not how to rehabilitate criminals and
avert future criminal behaviors, but how to make criminal activities less attractive and
therefore prevent crimes. To that end, “target hardening” such as installing deadbolts and
building self-defense skills, legal deterrents such as eliminating parole for certain crimes, “three
strikes law” (mandatory incarceration for three offenses, even if the offenses are minor and not
worth imprisonment), and the death penalty, increasing the chances of apprehension using
means such as neighborhood watch programs, special task forces on drugs or gang-related
crimes, and increased police patrols, and educational programs such as highly visible notices
such as “Trespassers will be prosecuted” are effective in preventing crimes. This theory has
interesting implications not only for traditional crimes, but also for contemporary white-collar
crimes such as insider trading, software piracy, and illegal sharing of music.
35
Chapter 5
Research Design
Research design is a comprehensive plan for data collection in an empirical research
project. It is a “blueprint” for empirical research aimed at answering specific research
questions or testing specific hypotheses, and must specify at least three processes: (1) the data
collection process, (2) the instrument development process, and (3) the sampling process. The
instrument development and sampling processes are described in next two chapters, and the
data collection process (which is often loosely called research design) is introduced in this
chapter and is described in further detail in Chapters 9-12.
Broadly speaking, data collection methods can be broadly grouped into two categories:
positivist and interpretive. Positivist methods, such as laboratory experiments and survey
research, are aimed at theory (or hypotheses) testing, while interpretive methods, such as
action research and ethnography, are aimed at theory building. Positivist methods employ a
deductive approach to research, starting with a theory and testing theoretical postulates using
empirical data. In contrast, interpretive methods employ an inductive approach that starts
with data and tries to derive a theory about the phenomenon of interest from the observed
data. Often times, these methods are incorrectly equated with quantitative and qualitative
research. Quantitative and qualitative methods refers to the type of data being collected
(quantitative data involve numeric scores, metrics, and so on, while qualitative data includes
interviews, observations, and so forth) and analyzed (i.e., using quantitative techniques such as
regression or qualitative techniques such as coding). Positivist research uses predominantly
quantitative data, but can also use qualitative data. Interpretive research relies heavily on
qualitative data, but can sometimes benefit from including quantitative data as well.
Sometimes, joint use of qualitative and quantitative data may help generate unique insight into
a complex social phenomenon that are not available from either types of data alone, and hence,
mixed-mode designs that combine qualitative and quantitative data are often highly desirable.
Key Attributes of a Research Design
The quality of research designs can be defined in terms of four key design attributes:
internal validity, external validity, construct validity, and statistical conclusion validity.
Internal validity, also called causality, examines whether the observed change in a
dependent variable is indeed caused by a corresponding change in hypothesized independent
variable, and not by variables extraneous to the research context. Causality requires three
conditions: (1) covariation of cause and effect (i.e., if cause happens, then effect also happens;
and if cause does not happen, effect does not happen), (2) temporal precedence: cause must
36 | S o c i a l S c i e n c e R e s e a r c h
precede effect in time, (3) no plausible alternative explanation (or spurious correlation).
Certain research designs, such as laboratory experiments, are strong in internal validity by
virtue of their ability to manipulate the independent variable (cause) via a treatment and
observe the effect (dependent variable) of that treatment after a certain point in time, while
controlling for the effects of extraneous variables. Other designs, such as field surveys, are poor
in internal validity because of their inability to manipulate the independent variable (cause),
and because cause and effect are measured at the same point in time which defeats temporal
precedence making it equally likely that the expected effect might have influenced the expected
cause rather than the reverse. Although higher in internal validity compared to other methods,
laboratory experiments are, by no means, immune to threats of internal validity, and are
susceptible to history, testing, instrumentation, regression, and other threats that are discussed
later in the chapter on experimental designs. Nonetheless, different research designs vary
considerably in their respective level of internal validity.
External validity or generalizability refers to whether the observed associations can be
generalized from the sample to the population (population validity), or to other people,
organizations, contexts, or time (ecological validity). For instance, can results drawn from a
sample of financial firms in the United States be generalized to the population of financial firms
(population validity) or to other firms within the United States (ecological validity)? Survey
research, where data is sourced from a wide variety of individuals, firms, or other units of
analysis, tends to have broader generalizability than laboratory experiments where artificially
contrived treatments and strong control over extraneous variables render the findings less
generalizable to real-life settings where treatments and extraneous variables cannot be
controlled. The variation in internal and external validity for a wide range of research designs
are shown in Figure 5.1.
Internal validity
External
validity
Single lab
experiment
Multiple lab
experiment
Single
case study
Multiple
case study
Cross-sectional
field survey
Longitudinal
field survey
Field
experiment
Math
proofs
Validity
frontier
Simulation
Ethnography
Cone of Validity
Figure 5.1. Internal and external validity
Some researchers claim that there is a tradeoff between internal and external validity:
higher external validity can come only at the cost of internal validity and vice-versa. But this is
not always the case. Research designs such as field experiments, longitudinal field surveys, and
multiple case studies have higher degrees of both internal and external validities. Personally, I
prefer research designs that have reasonable degrees of both internal and external validities,
i.e., those that fall within the cone of validity shown in Figure 5.1. But this should not suggest
that designs outside this cone are any less useful or valuable. Researchers’ choice of designs is
R e s e a r c h D e s i g n | 37
ultimately a matter of their personal preference and competence, and the level of internal and
external validity they desire.
Construct validity examines how well a given measurement scale is measuring the
theoretical construct that it is expected to measure. Many constructs used in social science
research such as empathy, resistance to change, and organizational learning are difficult to
define, much less measure. For instance, construct validity must assure that a measure of
empathy is indeed measuring empathy and not compassion, which may be difficult since these
constructs are somewhat similar in meaning. Construct validity is assessed in positivist
research based on correlational or factor analysis of pilot test data, as described in the next
chapter.
Statistical conclusion validity examines the extent to which conclusions derived using
a statistical procedure is valid. For example, it examines whether the right statistical method
was used for hypotheses testing, whether the variables used meet the assumptions of that
statistical test (such as sample size or distributional requirements), and so forth. Because
interpretive research designs do not employ statistical test, statistical conclusion validity is not
applicable for such analysis. The different kinds of validity and where they exist at the
theoretical/empirical levels are illustrated in Figure 5.2.
Figure 5.2. Different Types of Validity in Scientific Research
Improving Internal and External Validity
The best research designs are those that can assure high levels of internal and external
validity. Such designs would guard against spurious correlations, inspire greater faith in the
hypotheses testing, and ensure that the results drawn from a small sample are generalizable to
the population at large. Controls are required to assure internal validity (causality) of research
designs, and can be accomplished in four ways: (1) manipulation, (2) elimination, (3) inclusion,
and (4) statistical control, and (5) randomization.
In manipulation, the researcher manipulates the independent variables in one or more
levels (called “treatments”), and compares the effects of the treatments against a control group
where subjects do not receive the treatment. Treatments may include a new drug or different
38 | S o c i a l S c i e n c e R e s e a r c h
dosage of drug (for treating a medical condition), a, a teaching style (for students), and so forth.
This type of control is achieved in experimental or quasi-experimental designs but not in non-
experimental designs such as surveys. Note that if subjects cannot distinguish adequately
between different levels of treatment manipulations, their responses across treatments may not
be different, and manipulation would fail.
The elimination technique relies on eliminating extraneous variables by holding them
constant across treatments, such as by restricting the study to a single gender or a single socio-
economic status. In the inclusion technique, the role of extraneous variables is considered by
including them in the research design and separately estimating their effects on the dependent
variable, such as via factorial designs where one factor is gender (male versus female). Such
technique allows for greater generalizability but also requires substantially larger samples. In
statistical control, extraneous variables are measured and used as covariates during the
statistical testing process.
Finally, the randomization technique is aimed at canceling out the effects of extraneous
variables through a process of random sampling, if it can be assured that these effects are of a
random (non-systematic) nature. Two types of randomization are: (1) random selection,
where a sample is selected randomly from a population, and (2) random assignment, where
subjects selected in a non-random manner are randomly assigned to treatment groups.
Randomization also assures external validity, allowing inferences drawn from the
sample to be generalized to the population from which the sample is drawn. Note that random
assignment is mandatory when random selection is not possible because of resource or access
constraints. However, generalizability across populations is harder to ascertain since
populations may differ on multiple dimensions and you can only control for few of those
dimensions.
Popular Research Designs
As noted earlier, research designs can be classified into two categories positivist and
interpretive depending how their goal in scientific research. Positivist designs are meant for
theory testing, while interpretive designs are meant for theory building. Positivist designs seek
generalized patterns based on an objective view of reality, while interpretive designs seek
subjective interpretations of social phenomena from the perspectives of the subjects involved.
Some popular examples of positivist designs include laboratory experiments, field experiments,
field surveys, secondary data analysis, and case research while examples of interpretive designs
include case research, phenomenology, and ethnography. Note that case research can be used
for theory building or theory testing, though not at the same time. Not all techniques are suited
for all kinds of scientific research. Some techniques such as focus groups are best suited for
exploratory research, others such as ethnography are best for descriptive research, and still
others such as laboratory experiments are ideal for explanatory research. Following are brief
descriptions of some of these designs. Additional details are provided in Chapters 9-12.
Experimental studies are those that are intended to test cause-effect relationships
(hypotheses) in a tightly controlled setting by separating the cause from the effect in time,
administering the cause to one group of subjects (the “treatment group”) but not to another
group (“control group”), and observing how the mean effects vary between subjects in these
two groups. For instance, if we design a laboratory experiment to test the efficacy of a new drug
in treating a certain ailment, we can get a random sample of people afflicted with that ailment,
R e s e a r c h D e s i g n | 39
randomly assign them to one of two groups (treatment and control groups), administer the
drug to subjects in the treatment group, but only give a placebo (e.g., a sugar pill with no
medicinal value). More complex designs may include multiple treatment groups, such as low
versus high dosage of the drug, multiple treatments, such as combining drug administration
with dietary interventions. In a true experimental design, subjects must be randomly
assigned between each group. If random assignment is not followed, then the design becomes
quasi-experimental. Experiments can be conducted in an artificial or laboratory setting such
as at a university (laboratory experiments) or in field settings such as in an organization where
the phenomenon of interest is actually occurring (field experiments). Laboratory experiments
allow the researcher to isolate the variables of interest and control for extraneous variables,
which may not be possible in field experiments. Hence, inferences drawn from laboratory
experiments tend to be stronger in internal validity, but those from field experiments tend to be
stronger in external validity. Experimental data is analyzed using quantitative statistical
techniques. The primary strength of the experimental design is its strong internal validity due
to its ability to isolate, control, and intensively examine a small number of variables, while its
primary weakness is limited external generalizability since real life is often more complex (i.e.,
involve more extraneous variables) than contrived lab settings. Furthermore, if the research
does not identify ex ante relevant extraneous variables and control for such variables, such lack
of controls may hurt internal validity and may lead to spurious correlations.
Field surveys are non-experimental designs that do not control for or manipulate
independent variables or treatments, but measure these variables and test their effects using
statistical methods. Field surveys capture snapshots of practices, beliefs, or situations from a
random sample of subjects in field settings through a survey questionnaire or less frequently,
through a structured interview. In cross-sectional field surveys, independent and dependent
variables are measured at the same point in time (e.g., using a single questionnaire), while in
longitudinal field surveys, dependent variables are measured at a later point in time than the
independent variables. The strengths of field surveys are their external validity (since data is
collected in field settings), their ability to capture and control for a large number of variables,
and their ability to study a problem from multiple perspectives or using multiple theories.
However, because of their non-temporal nature, internal validity (cause-effect relationships)
are difficult to infer, and surveys may be subject to respondent biases (e.g., subjects may
provide a “socially desirable” response rather than their true response) which further hurts
internal validity.
Secondary data analysis is an analysis of data that has previously been collected and
tabulated by other sources. Such data may include data from government agencies such as
employment statistics from the U.S. Bureau of Labor Services or development statistics by
country from the United Nations Development Program, data collected by other researchers
(often used in meta-analytic studies), or publicly available third-party data, such as financial
data from stock markets or real-time auction data from eBay. This is in contrast to most other
research designs where collecting primary data for research is part of the researcher’s job.
Secondary data analysis may be an effective means of research where primary data collection is
too costly or infeasible, and secondary data is available at a level of analysis suitable for
answering the researcher’s questions. The limitations of this design are that the data might not
have been collected in a systematic or scientific manner and hence unsuitable for scientific
research, since the data was collected for a presumably different purpose, they may not
adequately address the research questions of interest to the researcher, and interval validity is
problematic if the temporal precedence between cause and effect is unclear.
40 | S o c i a l S c i e n c e R e s e a r c h
Case research is an in-depth investigation of a problem in one or more real-life settings
(case sites) over an extended period of time. Data may be collected using a combination of
interviews, personal observations, and internal or external documents. Case studies can be
positivist in nature (for hypotheses testing) or interpretive (for theory building). The strength
of this research method is its ability to discover a wide variety of social, cultural, and political
factors potentially related to the phenomenon of interest that may not be known in advance.
Analysis tends to be qualitative in nature, but heavily contextualized and nuanced. However,
interpretation of findings may depend on the observational and integrative ability of the
researcher, lack of control may make it difficult to establish causality, and findings from a single
case site may not be readily generalized to other case sites. Generalizability can be improved by
replicating and comparing the analysis in other case sites in a multiple case design.
Focus group research is a type of research that involves bringing in a small group of
subjects (typically 6 to 10 people) at one location, and having them discuss a phenomenon of
interest for a period of 1.5 to 2 hours. The discussion is moderated and led by a trained
facilitator, who sets the agenda and poses an initial set of questions for participants, makes sure
that ideas and experiences of all participants are represented, and attempts to build a holistic
understanding of the problem situation based on participants’ comments and experiences.
Internal validity cannot be established due to lack of controls and the findings may not be
generalized to other settings because of small sample size. Hence, focus groups are not
generally used for explanatory or descriptive research, but are more suited for exploratory
research.
Action research assumes that complex social phenomena are best understood by
introducing interventions or “actions” into those phenomena and observing the effects of those
actions. In this method, the researcher is usually a consultant or an organizational member
embedded within a social context such as an organization, who initiates an action such as new
organizational procedures or new technologies, in response to a real problem such as declining
profitability or operational bottlenecks. The researcher’s choice of actions must be based on
theory, which should explain why and how such actions may cause the desired change. The
researcher then observes the results of that action, modifying it as necessary, while
simultaneously learning from the action and generating theoretical insights about the target
problem and interventions. The initial theory is validated by the extent to which the chosen
action successfully solves the target problem. Simultaneous problem solving and insight
generation is the central feature that distinguishes action research from all other research
methods, and hence, action research is an excellent method for bridging research and practice.
This method is also suited for studying unique social problems that cannot be replicated outside
that context, but it is also subject to researcher bias and subjectivity, and the generalizability of
findings is often restricted to the context where the study was conducted.
Ethnography is an interpretive research design inspired by anthropology that
emphasizes that research phenomenon must be studied within the context of its culture. The
researcher is deeply immersed in a certain culture over an extended period of time (8 months
to 2 years), and during that period, engages, observes, and records the daily life of the studied
culture, and theorizes about the evolution and behaviors in that culture. Data is collected
primarily via observational techniques, formal and informal interaction with participants in
that culture, and personal field notes, while data analysis involves sense-making. The
researcher must narrate her experience in great detail so that readers may experience that
same culture without necessarily being there. The advantages of this approach are its
sensitiveness to the context, the rich and nuanced understanding it generates, and minimal
R e s e a r c h D e s i g n | 41
respondent bias. However, this is also an extremely time and resource-intensive approach, and
findings are specific to a given culture and less generalizable to other cultures.
Selecting Research Designs
Given the above multitude of research designs, which design should researchers choose
for their research? Generally speaking, researchers tend to select those research designs that
they are most comfortable with and feel most competent to handle, but ideally, the choice
should depend on the nature of the research phenomenon being studied. In the preliminary
phases of research, when the research problem is unclear and the researcher wants to scope
out the nature and extent of a certain research problem, a focus group (for individual unit of
analysis) or a case study (for organizational unit of analysis) is an ideal strategy for exploratory
research. As one delves further into the research domain, but finds that there are no good
theories to explain the phenomenon of interest and wants to build a theory to fill in the unmet
gap in that area, interpretive designs such as case research or ethnography may be useful
designs. If competing theories exist and the researcher wishes to test these different theories or
integrate them into a larger theory, positivist designs such as experimental design, survey
research, or secondary data analysis are more appropriate.
Regardless of the specific research design chosen, the researcher should strive to collect
quantitative and qualitative data using a combination of techniques such as questionnaires,
interviews, observations, documents, or secondary data. For instance, even in a highly
structured survey questionnaire, intended to collect quantitative data, the researcher may leave
some room for a few open-ended questions to collect qualitative data that may generate
unexpected insights not otherwise available from structured quantitative data alone. Likewise,
while case research employ mostly face-to-face interviews to collect most qualitative data, the
potential and value of collecting quantitative data should not be ignored. As an example, in a
study of organizational decision making processes, the case interviewer can record numeric
quantities such as how many months it took to make certain organizational decisions, how
many people were involved in that decision process, and how many decision alternatives were
considered, which can provide valuable insights not otherwise available from interviewees’
narrative responses. Irrespective of the specific research design employed, the goal of the
researcher should be to collect as much and as diverse data as possible that can help generate
the best possible insights about the phenomenon of interest.
43
Chapter 6
Measurement of Constructs
Theoretical propositions consist of relationships between abstract constructs. Testing
theories (i.e., theoretical propositions) require measuring these constructs accurately, correctly,
and in a scientific manner, before the strength of their relationships can be tested.
Measurement refers to careful, deliberate observations of the real world and is the essence of
empirical research. While some constructs in social science research, such as a person’s age,
weight, or a firm’s size, may be easy to measure, other constructs, such as creativity, prejudice,
or alienation, may be considerably harder to measure. In this chapter, we will examine the
related processes of conceptualization and operationalization for creating measures of such
constructs.
Conceptualization
Conceptualization is the mental process by which fuzzy and imprecise constructs
(concepts) and their constituent components are defined in concrete and precise terms. For
instance, we often use the word “prejudice” and the word conjures a certain image in our mind;
however, we may struggle if we were asked to define exactly what the term meant. If someone
says bad things about other racial groups, is that racial prejudice? If women earn less than men
for the same job, is that gender prejudice? If churchgoers believe that non-believers will burn
in hell, is that religious prejudice? Are there different kinds of prejudice, and if so, what are
they? Are there different levels of prejudice, such as high or low? Answering all of these
questions is the key to measuring the prejudice construct correctly. The process of
understanding what is included and what is excluded in the concept of prejudice is the
conceptualization process.
The conceptualization process is all the more important because of the imprecision,
vagueness, and ambiguity of many social science constructs. For instance, is “compassion” the
same thing as “empathy” or “sentimentality”? If you have a proposition stating that
“compassion is positively related to empathy”, you cannot test that proposition unless you can
conceptually separate empathy from compassion and then empirically measure these two very
similar constructs correctly. If deeply religious people believe that some members of their
society, such as nonbelievers, gays, and abortion doctors, will burn in hell for their sins, and
forcefully try to change the “sinners” behaviors to prevent them from going to hell, are they
acting in a prejudicial manner or a compassionate manner? Our definition of such constructs is
not based on any objective criterion, but rather on a shared (“inter-subjective”) agreement
between our mental images (conceptions) of these constructs.
44 | S o c i a l S c i e n c e R e s e a r c h
While defining constructs such as prejudice or compassion, we must understand that
sometimes, these constructs are not real or can exist independently, but are simply imaginary
creations in our mind. For instance, there may be certain tribes in the world who lack prejudice
and who cannot even imagine what this concept entails. But in real life, we tend to treat this
concept as real. The process of regarding mental constructs as real is called reification, which is
central to defining constructs and identifying measurable variables for measuring them.
One important decision in conceptualizing constructs is specifying whether they are
unidimensional and multidimensional. Unidimensional constructs are those that are expected
to have a single underlying dimension. These constructs can be measured using a single
measure or test. Examples include simple constructs such as a person’s weight, wind speed,
and probably even complex constructs like self-esteem (if we conceptualize self-esteem as
consisting of a single dimension, which of course, may be a unrealistic assumption).
Multidimensional constructs consist of two or more underlying dimensions. For instance, if
we conceptualize a person’s academic aptitude as consisting of two dimensions mathematical
and verbal ability then academic aptitude is a multidimensional construct. Each of the
underlying dimensions in this case must be measured separately, say, using different tests for
mathematical and verbal ability, and the two scores can be combined, possibly in a weighted
manner, to create an overall value for the academic aptitude construct.
Operationalization
Once a theoretical construct is defined, exactly how do we measure it?
Operationalization refers to the process of developing indicators or items for measuring
these constructs. For instance, if an unobservable theoretical construct such as socioeconomic
status is defined as the level of family income, it can be operationalized using an indicator that
asks respondents the question: what is your annual family income? Given the high level of
subjectivity and imprecision inherent in social science constructs, we tend to measure most of
those constructs (except a few demographic constructs such as age, gender, education, and
income) using multiple indicators. This process allows us to examine the closeness amongst
these indicators as an assessment of their accuracy (reliability).
Indicators operate at the empirical level, in contrast to constructs, which are
conceptualized at the theoretical level. The combination of indicators at the empirical level
representing a given construct is called a variable. As noted in a previous chapter, variables
may be independent, dependent, mediating, or moderating, depending on how they are
employed in a research study. Also each indicator may have several attributes (or levels) and
each attribute represent a value. For instance, a gender” variable may have two attributes:
male or female. Likewise, a customer satisfaction scale may be constructed to represent five
attributes: “strongly dissatisfied”, “somewhat dissatisfied”, “neutral”, “somewhat satisfied” and
“strongly satisfied”. Values of attributes may be quantitative (numeric) or qualitative (non-
numeric). Quantitative data can be analyzed using quantitative data analysis techniques, such
as regression or structural equation modeling, while qualitative data require qualitative data
analysis techniques, such as coding. Note that many variables in social science research are
qualitative, even when represented in a quantitative manner. For instance, we can create a
customer satisfaction indicator with five attributes: strongly dissatisfied, somewhat dissatisfied,
neutral, somewhat satisfied, and strongly satisfied, and assign numbers 1 through 5
respectively for these five attributes, so that we can use sophisticated statistical tools for
quantitative data analysis. However, note that the numbers are only labels associated with
M e a s u r e m e n t o f C o n s t r u c t s | 45
respondents’ personal evaluation of their own satisfaction, and the underlying variable
(satisfaction) is still qualitative even though we represented it in a quantitative manner.
Indicators may be reflective or formative. A reflective indicator is a measure that
“reflects” an underlying construct. For example, if religiosity is defined as a construct that
measures how religious a person is, then attending religious services may be a reflective
indicator of religiosity. A formative indicator is a measure that “forms” or contributes to an
underlying construct. Such indicators may represent different dimensions of the construct of
interest. For instance, if religiosity is defined as composing of a belief dimension, a devotional
dimension, and a ritual dimension, then indicators chosen to measure each of these different
dimensions will be considered formative indicators. Unidimensional constructs are measured
using reflective indicators (even though multiple reflective indicators may be used for
measuring abstruse constructs such as self-esteem), while multidimensional constructs are
measured as a formative combination of the multiple dimensions, even though each of the
underlying dimensions may be measured using one or more reflective indicators.
Levels of Measurement
The first decision to be made in operationalizing a construct is to decide on what is the
intended level of measurement. Levels of measurement, also called rating scales, refer to the
values that an indicator can take (but says nothing about the indicator itself). For example,
male and female (or M and F, or 1 and 2) are two levels of the indicator “gender.” In his seminal
article titled "On the theory of scales of measurement" published in Science in 1946,
psychologist Stanley Smith Stevens (1946) defined four generic types of rating scales for
scientific measurements: nominal, ordinal, interval, and ratio scales. The statistical properties
of these scales are shown in Table 6.1.
Scale
Central Tendency
Statistics
Transformations
Nominal
Mode
Chi-square
One-to-one (equality)
Ordinal
Median
Percentile, non-parametric
statistics
Monotonic increasing
(order)
Interval
Arithmetic mean, range,
standard deviation
Correlation, regression,
analysis of variance
Positive linear (affine)
Ratio
Geometric mean,
harmonic mean
Coefficient of variation
Positive similarities
(multiplicative, logarithmic)
Note: All higher-order scales can use any of the statistics for lower order scales.
Table 6.1. Statistical properties of rating scales
Nominal scales, also called categorical scales, measure categorical data. These scales