Friday, September 5, 2014


The Research Process and the Hypothesis Formulation 

1. Introduction to Research

Research is present in every walk of life.  The concept of research in different forms existed from time immemorial.  It assumed significance after the formal education and organizations came into existence.  It does not mean that the formal education and organizations gave birth to research.  Certainly the process of research was used by different civilizations to know the past, understand the present and predict the future.  We could define research as the process used to move from unorganized way of thinking and decision making to the organized way of thinking and decision making.  Formally, research could be defined as the process and tools used in decision making that reduce the risk of wrong decisions. Research is a systematic inquiry into a phenomenon to provide information to the decision maker in any field.  Research is undertaken in all the subjects of inquiry.  While scientific research involves systematic and scientific experimentation which could be replicated and verified, the replication and verifiability may not be central to social, political, economic, commerce, commercial and business research.  Business research involving commerce and management can be defined as the systematic inquiry into a phenomenon to provide information to the managerial process of decision making.  A large part of the research in accounting today is empirical and has focused on different issues related to accounting concepts, principles, the treatment of specific items of income, expenses, assets and liabilities etc. 
Research has helped the human kind in all walks of life.  The importance of research grew as organizations, both ‘for profit’ and ‘not for profit’, grew in size and started to recognize that research can provide useful inputs into the decision making.  Countries started to recognize the importance of research as the economies became more complex and many variables started to influence the outcome of a scheme/policy.  Scientific research has been of immense help in understanding the most complex things in science and technology.  Research in medical field has solved many a problems of the human and animal life.  Without the basic research, human life would have been miserable today.  It is because research which is abstract in nature was undertaken by scientists, we have been able to view the TV, travel from place to place, get electricity and light in all the houses, save the human and animal life from deadly diseases, launch the satellites into geostationary orbits and organise the working of business organisations and find out the results of the operations for each year.  While the basic or pure research may not look at the immediate application of the research results in any field, the applied research usually looks for solving the current or immediate future problems.  It is not uncommon for the researchers in the educational institutions to be at the receiving end whenever some problems crop up in a state/country.  The policy makers immediately criticize that the researchers have not been able to solve their problems.  Unfortunately what they do not realize is that the problem which arises may have to be adequately researched before the tentative solutions are found.  These types of research are undertaken to address the current problems and find policy measures to solve the problems.
Every research involves systematic steps.  Understanding of these steps is essential to know the best ways of doing research.  This paper has the limited objective of discussing the research process and the hypothesis formulation.  The paper is organised in five parts.  Part 2 focuses on the research process, part 3 discusses the issues involved in hypothesis formulation and testing, part 4 discusses the definitions of variables used in the research and the last part presents the conclusions.   

2. The Research Process

Research can be pure or basic research or applied research.  Empiricism is assuming a lot of importance in the recent times.  Most of the research in commerce, economics and management is empirical in nature.  Therefore, many researchers use the data to either describe a phenomenon or prove or disprove a relationship.  When a survey research involves collection of qualitative and quantitative data, researchers would first define the problem, then set the objectives and then go on to collect the data relevant for their work.  Generally, the research process involve the following steps
2.1  Idea generation:  A research idea may be generated by general/specific problem that is being faced currently, the problem that may be imagined by either the researcher or others.  It could also emerge from a brain storming session, Delphi method, researcher’s reading, researcher’s/others’ imagination.  The problems currently being encountered could be related to the economy, industry, company, society, a social organization, an administration division of the government, world level organisations etc.  For example, India is facing the problem of higher than expected food inflation today.  Therefore, the relevant idea could be what causes food inflation.  We have seen the fall of multinational financial institutions during the financial crisis that originated in the US and spread to the entire world.  The research idea may come from this crisis where the researcher may want to study the causes of the fall of the institutions of that magnitude.
2.2  Problem identification:  Once the researcher generates research idea based from different sources, the researcher needs to identify the problem in concrete terms.  An idea may be a hazy one, but that needs to be refined while identifying the problem.  The problem identification will define the scope of the research and sets the boundary line for the research.  If the problem is too broad and cannot be approached in a single research, then we may not arrive at any conclusions.  Therefore, we need to identify the problem that the researcher wants to address in his/her work.
2.3   Problem definition:  At this stage the researcher not only knows the problems that needs to be addressed but also defines it.  The problem definition clearly identifies the phenomenon that needs to be investigated, the variables needed to define the problem, the relationship that needs to be established tentatively.  The problem definition is done with the help of reading of the related materials and works.
2.4  Review of related concepts, theories and the literature:  Once the research problem is tentatively defined, the researcher needs to review the related concepts, theories and the empirical and theoretical works that have already been carried out.  This stage is crucial to understand whether or not to undertake the the proposed research work.  If the review of related concepts, theories and literature reveals that the problem under investigation has already been sufficiently investigated and robust conclusions have been drawn, the problem need not be researched again.  Alternatively, this stage may reveal that the researcher needs to modify the problem and redefine in the light of the evidences that are already available. The review also helps the researcher to understand the problems that have already been investigated, the methodology that are followed, the variables used and the phenomenon investigated and the conclusions that are drawn.   A critical review of the literature would establish the methodological problems, the data collection problems and the need for refining and reinvestigating the problems.  The literature review can be compared to the flow of a river.  The river starts from some place in a small way, collects water from small streams and tributaries before becoming a big river and establishing its own identity and then flowing in some definite direction.  The literature review would also reveal what are the different dimensions of the problem that is being investigated and what are the ways in which different researchers have defined and investigated the phenomenon.   The review would also give an idea of different variables that are need to be studied, the type of relationship between different variables/attributes that are to be studied, and the tentative relationships that could be formulated as a possible explanation of the problem under investigation.
2.5  Formulation of hypotheses:  The problem identification, definition and review of related literature would reveal the different dimensions of the problems, the variables used in the study, the methodologies followed and the conclusions drawn by various researchers.  A critical review of the studies should also reveal the gaps in the problems, the problems of drawing conclusions using the limited data and the questionable methodologies used in the studies.  It would also establish the new variables that need to be considered, the refinements or new methodologies to be used and the data collection tools to be applied while taking up the researcher’s current research problem.  This should be the basis on which the researcher should formulate the hypotheses.  Each of the hypotheses should have a basis and be supported by either the earlier established conclusions of different studies or established practices etc.  The hypothesis formulation is crucial as the entire data collection work of the researcher hinges on this.  Therefore, the researcher should exercise extreme care in formulating and stating the hypotheses.
2.6  Research Design including sampling Design:  While research design encompasses the entire research work starting from research idea generation and ending with the drawing of conclusions, sampling design deals with the data and the data collection techniques that are employed by the researcher.  Sampling design is the process of defining the population, sample, the sampling frame and the sampling techniques.  This process helps make the data a representative one for the entire population.  Since the entire population cannot be studied by the researcher for want of time and cost of investigation, he/she uses sample to draw conclusion about the entire population.  Therefore, it is extremely important that the researcher uses the correct sampling techniques to make the sample representative of the population. The use of statistical techniques for analysis of the data assumes that the sample is a random one and is representative of the entire population. Therefore, if this condition is not satisfied, the entire conclusions will be wrong.
2.7  Data collection:  The researcher uses the appropriate data collection instrument and administers it to the sampling units chosen in the above step. The data collection instrument should be carefully designed to minimize the sampling and non-sampling errors.  Further, hallow effect should be taken care while finalizing the data collection instrument.  The researcher may use pilot study to redraft and revise the data collection instrument.  It has to be ensured that there are sufficient numbers of observations from the data collected to draw conclusions.  Use of statistical techniques would also call for a minimum number of observations.  If this is not ensured in data collection stage, the analysis of data poses problems.
2.8  Data checking and validation:  If sufficient care is taken at the data collection stage, the problem of checking and validation of the data is easy.  If the data collection is not done properly, a lot of inconsistencies may be encountered while reading the data.  The first thing the researcher should do after collecting the data is to put the data to “eye ball test”(EBT).    EBT involves observing the data carefully by the researcher for the possible errors, omissions, inaccuracies and inconsistencies.  A careful observation of the data may reveal that there are certain problems with the data collected.  Before, proceeding further, the researcher should correct mistakes in the data and clean it.  The process of ensuring the accuracy of data leads to data validation.  This stage would avoid the possible blunders the researcher would commit by using the wrong data and subjecting it to mountain of statistics.  No sophisticated statistical tests can rectify the wrong conclusions arising out of the inaccurate data.  Therefore, it is necessary that the researcher first cleanses the data before subjecting it to any statistical tools.
2.9  Data analysis:  The cleaned data may be summarized and analysed subjecting it to statistical tools.  The statistical tools may be broadly classified into two categories for the purpose of data analysis.  The first category of the tools like measures of central tendency, dispersion, skewness and kurtosis may be used to summarise the mass data that would help the researcher to get a limited number of parameters that would describe the characteristics of the data that has been collected.  The second category of the statistical tools like the measures of correlation, regression, the various statistical tests can be used for establishing and testing the relationship/hypotheses.  These tools would help to know whether or not the research hypothesis can be accepted or rejected.
2.10    Interpretation:  Most of the researchers stop by saying whether or not the hypotheses formulated are either accepted or rejected.  Mere statements of acceptance or rejection of hypotheses is not the purpose of research.  The interpretation of the results of the data analysis and knowing whether the hypotheses are accepted or rejected and then stating the results in terms of the original problem is what is intended in the interpretation stage.  At this stage the researcher should also give possible interpretations for either acceptance or rejection of the hypotheses.  This is also the stage at which the researcher should compare his/her results with those of the works studied in the literature review.
2.11    Conclusions:  The researcher should draw the final conclusions of the research in terms of the original problem being investigated.  The researcher should also highlight the problems of generalizations and the possible problems that could have been investigated as part of his/her research but could not be done for various reasons.  This discussion would lead to the problems that other researcher could undertake.  The researcher should also discuss the implications of the conclusions drawn.  Merely stating the conclusion is not sufficient.

3. Hypothesis Formulation

Most of the research students in economics, commerce and management and in other fields would be using empirical research.  While the empirical work in science involves experimentation in a laboratory; the entire economy, industry, companies, social organizations etc are the laboratories for students in economics, commerce and management and in other fields in social sciences.  When we use the empirical research, the researchers are defining the problems in terms of the variables/attributes and their tentative relationships.  Therefore, when the relationships between different variables/attributes are defined and formulated as a statement to explain a phenomenon, it should emerge from some source.  That source could be the observation, literature, theory, an exploratory study undertaken specifically for this purpose, current problems being faced, the problems that are likely to be faced, the abstract formulation of the relationship that may or may not be relevant in the current context, the phenomenon that was investigated for a long time but conclusions could not be arrived at.  Therefore, the sources of hypotheses could be the previous literature, the existing theory, law, an exploratory study, social/economic/demographic/scientific phenomenon etc.  It is not necessary that every research study should have a hypothesis.  However, when a hypothesis is formulated, it should be done in a scientific way.  This is what is being discussed here.  Before, proceeding further we need to define hypothesis.
3.1 What is a hypothesis?
A hypothesis is a tentative statement that establishes a possible relationship between the variables, which in turn, explains some phenomenon or relationship.  It can be defined as ‘a statement about the population’.  This statement could be stated in terms of the relationship between the variables/attributes, cause and effect relationship, a phenomenon that explains a problem etc.  A hypothesis is a statement which may include a prediction.  A fact cannot be a hypothesis.  A hypothesis should not be confused with facts.  For example, the population of India consists of male and female.  It is a statement about the population of a country but it is not a hypothesis.  It is a fact.  The hypothesis should not be confused with the established theories.  Theories are general explanations based on a large amount of data and established by repeated investigations.  For example, the theory of evolution applies to all living things and is based on wide range of observations. However, there are many things about evolution that are not fully understood such as gaps in the fossil record.  Many hypotheses have been proposed and tested on these issues.  If we can take the example from economics, India is experiencing the food inflation as one of the problems now.  What causes food inflation can be put in terms of different reasoning.  Therefore, they are only tentative statements which need to be tested before they are accepted.  These can be the hypotheses.  ‘Demand induces supply’ is not normally a hypothesis.  However, if one has the reason to believe that something other than demand induces supply, then it can be put in the form of hypothesis.  A hypothesis should be capable of being investigated and proved or disproved.  Established truths cannot become hypothesis even when they can be investigated.  This is because after the investigation you will conclude the same established truth.  Therefore, only when there is possibility of either accepting or rejecting the statement, it becomes the hypothesis.     
3.2. How to formulate a hypothesis?
A hypothesis cannot emerge from vacuum.  It has to have a strong basis.  The source of the hypothesis could be the current problem, social and demographic phenomenon, economic problems, country’s problems, scientific problems etc.  The hypothesis should have basis of either the previous literatures or strong prevailing notions that are challenged by people, policies, practices, law, social customs, scientific experiments, nature etc.  The basis of research hypothesis is usually the critical review of previous literature.  The critical review of the literature would reveal that there are problems about the conclusions drawn and therefore, there is a further need for investigation.  This conclusion could give rise to the new hypothesis. Therefore, a researcher should formulate each hypothesis on the basis of the literature review he does and the discussion of the reviews. 
3.3. Examples of what are not Hypotheses
The following cannot be the hypotheses.
  • Commercial Banks in India accept the deposits from the general public.
  • The purpose of Commercial Banks in India is to lend money to the general public and other organizations.
  • The customers of a company consist of male and female.
  • A large number of customers visit bus terminus every day.
  • Money supply influences the inflation.
  • The voters in India are the male and female.
  • The age of the voters in India is greater than or equal to 18 years.
  • The government of India derives revenue from the public sector undertakings.
  • Export oriented companies earn foreign exchange for India.
  • The industrial units located in Special Economic Zones (SEZs) are eligible for fiscal incentives.
  • China is the second largest economy in the world.
  • The financial crisis originated in the US and spread to other parts of the world.
  • WTO ensures the free trade in goods and services across the member countries.
  • India has initiated economic reforms.
  • The women in India like jewellery.

3.4. Examples of what are Hypotheses
  • The following can be the hypotheses.
  • The capital structure of a firm influences the value of the firm.
  • The value of dividend paying firm is more than that of firms not paying dividends.
  • There is a positive relationship between wages and production
  • Training methods influence the productivity of firms.
  • The products produced by the MNCs are superior to that of Indian companies.
  • The TV viewing improves the IQ of children.
  • The use of chemical fertilizers may adversely affect the fertility of the soil.
  • The capital structure of a firm influences the cost of capital
  • Accounting earnings influence the share prices.
  • Publicly available earnings information is absorbed by the stock market instantaneously.
  • There is no significant difference between the Indian Accounting Standards and International Accounting Standards.
  • Accountants involve in earnings manipulation when their pay packages are based on the accounting earnings.
  • Accountants chose those accounting principles that maximize their personal wealth as opposed to firm’s wealth.

The following example shows how to formulate a Hypothesis.  We know that there are different theories that explain the capital structure of firm.  Many variables could influence the capital structure of a firm.  A definite answer to the capital structure puzzle is yet to be found.  Therefore, researcher could examine the factors that influence the capital structure of a firm.  A variable that is used to measure the firm’s ability to generate capital internally as examined by Kester (1986), Titman and Wessels (1988), Wald (1999) and Pandey et al (2000), among others, is earnings before interest and taxes (EBIT) scaled by total assets.  Profitability is found to be negatively correlated with leverage.  Mallikarjunappa and Goveas (2007) tested the factors influencing the capital structure of a firm and concluded that profitability do not have significant relationship with the debt ratio.  However, a significant negative relationship between profitability and debt ratio supports the pecking order hypothesis that the firms with liquid assets and internal accruals would use less debt.  Accordingly, we can hypothesize that firm with high profitability will use less debt capital in the capital structure.  Therefore, we can hypothesize as follows. 
Hypothesis:     Profitability is inversely related to debt ratios.
3.5. How do we test the hypothesis?
The following steps are followed in hypothesis testing.
3.5.1.      Formulation of hypotheses
3.5.2.      Sampling design
3.5.3.      Deciding sampling distributions
3.5.4.      Setting the level of significance.
3.5.5.      Deciding the acceptance and rejection region.
3.5.6.      Data collection
3.5.7.      Computation of the values of test statistic based on the appropriate sampling distribution and the data collected.
3.5.8.      Deciding whether the computed value of test statistic falls in the acceptance or rejection region.
3.5.9.      Deciding whether to accept or reject the hypothesis.
3.5.10.  Drawing conclusions in terms of original problem.
3.5.11.  Interpreting the implications of conclusions.

4. Operational definitions of variables used in the research

It is very important that we provide operational definitions of the variables used in the hypotheses.  For example, if the hypothesis is: ‘morale and production are positively related’; it important that we define morale and the way it is measured.  Similarly production should be defined and measurement should be specified.  If the survey work is involved, the data collection instrument should contain questions related to the operational definitions provided by the researcher.  This should be done before the data collection and not after the data collection.  Most often researchers do not even know about this and will end up with problems after the data collection.  Therefore, every variable/attribute in the hypotheses should be clearly defined.  Further, measurements of these variables/attributes should also be specified and incorporated into the data collection instrument.  It is always a good idea to use the pilot survey and put the data collected to the statistical tests to know and understand whether there are any problems with definitions and measurements.  It should also be ensured that the variables are understood by all the respondents in the similar way.  There should not be any ambiguity in the variable measurements.  For example, if a respondent in the production department is asked about morale, he may understand morale in his own way and others may understand morale in their own way.  Therefore, it is preferable to give the indicators of high or low morale so that the responses are correct and comparable. This task is easy in case of variable like production as we can define it in terms of number of units.  Here also it is important to specify that production is measured in terms of number of units and even the units should be specified.  However, it may not be so easy in case variables like morale.  It is here that the researcher should search adequate literature and clearly define the variables and their measurements.  Consider another hypothesis: education increases income of the people.  This hypothsis specifies a positive relationship between the concepts “education” and “income.”  This abstract statement of hypothesis cannot be tested without the operational definitions of the two variables or concepts.  First, it must be operationalized or situated in the real world by rules of interpretation. To test this hypothesis, the abstract meaning of education and income must be derived or operationalized. The concepts should be capble of being understood uniformly and measured. Education could be defined and measured by years of schooling, professional eduction, general eduction, technical eduction, or highest degree completed etc. Income could be measured by hourly rate of pay, monthly income, yearly salary, etc.  Similarly if the hypothesis to be tested is ‘accounting earnings influences the stock prices’, we need to define the two variables, accounting earnings and stock prices.  Accounting earnings may have different meanings.  These could be net profit, gross profit, profit before taxes, profit before taxes and interest, cash profit, operating profit, operating cash profit, profit before taxes, interest and depreciation, etc.  Similarly stock prices may be the opening price, closing price, highest price, lowest price, unadjusted price, adjusted price etc.  We need to clearly define these two and clarify that accounting earnings are measured as profit before taxes, interest and depreciation and stock prices are the closing adjusted prices.  This makes the entire hypothesis clear and unambiguous.  In the absence of operational definitions the researchers and the readers may form their own opinion about the variables and understand the conclusions drawn.  To avoid differences in the understandings the variables and concepts need to be defined after the formulation of the hypotheses and before the data collection.  Without the operational definitions of the variables/concepts, the researcher will end up with erroneous analysis, interpretation and conclusions.  This can damage the foundations of a good research work.

5. Conclusion

Research is a long process.  It involves commitment of time, cost and dedication.  Many a time the researcher is prepared to spend time and money but do not have dedication.  It is lack of dedication that leads to a lot of gaps in the research.  Even the dedicated researchers may go wrong if they do not have proper direction.  This paper is intended to show the process involved in the research and the hypothesis formulation.  There are many other issues related to hypothesis formulation and testing.  All these cannot be discussed in short paper like this.  The interested readers should read a good book on research methodology.  This paper is intended to clarify the issues related to research process and hypothesis formulation.  The detailed discussion on the testing process and the concepts involved in this is beyond the scope of this paper.  I hope that this paper will help the researchers to clearly understand the process of research and hypothesis formulation.  I have examined many PhD and MPhil theses and found that the researchers have erred in this stage.  Therefore, this short paper has been written to help the researchers to improve their skills in research.  Obviously this cannot be a final say on the issue.  Researchers can discover their own innovative ways of formulating the hypothesis. The research process can be improved by the collective wisdom of the researchers over a period of time.  I do hope that the researchers will share this collective wisdom with the young researchers.








No comments:

Post a Comment

Model Exam on E-Business, Module V - M Com S2 March - April 2020

DEPARTMENT OF COMMERCE MAHATMA GANDHI COLLEGE, THIRUVANANTHAPURAM Second Semester M.Com. Degree Examination, April 2020 Paper – I: ...