International Journal of
Library and Information Science

  • Abbreviation: Int. J. Lib. Inf. Sci.
  • Language: English
  • ISSN: 2141-2537
  • DOI: 10.5897/IJLIS
  • Start Year: 2009
  • Published Articles: 246

Full Length Research Paper

Bibliometric analysis of cancer publications in Nigeria during 2008 – 2012

Salisu, Sulaiman Akorede*
  • Salisu, Sulaiman Akorede*
  • Africa Regional Centre for Information, University of Ibadan, Nigeria.
  • Google Scholar
Ojoye, Opeyemi Tawakalit
  • Ojoye, Opeyemi Tawakalit
  • Africa Regional Centre for Information, University of Ibadan, Nigeria.
  • Google Scholar


  •  Received: 21 January 2015
  •  Accepted: 23 February 2015
  •  Published: 31 March 2015

 ABSTRACT

The bibliographic data on cancer literature of Nigeria was drawn from the Medline of National Library of Medicine (NLM), USA. This research work examines the conformity of Lotka’s Law to authorship distribution during 2008-2012.Totally, 677 articles in cancer produced by 1,854 authors, was compiled for the analysis. Lotka’s inverse power Law was used in testing the hypotheses. That is,  where the values of the exponent n and the constant c were computed; and Kolmogorov-Smimov (K-S) goodness-of-fit test was applied. The results suggested that all categories of author productivity distribution in this research work fit into Lotka’s generalized inverse square law.
 
Key words: Bibliographic data, publication, cancer, impact factor.


 INTRODUCTION

There exists a lack of knowledge regarding the quantity and quality of scientific yield in relation to cancer in Nigeria. Representation of cancer-related articles in the medical literature has not been well-established. The rapid increase in medical research publications has been facilitated by the development of the internet, integrated search engines, and on-line publishing (Glynn et al., 2010). The two principal repositories for medical research publications are the Web of Science (WoS) (Thompson Reuters), and PubMed (the National Library of Medicine (NLM)); the latter recognised as the most frequently used source for information in the medical field (Falagas et al., 2008).

Bibliometrics is a systematic method for evaluating research output which can help map changes in the interest of a scientific community over time and can provide insights into both qualitative and quantitative research trends. The bibliometrics indicator most commonly used to undertake qualitative analysis is the journal impact factor (IF) which is based on two elements; the numerator, which is the number of citations in the current year to items published in the previous 2 years, and the denominator, which is the number of substantive articles and reviews published in the same 2 years.

The principal focus of this study is to measure the quantity, authors’ productivity, quality and relevance of cancer-related articles. This was achieved by retrieving relevant articles on the subject matter from the Medline database of the National Library of Medicine (NLM), USA, 2008 – 2012 with search terms CANCER AND NIGERIA

AND ("2008/01/01”: "2012/12/31").


 LITERATURE REVIEW

Bibliometrics is derived from two different words: biblion, the Greek word for ‘book’, (which can be any literature) and metrics, the use of mathematical and statistical concepts, models and laws to analyse quantities (Hussain and Fatima, 2011). Thus, bibliometrics is a type of research method that uses quantitative analysis and statistics to describe patterns of publication within a given field or body of literature (Chuang et al., 2011). Janeving (2009) opined that with the bibliometric methods, the evaluation of research productivity for both individuals and institutions is made possible.

Lotka (1926) examined author productivity, i.e. the publication contributions of authors to a given discipline; he later propounded the Law of Scientific Productivity which states that the number of authors who have published a specific number of papers is approximately equal to the inverse square of that number multiplied by the number of authors who have published one paper only.

Nwagwu (2006), while testing the validity of Lotka’s law of productivity on four author categories of Nigeria’s biomedical research for the period 1967 to 2002, namely ‘all authors’, ‘first authors’, ‘non-collaborative authors’ and ‘co-authors’, discovered that it was only the data on the co-author category that did not conform with the law. Furthermore, he posited that many developing countries are yet to utilize bibliometric methods in scientific information gathering. Pulgarin (2012) proved that the two parameters of Lotka's law, the exponent n and the constant c, were influenced by the subject area's productivity and growth, by the type of area, the country, the time period, and the length of that period. A year later, a study by Sivakumar et al. (2013) averred that even though Lotka’s law holds true in many scientific disciplines but same cannot be said about biology literature of central universities in India. In 2014, Kumar (2014) analysed the distribution of productivity of authors in the field of Human Computer Interaction (HCI) research from Science Citation Index-Expanded for 2006 – 2011. In his study, he tested Lotka’s law of authors’ productivity by comparing two different methodologies.  He discovered that literature in the field of HCI research studies does conform to Lotka’s law and concluded that the law is a standardized means of measuring authors’ publication productivity in HCI research. However, Mahmood (2014) studied the exactness of Bibliometric laws through applications of Lotka’s law to library and information science (LIS) and revealed that Lotka’s law was not applicable to Pakistani LIS authors. The results of the study also indicated that majority of authors produced only one item in their entire professional career.

A review of bibliographic information from the field of public health nutrition in West Africa by Grant et al. (2010) concluded that research output from this area was too low, given the magnitude of nutrition problems in the region. Other challenges facing developing countries’ scholars, especially Africa, in the area of scientific research include low participation in international journals, publishing in journals with low impact factors, exclusion from international bibliographic databases, small circulations, inadequate funding, inadequate electronic resources, inadequate review and production staff, and difficulty in maintaining publication frequency (Tanya, 2005; Siegfried et al., 2006; Grant et al., 2010; Ajuwon et al., 2011; van Ejik et al., 2012).

While perusing biomedical literature, a bibliometric analysis of cancer literature is yet to be encountered. Nwagwu (2006) conducted analysis on Nigerian biomedical literature and concluded that research activities in local universities have shrunk over the years due to the several challenges previously mentioned. In the area of diabetes, Harande (2011) examined Nigerian literature between the years 1996 and 2009; he said that even though there is a rapid growth in the publication of diabetes-related literature in the country, there remains a need for all health and health-allied workers to collabo-rate on how to effectively combat the disease. In the area of HIV/AIDS, Uthman (2008) conducted a seminar work which discovered that articles with international collaboration appeared in journals with higher impact factors and also received more citations. Publication trend in biomedical research among African authors has indicated that Nigeria has produced more publications than other sub-Saharan countries excluding South Africa, due to efforts put into peer visibility and career advancement (Owolabi et al., 2007; Uthman and Uthman, 2007; Hofman et al., 2009).


 METHODOLOGY

This research which extracted and analysed publications on cancer within the geographical location Nigeria considering a window period of 5 years (that is 2008 to 2012) was carried out using the Medline database of the National Library of Medicine (NLM), USA. A total of 677 articles by 1854 authors were retrieved using the advanced search and custom settings tool of the database with the keywords CANCER AND NIGERIA AND ("2008/01/01”: "2012/12/31").

The dataset was analysed into the quantity of articles per year as shown in Figure 1. Also, in order to estimate the productivity of authors on the subject, authors were categorised into four groups (Nwagwu, 2006):

a). All authors: Here, authors for each article were extracted and counted. This is in conformity with the author counting method used in bibliometric analysis in which the number of occurrence of an author in the dataset equals the total credit to be awarded to the author.

b). First author: Here, counting was done considering only the first author of each publication where there are more than one authors. This was achieved using the first author column provided by the database.

c). Co-authors: This was done by considering only articles written by more than one authors. Then the authors for these articles were counted.

d). Non-collaborative authors: This was achieved by considering only single-authored articles. Credits were awarded by counting these authors.

 

 

The estimation of the productivity of the literature on this area of interest was done using the Lotka's law ( ) with aid of LOTKA software. This is a computer program for fitting a power law distribution such as Lotka’s law. It basically follows Nicholl’s methodology, using a maximum likelihood approach to estimate parameters, and a Kolmogorov-Smirnov test for goodness-of-fit.

The bibliometric variables used for the analysis are: number of authors, place of publication, year of publication, impact factor, institutional affiliation, number of pages, first authors, co-authors, non-collaborative authors and single-authored paper.

 

Hypotheses

1. All authors’ category distribution on cancer within Medline (2008 – 2012) fits Lotka’s Law.

2.  First author category distribution on cancer within Medline (2008 – 2012) fits Lotka’s Law.

3. Non-collaborative author category distribution on cancer within Medline (2008 – 2012) fits Lotka’s Law.

4. Co-author category distribution on cancer within Medline (2008 – 2012) fits Lotka’s Law

 

Impact factor of journal

The journals in the dataset for this study were ranked in the order of number of publications within the period of consideration. Then, the first ten ranked journals were selected which consist of fourteen (14) journals.  In order to calculate the Impact Factor (IF) of these fourteen (14) journals, the Publish or Perish (PoP) software by Harzing (2007) was used.

The total number of citations that each journal received per year was obtained by summing up the total number of citations obtained by all the articles published in a given journal per year for the entire period of study, i.e. 2008-2012. The same approach used to calculate the impact factor (IF) for ISI-indexed journals was adopted in order to obtain the IF of the 14 journals evaluated in this study.

The ISI (Thomson Reuters, 2009) uses the following formula in calculating a given journal's.

This study considered the number of citations earned by each journal in 2012 from articles published in the respective journals in 2008 to 2011 for purpose of calculating the journal Impact Factor (IF). Having obtained citations received in 2012 and the number of publications in a given journal in 2008, 2009, 2010, 2011 or 2012, the above formula was used to calculate each journal's IF.


 RESULTS

A total of 677 articles (Table 1) by 1854 authors were retrieved for analysis. It was observed that year 2009 witnessed the highest number of articles (162 papers) written on cancer while year 2012 witnessed the lowest number of publications with 109 articles (Figure 1). In order to establish the pattern of the distribution of the dataset, a graph of the four categories of authors was plotted. Figure 2 shows the number of authors against their contributions (x). The reverse j-shape of the curves shows that the distribution of authors in each category is in conformity with the bibliometric distribution (a social distribution).

 

 

 

 

The productivity of researchers on the subject matter was estimated using Lotka’s law by organizing the data as a distribution of articles over authors for each of the categories of authors, as shown in Table 1. From Table 1b, it can be observed that the co-author category has the highest number of authors with 74.20%, which is an indication of high level of collaboration on the subject matter. The non-collaborative author category has the lowest number of authors with 2.78% of the total number of authors, which also confirms the last statement.  While the first author category contributed 23.03%. It should be noted that the non-collaborative author category indicates authors who singly authored an article and that this dataset shows that sole-authorship or single-authorship is not really a common practice on research on cancer.

 

We did not just stop there as we went further to estimate the productivity of the category of author of the dataset using Lotka’s law and also testing whether this dataset can be explained using this law. The outcome is as shown in Table 2 and explained below. We tested for the hypotheses of the productivity pattern of each of the different author categories using Kolmogorov-Smirnov benchmark.

 

 

All authors

This category of authors which consist of 1,843 authors is the total number of authors in the dataset. An examination of the distribution of articles in this category (Table 1b) shows that about 76% of the scientists contributed just one article each in the bibliography while about 13% contributed only two items each, and about 5.3% contributed to three articles each. On the other hand, only about 0.3% of the authors in the bibliography contributed at least 9 items each. When this outcome was tested with the productivity pattern of the Lotka’s law it was found out that this category fits the law with the value α = 2.5973 and k = 76.55% against Lotka’s own benchmark in which k = 60.79.

The Kolmogorov-Smirnov statistic output indicates that the maximum absolute difference, Dmax is less than the critical value at all levels of significance (Table 2 – All authors). This as shown by Dmax = 0.0103 (sig. level (1%) = 0.0379; sig. level (5%) = 0.0316; sig. level (10%) = 0.0283) showing that Lotka’s law is valid in this case. Therefore, we do not reject the null hypothesis and conclude that this category of authors follows Lotka’s inverse power law.

 

First authors

This category of authors  has  473  authors  where  about  76% authors wrote just one article, 15% made 2 contributions and 0.4% made has the highest number of articles which is 6.  From Table 3, we have α = 2.6992 and K = 78.48% (a value higher than Lotkas’ 60.79) where Dmax = 0.0274 is lower that KS = 0.0749, 0.0625 and 0.056 at 1, 5 and 10% levels of significance respectively. We can therefore infer here also that we do not reject the null hypothesis and that this category of authors fits Lotka’s inverse power law.

 

 

Non-collaborative authors

57 authors appear in this category of single author. The result obtained here suggests that this category of authors like the previous two categories also conform to Lotka’s law of author productivity and generalised power inverse law. This as shown in table where α = 3.6709 and K = 89.82%. Since the maximum deviation, Dmax = 0.0316 falls within the critical values KS = 0.2159, 0.1801, 0.1616, 5 and 10% significance level  respect-tively, then we do not reject the null hypothesis and we can conclude that this category conforms with Lotka’s Law.

 

Co-authors

In this category, the total number of authors, 1524 in which about 79% of the authors in this category made a single contribution each while about 12% made 2 contributions each, about 5% made 3 contributions each and more than 4% contributed between 4 and 9 articles. Just as before, we compared this dataset with Lotka’s law and found out that this category also agreed with the law as shown in Table 3 with the parameters Dmax = 0.011, α = 2.621, K = 79.29%, KS = 0.0417, 0.0348, 0.0312 at 1, 5 and 10% level of significance, respectively.

 

Journals and authors’ impact

The impact factors of the fourteen (14) Journals that fall within the 1st ten ranks are as shown in Table 4. Here we considered number of citations earned by each journal in 2012 from paper published in the respective journals in 2008, 2009, 2010, or 2011.

 

 

Psycho-Oncology is ranked first in terms of the impact factor having an impact factor of 12, followed by Nigerian journal of medicine (IF = 3.92) and American Journal of Obstetrics and Gynaecology occupying the third position with IF = 3.64. While, Annals of African Medicine, Pan African Medical Journal (PAMJ) and African Journal of Paediatric Surgery (AJPS) have the lowest impact factor value of 0.71, 0.27 and 0.2 respectively.

Akang E. E. has the highest the number contributions of 20 articles in the dataset, followed closely by Abdulkareem F. B. and Banjo A. A. with 18 and 17 articles respectively. While Mohammed A., Niu Q. and Huo D. show high h-index of 106, 101 and 90 respectively.


 DISCUSSION

The distribution of cancer publications on Nigeria within the period 2008–2012 reflects certain regularities which are often the characteristics of a bibliometric distribution. All the categories of authors used for this study showed such regularities which are abound in the social and biomedical sciences (Nwagwu, 2006). This is reflected in the reverse j-shape of the distribution. Observably, as the number of articles published increases, authors produc-ing that many publications become less frequent.

The findings of this study show that at the various significance levels, all the four categories of authors did conform with the Lotka’s inverse power law at different parameters α = 2.5973, 2.699, 3.6709, 2.621 and k = 76.55%, 78.48%, 89.82%, 79.29% for ‘all authors’, ‘first authors’, ‘non-collaborative’ and co-authors’ respectively. This shows that the productivity coefficient of ‘all authors’ and co-author categories (approximately 2.6) is an indication of a very high productivity of authors in these categories. Also, the research publications on cancer witness high number of collaborative effort as shown by high number of co-authored article when compared to single-authored article.

It will be interesting to note that this study when com-pared with Nwagwu (2006), there is great similarity among the various categories of authors where he conducted a study on the productivity pattern of biomedical authors of Nigeria during 1967–2002, except for the ‘co-authors’ category where he discovered that this category did not conform with the inverse power law.

Since impact factors (IF) are generally perceived to be measures of quality or influence, Psycho-Oncology is of very high quality (IF = 12), followed by Nigerian journal of medicine (IF = 3.92) and American Journal of Obstetrics & Gynaecology (IF = 3.64). Those that are of low quality are the Annals of African Medicine (IF = 0.71), The Pan African Medical Journal (IF = 0.27) and African Journal of Paediatric Surgery (IF = 0.2).


 CONCLUSION

Lotka’s Law of scientific productivity is one of the laws of bibliometrics used to characterise authors’ productivity.  

This study adopted the generalized inverse form of this law in estimating and understanding the productivity pattern of authors’contributions on the disease, cancer, within the window period 2008 – 2012 using the Medline database of the National Library of Medicine (NLM), USA. This study revealed that all the categories of authors conformed with the Lotka’s law and thus applicable to studying authors’ productivity of cancer publications in Nigeria.



 REFERENCES

 

Ajuwon G, Austone L, Raghavan R, Kotzin S, Hoffman K (2011). Assessment of Scholarly Publications of Nigerian Health Sciences Researchers in MEDLINE/Pubmed (1996 – 2007). Sierra Leone J. Biomed. Res. 3(2):89-96.
Crossref

 

 

Chuang K, Chuang Y, Ho M, Ho Y (2011). Bibliometric Analysis of Public Health Research in Africa: The Overall Trend and Regional Comparisons. South Afr. J. Sci. 107 (6):309-315.

 

Falagas ME, Pitsouni EI, Malietzis GA, Pappas G (2008). Comparison of PubMed, Scopus, Web of Science, and Google Scholar: strengths and weaknesses. Faseb J. 22:338-342. doi: 10.1096/fj.07-9492LSF.
Crossref

 

Glynn RW, Chin JZ, Kerin MJ, Sweeney KJ (2010). Representation of Cancer in the Medical Literature - A Bibliometric Analysis. PLoS ONE 5(11):e13902. http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0013902. 10th June, 2013

 

Grant JA, Shelby EW, Kenneth HB (2010). Bibliographic Analysis of Scientific Research on Selected Topics in Public Health Nutrition in West Africa: A Review of Articles Published from 1998 to 2008. Global Public Health 5(51):S42-S57.

 

Harande Y (2011). Exploring the Literature of Diabetes in Nigeria: A Bibliometric Study. Afr. J. Diabetes Med. 19(2).

 

Harzing AW (2007). Publish or Perish. http://www.harzing.com/pop.htm. 6th May, 2013.

 

Hofman KJ, Kanyengo CW, Rapp BA, Kotzin S (2009). Mapping the health research landscape in Sub-Saharan Africa: a study of trends in biomedical publications. J. Med. Libr. Assoc. 97(1):41.
Crossref
PubMed; PubMed Central

 

Hussain A, Fatima N (2011). A bibliometric analysis of the 'Chinese Librarianship: an International Electronic Journal,(2006-2010)'. Chinese Librarianship: an International electronic journal, 31.

 

Janeving B (2009). The Publication Activity of Region VastraGotaland: A Bibliometric Study of an Administrative and Political Swedish Region during the Period 1998 – 2006. Inform. Resour. 14:397.

 

Kumar S (2014). Author productivity in the field of Human Computer Interaction (HCI) research. Annals Libr. Inform. Stud. 16:273-285.

 

Lotka AJ (1926). The Frequency Distribution of Scientific Productivity. J. Washington Acad. Sci. 16:317-323.

 

Naseer MM, Mahmood K (2014). Applicability of Lotka's law to Pakistani LIS authors. Pak. Libr. Inform. Sci. J. 45(2).

 

Nwagwu WE (2006). A Bibliometric Analysis of Productivity Patterns of Biomedical Authors of Nigeria during 1967 – 2002. Scientometrics, 69(2):259-269.
Crossref

 

Owolabi MO, Bower JH, Ogunniyi A (2007). Mapping Africa's Way into Prominence in the Field of Neurology. Archeaol. Neurol. 64:1696-1700.
Crossref
PubMed

 

Pulgarín A (2012). Dependence of Lotka's law parameters on the scientific area.Malaysian J. Libr. Inform. Sci. 17(1):41-50.

 

Siegfried N, Busgeeth K, Certain E (2006). Scope and Geographical Distribution of African Medical Journals Active in 2005. South Afr. Med. J. 96(6):533-538.
PubMed

 

Sivakumar N, Sivaraman P, Tamilselvan N (2013). Application of Lotka's law in biology literature of Central Universities in India. Int. J. Libr. Inform. Sci. 2(1):61-70.

 

Tanya T (2005). Global Collaboration Gives Greater Voice to African Journals. Environ. Health Perspect. 113 (7):A452-A454.
Crossref

 

Uthman AO, Uthman MB (2007). Geography of African Biomedical Publications: An Analysis of 1996 – 2005 PubMed Papers. Int. J. Health Geography. 6(46).
PubMed; PubMed Central

 

Uthman OA (2008). HIV/AIDS in Nigeria: A Bibliometric Analysis. BMC Infectious Diseases 2008. 8(19).
Crossref
PubMed; PubMed Central

 

Van Eijk AM, Hill J, Povall S, Reynolds A, Wong H, TerKuile FO (2012). The Malaria in Pregnancy Library: A Bibliometric Review. Malaria Journal, 2012, 11 (362).

 




          */?>