research article using chi square test

  • Get new issue alerts Get alerts
  • Submit a Manuscript

Secondary Logo

Journal logo.

Colleague's E-mail is Invalid

Your message has been successfully sent to your colleague.

Save my selection

Chi-square Test and its Application in Hypothesis Testing

Rana, Rakesh; Singhal, Richa

Statistical Section, Central Council for Research in Ayurvedic Sciences, Ministry of AYUSH, GOI, New Delhi, India

Address for correspondence: Dr. Richa Singhal, Central Council for Research in Ayurvedic Sciences, Ministry of AYUSH, GOI, New Delhi, India. E-mail: [email protected]

This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.

In medical research, there are studies which often collect data on categorical variables that can be summarized as a series of counts. These counts are commonly arranged in a tabular format known as a contingency table. The chi-square test statistic can be used to evaluate whether there is an association between the rows and columns in a contingency table. More specifically, this statistic can be used to determine whether there is any difference between the study groups in the proportions of the risk factor of interest. Chi-square test and the logic of hypothesis testing were developed by Karl Pearson. This article describes in detail what is a chi-square test, on which type of data it is used, the assumptions associated with its application, how to manually calculate it and how to make use of an online calculator for calculating the Chi-square statistics and its associated P -value.

The logic of hypothesis testing was first invented by Karl Pearson (1857-1936), a renaissance scientist, in Victorian London in 1900. [ 1 ] Pearson's Chi-square distribution and the Chi-square test also known as test for goodness-of-fit and test of independence are his most important contribution to the modern theory of statistics. The importance of Pearson's Chi-square distribution was that, the statisticians could use the statistical methods that did not depend on the normal distribution to interpret the findings. He invented the Chi-square distribution to mainly cater the needs of biologists, economists, and psychologists. His paper in 1900 published in Philosophical magazine elaborates the invention of Chi-square distribution and goodness of fit test. [ 2 , 3 ]

Chi-square test is a nonparametric test used for two specific purpose: (a) To test the hypothesis of no association between two or more groups, population or criteria (i.e. to check independence between two variables); (b) and to test how likely the observed distribution of data fits with the distribution that is expected (i.e., to test the goodness-of-fit). It is used to analyze categorical data (e.g. male or female patients, smokers and non-smokers, etc.), it is not meant to analyze parametric or continuous data (e.g., height measured in centimeters or weight measured in kg, etc.).

For example if we want to test that in a health camp attended by 50 persons the one who exercise regularly are having lesser body mass index (BMI) by taking their actual BMI values, than it cannot be tested using a Chi-square test. However, if we divide the same set of 50 persons into two categories as obese with BMI ≥ 30 and nonobese with BMI < 30, than the same data can be tested using a Chi-square test by counting the number of obese and nonobese persons across two groups, the one who exercise regularly and the one who does not. A 2×2 contingency table also known as cross tables can be constructed for calculating a Chi-square statistic [ Table 1 ].

T1-17

ASSUMPTIONS UNDERLYING A CHI-square TEST

  • The data are randomly drawn from a population
  • The values in the cells are considered adequate when expected counts are not <5 and there are no cells with zero count [ 4 , 5 ]
  • The sample size is sufficiently large. The application of the Chi-square test to a smaller sample could lead to type II error (i.e. accepting the null hypothesis when it is actually false). There is no expected cut-off for the sample size; however, the minimum sample size varies from 20 to 50
  • The variables under consideration must be mutually exclusive. It means that each variable must only be counted once in a particular category and should not be allowed to appear in other category. In other, words no item shall be counted twice.

HOW TO CALCULATE A CHI-square STATISTICS?

The formula for calculating a Chi-square statistic is:

F1-17

O stands for the observed frequency,

E stands for the expected frequency.

Expected count is subtracted from the observed count to find the difference between the two. Then the square of the difference is calculated to get rid of the negative vales (as the squares of 2 and −2 are, of course, both 4). Then the square of the difference is divided by the expected count to normalize bigger and smaller values (because we don't want to get bigger Chi-square values just because we are working on large data sets). The sigma sign in front of them denotes that we have, to sum up, these values calculated for each cell.

As an example, suppose we want to find out that whether there is an association between smoking and lung disease.

The null and alternative hypothesis will be:

H 0 : There is no association between smoking and lung disease.

H 1 : There is an association between smoking and lung disease.

The basic step for calculating a Chi-square test is setting up a 2 × 2 contingency table [ Table 2 ].

T2-17

The general formula for calculating the expected counts from observed count for a particular cell is [(corresponding row total * corresponding column total) /Total no. of patients] [ Table 3 ].

T3-17

Before we proceed further, we need to know how many degrees of freedom (df) we have. When a comparison is made between one sample and another, a simple rule is that the df equals (number of columns − 1) × (number of rows − 1) excluding the rows and column containing the total. Hence, in our example df = (2−1) × (2−1) = 1.

Hypothetical data for calculating the Chi-square test for our example of testing an association between smoking and lung disease is given in Table 4 . Chi-square test can be calculated manually by using the formula described above. Refer [ Table 5 and Table 6 ] for manual calculations. Chi-square value for our example as shown in Table 6 is 3.42, df = 1. If we want to test our hypothesis at 5% level of significance than our predetermined alpha level of significance is 0.05. Looking into the Chi-square distribution table [ Table 7 ] with 1 degree of freedom and reading along the row we find our value of χ 2 (3.42) lies between 2.706 and 3.841. The corresponding probability is between the 0.10 and 0.05 probability levels. That means that the P value is above 0.05 (it is actually 0.065). Since a P value of 0.065 is greater than the conventionally accepted significance level of 0.05 (i.e., P > 0.05) we fail to reject the null hypothesis or in other words we accept our null hypothesis and conclude that there is no association between smoking and lung disease.

T4-17

HOW TO USE A CHI-square DISTRIBUTION TABLE TO APPROXIMATE P VALUE?

Scientists and statisticians use large tables of values to calculate the P value for their experiment. These tables are generally set up with the vertical axis on the left corresponding to df and the horizontal axis on the top corresponding to P value. Use these tables by first finding our df, then reading that row across from the left to the right until we find the first value bigger than our Chi-square value. Look at the corresponding P value at the top of the column. Chi-square distribution tables are available from a variety of sources-they can easily be found online or in science and statistics textbooks.

USING AN ONLINE CHI-square CALCULATOR

The Chi-square statistics and its associated P value can be calculated through online calculators also which are easily available on the internet. For user-friendly online calculator, you may visit this uniform resource locator http://www.socscistatistics.com/tests/chisquare/default2.aspx . Many more online calculators are available on the World Wide Web. The basic step for using an online calculator is to correctly fill in your data into it.

Step by step procedure of using an online calculator is described below:

  • Step 1: For our example of finding an association between smoking and lung disease we have to fill in the observed values in the cells of an online calculator [ Figure 1 ]

F2-17

  • Step 2: Click on the next button. Another screen will pop up as shown in Figure 2

F3-17

  • Step 3: Click on the Calculate Chi^2 button. And you are done with your calculation Output of the Chi-square test will be as shown in Figure 3 .

F4-17

The image above shows the Chi-square value as 3.4177 and its associated P value as 0.0645 which is actually greater than P value of 0.05, hence no significant difference has been observed. To conclude, there is no association between smoking and lung disease.

WHAT DOES A CHI-square TEST TELL AND WHAT IT DOES NOT?

It may be clearly understood that Chi-square test only tells us the probability of independence of a distribution of data or in simple terms it will only test that whether two variables are associated with each other or not. It will not tell us that how closely they are associated. For instance in the above example, the Chi-square test will only tell us that whether there is any relation between smoking and lung disease. It will not tell us that how likely it is, that smokers are prone to lung disease. However, once we got to know that there is a relation between these two variables, we can explore other methods to calculate the amount of association between them.

Source of Support:

Conflict of interest:.

None declared.

  • Cited Here |

Categorical data analysis; Chi-square test; hypothesis testing; online calculator

  • + Favorites
  • View in Gallery

Readers Of this Article Also Read

The story of heart transplantation: from cape town to cape comorin, the odds ratio: principles and applications, how to use medical search engines, tools for placing research in context, t</em>-tests in medical research', 'pandey r. m.', 'journal of the practice of cardiovascular sciences', 'may-aug 2015', '1', '2' , 'p 185-188');" onmouseout="javascript:tooltip_mouseout()" class="ejp-uc__article-title-link">commonly used t -tests in medical research.

  • Open access
  • Published: 30 September 2021

Chi-square test under indeterminacy: an application using pulse count data

  • Muhammad Aslam   ORCID: orcid.org/0000-0003-0644-1950 1  

BMC Medical Research Methodology volume  21 , Article number:  201 ( 2021 ) Cite this article

3014 Accesses

12 Citations

Metrics details

The data obtained from the counting process is known as the count data. In practice, the counting can be done at the same time or the time of the count is not the same. To test either the K counts are differed significantly or not, the Chi-square test for K counts is applied.

The paper presents the Chi-square tests for K counts under neutrosophic statistics. The test statistic of the proposed test when K counts are recorded at the same time and different time are proposed. The testing procedure of the proposed test is explained with the help of pulse count data.

Conclusions

From the analysis of pulse count data, it can be concluded that the proposed test suggests the cardiologists use different treatment methods on patients. In addition, the proposed test gives more information than the traditional test under uncertainty.

Peer Review reports

The data obtained from the counting process is known as the count data. In practice, the counting can be done at the same time or the time of the count is not the same. To test either the K counts are differed significantly or not, the Chi-square test for K counts is applied. This test is applied to test the null hypothesis either the same training or methods should be applied on K counts against the alternative hypothesis that different training or methods should be applied on K counts. The Chi-square test for K counts under classical statistics is applied under the assumption that K counts are obtained under comparable conditions, see [ 1 , 2 ]. Worked on the test for testing two means of Poisson distribution [ 3 , 4 , 5 , 6 , 7 , 8 , 9 , 10 ] presented applications of test for count data in a variety of fields.

According to [ 11 ], “statistical data are frequently not precise numbers but more or less non-precise also called fuzzy. Measurements of continuous variables are always fuzzy to a certain degree”. Similarly, the counting data is not always exact but may be in intervals or unclear. For example, the weather record data and pulse count data are expressed in intervals than the exact values. In these situations, the existing Chi-square test for K counts under classical statistics may mislead the decision-makers. The fuzzy-based tests may be an alternative to being applied when the count data is intervals. The applications of statistical tests under fuzzy logic can be seen in [ 12 , 13 , 14 , 15 , 16 , 17 , 18 , 19 ].

The fuzzy logic is a special case of neutrosophic logic [ 20 , 21 ] showed the efficiency of the neutrosophic logic over the fuzzy logic and analysis based on the interval approach. For the application of neutrosophic logic, the reader may refer to [ 22 , 23 , 24 , 25 , 26 ]. The neutrosophic statistics was proposed using the idea of neutrosophic logic by [ 27 ]. This is a branch of mathematical statistics that provides the presentation, analysis, and inference of neutrosophic, fuzzy, interval, and indeterminate data. Classical statistics was considered a special case of neutrosophic statistics [ 28 , 29 , 30 , 31 , 32 , 33 ] discussed various applications of neutrosophic statistics.

The existing Chi-square test for K counts under classical statistics cannot be applied when the counts are in intervals. In this paper, the design of the Chi-square test for K counts under neutrosophic statistics will be given. We will extend the statistic for counts at the same time and at different times under neutrosophic statistics. The testing of the hypothesis will be given. The application of the proposed test will be given using the pulse counts data. By proposing the test, it is expected that the proposed test will be effective, flexible, informative, and adequate to be applied under uncertainty.

The pulse rate is theoretically is considered a discrete variable. The application of the proposed test is given using the counts of the pulse rate of 11 patients. The first 11 values of data are obtained from [ 34 ] and the next values are generated by simulation. The data is shown in Table  1 . A cardiologist is interested to see either the same treatment should be applied to all patients or not. Therefore, the null hypothesis for this case is that the same treatment method should be applied vs. the alternative hypothesis that different methods of treatment should be applied to all patients. As the pulse counts are noted in the same period of time, therefore, the statistic \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) is given in Eq. ( 1 ) a suitable statistic to apply for testing the given hypothesis. The neutrosophic form of \({\overline{N}}_{iN}\epsilon \left[{\overline{N}}_{iL},{\overline{N}}_{iU}\right]\) using the given data is given as \({\overline{N}}_{iN}=73.54+91.8{I}_{i\overline{N}};{I}_{i\overline{N}}\epsilon \left[\mathrm{0,0.1989}\right]\) .The test statistic \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) for the given data is computed as \({\chi}_{1N}^2=\sum_{i=1}^K\frac{{\left({N}_{iN}-{\overline{N}}_{iN}\right)}^2}{{\overline{N}}_{iN}}=\left[\mathrm{146.97,121.78}\right]\) .The neutrosophic form of the statistic \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) using the data is given as \({\chi}_{1N}^2=146.97-121.78{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[\mathrm{0,0.2068}\right]\) . The proposed test will be implemented as follows

Step-1: State the null H 0 : all patients should be treated by the same method and alternative hypothesis  H 1 : patients should be treated with different methods.

Step-2: Let α  = 5% and critical values are 34.76 and 67.5.

Step-3: Reject H 0 as the values of \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) fall in the rejection region.

From the study, it can be concluded that cardiologists should use different methods of treatment for the patients.

As mentioned earlier, the neutrosophic form of the proposed test consists of two parts. The first and the second parts presented classical statistics and indeterminate, respectively. The neutrosophic form of \({\chi}_{1N}^2={\chi}_{1L}^2+{\chi}_{1U}^2{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[{I}_{\chi_{1L}^2},{I}_{\chi_{1U}^2}\right]\) reduces to statistic under classical statistics when \({I}_{\chi_{1L}^2}=0\) . The efficiency of the proposed test will be compared with the existing test in terms of uncertainty measures. The neutrosophic form of \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) of pulse count data is \({\chi}_{1N}^2=146.97-121.78{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[\mathrm{0,0.2068}\right]\) . When \({I}_{\chi_{1L}^2}=0\) , the value 146.97 presents the existing test statistic. The part \(121.78{I}_{\chi_{1N}^2}\) is an indeterminate part and \({I}_{\chi_{1U}^2}\) = 0.2068 is the uncertainty measure associated with statistic \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) . From the neutrosophic form, it can be seen that the proposed test statistic can be expressed in interval rather than the exact value. Under uncertainty, the value of the test statistic is from 146.97 to 121.78. From this study, it can be seen that the proposed test gives the results in the indeterminate interval that is expecting under uncertainty. On the other, the proposed statistic gives information about indeterminacy. Under an indeterminate environment, the proposed test has the interpretation like: when  α  = 5%, the probability of committing a type-I error is 0.05, the probability of accepting H 0 is 0.95 and the chance of indeterminacy about the accepting or rejecting H 0 is 0.2068. Let β is the probability of rejecting H 0 when it is true. To study the power of test (1 −  β ) for the proposed test and the existing test, various values of the level of significance α are considered. The neutrosophic data is generated from 45 to 55 and the values of \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) are computed and compared with the critical values at various levels of  α . The probability of rejecting H 0 when it is true ( β ) is calculated and used to calculate the power of the test (1 −  β ). The values of (1 −  β ) at various values of α are shown in Table  2 and plotted in Fig.  1 . From Table 1 , it can be seen that as the value of α increases from 0.1 to 0.99, the power of the test also increases. Figure  1 clearly indicates that the power curve of the proposed test is higher than the power curve of the existing test. The comparative study shows that the proposed test is efficient, revealing, and stretchy than the existing test. In addition, the proposed test provides higher values of power of the test as compared to the existing test.

figure 1

Power curves of the proposed and existing tests at various values of α

The existing test for K counts under classical statistics is applied under the assumption that the data of counts must be noted under comparable conditions. The existing test to investigate the existing test between K counts can be applied only when the counts are exact, precise, and clear. In this section, the proposed test for K counts will be introduced when the count data is in indeterminate intervals, unclear and vagueness. The proposed test will be designed when the time of counts is equal and not equal. The method of the proposed test when times of counts are equal is designed first. Let N iN  =  N iL  +  N iU I iN ; I iN ϵ [ I iL ,  I iU ] be neutrosophic counts at ith time, where N iL presents exact counts, N iU I iN presents inexact or indeterminate counts and I iN ϵ [ I iL ,  I iU ] be a measure of indeterminacy associated with the counts. Suppose that \({\overline{N}}_{iN}={\overline{N}}_{iL}+{\overline{N}}_{iU}{I}_{i\overline{N}};{I}_{i\overline{N}}\epsilon \left[{I}_{i\overline{L}},{I}_{i\overline{U}}\right]\) be a neutrosophic average of K counts, where \({\overline{N}}_{iL}\) and \({\overline{N}}_{iU}{I}_{i\overline{N}}\) are the determined and indeterminate part of neutrosophic average and \({I}_{i\overline{N}}\epsilon \left[{I}_{i\overline{L}},{I}_{i\overline{U}}\right]\) be the measure of indeterminacy associated with the neutrosophic average. The proposed test will be applied for testing the null hypothesis H 0  :  N iN = constant, when  i  = 1, 2, 3, …, K . The test statistic under neutrosophic statistics say \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) can be written as follows.

The neutrosophic form of the proposed test \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) can be written as.

Note that the proposed statistic \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) is the extension of the test statistic under classical statistics. The proposed test statistic \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) reduces to classical statistic \({\chi}_{1L}^2\) when \({I}_{\chi_{1L}^2}=0\) . The second part \({\chi}_U^2{I}_{\chi_{1N}^2}\) presents the indeterminate part and \({I}_{\chi_{1N}^2}\epsilon \left[{I}_{\chi_{1L}^2},{I}_{\chi_{1U}^2}\right]\) is the measure of uncertainty.

Suppose now that the time to record ith neutrosophic count is  t i . The test statistic, say \({\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]\) under neutrosophic statistics for this case is given by.

where \({\overline{R}}_N=\sum {N}_{iL}/\sum {t}_{iL}+\sum {N}_{iU}/\sum {t}_{iU}{I}_{{\overline{R}}_N};{I}_{{\overline{R}}_N}\epsilon \left[{I}_{{\overline{R}}_L},{I}_{{\overline{R}}_U}\right]\) and \({I}_{{\overline{R}}_N}\epsilon \left[{I}_{{\overline{R}}_L},{I}_{{\overline{R}}_U}\right]\) is a measure of indeterminacy. The neutrosophic form of test statistic \({\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]\) is expressed as follows.

Note that the proposed statistic \({\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]\) is the extension of the test statistic under classical statistics. The proposed test statistic \({\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{1U}^2\right]\) reduces to classical statistic \({\chi}_{2L}^2\) when \({I}_{\chi_{2L}^2}=0\) . The second part \({\chi}_U^2{I}_{\chi_{2N}^2}\) presents the indeterminate part and \({I}_{\chi_{2N}^2}\epsilon \left[{I}_{\chi_{2L}^2},{I}_{\chi_{2U}^2}\right]\) is the measure of uncertainty. The proposed test will be implemented in the following steps

Step-1: State the null H 0 and alternative hypothesis  H 1 .

Step-2: State the level of significance  α and decide about the critical region using the Chi-square table.

Step-3: Reject H 0 if \({\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]\) or \({\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]\) falls in the rejection area, otherwise accept H 1 .

In this paper the Chi-square tests for K counts under neutrosophic statistics was presented. The test statistic of the proposed test when K counts were recorded at the same time and different times was proposed. The proposed test was the modified version of the existing test for K counts. The testing of the hypothesis procedure was explained with the help of a real example. From the pulse count data, it is concluded that the proposed test is effective to apply in uncertainty. In addition, the proposed test provides higher values of the power of the test. The proposed test guides the cardiologists to apply different treatment methods for patients. The proposed test using big data can be extended us future research.

Availability of data and materials

All data generated or analysed during this study are included in this published article.

Kanji GK. 100 statistical tests. London: Sage Publications; 2006. https://doi.org/10.4135/9781849208499 .

Krishnamoorthy K, Thomson J. A more powerful test for comparing two Poisson means. J Stat Plan Inference. 2004;119:23–35.

Article   Google Scholar  

Hilbe JM. The statistical analysis of count data/El análisis estadístico de los datos de recuento. Cult Educ. 2017;29:409–60.

Puig P, Weiß CH. Some goodness-of-fit tests for the Poisson distribution with applications in Biodosimetry. Comput Stat Data Anal. 2020;144:106878.

White GC, Bennetts RE. Analysis of frequency count data using the negative binomial distribution. Ecology. 1996;77:2549–57.

Coxe S, West SG, Aiken LS. The analysis of count data: a gentle introduction to Poisson regression and its alternatives. J Pers Assess. 2009;91:121–36.

Salinas-Rodriguez A, Manrique-Espinoza B, Sosa-Rubi SG. Statistical analysis for count data: use of healthcare services applications. Salud Publica de Mexico. 2009;51:397–406.

Pham TV, Jimenez CR. An accurate paired sample test for count data. Bioinformatics. 2012;28:i596–602.

Article   CAS   Google Scholar  

Hawinkel S, Rayner J, Bijnens L, Thas O. Sequence count data are poorly fit by the negative binomial distribution. PLoS One. 2020;15:e0224909.

Böhning, D. & Sangnawakij, P. Count outcome meta-analysis for comparing treatments by fusing mixed data sources: comparing interventions using across report information. AStA Adv Stat Anal. 2020;1–11.

Viertl R. Univariate statistical analysis with fuzzy data. Comput Stat Data Anal. 2006;51:133–47.

Filzmoser P, Viertl R. Testing hypotheses with fuzzy data: the fuzzy p-value. Metrika. 2004;59:21–9.

Tsai C-C, Chen C-C. Tests of quality characteristics of two populations using paired fuzzy sample differences. Int J Adv Manuf Technol. 2006;27:574–9.

Taheri SM, Arefi M. Testing fuzzy hypotheses based on fuzzy test statistic. Soft Comput. 2009;13:617–25.

Jamkhaneh, E. B. & Ghara, A. N. in 2010 International Conference on Intelligent Computing and Cognitive Informatics . 86–89 (IEEE).

Chachi, J., Taheri, S. M. & Viertl, R. Testing statistical hypotheses based on fuzzy confidence intervals. Aust J Stat. 2012;41, 267–286–267–286.

Kalpanapriya D, Pandian P. Statistical hypotheses testing with imprecise data. Appl Math Sci. 2012;6:5285–92.

Google Scholar  

Montenegro, M., Casals, M. A. R., Lubiano, M. a. A. & Gil, M. a. A. Two-sample hypothesis tests of means of a fuzzy random variable. Inf Sci. 2001; 133:89–100.

Park S, Lee S-J, Jun S. Patent big data analysis using fuzzy learning. Int J Fuzzy Syst. 2017;19:1158–67.

Smarandache F. Neutrosophy. Neutrosophic probability, set, and logic, ProQuest Information & Learning. Ann Arbor. 1998;105:118–23.

Smarandache, F. Introduction to neutrosophic measure, neutrosophic integral, and neutrosophic probability. Infinite Study, 2013.

Broumi, S. & Smarandache, F. in Applied Mechanics and Materials . Trans Tech Publ 511–517.

Guo Y, Sengur A. NCM: Neutrosophic c-means clustering algorithm. Pattern Recogn. 2015;48:2710–24.

Broumi, S., Bakali, A., Talea, M. & Smarandache, F. Bipolar neutrosophic minimum spanning tree. Infinite Study, 2018.

Abdel-Baset M, Chang V, Gamal A. Evaluation of the green supply chain management practices: a novel neutrosophic approach. Comput Ind. 2019;108:210–20.

Abdel-Basset M, Mohamed M, Elhoseny M, Chiclana F, Zaied AE-NH. Cosine similarity measures of bipolar neutrosophic set for diagnosis of bipolar disorder diseases. Artif Intell Med. 2019;101:101735.

Smarandache F. Introduction to neutrosophic statistics. Infinite Study, 2014. https://doi.org/10.13140/2.1.2780.1289 .

Chen J, Ye J, Du S. Scale effect and anisotropy analyzed for neutrosophic numbers of rock joint roughness coefficient based on neutrosophic statistics. Symmetry. 2017;9:208.

Chen J, Ye J, Du S, Yong R. Expressions of rock joint roughness coefficient using neutrosophic interval statistical numbers. Symmetry. 2017;9:123.

Aslam, M. Neutrosophic analysis of variance: application to university students. Complex Intelligent Systems. 2019;1–5.

Aslam M. Neutrosophic analysis of variance: application to university students. Complex Intelligent Syst. 2019;5:403–7.

Aslam M, Albassam M. Application of Neutrosophic logic to evaluate correlation between prostate Cancer mortality and dietary fat assumption. Symmetry. 2019;11:330.

Aslam M. A new method to analyze rock joint roughness coefficient based on neutrosophic statistics. Measurement. 2019;146:65–71.

Gioia F, Lauro CN. Basic statistical methods for interval data. Statistica Applicata. 2005;17:75–104.

Download references

Acknowledgements

We are thankful to the editor and reviewers for their valuable suggestions to improve the quality of the paper.

Author information

Authors and affiliations.

Department of Statistics, Faculty of Science, King Abdulaziz University, Jeddah, 21551, Saudi Arabia

Muhammad Aslam

You can also search for this author in PubMed   Google Scholar

Contributions

MA wrote the paper. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Muhammad Aslam .

Ethics declarations

Ethics approval and consent to participate, consent for publication, competing interests, additional information, publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article.

Aslam, M. Chi-square test under indeterminacy: an application using pulse count data. BMC Med Res Methodol 21 , 201 (2021). https://doi.org/10.1186/s12874-021-01400-z

Download citation

Received : 02 August 2021

Accepted : 15 September 2021

Published : 30 September 2021

DOI : https://doi.org/10.1186/s12874-021-01400-z

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Chi-square test
  • Pulse count
  • Classical statistics
  • Uncertainty

BMC Medical Research Methodology

ISSN: 1471-2288

research article using chi square test

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Dtsch Arztebl Int
  • v.107(19); 2010 May

Choosing Statistical Tests

Jean-baptist du prel.

1 Institut für Epidemiologie, Universität Ulm

Bernd Röhrig

2 Medizinischer Dienst der Krankenversicherung Rheinland-Pfalz (MDK), Referat Rehabilitation/Biometrie

Gerhard Hommel

3 Institut für Medizinische Biometrie, Epidemiologie und Informatik (IMBEI) Universitätsmedizin Mainz

Maria Blettner

The interpretation of scientific articles often requires an understanding of the methods of inferential statistics. This article informs the reader about frequently used statistical tests and their correct application.

The most commonly used statistical tests were identified through a selective literature search on the methodology of medical research publications. These tests are discussed in this article, along with a selection of other standard methods of inferential statistics.

Results and conclusions

Readers who are acquainted not just with descriptive methods, but also with Pearson’s chi-square test, Fisher’s exact test, and Student’s t test will be able to interpret a large proportion of medical research articles. Criteria are presented for choosing the proper statistical test to be used out of the most frequently applied tests. An algorithm and a table are provided to facilitate the selection of the appropriate test.

Medical knowledge is increasingly based on empirical studies and the results of these are usually presented and analyzed with statistical methods. It is therefore an advantage for any physician if he/she is familiar with the frequently used statistical tests, as this is the only way he or she can evaluate the statistical methods in scientific publications and thus correctly interpret their findings. The present article will therefore discuss frequently used statistical tests for different scales of measurement and types of samples. Advice will be presented for selecting statistical tests—on the basis of very simple cases.

Statistical tests used frequently in medical studies

In order to assess which statistical tests are most often used in medical publications, 1828 publications were taken from six medical journals in general medicine, obstetrics and gynecology, or emergency medicine. The result showed that a reader who is familiar with descriptive statistics, Pearson’s chi-square test, Fisher’s exact test and the t-test, should be capable of correctly interpreting the statistics in at least 70% of the articles ( 1 ). This confirmed earlier studies on frequently used statistical tests in medical scientific literature ( 2 , 3 ). There have however been changes over time in the spectrum of the tests used. A survey of the analytical statistical procedures used in publications of the journal Pediatrics in the first six months of 2005 found that the proportion of inferential methods had increased from 48 to 89% between 1982 and 2005 ( 4 ). There was also a trend towards more complex test procedures. Nevertheless, here too, the most frequent tests were the t-test, the chi-square test, and Fisher’s exact test. This article will accordingly discuss these tests and their proper application, together with other important statistical tests. If the reader is familiar with this limited number of tests, he/she will be capable of interpreting a large proportion of medical publications. Information about the rarer statistical tests can be found in the corresponding articles, in advanced literature ( 5 – 7 ), or by consulting an experienced statistician.

What is the purpose of statistical tests?

Clinical studies [for example, [ 5 , 8 ]) often compare the efficacy of a new preparation in a study group with the efficacy of an established preparation, or a placebo, in a control group. Aside from a pure description ( 9 ), we would like to know whether the observed differences between the treatment groups are just random or are really present. This is because differences can be due to chance variability (scatter) in a parameter, such as the success of the treatment in the study group.

If a scientific question is to be examined by comparing two or more groups, one can perform a statistical test. This means that a null hypothesis must be formulated, which can in principle be rejected. Moreover, a suitable test parameter must be identified ( 10 , 11 ).

For example, a clinical study might investigate whether an antihypertensive drug works better than placebo. The test variable may then be the reduction in diastolic blood pressure, calculated from the mean difference in blood pressure between the active treatment and placebo groups. The null hypothesis is then: “There is no difference between the active treatment and the placebo with respect to antihypertensive activity” (effect = 0).

A statistical test then calculates the probability of obtaining the observed data (or even more extreme data), if the null hypothesis is correct. A small p-value means that this probability is slight. The null hypothesis is rejected if the p-value is less than a level of significance which has been defined in advance. A test variable (test statistic) is calculated from the observed data and this forms the basis of the statistical test. In our case, this might be the difference in mean blood pressure after six months. If specific assumptions are made about the distribution of the data (for example, normal distribution), the theoretical (expected) distribution of the test variable can be calculated.

The value of the test variable calculated from the observations is then compared with the distribution expected if the null hypothesis were correct ( 5 ). If this value is greater or less than a specific limit, it is unlikely that the null hypothesis is correct and the null hypothesis is accordingly rejected. The result is then “statistically significant at the level α”. The statistical test is thus a decision whether the observed value can be explained by chance, or whether it is greater than chance (statistically significant). The terms “level of significance” and the principle of the interpretation of p-values have already been discussed ( 10 , 11 ). The underlying steps in a statistical test are shown once again in the Box .

Steps in a statistical test

  • Statement of the question to be answered by the study
  • Formulation of the null and alternative hypotheses
  • Decision for a suitable statistical test
  • Specification of the level of significance (for example, 0.05)
  • Performance of the statistical test analysis: calculation of the p-value
  • p<0.05 leads to rejection of the null hypothesis and acceptance of the alternative hypothesis
  • p≥0.05 leads to retention of the null hypothesis
  • Interpretation of the test result

It is possible to be mistaken, either in the rejection or in the retention of the null hypothesis. The reason for this is that the values exhibit scatter, as, for example, not all patients react equally to a drug. An “error of the first type” is the mistaken rejection of the null hypothesis; the maximal probability of this error is the level of significance α. This is often chosen to be 5% ( 10 , 11 ). An “error of the second type” is the mistaken retention of the null hypothesis; the probability of this is ß, which is the same as 1 minus the power of the study. The power of the study is specified before the study starts and depends on the sample size, as well as other factors. A power of 80% is often selected ( 10 , 11 ).

Important steps in the decision for a statistical test

The decision for a statistical test is based on the scientific question to be answered, the data structure, and the study design. Before the data are recorded and the statistical test is selected, the question to be answered and the null hypothesis must be formulated. The test and the level of significance must be specified in the study protocol before the study is performed. It must be decided whether the test should be one-tailed or two-tailed. If the test is two-tailed, this means that the direction of the expected difference is unclear. One does not know whether there is a difference between the new drug and placebo with respect to efficacy. It is unclear in which direction the difference may be. (The new drug might even work less well than the placebo). A one-tailed test should only be performed when there is clear evidence that the intervention should only act in one direction.

The outcome variable (endpoint) is defined at the same time the question to be answered is formulated. Two criteria are decisive for the selection of the statistical test:

  • The scale of measurement of the test variable (continuous, binary, categorical)
  • The type of study design (paired or unpaired).

Scales of measurement: continuous, categorical, or binary

The different scales of measurement have already been discussed in the articles on study design and descriptive statistics, under the selection of suitable measures and methods of illustration ( 9 , 12 ).

For example, in the comparison of two antihypertensives, the endpoint can be the antihypertensive activity in the two treatment groups. The reduction in blood pressure is a continuous endpoint. It is also necessary to distinguish whether a continuous endpoint is (approximately) normally distributed or not.

If however one only considers whether the diastolic blood pressure falls under 90 mm Hg or not, the endpoint is then categorical. It is even binary, as there are only two possibilities. If there is a meaningful sequence in the categorical endpoints, this can be described as an “ordinal endpoint.”

Paired and unpaired study designs

A statistical test is used to compare the results of the endpoint under different test conditions (such as treatments). There are often two therapies.

If results can be obtained for each patient under all experimental conditions, the study design is paired (dependent). For example, two times of measurement may be compared, or the two groups may be paired with respect to other characteristics.

Typical examples of pairs are studies performed on one eye or on one arm of the same person. Typical paired designs include comparisons before and after treatment. “Matched pairs,” for example in case-control studies, are a special case. This involves selecting persons from one group with the same specified characteristics as persons in another group. The data are then no longer independent and should be treated as if they were paired observations from one group ( 5 ).

With an unpaired or independent study design, results for each patient are only available under a single set of conditions. The results of two (or more) groups are then compared. There may be differences in the sizes of the groups.

Common statistical tests

The most important statistical tests are listed in the Table . A distinction is always made between “categorical or continuous” and “paired or unpaired.” If the endpoint is continuous, normal and non-normal distributions are distinguished ( Table ).

Fisher’s exact testSuitable for binary data in unpaired samples: the 2 x 2 table is used to compare treatment effects or the frequencies of side effects in two treatment groups
Chi-square testSimilar to Fisher’s exact test (albeit less precise). Can also compare more than two groups or more than two categories of the outcome variable. Preconditions: sample size >ca. 60. Expected number in each field ≥5.
McNemar testPreconditions similar to those for Fisher’s exact test, but for paired samples
Student’s t-testTest for continuous data. Investigates whether the expected values for two groups are the same, assuming that the data are normally distributed. The test can be used for paired or unpaired groups.
Analysis of varianceTest preconditions as for the unpaired t-test, for comparison of more than two groups. The methods of analysis of variance are also used to compare more than two paired groups.
Wilcoxon’s rank sum test (also known as the unpaired Wilcoxon rank sum test or the Mann-Whitney U test)Test for ordinal or continuous data. In contrast to Student’s t-test, does not require the data to be normally distributed. This test too can be used for paired or unpaired data.
Kruskal-Wallis testTest preconditions as for the unpaired Wilcoxon rank sum test for comparing more than two groups
Friedman testComparison of more than two paired samples, at least ordinally scaled data
Log rank testTest of survival time analysis to compare two or more independent groups
Pearson correlation testTests whether two continuous normally distributed variables exhibit linear correlation
Spearman correlation testTests whether there is a monotonous relationship between two continuous, or at least ordinal, variables

Group comparison of two categorical endpoints

The group comparison for two categorical endpoints is illustrated here with the simplest case of a 2 x 2 table (four field table) ( Figure 1 ). However, the procedure is similar for the group comparison of categorical endpoints with multiple values ( Table ).

An external file that holds a picture, illustration, etc.
Object name is Dtsch_Arztebl_Int-107-0343_001.jpg

Test selection for group comparison with two categorical endpoints;

*1 Preconditions: sample size

>ca. 60. Expected number in each field ≥5

If the frequency of success in two treatment groups is to be compared, Fisher’s exact test is the correct statistical test, particularly with small samples ( 7 ). For large samples (about n >60), the chi-square test can also be used ( Table ).

One example of the use of this test would be an intervention within a group at two anatomical sites, such as the implantation of two different sorts of IOL lenses in the right and left eyes, with the endpoint “Operation successful: yes or no.” The samples to be compared are paired. In such a case, one has to perform the McNemar test ( 7 ).

Continuous and at least ordinally scaled variables

Figure 2 shows a decision algorithm for test selection.

An external file that holds a picture, illustration, etc.
Object name is Dtsch_Arztebl_Int-107-0343_002.jpg

Algorithm for test selection for group comparison of a continuous endpoint

Normally distributed variables—parametric tests: So-called parametric tests can be used if the endpoint is normally distributed.

Where subjects in both groups are independent of each other (persons in first group are different from those in second group), and the parameters are normally distributed and continuous, the unpaired t-test is used. If a comparison is to be made of a normally distributed continuous parameter in more than two independent (unpaired) groups, analysis of variance (ANOVA) can be used. One example would be a study with three or more treatment arms. ANOVA is a generalization of the unpaired t-test. ANOVA only informs you whether the groups differ, but does not say which groups. This requires methods of multiple testing ( 11 ).

The paired t-test is used for normally distributed continuous parameters in two paired groups. If a normally distributed continuous parameter is compared in more than two paired groups, methods based on analysis of variance are also suitable. The factor describes the paired groups—for example, more than two points of measurement in the use of a therapy.

Non-normally distributed variables—non-parametric tests: If the parameter of interest is not normally distributed, but at least ordinally scaled, non-parametric statistical tests are used. One of these tests (the “rank test”) is not directly based on the observed values, but on the resulting rank numbers. This necessitates putting the values in order of size and giving them a running number. The test variable is then calculated from these rank numbers. If the necessary preconditions are fulfilled, parametric tests are more powerful than non-parametric tests. However, the power of parametric tests may sink drastically if the conditions are not fulfilled.

The Mann-Whitney U test (also known as the Wilcoxon rank sum test) can be used for the comparison of a non-normally distributed, but at least ordinally scaled, parameter in two unpaired samples ( 5 ). If more than two unpaired samples are to be compared, the Kruskal-Wallis test can be used as a generalization of the Mann-Whitney U test ( 13 ).

The Wilcoxon signed rank test can be used for the comparison of two paired samples of non-normally distributed, but at least ordinally scaled, parameters ( 13 ). Alternatively, the sign test should be used when the two values are only distinguished on a binary scale—for example, improvement versus deterioration ( 7 ). If more than two paired samples are being compared, the Friedman test can be used as a generalization of the sign test.

Other test procedures

Survival time analysis.

If the point of interest is not the endpoint itself, but the time till it is reached, survival time analysis is the most suitable procedure. This compares two or more groups with respect to the time when an endpoint is reached (within the period of observation) ( 13 ). One example is the comparison of the survival time of two groups of cancer patients given different therapies. The endpoint here is death, although it could just as well be the occurrence of metastases. In contrast to the previous tests, it almost never happens that all subjects reach the endpoint in survival time analysis, as the period of observation is limited. For this reason, the data are also described as (right) censored, as it is still unclear when all subjects will reach the endpoint when the study ends. The log rank test is the usual statistical test for the comparison of the survival functions between two groups. A formula is used to calculate the test variable from the observed and the expected numbers of events. This value can be compared with the known distribution which would have been expected if the null hypothesis were correct—the chi-square distribution in this case. A p-value can thus be calculated. A rule can then be given for deciding for or against the null hypothesis.

Correlation analysis

Correlation analysis examines the strength of the correlation between two test variables, for example, the strength of the correlation between the body weight of a neonate and its body length. The selection of a suitable measure of association depends on the scale of measurement and the distribution of the two parameters. The parametric variant (Pearson correlation coefficient) exclusively tests for a linear correlation between continuous parameters. On the other hand, the non-parametric variant—the Spearman correlation coefficient—solely tests for monotonous relationships for at least ordinally scaled parameters. The advantages of the latter are its robustness to outliers and skew distributions. Correlation coefficients measure the strength of association and can have values between –1 and +1. The closer they are to 1, the stronger is the association. A test variable and a statistical test can be constructed from the correlation coefficient. The null hypothesis to be tested is then that there is no linear (or monotonous) correlation.

The null hypotheses for these statistical tests described in this article are that the groups are equal. These commonly used tests are also known as “inequality tests”. There are however other types of test. “Trend tests” examine whether there is a tendency for increasing or decreasing values in at least three groups. There are also “superiority tests”, “non-inferiority tests,” and “equivalence tests.” For example, a superiority test examines whether an expensive new drug is better than the conventional standard medication by a specific and medically relevant difference. A non-inferiority test might examine whether a cheaper new medicine is not much worse than a conventional medicine. The acceptable level of activity is specified before the start of the study on the basis of expert medical knowledge. An equivalence test is intended to show that a medication has approximately the same activity as a conventional standard medication. The advantages of the new medication might be simpler administration, fewer side effects, or a lower price.

The methods of regression analysis and the related statistical tests will be discussed in more detail in the course of this series on the evaluation of scientific publications.

The present selection of statistical tests is obviously incomplete. Our intention has been to make it clear that the selection of a suitable test procedure is based on criteria such as the scale of measurement of the endpoint and its underlying distribution. We would like to recommend Altman’s book ( 5 ) to the interested reader as a practical guide. Bortz et al. ( 7 ) present a comprehensive overview of non-parametric tests (in German).

The selection of the statistical test before the study begins ensures that the study results do not influence the test selection. Moreover, the necessary sample size depends on the test procedure selected. Problems in planning sample size will be discussed in more detail later in this series.

Finally, the point must be made that a statistical test is not necessary for every study. Statistical testing can be dispensed with in purely descriptive studies ( 12 ) or when the interrelationships are based on scientific plausibility or logical arguments. Statistical tests are also usually not helpful when investigating the quality of a diagnostic test procedure or rater agreement (for example, in the form of a Bland-Altman diagram) ( 14 ). Because of the probability of error, statistical tests should be used “as often as necessary, but as little as possible.” The risk of purely chance results is especially high with multiple testing ( 11 ).

Acknowledgments

Translated from the original German by Rodney A. Yeates, M.A., Ph.D.

Conflict of interest statement

The authors declare that no conflict of interest exists according to the guidelines of the International Committee of Medical Journal Editors.

  • Research article
  • Open access
  • Published: 19 September 2005

Aerobic exercise and its impact on musculoskeletal pain in older adults: a 14 year prospective, longitudinal study

  • Bonnie Bruce 1 ,
  • James F Fries 1 &
  • Deborah P Lubeck 2  

Arthritis Research & Therapy volume  7 , Article number:  R1263 ( 2005 ) Cite this article

41k Accesses

42 Citations

65 Altmetric

Metrics details

We studied the long term impact of running and other aerobic exercise on musculoskeletal pain in a cohort of healthy aging male and female seniors who had been followed for 14 years. We conducted a prospective, longitudinal study in 866 Runners' Association members (n = 492) and community controls (n = 374). Subjects were also categorized as Ever-Runners (n = 565) and Never-Runners (n = 301) to include runners who had stopped running. Pain was the primary outcome measure and was assessed in annual surveys on a double-anchored visual analogue scale (0 to 100; 0 = no pain). Baseline differences between Runners' Association members and community controls and between Ever-Runners versus Never-Runners were compared using chi-square and t-tests. Statistical adjustments for age, body mass index (BMI), gender, health behaviors, history of arthritis and comorbid conditions were performed using generalized estimating equations. Runner's Association members were younger (62 versus 65 years, p < 0.05), had a lower BMI (22.9 versus 24.2, p < 0.05), and less arthritis (35% versus 41%, p > 0.05) than community controls. Runners' Association members averaged far more exercise minutes per week (314 versus 123, p < 0.05) and miles run per week (26 versus 2, p < 0.05) and tended to report more fractures (53% versus 47%, p > 0.05) than controls. Ever-Runners were younger (62 versus 66 years, p < 0.05), had lower BMI (23.0 versus 24.3, p < 0.05), and less arthritis (35% versus 43%, p < 0.05) than Never-Runners. Ever-Runners averaged more exercise minutes per week (291 versus 120, p < 0.05) and miles run per week (23 versus 1, p < 0.05) and reported a few more fractures (52% versus 48%, p > 0.05) than Never-Runners. Exercise was associated with significantly lower pain scores over time in the Runners' Association group after adjusting for gender, baseline BMI, and study attrition (p < 0.01). Similar differences were observed for Ever-Runners versus Never-Runners. Consistent exercise patterns over the long term in physically active seniors are associated with about 25% less musculoskeletal pain than reported by more sedentary controls, either by calendar year or by cumulative area-under-the-curve pain over average ages of 62 to 76 years.

Introduction

The prevalence of older adults in the United States is growing at a substantial rate. By 2030, nearly one-fifth of Americans will be in their sixties or older [ 1 ], which will have a considerable impact on public health. Numerous epidemiological and clinical studies have established that older adults who participate in regular physical activity are healthier and have a better quality of life than those who are inactive [ 2 – 4 ]. Regular exercise has also been shown to reduce pain in patients with knee osteoarthritis [ 5 , 6 ] and to help prevent mechanical low back pain [ 7 ]. In contrast, inactivity has been associated with greater pain with injury and has been associated with lower bone density and muscle tone [ 8 ]. On the other hand, some aerobic activities, such as running, have been found to result in increased risk for stress or other fractures [ 9 , 10 ]. Recurring trauma to soft tissue resulting from excessive physical activity conceivably could increase pain and disability [ 11 ]. Few studies have addressed the relationship between aerobic exercise and the perception of pain with advancing age.

To study the effect of exercise on disability and pain, our group [ 10 ] had investigated the relationship of running and its impact on musculoskeletal pain and disability in cohorts of Runners' Association members and community controls and Ever-Runners and Never-Runners who were followed prospectively for six years. In that study, no increase in joint pain or stiffness with age was observed in subjects who exercised often and intensely compared with their more sedentary counterparts. Pain was reduced, however, at all time points by about 25% in the exercising group. In fact, there was a slight decrease in pain for women who exercised over time.

In this investigation, we have extended that research in those cohorts. We have evaluated the association of vigorous physical activity with pain with advancing age after 14 years of follow-up. We hypothesized that those who regularly participated in running or other aerobic activity would report less musculoskeletal pain rather than more over the long term than did their inactive counterparts.

Materials and methods

Sample selection and data collection methodology have been detailed previously [ 10 ]. Subjects were drawn from two groups: the Fifty-Plus Runners' Association with members across the United States and a Stanford University community-based random sample from the Lipid Research Clinics Study (community controls) which provided access to a sample that was similar in age to the Runners' Association. In this analysis, all subjects with at least two annual questionnaires were included. A total of 961 men and women (538 Runners' Association members and 423 community controls) who met eligibility criteria of being at least 50 years old, had at least a high school education, and used English as their primary language were initially enrolled in 1984. A major re-recruitment effort in 1991 targeted subjects who had dropped out in the first years of the study. To attenuate self-selection bias and exercise effects due to the exercisers among the community controls, we also created groups of Ever-Runners and Never-Runners based on responses to the question at baseline: "Have you ever run for exercise for a period greater than one month?" The study was approved by the Stanford University Investigational Review Board, and each subject gave their informed consent.

Data collection

Each subject completed annual, mailed health assessment questionnaires [ 12 , 13 ]. The questionnaire includes items on medical history, health status, exercise habits, history of musculoskeletal injuries, health care utilization, and demographic variables, such as height and weight, smoking, and alcohol use.

Assessment of physical activity

Physical activity data were obtained from responses to the question: "How many minutes each week do you exercise vigorously (vigorous exercise will cause you to sweat, and your pulse, if taken, will be above 120). Include periods of rapid walking at work and in daily activities." Subjects indicated their participation in running, jogging, swimming, bicycling/stationary bike, aerobic dance/exercises, stair steppers, brisk walking, hiking/treadmill, racket sports, and other.

Assessment of pain

Annually since 1987, pain was assessed using a visual analog scale (VAS) where 0 = no pain and 100 = worst pain. From 1987 through 1989, subjects responded to the question "How bad has pain or stiffness been in the past week?" and marked their response on the VAS. In 1987, the VAS anchors were: 0 = no pain or stiffness; and 100 = very severe pain or stiffness. In 1988 and 1989, the VAS anchors were: 0 = no pain; and 100 = severe pain. Beginning in 1990, a closed-end stem about the presence or absence of pain was added. If the patient affirmed they had pain, then they rated their amount of pain on the VAS. If they responded "No", then their pain score was assigned a value of zero.

Statistical analysis

Differences between groups at first evaluation (baseline) were compared using chi-square and t-tests. Results are reported as mean (SE) or proportion. Longitudinal data were analyzed using generalized estimating equations (GEE) [ 14 ]. Separate analyses were conducted in which repeated measurements were coded by calendar year for questionnaire response and by age. Main-effect predictors were exercise group (Runners' Association/community controls), gender, baseline age, baseline body mass index (BMI; kg/m 2 ), years of education, number of hospital days in the past year, and dichotomous variables for smoking, and history of arthritis, fractures, and cancer at baseline. Baseline values y t = 1987 were defined as weighted means [ 15 ],

For all analyses, exercise group and gender were combined to form a four-level classification factor. Two-way interactions of each predictor with this classification factor were also included. To reduce collinearity, each continuous predictor was dichotomized about its mean. This dichotomization also produced estimates of gender and exercise group main effects that were more meaningful.

Four analytic approaches were employed to help reduce the impact of possible self-selection bias. First, we conducted a separate analysis by Ever-Runner and Never-Runner classification as well as by Runners' Association and community controls groups (from original enrollment). This grouping expanded the cases to include individuals who self-selected to run at an earlier age and who stopped running because of pain or other reasons before entering the study. Second, we used covariate adjustment to account for baseline differences between groups. Direct standardization [ 16 ] was used to produce covariate-adjusted mean VAS pain by exercise group and by study year for statistically significant predictors that were identified by the above regression analyses. Data for the community controls group from the first year that VAS pain was observed (1987) served as the reference standard. In addition to providing adjustment for differences on covariates between Runners' Association and community controls groups, use of a standard taken at baseline also permitted adjustment for possible attrition bias. Third, we assessed attrition bias (because any differential attrition between groups could result in a secondary form of self-selection bias) by performing separate analyses limited to study subjects who completed all questionnaires in addition to all subjects (completers versus all). Finally, we used a longitudinal study design in which an adverse stimulus is expected to eventually result in a poor outcome regardless of initial self-selection bias if groups differ sufficiently in exposure to the stimulus. If running creates damage through accumulated trauma, then runners with about ten-fold the amount of exposure to such trauma should have increased pain over time, and any initial differences due to self-selection should narrow as the study progresses.

In 1987, the first year that VAS pain was assessed, 811 subjects returned questionnaires (458 Runners' Association members; 353 community controls). The 1991 re-recruitment efforts increased study enrollment to 881 subjects (496 Runners' Association members; 385 community controls). Fifteen subjects were excluded from these analyses because classification data for Ever-Runner versus Never-Runner were not available. Over the study duration, subject retention averaged more than 95% on an annual basis (and 98% of living subjects each year) as shown in Fig. 1 . The mean (SE) years of follow up for Runners' Association and community controls were 11.4 (0.17) and 10.1 (0.22) (p < 0.05 for difference), respectively, and for Ever-Runners and Never-Runners it was 11.4 (0.16) and 10.5 (0.25) (p < 0.05 for difference), respectively. Data for this study are based on 866 subjects (492 Runners' Association and 374 community controls), who were also grouped as Ever-Runners (n = 565) and as Never-Runners (n = 301), for whom data were available.

figure 1

Sample size over time for Runners' Association members and community controls.

Demographic characteristics at the beginning of the study are presented for the four study groups in Table 1 . Overall, subjects were similarly well educated, but Runners' Association members and Ever-Runners were statistically younger, had lower BMI and baseline pain scores, ran more miles, exercised more minutes per week, and smoked less (all p < 0.05) relative to community controls and Never-Runners. History of arthritis was lower in Runners' Association members and Ever-Runners than community controls and Never-Runners, but statistically significant only for Ever-Runners versus Never-Runners. In quintiles of baseline exercise minutes/week, less than a fourth (22%, n = 37) of Ever-Runners and less than a tenth (7%, n = 12) of Runners' Association members were inactive, exercising less than 70 minutes a week (data not shown). In contrast, at the highest quintile, 88% (n = 155) of Runners' Association members and 91% (n = 159) of the Ever-Runners exercised between 355 and 2,119 minutes/week, indicating that subjects in both of these groups were very physically active. At the end of the study period, the groups had maintained similar levels of exercise minutes/week.

Baseline demographic characteristics for the two sets of groups by gender are shown in Table 2 . For both Runners' Association members and Ever-Runners, the greater majority of subjects were male (approximately 83%), whereas in community controls and Never-Runners the sex ratios were more evenly split (56% and 50%, respectively). In females, Runners' Association members were younger, weighed less, reported less pain, fewer hospital days during the past year, ran more miles and exercised more minutes a week (all p < 0.05) than female community controls. Female Runners' Association members were also better educated, smoked less, and drank less alcohol than their community control counterparts (all p > 0.05) and there was a higher proportion of females with a history of arthritis and a lower proportion of females with a history of fractures in the community controls and Never-Runners compared to their counterparts, but these differences were statistically indistinguishable. Characteristics of male Runners' Association members versus community controls followed similar patterns, although differences in VAS pain and hospital days were not statistically significant, whereas education, smoking and alcohol were.

Pain scores over time, adjusted for group, gender and baseline BMI, are presented in Fig. 2 for each study group (Runners' Association members, community controls, Ever-Runners, and Never-Runners). The statistically significant covariates, excluding time and group, were gender (p < 0.01), baseline BMI (p < 0.01), cigarette packs/day and number of hospital days (p = 0.02). For both comparison groups of runners (Runners' Association and Ever-Runners), pain scores remained significantly lower over time (p < 0.01) when compared with community controls or Never-Runners. The dip in scores between 1987 and 1991 is a result of the rephrasing and coding of the pain question as described earlier. Pain scores were consistently about 25% less in the exercising group throughout the period of observation.

figure 2

Adjusted mean visual analog scale pain scores over time by study group. For both comparison groups of runners (Runners' Association members and Ever-Runners), pain scores remained significantly lower over time (p < 0.01) when compared with community controls or Never-Runners.

Because gender was a significant covariate, pain scores over time are presented by gender in Fig. 3 , adjusted for covariates. Significant covariates, excluding time and group, are fracture in past year (p = 0.026) and the presence of arthritis (p < 0.001). As observed previously, community controls have more pain over time; however, female controls have the greatest self-reported pain, with female and male controls reporting significantly more pain than either female runners (p = 0.048) or male runners (p = 0.004). Similar results were observed for VAS pain scores by gender for Never-Runners and Ever-Runners (data not shown).

figure 3

Adjusted mean visual analog scale pain scores over time by gender and study group. After adjusting for covariates, the community controls have more pain over time; however, female controls have the greatest self-reported pain, with female and male controls reporting significantly more pain than either female runners (p = 0.0048) or male runners (p = 0.004).

To evaluate the impact of study attrition and the possibility that withdrawal from the study might be associated with increased pain, we repeated the analyses by group and gender for study completers only. There were 61 female runners, 253 male runners, 84 female community controls and 116 male community controls who completed all questionnaires. In addition to time, group, and gender, presence of arthritis (p < 0.001) and education years (p = 0.036) were significant covariates. Similar to results of analyses using all available data, reduced levels of pain for male and female runners were observed in completers, although the only statistically significant difference over time is between female runners and male and female controls (p < 0.05).

Our final analyses tested the extent to which exercise and pain were affected by increasing age (Fig. 4 ). As in previous analyses, a history of arthritis and fractures were significant covariates (p < 0.001). There were significant increases in pain scores for female and male community controls and runners with increasing age, although the rates of increase are relatively modest. Older female runners tended to have the greatest beneficial impact, although these associations were statistically equivalent to differences in male runners and controls (p = 0.51).

figure 4

Adjusted mean visual analog scale pain scores by age, gender and study group. Significant increases in pain scores for female and male community controls and runners were found with increasing age, although rates of increase are relatively modest. Older female runners tend to have the greatest benefit, although these associations are statistically equivalent to differences in male and female runners and controls (p = 0.51).

This paper addresses the issue as to whether consistent vigorous exercise patterns over the long term are associated with greater or reduced musculoskeletal pain. In these cohorts, runners had substantially reduced pain levels compared with controls, which persisted over average ages of 62 to 76 years. Exercise was associated with a substantial and significant reduction in pain even after adjusting for gender, baseline BMI and attrition, and despite the fact that fractures, a significant predictor of pain, were slightly more common among runners. This relationship held as well when study completers only were evaluated.

Previous studies have indicated that men and women can differ in levels of self-reported pain and its importance [ 17 – 20 ]. In this study, male community controls reported less pain than their female counterparts. Female community controls and Never-Runners tended to report the highest levels of pain on average, whereas female runners appeared to receive the greatest benefit in reduced pain.

This study does not provide insight into the mechanisms that might underlie these results, although we have previously ruled out self-report bias between runners and non-runners by differential validation against spousal values [ 21 ]. A trend toward more frequent reports of a history of arthritis in controls could have played a role; fractures, however, were more common in runners and should have worked in the opposite direction. Other possible mechanisms include endorphin release, exercise protection against secondary fibromyalgia, increased resistance to musculoskeletal micro-injury, psychologically based increase in pain threshold, innately high pain threshold influencing decision to exercise vigorously, or other psychological mechanisms.

Musculoskeletal pain has also been shown to be associated with disability in older individuals and individuals with some chronic diseases [ 22 , 23 ]. But in a study of persons with rheumatoid arthritis, Ward and Leigh [ 22 ] noted that pain was a larger contributor to measurement of overall health status than physical disability and among both older male and female individuals. Leveille and colleagues [ 23 ] evaluated the presence of pain in women over 65 years of age and observed widespread musculoskeletal pain.

The primary finding from this investigation is that while pain does increase with age in subjects in all study groups, there was no progressive increase in musculoskeletal pain in older adults who participated in regular vigorous exercise, including running, compared with those who did not. Initial differences favoring exercisers were shown to be maintained over time. As pain and disability are linked, our findings add to the evidence that morbidity associated with aging can be reduced by participating in regular aerobic activity.

Abbreviations

body mass index

generalized estimating equations

visual analog scale.

Centers for Disease Control and Prevention (CDC): Trends in aging: United States and worldwide. MMWR Morb Mortal Wkly Rep. 2003, 52: 101-104.

Google Scholar  

Centers for Disease Control: The State of Aging and Health in America. 2004, Washington: Merck Institute of Aging and Health

Macera CA, Hootman JM, Sniezek JE: Major public health benefits of physical activity. Arthritis Rheum. 2003, 49: 122-128. 10.1002/art.10907.

Article   PubMed   Google Scholar  

Pate RR, Pratt M, Blair SN, Haskell WL, Macera CA, Bouchard C, Buchner D, Ettinger W, Heath GW, King AC, et al: Physical activity and public health. A recommendation from the Centers for Disease Control and Prevention and the American College of Sports Medicine. JAMA. 1995, 273: 402-407. 10.1001/jama.273.5.402.

Article   CAS   PubMed   Google Scholar  

Brady TJ, Kruger J, Helmick CG, Callahan LF, Boutaugh ML: Intervention programs for arthritis and other rheumatic diseases. Health Educ Behav. 2003, 30: 44-63. 10.1177/1090198102239258.

Ettinger WH, Burns R, Messier SP, Applegate W, Rejeski WJ, Morgan T, Shumaker S, Berry MJ, O'Toole M, Monu J, Craven T: A randomized trial comparing aerobic exercise and resistance exercise with a health education program in older adults with knee osteoarthritis. The Fitness Arthritis and Seniors Trial (FAST). JAMA. 1997, 277: 25-31. 10.1001/jama.277.1.25.

Vuori I: Exercise and physical health: musculoskeletal health and functional capabilities. Res Q Exerc Sport. 1995, 66: 276-285.

Taimela S, Diederich C, Hubsch M, Heinricy M: The role of physical exercise and inactivity in pain recurrence and absenteeism from work after active outpatient rehabilitation for recurrent or chronic low back pain: a follow-up study. Spine. 2000, 25: 1809-1816. 10.1097/00007632-200007150-00012.

Daffner RH, Martinez S, Gehweiler JA: Stress fractures in runners. JAMA. 1982, 247: 1039-1041. 10.1001/jama.247.7.1039.

Fries JF, Singh G, Morfeld D, O'Driscoll P, Hubert H: Relationship of running to musculoskeletal pain with age. A six-year longitudinal study. Arthritis Rheum. 1996, 39: 64-72.

McAlindon TE, Cooper C, Kirwan JR, Dieppe PA: Determinants of disability in osteoarthritis of the knee. Ann Rheum Dis. 1993, 52: 258-262.

Article   PubMed Central   CAS   PubMed   Google Scholar  

Fries JF, Spitz P, Kraines RG, Holman HR: Measurement of patient outcome in arthritis. Arthritis Rheum. 1980, 23: 137-145.

Bruce B, Fries J: The Stanford health assessment questionnaire (HAQ): a review of its history, issues, progress, and documentation. J Rheumatol. 2003, 30: 167-178.

PubMed   Google Scholar  

Liang KY, Zeger SL: Longitudinal data analysis using generalized linear models. Biometrika. 1986, 73: 13-22.

Article   Google Scholar  

Sen A, Srivastava M: Regression Analysis: Theory, Method and Application. 1990, New York: Springer-Verlag

Rosner B: Fundamentals of Biostatistics. 1995, Belmont, CA: Wadsworth Publishing Co

Herr KA, Garand L: Assessment and measurement of pain in older adults. Clin Geriatr Med. 2001, 17: 457-478. 10.1016/S0749-0690(05)70080-X.

Kelly AM: Does the clinically significant difference in visual analog scale pain scores vary with gender, age, or cause of pain?. Acad Emerg Med. 1998, 5: 1086-1090.

Unruh AM: Gender variations in clinical pain experience. Pain. 1996, 65: 123-167. 10.1016/0304-3959(95)00214-6.

Unruh AM, Ritchie J, Merskey H: Does gender affect appraisal of pain and pain coping strategies?. Clin J Pain. 1999, 15: 31-40. 10.1097/00002508-199903000-00006.

Ward MM, Leigh JP: The relative importance of pain and functional disability to patients with rheumatoid arthritis. J Rheumatol. 1993, 20: 1494-1499.

CAS   PubMed   Google Scholar  

Leveille SG, Ling S, Hochberg MC, Resnick HE, Bandeen-Roche KJ, Won A, Guralnik JM: Widespread musculoskeletal pain and the progression of disability in older disabled women. Ann Intern Med. 2001, 135: 1038-1046.

Download references

Acknowledgements

This research was supported by a grant from the National Institutes of Health (5R01-AG15815) to Stanford University, Department of Immunology/Rheumatology (JFF, Principal Investigator).

Author information

Authors and affiliations.

Department of Immunology/Rheumatology, Stanford University, Palo Alto, CA, 94304, USA

Bonnie Bruce & James F Fries

Health Economics, Genentech/MS 241A, South San Francisco, CA, 94080, USA

Deborah P Lubeck

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Bonnie Bruce .

Additional information

Competing interests.

The authors declare that they have no competing interests.

Authors' contributions

BB performed statistical analyses, interpretation of data, and drafting of the manuscript; JFF participated in study design, interpretation of data, and drafting of the manuscript; DL participated in study design, statistical analyses, interpretation of data, and drafting of the manuscript. All authors have read and approved the final manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2, authors’ original file for figure 3, authors’ original file for figure 4, rights and permissions.

Reprints and permissions

About this article

Cite this article.

Bruce, B., Fries, J.F. & Lubeck, D.P. Aerobic exercise and its impact on musculoskeletal pain in older adults: a 14 year prospective, longitudinal study. Arthritis Res Ther 7 , R1263 (2005). https://doi.org/10.1186/ar1825

Download citation

Received : 20 May 2005

Revised : 23 August 2005

Accepted : 24 August 2005

Published : 19 September 2005

DOI : https://doi.org/10.1186/ar1825

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Musculoskeletal Pain
  • Lower Body Mass Index
  • Visual Analog Scale Pain
  • Community Control

View archived comments (1)

Arthritis Research & Therapy

ISSN: 1478-6362

research article using chi square test

Easy Sociology

  • Books, Journals, Papers
  • Guides & How To’s
  • Life Around The World
  • Research Methods
  • Functionalism
  • Postmodernism
  • Social Constructionism
  • Structuralism
  • Symbolic Interactionism
  • Sociology Theorists
  • General Sociology
  • Social Policy
  • Social Work
  • Sociology of Childhood
  • Sociology of Crime & Deviance
  • Sociology of Art
  • Sociology of Dance
  • Sociology of Food
  • Sociology of Sport
  • Sociology of Disability
  • Sociology of Economics
  • Sociology of Education
  • Sociology of Emotion
  • Sociology of Family & Relationships
  • Sociology of Gender
  • Sociology of Health
  • Sociology of Identity
  • Sociology of Ideology
  • Sociology of Inequalities
  • Sociology of Knowledge
  • Sociology of Language
  • Sociology of Law
  • Sociology of Anime
  • Sociology of Film
  • Sociology of Gaming
  • Sociology of Literature
  • Sociology of Music
  • Sociology of TV
  • Sociology of Migration
  • Sociology of Nature & Environment
  • Sociology of Politics
  • Sociology of Power
  • Sociology of Race & Ethnicity
  • Sociology of Religion
  • Sociology of Sexuality
  • Sociology of Social Movements
  • Sociology of Technology
  • Sociology of the Life Course
  • Sociology of Travel & Tourism
  • Sociology of Violence & Conflict
  • Sociology of Work
  • Urban Sociology
  • Changing Relationships Within Families
  • Conjugal Role Relationships
  • Criticisms of Families
  • Family Forms
  • Functions of the Family
  • Featured Articles
  • Privacy Policy
  • Terms & Conditions

How to Conduct a Chi-Square Test

Mr Edwards

In social science research, one of the common tasks researchers undertake is analyzing relationships between categorical variables. Understanding how variables like gender, ethnicity, or occupation are distributed across different categories can reveal significant insights about patterns of inequality , social behavior, or institutional biases. One powerful statistical tool used to analyze such relationships is the Chi-Square test. This test allows sociologists to explore whether the observed frequencies in a categorical dataset deviate from what would be expected under a given hypothesis. In this article, we will explore how to conduct a Chi-Square test step-by-step, breaking down the concepts and calculations in a way that is accessible to undergraduate sociology students.

Understanding the Chi-Square Test

The Chi-Square test is a statistical method designed to examine the association between two or more categorical variables. These variables represent data that can be categorized into distinct groups or categories. For instance, gender (male, female, other) and level of education (high school, college, graduate) are examples of categorical variables that may be of interest in sociological research. The test compares the observed frequencies of different categories against the expected frequencies, under the assumption that there is no association between the variables.

There are two primary types of Chi-Square tests: the Chi-Square test for independence and the Chi-Square test for goodness of fit. The test for independence examines whether two categorical variables are related, while the goodness-of-fit test determines if the observed distribution of a single categorical variable matches an expected distribution. In this article, we will focus on the Chi-Square test for independence, as it is more commonly used in sociology research.

When to Use a Chi-Square Test

Before diving into the details of how to conduct a Chi-Square test, it is essential to understand when it is appropriate to use this statistical tool. The Chi-Square test is best suited for scenarios where the data is categorical and the researcher is interested in testing the relationship between two variables. Some common sociological research questions that can be addressed using a Chi-Square test include:

  • Is there a relationship between gender and voting behavior?
  • Are educational attainment levels related to employment status ?
  • Does racial or ethnic background correlate with access to healthcare?

To use the Chi-Square test effectively, the data must meet certain conditions:

  • Independence : The observations in each category must be independent of each other. This means that no individual or case should appear in more than one category.
  • Expected Frequencies : Each cell in the contingency table (which we will discuss shortly) should have an expected frequency of at least 5. If the expected frequencies are too small, the results of the test may be unreliable.
  • Sample Size : The test is more reliable with larger sample sizes. While there is no strict rule, having at least 30 observations is generally recommended for a Chi-Square test.

If these conditions are met, the Chi-Square test is an appropriate method for testing relationships between categorical variables.

Steps for Conducting a Chi-Square Test

1. formulating hypotheses.

As with any statistical test, conducting a Chi-Square test begins with formulating two hypotheses: the null hypothesis (H₀) and the alternative hypothesis (H₁).

  • Null Hypothesis (H₀) : This hypothesis states that there is no relationship between the two categorical variables. In other words, the variables are independent of each other.
  • Alternative Hypothesis (H₁) : The alternative hypothesis suggests that there is a relationship between the variables, meaning that they are not independent.

For example, if you are studying the relationship between gender and voting behavior, your hypotheses might be:

  • H₀ : There is no relationship between gender and voting behavior.
  • H₁ : There is a relationship between gender and voting behavior.

2. Collecting and Organizing Data

Next, gather data on the variables you are interested in examining. This data should be categorical, with each observation falling into one category for each variable. Once you have collected the data, organize it into a contingency table , which displays the frequencies of observations for each combination of categories.

For instance, if you are analyzing the relationship between gender and voting behavior, your contingency table might look like this:

VotedDid Not VoteTotal
Male451560
Female552580
Non-Binary10515
Total11045155

In this example, the table shows the observed frequencies of individuals who voted or did not vote, broken down by gender.

3. Calculating Expected Frequencies

The next step involves calculating the expected frequencies for each cell in the contingency table, assuming that the null hypothesis is true (i.e., there is no relationship between the variables). The expected frequency for each cell is calculated using the following formula:

E ij = (Row Total of Row i × Column Total of Column j) / Grand Total

  • E ij is the expected frequency for cell (i,j),
  • Row Total of Row i is the total number of observations in the i-th row,
  • Column Total of Column j is the total number of observations in the j-th column,
  • Grand Total is the total number of observations in the entire dataset.

Let’s calculate the expected frequencies for the first cell (Male, Voted) using the table above. The row total for males is 60, the column total for those who voted is 110, and the grand total is 155.

E (Male, Voted) = (60 × 110) / 155 = 42.58

You would repeat this process for each cell in the contingency table to obtain the expected frequencies.

4. Computing the Chi-Square Statistic

Once you have the observed and expected frequencies, the next step is to calculate the Chi-Square statistic. This statistic measures the difference between the observed and expected frequencies for each cell in the contingency table. The formula for the Chi-Square statistic is:

χ 2 = ∑ [(O ij – E ij ) 2 / E ij ]

  • χ 2 is the Chi-Square statistic,
  • O ij is the observed frequency for cell (i,j),
  • The sum is taken over all cells in the contingency table.

For each cell, you subtract the expected frequency from the observed frequency, square the result, and divide by the expected frequency. After calculating this value for all cells, you sum the results to obtain the overall Chi-Square statistic.

5. Determining Degrees of Freedom

Degrees of freedom (df) are a critical component in determining the significance of the Chi-Square statistic. In the case of a Chi-Square test for independence, the degrees of freedom are calculated using the formula:

df = (r – 1) × (c – 1)

  • r is the number of rows in the contingency table,
  • c is the number of columns in the contingency table.

In our example, there are 3 rows (Male, Female, Non-Binary) and 2 columns (Voted, Did Not Vote). Therefore, the degrees of freedom would be:

df = (3 – 1) × (2 – 1) = 2

6. Interpreting the Results

To determine whether the relationship between the variables is statistically significant, compare the calculated Chi-Square statistic to a critical value from the Chi-Square distribution table. The critical value depends on two factors: the degrees of freedom and the chosen significance level (often set at 0.05, or 5%).

If the calculated Chi-Square statistic is greater than the critical value, you can reject the null hypothesis, indicating that there is a significant relationship between the variables. If the Chi-Square statistic is less than the critical value, you fail to reject the null hypothesis, meaning that there is no evidence of a relationship between the variables.

7. Reporting the Results

When reporting the results of a Chi-Square test in a research paper or article, it is important to provide a clear summary of the findings. Typically, this includes the following information:

  • The observed Chi-Square statistic,
  • The degrees of freedom,
  • The p-value (the probability that the observed association occurred by chance),
  • Whether the result is statistically significant (i.e., whether you reject the null hypothesis),
  • A brief interpretation of the findings in the context of the research question.

For example, you might report your results as follows:

“A Chi-Square test for independence was performed to examine the relationship between gender and voting behavior. The test revealed a significant association between the two variables, χ 2 (2, N = 155) = 6.12, p = 0.047, indicating that voting behavior is not independent of gender.”

The Chi-Square test is an invaluable tool in sociology, enabling researchers to explore the relationships between categorical variables and to test hypotheses about social behavior and structures. By following the steps outlined in this article—formulating hypotheses, collecting data, calculating expected frequencies, computing the Chi-Square statistic, determining degrees of freedom, and interpreting the results—sociologists can rigorously test whether observed patterns in their data reflect significant associations or are merely the result of random chance.

Understanding and correctly applying the Chi-Square test helps sociologists draw meaningful conclusions about the social world, contributing to broader discussions about inequality, social behavior, and institutional practices. As a fundamental part of the sociologist’s toolkit, mastering the Chi-Square test is essential for undergraduate students who wish to engage critically with empirical research.

Mr Edwards has a PhD in sociology and 10 years of experience in sociological knowledge

Related Articles

A man clearing snow using a road gritting machine

Snowballing Technique in Sociological Research

The snowballing technique, also known as snowball sampling, is a non-probability sampling method widely used in qualitative research within the...

An abstract image

Central Tendency in Research: An Outline and Explanation in Sociology

Learn about the concept of central tendency in sociological research. Discover the definition and measures of central tendency, and understand...

A conceptual illustration showing an individual at the center surrounded by representations of different social institutions.

Institutionalization: An Overview

black and white shot of a row of jail cells

Total Institutions Explained

An aristocratic building

Institutionalism Explained

Get the latest sociology.

Would you be interested in enrolling in courses from Easy Sociology?

Recommended

The intellectual game of chess

Understanding Game Theory: Strategic Decision-Making and Social Interactions

a structuralism modernity building

Understanding Secondary Technical Schools in Sociology

24 hour trending.

An army helmet

Understanding Conflict Theories in Sociology

The symbolic interactionist view of education: a detailed outline and explanation, the effect of neoliberalism on education, pierre bourdieu’s symbolic capital in sociology, the role and functions of the education system: exploring its relationship to the economy and class structure.

Easy Sociology makes sociology as easy as possible. Our aim is to make sociology accessible for everybody. © 2023 Easy Sociology

© 2023 Easy Sociology

  • Open access
  • Published: 12 September 2024

Postoperative sensitivity of composites using novel Bacillus subtilis nanofortified adhesives: a triple-blind study

  • Nehal Amir 1 ,
  • Afsheen Mansoor 2 , 3 ,
  • Nabiha Eeman 4 ,
  • Muhammad Nouman Ahmed 5 ,
  • Emaan Mansoor 6 ,
  • Khadim Hussain 7 , 10 &
  • Paulo J. Palma 8 , 9  

BMC Oral Health volume  24 , Article number:  1077 ( 2024 ) Cite this article

Metrics details

Nanotechnology

is the art and science of dealing with nanoscale particles. This has transformed contemporary dental practices through myriad contributions to biomaterial science. Titanium dioxide nanoparticles procured from Bacillus subtilis , an eco-friendly and biogenic source, can significantly magnify the physiochemical attributes of dental materials. However, postoperative sensitivity is a major drawback of composite restorations. The incorporation of these nanoparticles into dental adhesives can greatly benefit clinical dentistry by resolving this issue. This trial aimed to evaluate the effectiveness of a novel titanium dioxide nanofortified adhesive on the postoperative sensitivity of composite restorations.

This triple-blind, parallel-group randomized controlled trial was conducted at the Department of Operative Dentistry and Endodontics, School of Dentistry, Islamabad, from May 15, 2023, to November 25, 2023. Participants ( n  = 60) with Class I and II primary carious lesions with a minimum cavity depth of 3–5 mm were randomly assigned to two groups ( n  = 30). After obtaining informed consent, the restorative procedure was accomplished using a minimally invasive approach and etch-and-rinse adhesive strategy. In group A, a nanofortified adhesive was used for composite restoration, whereas in group B, an adhesive without nanoparticles was used. Postoperative sensitivity was evaluated using the Visual Analog Scale (VAS) score at follow-up periods: of one day, one week, two weeks and one month. A Chi-square test was used to compare postoperative sensitivity between the two groups. The level of significance was set at p  < 0.05.

A noteworthy association was observed between sensitivity and the group variable at all four evaluation periods: after one day ( p  = 0.002), 1 week ( p  = 0.002), 2 weeks ( p  = 0.007) and one month. In conclusion, participants who underwent restorative intervention using titanium dioxide nanoreinforced adhesives reported a notable reduction in sensitivity at all time intervals. Hence, the occurrence and severity of postoperative sensitivity are significantly reduced using Bacillus subtilis-procured nanofortified adhesives as compared to conventional adhesives without nanoparticles.

Trial registration

This trial was retrospectively registered at ClinicalTrials.gov (ID: NCT06242184) on 03/02/2024. All procedures involving human participants were performed in conformance with this protocol.

Peer Review reports

Nanotechnology is the implementation of scientific principles to substantially manipulate matter at the nanoscale to attain the prerequisites of dimensions- and structure-dependent attributes that are distinct from those associated with individual atoms, molecules, or extrapolation from larger sizes of the same material. It has gained immense popularity in clinical dentistry owing to its multitudinous contributions to dental biomaterials [ 1 , 2 ]. The pinnacle of nanoscience is nanoparticles, which are composed of atoms or molecules that congregate to form nanostructured agglomerates and provide dental materials with remarkable physiochemical and optical-magnetic qualities [ 3 ].

Nanoparticles are categorized as having organic, inorganic, or carbon-based lineages. Metal and metal oxide-based materials are exemplary inorganic nanoparticles. The outstanding attributes of metal oxide-based nanoparticles include their dimensions, which range from 10 to 100 nm; their configuration, which is spherical and cylindrical; and their surface qualities, which include pore size, enhanced surface-area-to-volume ratio, and surface charge. Titanium dioxide, also known as titania, is one of the most well-characterized metal-oxide nanoparticles. It has illustrious optical, magnetic, physicochemical, rheological, and biological properties. Titanium dioxide has a high refractive index, biocompatibility, and cost-effectiveness, making it an ideal choice for utilization in dental materials [ 4 , 5 ].

Composite resin, an esthetic adhesive material, is a paragon in contemporary restorative dentistry. This has transformed the paradigm to minimally invasive dentistry. Among the advantages of composite restorations are their distinguished aesthetics, adhesion to tooth structure, and preservation of healthy dental substrates [ 6 ]. Nonetheless, they have explicit shortcomings that jeopardize the long-term viability of posterior restorations. Polymerization shrinkage in composites is primarily responsible for these unfavorable outcomes, including cuspal deflection, postoperative sensitivity, secondary caries, marginal discoloration, restorative fracture, and microleakage at the tooth-restorative junction [ 7 ]. Post-operative sensitivity in composite restorations is caused by a multitude of reasons including iatrogenic pulpal damage during caries excavation, inadequate coverage and penetration of adhesives into the dentinal tubules, and the impact of various placement strategies, bonding techniques, curing modes, and the choice of composite material used. Incomplete hybridization is a major cause of sensitivity in composites. This predicament can be resolved by the prospective sealing potential of dental adhesives to occlude patent dentinal tubules at the dentin-adhesive interface. Nevertheless, because of the degeneration and long-term instability of the hybrid layer, this attribute of adhesives is susceptible to time-dependent decline [ 6 , 7 , 8 ].

Appreciable improvements in the physio-chemical properties and long-term stability of the tooth-adhesive interface can be achieved by fortifying dental adhesives with nanoparticles.Nano-particles, when incorporated into dental adhesives, enormously magnify the bonding capability of resins into dental substrate by its deeper penetration and sealing capability of the tubular network of dentin. Thus, it could massively reduce post-operative sensitivity in composite restorations [ 9 , 10 ]. The rationale of this study was to enhance the tubule-sealing ability of dental adhesives by enhancing the bonding performance of resins through the incorporation of nanoparticles, thus minimizing the issue of sensitivity. The objective of this trial was to evaluate the effectiveness of a novel titanium dioxide nanofortified adhesive on the postoperative sensitivity of composite restorations. The hypothesis stated that the occurrence and severity of postoperative sensitivity in composites were lessened using Bacillus subtilis-procured novel nanofortified adhesives compared to dental adhesives without nanoparticles.

Study design

The present study followed the CONSORT (Consolidated Standards of Reporting Trials) 2010 guidelines. This study was formulated as a triple-blind, parallel-group, randomized clinical trial with a 1:1 allocation ratio. The operator, data analyst, and participants were all blinded to the groups to which they were assigned. The participants were divided into two groups. Group A: Nano-fortified adhesive was employed for restorative treatment, whereas in Group B: Adhesive without nanoparticles was used. All participants were briefed about the study, along with its potential merits and demerits, and a consent form was validated by the principal investigator from each participant.

Sample size calculation

The sample size was calculated as 60 using the WHO calculator (30 participants in each interventional group). The test had a power of 80% and significance level of 5%. The population mean test result was 1.38, but the population standard deviation was calculated to be 1.6, and the expected population mean was 3.69 [ 11 ].

Patient selection and blinding

Participants fulfilling the inclusion criteria were selected from the outpatient department using a consecutive non-probability sampling method and were randomly divided by a trial-autonomous researcher in each study group using a Lottery method . For allocation concealment, an opaque sealed envelope with designated coding was employed for each study group. The dental adhesive used in this intervention had identical bottles with the assigned codes (A and B) for each interventional group. In this study, a trial-autonomous investigator performed both the blinding and allocation concealment. Table  1 presents the inclusion and exclusion criteria of this study.

Synthesis of microbial titanium dioxide nanoparticles

The strain of Bacillus Subtilis ( Accessions No; ATCC ® 6633TM-Catalog No/ 0486-SPR ) was obtained from National Institute of Health Islamabad , Pakistan for preparing the Titanium dioxide nanoparticles. They were fabricated via the methodology available in the reported literature [ 9 ]. The supernatant was prepared from fresh Bacillus Subtilis culture after centrifuging it at 10,000 RmP for about 10 min. Then, 80 ml of supernatant and 20 ml of 00.025 m Ti(-OH)2 (Americans Element, 10,884.0 WeYburn AVE/ Los Angeles; CA/90024.0,USA) were mixed together and were placed on metal plate at 60 o C for about 20.0 min where white crystal like deposits appeared at the bottom of the flask confirming the Titanium dioxide nanoparticles formation. The characterization techniques involved to identify its original phase, peak form, size, shape, texture, roughness, morphology, elemental composition, functional groups and biocompatibility were performed through equipments: DMAX/2400 Diffractometer (Rigaaku Corpooration: Akishimau Tokyo’s company Japan-), Scanning electron microscope-(NOVA Nanoseme 4300, FEI-company-HillsBoro, ORA, USA), Atomic force microscope (Quesant/Universal SPMS, Ambios-Technology: SantA Cruz’s, C-A, USA), Transmission electron microscope (Jeols JEMS-200CXX-Bioz Star’s: Tokoyo, Japan), Fourier transform infrared spectrophotometery (JASCOS FT/IRR-6600; Utrecht-T, Netherlands) and Spectrometer-(Lambda-A 9500, Perkins Elmer’s, Waltham-MA, USA) [ 4 , 9 , 12 ].

Properties of microbial titanium dioxide nanoparticles

The Titanium dioxide nanoparticles prepared by the microbes displayed mixed anatase-rutile phase having particle size of about 70.00 nm arranged in aggloremates. They possessed smooth texture with minimal roughness in their morphology. Moreover, these nanoparticles revealed pure titanium and oxygen in their elemental composition and confirmed the presence of carboxyl groups, amine groups and titanium-oxygen bendings in their spectrum. These were found to be biocompatible against fibroblast cell lines that enhanced their unique characteristics [ 4 , 9 , 12 ].

Incorporation of nanoparticles into dental adhesive

In this study, synthesized nanoparticles were mixed into a commercially available 5 mL bottle of adhesive (One Coat Bond SL Coltene, Switzerland) at a mass fraction of 30%.

Clinical procedure

The study was conducted at the Department of Operative Dentistry and Endodontics , School of Dentistry , Shaheed Zulfiqar Ali Bhutto Medical University , Islamabad , from May 15 , 2023 to November 25 , 2023 , after acceptance from the Institutional Ethical Committee ( SOD/ERB/2023/31 − 01 ). Before the initiation of the procedure, circumstantial medical and dental history was recorded. Each participant was anesthetized to ensure comfort during the procedure using the local anesthetic solution Lignospan Special-lidocaine hydrochloride 2% with 1:80,000 epinephrine (Septodont, USA). Under rubber dam isolation, carious lesions were excavated using a high-speed handpiece, the cavity design was delineated by the extent of the carious lesion, and the depth of the prepared cavity was estimated using a periodontal probe. Before initiating restorative treatment, a Palodent Plus Sectional Matrix System (Dentsply Sirona, USA) was used to obtain the desired proximal contour of the restoration.

Acid etching was performed using Phosphoric acid 35% (Etchant gel S Coltene, Switzerland) for 15–30 s. The enamel surface was etched for 30 s while dentin substrate was etched for 15 s. Afterwards, the area was rinsed with water for 20 s and allowed to air-dry for 5 s. The subsequent steps were executed in conformance with the fundamentals of adhesive dentistry (Fig.  1 ).

Group A (nanofortified dental adhesive): In this interventional group, titanium dioxide nanoparticles were incorporated in 5 mL adhesive (One Coat Bond SL Coltene, Switzerland). Using a microbrush, the etched surface was vigorously coated with adhesive for 20 s. This was followed by another application for 20 s. For 40 s, the adhesive coating was light-cured using an LED curing light (Woodpecker DTE LUX E Plus, Woodpecker, China). Subsequently, the restorative material (Brilliant NG Universal Nano Composite Coltene, Switzerland) was placed using the incremental packing strategy and light-cured for 40 s using a curing light Woodpecker DTE LUX E Plus (Woodpecker, China).

Group B (adhesive without nanoparticles): In this experimental group, a commercially available adhesive (One Coat Bond SL Coltene, Switzerland) without nanoparticles was employed, followed by the application of composite (Brilliant NG Universal Nano Composite Coltene, Switzerland) using the incremental packing strategy and light-curing for 40 s with curing light Woodpecker DTE LUX E Plus (Woodpecker, China).

Subsequently, the rubber dam was removed, the restoration was assessed for any occlusal modifications and the adequacy of inter-proximal contacts. Afterwards, the finishing was completed. The restoration was polished using a Jiffy Original Composite System (Ultradent, USA).

figure 1

Clinical intervention for the restorative treatment

Assessment of postoperative sensitivity

The Visual Analog Scale (VAS) score was used to assess sensitivity after one day, one week, two weeks, and one month. VAS is a psychometric tool for measuring subjective attributes that cannot be measured. It is a 10 cm long scale with markings from 0 to 10, calibrated for sensitivity scores of 0 (none), 1–3 (mild), 4–6 (moderate), and 7–10 (severe) (Fig.  2 ) [ 11 , 13 , 14 ]. Moreover, participants were directed to rate whether their sensitivity was spontaneous or stimulated. They were instructed to mention the provoking stimuli.

figure 2

Visual Analog Scale (VAS) score

Statistical analysis

The data were analyzed using SPSS software version 26. A Chi-square test was employed to compare the postoperative sensitivity between the two groups at the follow-up period. The level of significance was set at p  < 0.05.

A total of 112 participants were enrolled in this interventional study. Of these, 30 participants did not satisfy the inclusion criteria, 10 refused participation, and 12 had other reservations about this trial. The study comprised 60 participants, who were designated into two groups ( n  = 30). Table  2 provides a detailed overview of the critical variables based on the 60 participants. Figure  3 illustrates the CONSORT 2010 Flow Diagram.

figure 3

CONSORT 2010 flow diagram

Postoperative sensitivity levels after one day, one week, two weeks, and one month show distinct patterns. After one day, the sensitivities ranged from none (25%) to severe (6.7%). After one and two weeks, the patterns of none, mild, moderate, and severe sensitivity persisted. After one month, 81.7% indicated no sensitivity, 15% indicated mild sensitivity, and 3.3% indicated moderate sensitivity. Table  3 depicts the cross-tabulations and chi-square tests that investigated the relationship between sensitivity at different time intervals and several categorical variables.

A noteworthy correlation was found between sensitivity after one day and the group variable ( p  = 0.002) (Fig.  4 ). The majority of participants in the “Adhesive with Nanoparticles” group reported no or mild sensitivity, whereas the “Adhesive without Nanoparticles” group reported a higher number of participants experiencing moderate and severe sensitivity. The analysis of sensitivity after one day and Black classification showed no statistically significant relationship between these variables ( p  = 0.65). The distribution of sensitivity levels was comparable between the Class I and Class II restorations. Analysis of sensitivity after one day and tooth distribution revealed no statistically significant correlation ( p  = 0.497). The sensitivity levels appeared to be similar for both the premolar and molar tooth distributions.

figure 4

Distribution of sensitivity between group variables after one day

Sensitivity after one week in the group showed a significant correlation ( p  = 0.002). The group utilizing the “Adhesive with Nanoparticles” has a greater proportion of participants reporting no or mild sensitivity in comparison to the group using the “Adhesive without Nanoparticles” Adhesive without nanoparticles. Sensitivity after one week and black classification were analyzed using cross-tabulation. (Fig.  5 ) The results showed no statistically significant association between the two variables ( p  = 0.493). Sensitivity after one week did not show a statistically significant association with tooth distribution ( p  = 0.193). The sensitivity levels appear to be similar in both premolar and molar tooth distributions.

figure 5

Distribution of sensitivity between group variables after one week

There were notable connections between sensitivity after two weeks and the group, as indicated by the statistical analysis ( p  = 0.007) (Fig.  6 ). The majority of participants in the “Adhesive with Nanoparticles” group reported either no sensitivity or mild sensitivity. In contrast, the “Adhesive without nanoparticles group had a higher percentage of participants experienced moderate sensitivity. The contingency table analysis for sensitivity after two weeks and black categorization revealed a weakly significant correlation ( p  = 0.05). However, there was no significant correlation between tooth distribution and sensitivity after two weeks ( p  = 0.66).

figure 6

Distribution of sensitivity between group variables after two weeks

A strong correlation between the sensitivity after one month and the group (p  = 0.001) was observed. The majority of the participants in the “Adhesive with Nanoparticles” group reported no sensitivity. Moreover, sensitivity after one month does not show any significant correlation with black classification ( p  = 0.936) and tooth distribution ( p  = 0.989 ) (Fig.  7 ).

figure 7

Distribution of sensitivity between the group variables after 1 month

Table  4 shows that participants who used “adhesives with nanoparticles” consistently exhibited reduced levels of sensitivity in comparison to those who used “adhesives without nanoparticles” at all four time points. After one day, a significant percentage of patients who had restorative treatment with nano-reinforced adhesives claimed that they did not experience any sensitivity (13 versus 2), and the group that did not have nanoparticles was the only one that experienced severe sensitivity. At the end of one week, the pattern was maintained, with a greater number of participants expressing no sensitivity (17 rather than 5), while a smaller number reported moderate sensitivity (2 rather than 12). At two weeks, this tendency became more obvious, with 22 patients with nanofortified adhesives reporting no sensitivity, compared to 11 participants in the group that did not use nanoparticles, and there were no reports of moderate sensitivity in the group that used titanium dioxide nanoparticles. Based on the results of this study, it can be deduced that adhesives containing titanium dioxide reinforced nanoparticles resulted in significantly decreased levels of postoperative sensitivity over time when compared to adhesives without nanoparticle fortification.

With the inception of resin composites in clinical practice, the field of restorative dentistry has blossomed. Resin composites, which embody the fundamentals of minimally invasive and adhesive dentistry, provide high-quality esthetic and durable posterior restorations. Despite its long-track record of clinical success, it has several shortcomings. Polymerization shrinkage during the free radical addition reaction produces many undesirable outcomes, one of which is postoperative sensitivity. The drawback of sensitivity subsequent to composite restoration is primarily explained by the hydrodynamic theory, as the fluctuations in tubular hydrostatic pressure cause stimulation of nerve endings in the dental pulp. Many strategies have been employed over the decades to counteract this pitfall of resin composites, such as modifications in placement techniques, variations in restorative materials, utilization of low-viscosity materials, and alterations in curing modes. However, no single treatment modality has completely eliminated this issue [ 15 , 16 , 17 ].

Amid this predicament, a silver lining of hope has emerged from the instigation of nanoscience in clinical dentistry. The philosophy of nanoscience has proven to be a ground-breaking step in the development of dental biomaterials. A multitude of advancements has been made in dental materials through the incorporation of fundamental principles of nanotechnology. Reinforced composites of nano-fortified adhesives encompass the field of restorative dentistry. The core of a successful composite restoration is the establishment of a durable hybrid layer. Incomplete infiltration of adhesives into demineralized dentin is the main culprit for nanoleakage, which is mainly responsible for the postoperative sensitivity of composites. Thus, by fortifying the adhesives with nanoparticles, significant enhancements in physio-chemical attributes can be attained. These reinforced adhesives have better permeability into the dentin matrix, resulting in longevity and stability in the hybrid layer [ 18 , 19 , 20 ].

Thus, the principles of nanoscience can resolve the issues of postoperative sensitivity in composites by utilizing the superlative permeability and mechanical attributes of nanofillers. In this interventional study, nanoparticles were mixed with a commercially available adhesive (One Coat Bond SL Coltene, Switzerland) at a mass fraction of 30% [ 20 , 21 ]. This was attributed to the fact that reinforced nanofillers have a significant impact on the viscoelasticity and flowability of adhesives into dentinal tubules [ 4 , 11 ]. Hence, the percentage of incorporated titanium dioxide nanoparticles was deliberately set at moderate levels to avoid any detrimental impact on the dentin adhesives while upgrading their physio-chemical properties. In this study, titanium dioxide nanoparticles procured from a biogenic source, Bacillus subtilis , were employed to reinforce dental adhesives. The superior physio-chemical, biological, and optical properties of titanium dioxide nanoparticles have already been elucidated in various studies conducted recently [ 22 , 23 , 24 ]. Moreover, an in vitro study has supported the safety profile of these biogenically synthesized nanoparticles as being non-cytotoxic and eco-friendly [ 25 ].

In accordance with the findings of this study, a noticeable association was observed between sensitivity and the group variable at all four evaluation periods: after one day ( p  = 0.002), one week ( p  = 0.002), two weeks ( p  = 0.007) and one month ( p  = 0.001). The majority of participants in the “Adhesive with Nanoparticles” group reported none or mild sensitivity, whereas the “Adhesive without Nanoparticles” group reported a higher number of participants experiencing moderate to severe sensitivity. The results of this trial go hand in gloves with a study conducted at the Department of Operative Dentistry, Faculty of Dentistry, Cairo University [ 11 ] that shows a significant reduction in postoperative sensitivity using bioglass-incorporated nanoparticles in dentin adhesives.

An etch-and-rinse adhesive strategy was used in this interventional study. This was chiefly due to the increased prevalence of postoperative sensitivity in the composites when this adhesive technique was employed. Several studies have shown promising results in favor of the self-etch adhesive technique in terms of sensitivity [ 26 , 27 ]. However, the findings of the systemic review negated the impact of the adhesive strategy on postoperative sensitivity in posterior composites [ 28 , 29 ]. Additionally, an incremental placement technique was utilized for this intervention owing to the proposition that the bulk-fill strategy has a more successful clinical outcome and reduced sensitivity compared to the incremental layering technique [ 14 , 30 ]. This was mainly used to judge the impact of reinforced nanoparticles in adhesives on the sensitivity of the composites. Nonetheless, various studies have highlighted that the placement techniques do not influence the postoperative sensitivity of composite restorations [ 31 , 32 ].

The most striking attribute of this intervention was its novelty. No previous study has utilized Bacillus subtilis-procured nanoparticles in dentin adhesives. This study is the first of its kind to demonstrate the impact of biogenically synthesized titanium dioxide nanoparticles on the sensitivity of composite restorations. Titanium dioxide nanoparticles can significantly improve the properties of adhesives. Due to their biogenic origin, these nanoparticles are eco-friendly, and their cytotoxicity profile has also been authenticated [ 33 ]. Additionally, naturally prepared titanium dioxide nanoparticles displayed outstanding antimicrobial features that might play a significant role in the adhesive dentistry as well [ 34 , 35 ]. Moreover, the triple-blind nature of this intervention rendered the study unbiased, and ratified its authenticity. However, this study had certain limitations. One of the major shortcomings of this trial is that one month could be a short evaluation period to remark the appreciable impact of nano-reinforced adhesives on post-restorative sensitivity in composites. This trial did not evaluate the influence of variables such as cavity depth, arch distribution, adhesive techniques, or different placement strategies on postoperative sensitivity after composite restorations. Moreover, this trial was conducted in a single hospital setting. To authenticate the findings and practicability of this study, it should have been executed at multiple clinical setups. Thus, further research is warranted to validate the efficacy of novel titanium dioxide nanofortified adhesives in restorative dentistry.

Nanotechnology is a cynosure in contemporary science. This study paves the way for further research on nanoscience to synthesize ecologically secure nanoparticles and incorporate them into biomaterials. It can drastically improve the final treatment outcome and benefit healthcare practitioners by providing high-quality dental care to patients.

Clinical relevance

This study shed light on one of the major issues in restorative dentistry, which is post-operative sensitivity. By embodying composite restorative material with nanoparticles, significant enhancement in bonding performance could be achieved. This could enormously improve the longevity of restorations by providing premium dental treatment to patients.

A noteworthy association was observed between sensitivity and the group variable at all four evaluation periods: after one day ( p  = 0.002), one week ( p  = 0.002), two weeks ( p  = 0.007) and one month ( p  = 0.001). The majority of participants in the “Adhesive with Nanoparticles” group reported none or mild sensitivity whereas the “Adhesive without Nanoparticles” group had a higher number of participants experiencing moderate to severe sensitivity. It could be deduced that participants who underwent restorative intervention using titanium dioxide nanoreinforced adhesives reported a notable reduction in sensitivity at all time intervals. Hence, the incidence and severity of postoperative sensitivity are significantly reduced by using Bacillus subtilis-procured nano-fortified adhesives as compared to conventional adhesives without nanoparticles.

Limitations of the study

A limited sample size was used in this clinical trial to investigate patient sensitivity. Therefore, future studies with a larger number of patients involved in the clinical trial are required to clarify any ambiguity regarding the limited sample size.

Data availability

All data generated or analyzed during the study is available in this manuscript.

Malik S, Waheed Y. Emerging applications of nanotechnology in dentistry. Dent J (Basel). 2023;11(11):266. https://doi.org/10.3390/dj11110266 .

Article   PubMed   Google Scholar  

Bapat RA, Joshi CP, Bapat P, Chaubal TV, Pandurangappa R, Jnanendrappa N, Gorain B, Khurana S, Kesharwani P. The use of nanoparticles as biomaterials in dentistry. Drug Discov Today. 2019;24(1):85–98. Epub 2018 Aug 31. PMID: 30176358.

Article   CAS   PubMed   Google Scholar  

Balhaddad AA, Garcia IM, Mokeem L, Alsahafi R, Collares FM, Sampaio de Melo MA. Metal oxide nanoparticles and nanotubes: ultrasmall nanostructures to engineer antibacterial and improved dental adhesives and composites. Bioengineering. 2021;8(10):146. https://doi.org/10.3390/bioengineering8100146 .

Article   CAS   Google Scholar  

Mansoor A, Khurshid Z, Khan MT, Mansoor E, Butt FA, Jamal A, Palma PJ. Medical and dental applications of Titania nanoparticles: an overview. Nanomaterials (Basel). 2022;12(20):3670. https://doi.org/10.3390/nano12203670 .

Jandt KD, Watts DC. Nanotechnology in dentistry: present and future perspectives on dental nanomaterials. Dent Mater. 2020;36(11):1365–78.

Article   CAS   PubMed   PubMed Central   Google Scholar  

Sabbagh J, Fahd JC, McConnell RJ. Post-operative sensitivity and posterior composite resin restorations: a review. Dent Update. 2018;45(3):207–13.

Article   Google Scholar  

Hayashi M. Adhesive Dentistry: understanding the science and achieving clinical success. Dent Clin North Am. 2020;64(4):633–43. https://doi.org/10.1016/j.cden.2020.05.001 .

Demarco FF, Collares K, Correa MB, Cenci MS, Moraes RR, Opdam NJ. Should my composite restorations last forever? Why are they failing? Braz Oral Res. 2017;31(suppl 1):e56.doi: 10.1590/1807-3107BOR-2017.vol31.0056.

Mansoor A, Khan MT, Mehmood M, Khurshid Z, Ali MI, Jamal A. Synthesis and characterization of titanium oxide nanoparticles with a novel biogenic process for dental application. Nanomaterials (Basel). 2022;12(7):1078.

Bastos NA, Bitencourt SB, Martins EA, De Souza GM. Review of nanotechnology applications in resin-based restorative materials. J Esthet Restor Dent. 2021;33(4):567–82.

Aboelenein AZ, Riad MI, Haridy MF. Effect of a self-etch adhesive containing nanobioglass on postoperative sensitivity of posterior composite restorations - a randomized trial. Open Access Maced J Med Sci. 2019;7(14):2313–20.

Article   PubMed   PubMed Central   Google Scholar  

Kirthi A, Rahuman AA, Rajakumar G, Marimuthu S, Santhoshkumar T, Jayaseelan C, Elango G, Zahir AA, Kamaraj C, Bagavan A. Biosynthesis of titanium dioxide nanoparticles using bacterium Bacillus subtilis . Mater Lett. 2011;65:2745–7. [CrossRef].

Elkady M, Abdelhakim SH, Riad M. Impact of repeated preheating of bulk-fill resin composite on postoperative hypersensitivity; a randomized controlled clinical trial. BMC Oral Health. 2024;24(1):453. https://doi.org/10.1186/s12903-024-04170-4 .

Channa S, Rajput F, Bilgrami A, Javed F, Faraz H, Madiha. Evaluation of post operative sensitivity of nanofilled composite Versus bulk filled resin composite in posterior class 2 restoration: post-operative sensitivity of nanofilled composite. Pakistan J Health Sci. 2023;4(04).

Cristina I, Cristina I. Post-operative sensitivity in direct resin composite restorations: Clinical practice guidelines. 2012.

Cieplik F, Hiller KA, Buchalla W, Federlin M, Scholz KJ. Randomized clinical split-mouth study on a novel self-adhesive bulk-fill restorative vs. a conventional bulk-fill composite for restoration of class II cavities - results after three years. J Dent. 2022;125:104275.

Tardem C, Albuquerque EG, Lopes LS, Marins SS, Calazans FS, Poubel LA, Barcelos R, Barceleiro MO. Clinical time and postoperative sensitivity after use of bulk-fill (syringe and capsule) vs. incremental filling composites: a randomized clinical trial. Braz Oral Res. 2019;33(0):e089.

Azmy E, Al-Kholy MRZ, Fattouh M, Kenawi LMM, Helal MA. Impact of nanoparticles additions on the strength of dental composite resin. Int J Biomater. 2022;2022:1165431.

Vellappally S, Hashem M, Altinawi A, Fouad H. Nanoparticle incorporated dentin bonding agent to caries effected dentin treated by photodynamic therapy, laser or chlorhexidine. Photodiagnosis Photodyn Ther. 2021;36:102495.

Liang K, Weir MD, Reynolds MA, Zhou X, Li J, Xu HHK. Poly (amido amine) and nano-calcium phosphate bonding agent to remineralize tooth dentin in cyclic artificial saliva/lactic acid. Mater Sci Eng C Mater Biol Appl. 2017;72:7–17.

Mansoor A, Mansoor E, Khan MT, Mehmood M, Hassan SMU, Shah AU, Asjid U, Rai A, Palma PJ. Synthesis and clinical efficacy of novel jasmine titania tooth whitening gel on color, surface roughness and morphology. Nano-structures &Nano-objects. 2024;38:101206. https://doi.org/10.1016/j.nanoso.2024.101206 .

Melo MA, Cheng L, Zhang K, Weir MD, Rodrigues LK, Xu HH. Novel dental adhesives containing nanoparticles of silver and amorphous calcium phosphate. Dent Mater. 2013;29(2):199–210. https://doi.org/10.1016/j.dental.2012.10.005 .

Ramos-Tonello CM, Lisboa-Filho PN, Arruda LB, Tokuhara CK, Oliveira RC, Furuse AY, Rubo JH, Borges AFS. Titanium dioxide nanotubes addition to self-adhesive resin cement: Effect on physical and biological properties. Dent Mater. 2017;33(7):866–75.

Stürmer M, Garcia IM, Souza VS, Visioli F, Scholten JD, Samuel SMW, Leitune VCB, Collares FM. Titanium dioxide nanotubes with triazine-methacrylate monomer to improve physicochemical and biological properties of adhesives. Dent Mater. 2021;37(2):223–35.

Mansoor A, Khurshid Z, Mansoor E, Khan MT, Ratnayake J, Jamal A. Effect of currently available nanoparticle synthesis routes on their biocompatibility with fibroblast cell lines. Molecules. 2022;27(20):6972.

Sancakli HS, Yildiz E, Bayrak I, Ozel S. Effect of different adhesive strategies on the post-operative sensitivity of class I composite restorations. Eur J Dent. 2014;8(1):15–22. https://doi.org/10.4103/1305-7456.126234 . PMID: 24966741; PMCID: PMC4054027.

Yousaf A, Aman N, Manzoor MA, Shah JA, Dilrasheed. Postoperative sensitivity of self etch versus total etch adhesive. J Coll Physicians Surg Pak. 2014;24(6):383–6.

PubMed   Google Scholar  

Costa T, Rezende M, Sakamoto A, Bittencourt B, Dalzochio P, Loguercio AD, Reis A. Influence of sdhesive type and placement technique on postoperative sensitivity in posterior composite restorations. Oper Dent. 2017 Mar/Apr;42(2):143–54.

Reis A, Dourado Loguercio A, Schroeder M, Luque-Martinez I, Masterson D, Cople Maia L. Does the adhesive strategy influence the post-operative sensitivity in adult patients with posterior resin composite restorations? A systematic review and meta-analysis. Dent Mater. 2015;31(9):1052–67.

Loguercio AD, Ñaupari-Villasante R, Gutierrez MF, Gonzalez MI, Reis A, Heintze SD. 5-year clinical performance of posterior bulk-filled resin composite restorations: a double-blind randomized controlled trial. Dent Mater. 2023;39(12):1159–68. https://doi.org/10.1016/j.dental.2023.10.018 . Epub 2023 Oct 13. PMID: 37839995.

Arbildo-Vega HI, Lapinska B, Panda S, Lamas-Lara C, Khan AS, Lukomska-Szymanska M. Clinical effectiveness of bulk-fill and conventional resin composite restorations: systematic review and meta-analysis. Polym (Basel). 2020;12(8):1786.

Sengupta A, Naka O, Mehta SB, Banerji S. The clinical performance of bulk-fill versus the incremental layered application of direct resin composite restorations: a systematic review. Evid Based Dent. 2023;24(3):143.

Mansoor A, Mansoor E, Mehmood M, Hassan SMU, Shah AU, Asjid U, Ishtiaq M, Jamal A, Rai A, Palma PJ. Novel microbial synthesis of titania nanoparticles using probiotic Bacillus coagulans and its role in enhancing the microhardness of glass ionomer restorative materials. Odontology. 2024;30. https://doi.org/10.1007/s10266-024-00921-5 .

Mansoor A, Mehmood M, Hassan SMU, Ali MI, Badshah M, Jamal A, History A. Antibacterial effect of titanium oxide nanoparticles and their application as alternative to antibiotics. Pak Vet J. 2023;43(2):269–275. https://doi.org/10.29261/pakvetj/2023.039 .

Mansoor A, Mehmood M, Ishtiaq M, Jamal A. Synthesis of TiO2 nanoparticles and demonstration of their antagonistic properties against selected dental caries promoting bacteria. Pak J Med Sci. 2023;39(5):1249–54. https://doi.org/10.12669/pjms.39.5.7851 .

Download references

Acknowledgements

Authors acknowledge School of Dentistry for their help and support in the research.

Author information

Authors and affiliations.

Department of Operative Dentistry and Endodontics, School of Dentistry, Shaheed Zulfiqar Ali Bhutto Medical University, Islamabad, Pakistan

Department of Microbiology, Quaid-i-Azam University, Islamabad, 45320, Pakistan

Afsheen Mansoor

Department of Dental Material Sciences, School of Dentistry, Shaheed Zulfiqar Ali Bhutto Medical University, Islamabad, 44080, Pakistan

Medical Student, Rawalpindi Medical University, Rawalpindi, Pakistan

Nabiha Eeman

Medical Student, Army Medical College, National University of Medical Sciences, Rawalpindi, Pakistan

Muhammad Nouman Ahmed

Islamic International Dental College, Riphah International University Islamabad, Islamabad, 46000, Pakistan

Emaan Mansoor

CRS Agriculture Department, Islamia University, Bahawalpur, Pakistan

Khadim Hussain

Faculty of Medicine, Center for Innovation and Research in Oral Sciences (CIROS), University of Coimbra, Coimbra, 3000-075, Portugal

Paulo J. Palma

Faculty of Medicine, Institute of Endodontics, University of Coimbra, Coimbra, 3000-075, Portugal

Department of Statistics, Islamia University, Bahawalpur, Pakistan

You can also search for this author in PubMed   Google Scholar

Contributions

Conceptualization: N.A, A.M.; Methodology: N.A., A.M.; Data Collection: N.A., N.E., M.N.A., E.M.; Data Processing: N.A., N.E., M.N.A., E.M, K.H.; Data Analysis & Interpretation: K.H, N.A.; Writing - Original Draft: N.A.; Writing - Review & Editing: A.M., P.J.P.; Validation and Critical Review: A.M., P.J.P.; Supervised by: A.M.

Corresponding authors

Correspondence to Afsheen Mansoor or Paulo J. Palma .

Ethics declarations

Ethics approval and consent to participate.

The study was conducted after the approval from Institutional Ethical Research Committee , School of Dentistry , Shaheed Zulfiqar Ali Bhutto Medical University with the reference number (SOD/ERB/2023/31 − 01) . An informed consent was obtained from all of the participants included in this trial.

Consent for publication

A consent for publication was obtained from allthe participants involved in the study.

Conflict of interest

The authors declare no conflicts of interest.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions.

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .

Reprints and permissions

About this article

Cite this article.

Amir, N., Mansoor, A., Eeman, N. et al. Postoperative sensitivity of composites using novel Bacillus subtilis nanofortified adhesives: a triple-blind study. BMC Oral Health 24 , 1077 (2024). https://doi.org/10.1186/s12903-024-04825-2

Download citation

Received : 01 July 2024

Accepted : 28 August 2024

Published : 12 September 2024

DOI : https://doi.org/10.1186/s12903-024-04825-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Composite resins
  • Dentin sensitivity
  • Nanoparticles
  • Restoration
  • Titanium dioxide

BMC Oral Health

ISSN: 1472-6831

research article using chi square test

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings
  • My Bibliography
  • Collections
  • Citation manager

Save citation to file

Email citation, add to collections.

  • Create a new collection
  • Add to an existing collection

Add to My Bibliography

Your saved search, create a file for external citation management software, your rss feed.

  • Search in PubMed
  • Search in NLM Catalog
  • Add to Search

McNemar chi2 test revisited: comparing sensitivity and specificity of diagnostic examinations

Affiliation.

  • 1 Gama Filho University, School of Medicine, Rio de Janeiro, Brazil. [email protected]
  • PMID: 18224558
  • DOI: 10.1080/00365510701666031

When evaluating a novel diagnostic examination for clinical use, it should be compared with a reference standard, defined as the best available examination, which may include clinical and laboratory criteria. The novel examination and reference standard's results are usually presented in the form of a 2 x 2 table, which allows calculation of sensitivity, specificity and accuracy. It has been recommended that the measures of statistical uncertainty should be reported, such as the 95% confidence interval, when evaluating the accuracy of diagnostic examinations. Comparing the difference in sensitivity or specificity of a novel examination with the reference standard is important when evaluating its usefulness. The McNemar chi(2) test, used to compare discordance of two dichotomous responses, can be applied for this purpose. However, applying the McNemar test to a 2 x 2 table for comparing the accuracy of examinations is not recommended, since this test is sensitive to the proportion of positive versus negative subjects. Moreover, if the novel examination has higher sensitivity than the one considered as the reference standard, constructing a classic 2 x 2 table would result in low specificity of the novel examination. Thus, in order to compare sensitivities and specificities between examinations, this table is inappropriate and an independent reference standard is necessary. In this article, we propose the use of the McNemar chi(2) test to compare sensitivities between examinations using a 2 x 2 table exclusively among diseased patients, defined by a set of criteria and follow-up of patients. Likewise, specificities can be compared applying the McNemar test among healthy individuals.

PubMed Disclaimer

Similar articles

  • Pleural fluid ADA, IgA-ELISA and PCR sensitivities for the diagnosis of pleural tuberculosis. Trajman A, Kaisermann C, Luiz RR, Sperhacke RD, Rossetti ML, Féres Saad MH, Sardella IG, Spector N, Kritski AL. Trajman A, et al. Scand J Clin Lab Invest. 2007;67(8):877-84. doi: 10.1080/00365510701459742. Scand J Clin Lab Invest. 2007. PMID: 17852820
  • The diagnostic utility of adenosine deaminase isoenzymes in tuberculous pleural effusions. Zemlin AE, Burgess LJ, Carstens ME. Zemlin AE, et al. Int J Tuberc Lung Dis. 2009 Feb;13(2):214-20. Int J Tuberc Lung Dis. 2009. PMID: 19146750
  • [Research on diagnostic tests in Medicina Clinica. A methodological assessment]. Ramos Rincón JM, Hernández Aguado I. Ramos Rincón JM, et al. Med Clin (Barc). 1998 Jul 4;111(4):129-34. Med Clin (Barc). 1998. PMID: 9717144 Spanish.
  • Diagnostic accuracy of flow rate testing in urology. Patel HR, Garcia-Montes F, Christopher N, Reeves BC, Emberton M. Patel HR, et al. BJU Int. 2003 Jul;92(1):58-63. BJU Int. 2003. PMID: 12823384 Review.
  • [Roaming through methodology. XXXII. False test results]. van der Weijden T, van den Akker M. van der Weijden T, et al. Ned Tijdschr Geneeskd. 2001 May 12;145(19):906-8. Ned Tijdschr Geneeskd. 2001. PMID: 11387865 Review. Dutch.
  • Comparison of AI-integrated pathways with human-AI interaction in population mammographic screening for breast cancer. Frazer HML, Peña-Solorzano CA, Kwok CF, Elliott MS, Chen Y, Wang C; BRAIx Team; Lippey JF, Hopper JL, Brotchie P, Carneiro G, McCarthy DJ. Frazer HML, et al. Nat Commun. 2024 Aug 30;15(1):7525. doi: 10.1038/s41467-024-51725-8. Nat Commun. 2024. PMID: 39214982 Free PMC article.
  • Evaluation metrics and statistical tests for machine learning. Rainio O, Teuho J, Klén R. Rainio O, et al. Sci Rep. 2024 Mar 13;14(1):6086. doi: 10.1038/s41598-024-56706-x. Sci Rep. 2024. PMID: 38480847 Free PMC article.
  • Retrospective validation of MetaSystems' deep-learning-based digital microscopy platform with assistance compared to manual fluorescence microscopy for detection of mycobacteria. Desruisseaux C, Broderick C, Lavergne V, Sy K, Garcia D-J, Barot G, Locher K, Porter C, Caza M, Charles MK. Desruisseaux C, et al. J Clin Microbiol. 2024 Mar 13;62(3):e0106923. doi: 10.1128/jcm.01069-23. Epub 2024 Feb 1. J Clin Microbiol. 2024. PMID: 38299829 Free PMC article.
  • Diagnostic accuracy of an artificial intelligence algorithm versus radiologists for fracture detection on cervical spine CT. van den Wittenboer GJ, van der Kolk BYM, Nijholt IM, Langius-Wiffen E, van Dijk RA, van Hasselt BAAM, Podlogar M, van den Brink WA, Bouma GJ, Schep NWL, Maas M, Boomsma MF. van den Wittenboer GJ, et al. Eur Radiol. 2024 Aug;34(8):5041-5048. doi: 10.1007/s00330-023-10559-6. Epub 2024 Jan 11. Eur Radiol. 2024. PMID: 38206401
  • A Novel Clinical Tool to Detect Severe Obstructive Sleep Apnea. Ye Y, Yan ZL, Huang Y, Li L, Wang S, Huang X, Zhou J, Chen L, Ou CQ, Chen H. Ye Y, et al. Nat Sci Sleep. 2023 Oct 17;15:839-850. doi: 10.2147/NSS.S418093. eCollection 2023. Nat Sci Sleep. 2023. PMID: 37869520 Free PMC article.

Publication types

  • Search in MeSH

Related information

  • PubChem Compound
  • PubChem Compound (MeSH Keyword)
  • PubChem Substance

Grants and funding

  • 5U2 R TW006883-03/TW/FIC NIH HHS/United States

LinkOut - more resources

Full text sources.

  • Taylor & Francis

full text provider logo

  • Citation Manager

NCBI Literature Resources

MeSH PMC Bookshelf Disclaimer

The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.

IEEE Account

  • Change Username/Password
  • Update Address

Purchase Details

  • Payment Options
  • Order History
  • View Purchased Documents

Profile Information

  • Communications Preferences
  • Profession and Education
  • Technical Interests
  • US & Canada: +1 800 678 4333
  • Worldwide: +1 732 981 0060
  • Contact & Support
  • About IEEE Xplore
  • Accessibility
  • Terms of Use
  • Nondiscrimination Policy
  • Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity. © Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.

  • Research Note
  • Open access
  • Published: 14 September 2024

Prevalence, types and outcome of injuries among abattoir workers in Ghana

  • Abigail Aban Tetteh 1 , 2 ,
  • Veronica Millicent Dzomeku 1 ,
  • Patience Achiamaa Barnie 1 ,
  • Adwoa Gyamfi 1 ,
  • Ato Kwamina Arhin 3 ,
  • Benjamin Noble Adjei 4 ,
  • Bernard Barnie 4 ,
  • Emmanuel Kwaku Nakua 4 ,
  • Charles Mock 5 &
  • Peter Donkor 6  

BMC Research Notes volume  17 , Article number:  265 ( 2024 ) Cite this article

Metrics details

In many places in the world, workers in the meat processing industry report high incidence of injuries. Details of such injuries are not well known for Ghana or much of Africa.

A cross-sectional survey involving 300 workers from three major meat processing facilities in the Kumasi metropolis of Ghana was carried out using a structured questionnaire from April to June 2023. The prevalence, types and outcome of injuries among workers were assessed. Test of association was established by Chi square analysis.

Over the prior 6 months, the prevalence of injury was 83.0%. Among the various injury types, lacerations had the highest prevalence (46.0%) followed by musculoskeletal pain (16.7%) bone fractures (14.0%), swelling (13.0%), burns and scalds (7.3%), and dislocations/sprains/strains (6.7%). More than half (58.9%) of injuries sustained were moderately severe (2–7 days of lost work) and nearly half (42.0%) required immediate medical attention. Gender, employment status, wages, availability and use of safety equipment were significantly associated with injuries among abattoir workers.

Conclusions

The incidence of injuries among abattoir workers in Kumasi, Ghana demonstrates a large public health burden requiring attention and improved enforcement through occupational safety interventions.

Peer Review reports

Introduction

Activities in the meat processing industry predispose worker to various form of injuries, such as deep lacerations, falls, fractures and bites from animals [ 1 , 2 , 3 , 4 , 5 ]. Non-fatal injuries such as sprains and strains are common often requiring time off work or job modifications based on severity [ 6 , 7 ]. Most studies on injuries in the meat processing industry come from high-income countries. But the few studies that have addressed injuries in the meat processing industry in Africa have shown a high prevalence of injuries among workers, often associated with factors such as dirty slippery floors, kicks and stamps from irate animals and sharp machinery [ 8 , 9 , 10 ].

Globally and in Ghana, abattoirs are obligated to have occupational safety policies that will mitigate the occurrence of injury. Regrettably these policies are inadequate, poorly coordinated or non-existent in most abattoirs [ 11 , 12 , 13 , 14 ].

In Ghana, there has been attention to other occupational safety issues, such as those in construction and mining, food safety and infectious disease risk from abattoirs [ 15 , 16 , 17 , 18 , 19 ]. However, there has been almost no attention paid to injury and safety risks for abattoir workers in Ghana. To support the development of occupational injury control strategies, it is imperative to obtain detailed information on injury characteristics. This study addressed the gap by assessing injury prevalence, types, and outcomes in abattoirs workers in Ghana.

Study design and setting

The study employed a quantitative research approach using descriptive cross-sectional design to solicit for information from abattoir workers in the Greater Kumasi Metropolis of Ghana. The city has a heterogeneous population and enhanced economic activities. It is the second largest city in Ghana with a population of 3,490,000 and a land size of 299km 2 [ 20 ]. Meat processing industries in Kumasi receive their supply of animals from different regions in the country, and neighboring countries like Mali and Niger [ 21 ].

Study population

The city has 3 major meat processing facilities and 17 smaller facilities with less than 20 workers each. This study was carried out in the three main meat processing facilities. This includes Kumasi Abattoir, the Subtui Musah Slaughterhouse and the Akwatia Line slaughter slab with worker population of 200, 560 and 290 respectively. These three facilities cut across the different grades of meat processing facilities in the country. Workers include those who move animals and work in lairage as well as butchers and slaughterers engaged in killing, singeing and processing of the meat. There are also retailers, administrators and general workers who dispense, inspect the site and keep the facility operating.

Data collection

A structured questionnaire adapted from previous injury literature was employed for the data collection [ 15 , 17 ]. It was divided into demographic characteristics, types and frequencies of injuries, and outcomes of the injuries occurring in the abattoir. A three-day training was done for the research assistants. The training focused on building understanding on the questionnaire and the objective of the study, conducting interviews and maintaining confidentiality. The questionnaire was in English but most of the respondents had low levels of education. Hence research assistants were trained on how to translate the questions into a language directly understood by the participants, mostly Twi and Hausa. The questionnaire asked about injuries over the prior six months. If a respondent had more than one injury event during that time, they were asked to report on whichever injury they chose.

The tool was pretested at a separate facility (Sofoline Slaughterhouse) not involved in the remainder of the study to assess the questions and the interview skills of the research assistants. Four questions were modified to assess the specific department in which the worker is engaged. Content validity assessment was done by seven experts, who have published extensively on injury related studies and necessary modifications were made before the actual data collection.

An estimated sample size of 300 was calculated using the Yamane formula [ 22 ]. Purposive sampling was used. After obtaining ethical clearance and administrative approval from authorities, the principal investigator and research assistants visited the three worksites for a total of eight days. During this time, they approached for interviews workers who were at the worksite that day. The principal investigator sought written informed consent from workers, explaining the objectives of the study. Workers who consented were interviewed. The principal investigator and research assistants interviewed as many workers as possible during the time allotted each day, up until the goal of 300 was achieved. All 300 workers approached agreed to be part of the study. Interviews were anonymous and no names or other identifying information about the respondents were collected. Data collection ran from April to June 2023.

Data analysis

Quantitative tools were employed in data analysis. Data were first cleaned and checked for completeness then exported to IBM SPSS Version 25.0, USA for analysis using descriptive statistics. Continuous variables were expressed as mean ± standard deviation (M ± SD) and the results were presented in tables, frequencies and percentages. Test of association was established by Chi square analysis.

Respondents’ demographic characteristics

A total sample of 300 respondents participated in the study. Table 1 summarizes their demographic characteristics. The largest single group of participants were between 40–49 years, representing 28.3% of the respondents. The industry is predominantly male (96.7%), and are married (78.0%). Similar proportions of the respondents have primary (24.0%), junior high (21.0%) and secondary education (25.0%) with only 9.0% having tertiary education. Nearly fifty percent of respondents have over 10 years of working experience in the abattoir. Majority (60.0%) are casual workers (46.3%) hired and paid daily wages.

Prevalence, types and outcome of injuries

In the 6 months prior to the study, 249 workers reported at least one injury for a prevalence of injury of 83.0% (Table  2 ). Laceration was the most frequent injury sustained by respondents, representing 46.0%, followed by musculoskeletal pain (16.7%) and bone fractures (14.0%). Other leading injuries included swelling of various body parts (13.0%), burns and scalds (7.3%), and dislocation, sprains, or strains (6.7%). Close to half (42.0%) of these injuries sustained required immediate medical attention (42.0%) at health facilities. Another large group (36.7%) were treated first aid by co-workers at the worksite. In terms of long-term outcome, more than half (58.9%) of injuries were moderately severe, leading to 2–7 days of lost work.

Association between socio-demographic characteristics, safety measures and injuries among workers in the abattoir

Table 3 shows a significant association between gender, employment status, wages and injuries among abattoir workers (p < 0.05). There was low availability of fire extinguisher (33.7%), first aid kits (16.8%), smoke detector (1.4%), emergency exit (12.1%), safety boots (63.2%) and injury incidence record book (4.1%). Only 30.1% of the respondents used PPEs at work. Use of PPEs, availability of fire extinguisher, first aid kits, and smoke detector respectively were significantly associated with injuries (p < 0.05).

This study reports the injury burden in the meat processing industry in Kumasi, an industry that has seen little attention in terms of research. The study assessed the prevalence, types and outcome of injuries sustained by workers. This study suggests that the prevalence of injury is high with the types being predominantly lacerations, followed by musculoskeletal pain. A significant number of workers sustained moderately severe injuries, losing 2–7 days of work time, as well as requiring medical attention, both of which represent financial losses for the workers. Gender, employment status, wages and availability and use of safety equipment were significantly associated with injuries.

Our findings need to be put into the context of other studies. In high-income countries, injury rates are generally much lower. For instance, injury incidence rates of 15.2 to 22.8 per 100 full-time employees and an annual total injury rates per 100 workers of 6.4% (poultry) and 13.2% (pork) have been reported in abattoir facilities in the United States [ 7 , 23 , 24 ]. All of these reports show far fewer injuries than the current study's finding of 83.0% of workers sustaining at least one injury during the past 6 months (approximately equivalent to 166 injuries per 100 workers per year).

The high burden of injury in Ghana is similar to what has been reported from other African countries. In a study of slaughterhouses in Kenya, Cook et al. found that 25% of workers reported an injury at least once per month, with 8% of workers still having a wound at the time of the interview [ 25 ]. In a different study, Makori et al. found that 85% of slaughterhouse workers in the Nairobi area had been injured in the past year [ 26 ]. Among 203 workers in five slaughter houses in Ilorin, Nigeria, 88% of workers reported having had at least one injury ever [ 27 ].

We examined the types of injuries sustained by workers and our data highlighted lacerations as the most dominate type of injury. It is unsurprising as workers in this sector are usually exposed to numerous hazards such as sharp cutting tools and bones. This finding agrees with studies, both in Africa and in countries elsewhere [ 9 , 23 , 25 , 28 , 29 , 30 ]. Musculoskeletal pain was the next common type of injury reported by this study possibly due to the repetitive movement and heavy lifting associated with abattoir operations [ 9 , 28 ]. However, in another study conducted in the United States, bovine related injuries dominated [ 31 ].

This study highlights an association between employment status, wages and injuries. Most of the workers in the current study were casual employees who are paid based on their daily wages irrespective of their decade working experience in the industry. Taking a day off is considered as absenteeism and no commission is earned by the worker [ 32 ]. It is possible that employees usually work shifts and overtime to make ends meet. Worker tiredness affects attention and reaction times and raises the risk of accidents. Studies have linked shift work and weariness to the probability of accidents [ 27 , 28 ]. This may possibly be a factor to the high prevalence of injury reported by this study.

Majority of injuries were moderately severe requiring between 2 and 7 days for treatment and recovery, with most respondents needing immediate medical attention. It can be anticipated that the lost wages and cost of treatment will have considerable negative economic consequences to these workers and their families. Most workers received daily pay and would not be paid while out of work. Also, the cost for treatment of even work-related injuries is usually borne by the worker and their families.

Gender of the worker was significantly associated with injuries in this study. It is evident from this and other studies that the industry is male dominated [ 1 , 7 , 25 ] and possibly puts this gender at risk for injuries. This mirrors findings from other studies where gender of the worker has been anticipated to be a major influencer on work place injury [ 33 ]. Although other studies suggest that the risk of injury is equivalent for both male and female [ 8 ].

The study highlights low PPE usage and the absence of safety equipment in the facilities, similar to findings of other studies in the industry [ 9 , 10 , 14 , 24 , 25 ]. Meanwhile there is a significant association between these factors and injuries. This finding agrees with other studies that using PPE properly, dramatically lowers the risk of injuries in the industry [ 27 , 34 ].

The types of injuries noted in the meat processing industry are mostly lacerations, musculoskeletal pain and bone fractures and the frequency of these injuries is high. Most of these injuries are moderately severe necessitating immediate medical treatment. This study highlights a high burden of injuries in the meat processing industries. This reinforces calls for enforcement of existing occupational health policies in this industry.

Strengths and limitations of the study

The study depended on self-report of injuries and there was no way of verifying answers about the types, frequencies and outcome of injuries sustained. A six-month recall period was used which could have led to recall bias for earlier injuries. Despite these limitations, the present study has several strengths. The sample size was large and also the study was conducted in three geographical locations in separate slaughterhouse facilities in the metropolis thereby increasing generalizability.

Availability of data and materials

Datasets used and analyzed in this study are safely kept with the corresponding author and will be available on reasonable request.

Dias NF, Tirloni AS, dos Reis DC, Moro AR. Risk of slaughterhouse workers developing work-related musculoskeletal disorders in different organizational working conditions. Int J Ind Ergon. 2020;1(76): 102929.

Article   Google Scholar  

Gómez MM. Prediction of work-related musculoskeletal discomfort in the meat processing industry using statistical models. Int J Ind Ergon. 2020;1(75): 102876.

Romanov D, Korostynska O, Lekang OI, Mason A. Towards human–robot collaboration in meat processing: challenges and possibilities. J Food Eng. 2022. https://doi.org/10.1016/j.jfoodeng.2022.111117 .

van Holland BJ, Soer R, de Boer MR, Reneman MF, Brouwer S. Preventive occupational health interventions in the meat processing industry in upper-middle and high-income countries: a systematic review on their effectiveness. Int Arch Occup Environ Health. 2015;88:389–402.

Article   PubMed   Google Scholar  

Vergara LG, Pansera TR. Ergonomics analysis of the activity of boning shoulder in a pig slaughter-house in the city of Ipiranga-SC. Work. 2012;41(Supplement 1):703–9.

Li S, Subbiah J, Dvorak B. Environmental and occupational impacts from US beef slaughtering are of same magnitude of beef foodborne illnesses on human health. Environ Int. 2019;1(129):507–16.

Leibler JH, Perry MJ. Self-reported occupational injuries among industrial beef slaughterhouse workers in the Midwestern United States. J Occup Environ Hyg. 2017;14(1):23–30.

Mogute JR. Work-related injuries among slaughterhouse workers in Nairobi city county, Kenya (Doctoral dissertation) Kenyatta University 2021.

Jerie S, Matunhira K. Occupational safety and health hazards associated with the slaughtering and meat processing industry in urban areas of Zimbabwe: a case study of the Gweru city Municipal Abattoir. Ghana J Geogr. 2022. https://doi.org/10.4314/gjg.v14i1.2 .

Johnson OE, Etokidem AJ. Occupational hazards and health problems among butchers in Uyo, Nigeria. Niger Med J. 2019;60(3):106–12.

Article   PubMed   PubMed Central   Google Scholar  

Meat and Livestock Australia. OHS Reference Guide Australian Meat Industry Part 4: common Hazards. 2001. http://mintrac-whs.com.au/wp-content/uploads/OHS-Reference-Guide-Part4 . Accessed 23 Aug 2024.

Annan JS, Addai EK, Tulashie SK. A call for action to improve occupational health and safety in Ghana and a critical look at the existing legal requirement and legislation. Saf Health Work. 2015;6(2):146–50.

Puplampu BB, Quartey SH. Key issues on occupational health and safety practices in Ghana: A review. Int J Business Soc Sci. 2012;3(19):151–6.

Google Scholar  

Osman F, Frederick A, Edmund K. An assessment of abattoirs, slaughterhouses and slaughter practices in the three Northern Regions of Ghana. Trop Veter. 2019;37(1):53–64.

Amissah J, Badu E, Agyei-Baffour P, Nakua EK, Mensah I. Predisposing factors influencing occupational injury among frontline building construction workers in Ghana. BMC Res Notes. 2019;12:1–8.

Gyedu A, Nakua EK, Otupiri E, Mock C, Donkor P, Ebel B. Incidence, characteristics and risk factors for household and neighbourhood injury among young children in semiurban Ghana: a population-based household survey. Inj Prev. 2015;21(e1):e71–9.

Nakua EK, Owusu-Dabo E, Newton S, Adofo K, Otupiri E, Donkor P, Mock C. Occupational injury burden among gold miners in Ghana. Int J Inj Contr Saf Promot. 2019;26(4):329–35.

Adonu RE, Dzokoto L, Salifu SI. Sanitary and hygiene conditions of slaughterhouses and its effect on the health of residents: a case study of Amasaman slaughterhouse in the Ga west municipality. Ghana Food Sci Qual Manag. 2017;65:11–5.

Asiam RI. Food safety knowledge and food safety practices of meat handlers in abattoirs and butcheries in Accra metropolis of Ghana (Doctoral dissertation, University of Education, Winneba).

Ghana Statistical Service. Ghana 2021 Population and Housing Census. Accra: Ghana Statistical Service; 2021.

Frimpong S, Gebresenbet G, Bosona T, Bobobee E, Aklaku E. Animal supply and logistics activities of abattoir chain in developing countries: the case of Kumasi Abattoir, Ghana. J Serv Sci Manag. 2021;5:1–8.

Uakarn C, Chaokromthong K, Sintao N. Sample size estimation using Yamane and Cochran and Krejcie and Morgan and Green formulas and Cohen statistical power analysis by G* power and comparisons. Apheit Int J. 2021;10(2):76–88.

Culp K, Brooks M, Rupe K, Zwerling C. Traumatic injury rates in meatpacking plant workers. J Agromed. 2008;13(1):7–16.

Kyeremateng-Amoah E, Nowell J, Lutty A, Lees PS, Silbergeld EK. Laceration injuries and infections among workers in the poultry processing and pork meatpacking industries. Am J Ind Med. 2014;57(6):669–82.

Cook EA, de Glanville WA, Thomas LF, Kariuki S, Bronsvoort BM, Fèvre EM. Working conditions and public health risks in slaughterhouses in western Kenya. BMC Public Health. 2017;17:1–2.

Makori CM, Warutere PN, Nguhiu PN. Factors associated with the injuries inflicted to workers in slaughterhouses and meat processing plants in Nairobi, Kenya. Int J Cur Res Life Sci. 2018;7:2020–3.

Odetokun IA, Ghali-Mohammed I, Alhaji NB, Nuhu AA, Oyedele HA, Ameen SA, Adetunji VO. Occupational health and food safety risks in Ilorin, Northcentral Nigeria: a cross-sectional survey of slaughterhouse workers. Food Protect Trends. 2020;40(4):241–50.

Slade J, Alleyne E. The psychological impact of slaughterhouse employment: a systematic literature review. Trauma Violence Abuse. 2023;24(2):429–40.

Abdullahi A, Hassan A, Kadarman N, Junaidu YM, Adeyemo OK, Lua PL. Occupational hazards among the abattoir workers associated with noncompliance to the meat processing and waste disposal laws in Malaysia. Risk Manag Healthc Policy. 2016;13:157–63.

Cook EA, de Glanville WA, Thomas LF, Kariuki S, de Clare Bronsvoort BM, Fèvre EM. Risk factors for leptospirosis seropositivity in slaughterhouse workers in western Kenya. Occup Environ Med. 2017;74(5):357–65.

Watts M, Meisel EM, Densie IK. Cattle-related trauma, injuries and deaths. Trauma. 2014;16(1):3.

Ribas V. On the Line: Slaughterhouse Lives and the Making of the New South (Oakland, CA, 2015; online edn, California Scholarship Online, 22 Sept. 2016), [cited 2024 Aug 23]. https://doi.org/10.1525/california/9780520282957.001.0001 .

Khan YA, Davis AL, Taylor JA. Ladders and lifting: how gender affects safety behaviors in the fire service. J Work Behav Health. 2017;32(3):206–25.

Alam MK, Keiko Y, Hossain MM. Present working conditions in slaughterhouses and meat selling centres and food safety of workers in two districts of bangladesh. Pertanika J Soc Sci Human. 2020;28(2):867–88.

Download references

Acknowledgements

We want to express appreciation to all the workers and management of the various abattoirs in Kumasi who were respondents of this study and to Mr. Joel Adusei-Gyamfi for the ideas and support.

This study was supported by grant D43 TW007267 from the Fogarty International Center at the US National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Author information

Authors and affiliations.

School of Nursing, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana

Abigail Aban Tetteh, Veronica Millicent Dzomeku, Patience Achiamaa Barnie & Adwoa Gyamfi

Ear, Nose and Throat Nursing School, Komfo Anokye Teaching Hospital, Kumasi, Ghana

Abigail Aban Tetteh

Department of Interdisciplinary Studies, The Akenten Appiah-Menka University of Skills Training and Entrepreneurial Development (AAMUSTED), Kumasi, Ghana

Ato Kwamina Arhin

School of Public Health, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana

Benjamin Noble Adjei, Bernard Barnie & Emmanuel Kwaku Nakua

Department of Surgery, University of Washington, Seattle, WA, USA

Charles Mock

Department of Surgery, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana

Peter Donkor

You can also search for this author in PubMed   Google Scholar

Contributions

Conceptualization and design of the study was by AAT, VD, PAB. EKN, screened the tool for data collection. VD, EKN, CM, BB, provided methodological insights. VD, PD, AAT, AG, AKA, coordinated data collection. BNA, AKA, AAT, VD, BB, CM carried out the initial analysis and drafted the initial manuscript. All authors discussed the results and critically reviewed its intellectual contents. VD, BB, EKN, PD and CM critically reviewed and revised the manuscript which as then approved by all other authors for submission.

Corresponding author

Correspondence to Abigail Aban Tetteh .

Ethics declarations

Ethics approval and consent to participate.

Ethical approval was obtained from the ethics committee of Kwame Nkrumah University of Science and Technology (CHRPE AP/203/23) with approval from the administrative wing of each of the facilities visited. Written and verbal consent was obtained from respondents after an explanation of every aspect of the research in their preferred language.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .

Reprints and permissions

About this article

Cite this article.

Tetteh, A.A., Dzomeku, V.M., Barnie, P.A. et al. Prevalence, types and outcome of injuries among abattoir workers in Ghana. BMC Res Notes 17 , 265 (2024). https://doi.org/10.1186/s13104-024-06934-1

Download citation

Received : 27 May 2024

Accepted : 04 September 2024

Published : 14 September 2024

DOI : https://doi.org/10.1186/s13104-024-06934-1

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Occupational injuries
  • Lacerations

BMC Research Notes

ISSN: 1756-0500

research article using chi square test

The Chi square test: an introduction

  • December 1995
  • This person is not on ResearchGate, or hasn't claimed this research yet.

Bruce F Walker at Murdoch University

  • Murdoch University

Abstract and Figures

research article using chi square test

Discover the world's research

  • 25+ million members
  • 160+ million publication pages
  • 2.3+ billion citations

Ali Bakdur

  • Priscilla Rayanne E. Silva

Matias Noll

  • Ioannis Reklos
  • Kholoud Saad Alghamdi
  • Elena Simperl

Santosh Kumar Mahato

  • Franco Meggio
  • Soham Bandyopadhyay

Monalisa Sarma

  • Susmita Paul
  • Md. Al Mamun
  • Tony Greenfield
  • H. R. Neave
  • Antony Ugoni
  • Recruit researchers
  • Join for free
  • Login Email Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google Welcome back! Please log in. Email · Hint Tip: Most researchers use their institutional email address as their ResearchGate login Password Forgot password? Keep me logged in Log in or Continue with Google No account? Sign up

Advertisement

Advertisement

Pakistani students’ perceptions about knowledge, use and impact of artificial intelligence (AI) on academic writing: a case study

  • Published: 11 September 2024

Cite this article

research article using chi square test

  • Shaista Rashid 1 ,
  • Sadia Malik   ORCID: orcid.org/0000-0002-4989-2359 2 ,
  • Faheem Abbas 2 &
  • Javaria Ahmad Khan 3  

20 Accesses

Explore all metrics

Integrating artificial intelligence (AI) in language pedagogy can help learn and develop many skills. In this context, this study explores Pakistani students' perceptions and trends regarding the knowledge, use, and impact of AI on their academic writing. The data was collected using a quantitative method, using a questionnaire through cluster sampling of four faculties and random sampling of 229 students from Bahuddin Zakariya University, Multan, Pakistan. Data is subjected to frequency analysis, Kruskal–Wallis hypothesis test, and chi-square association test using SPSS. The findings reveal that most students agree regarding the knowledge, use, and impact of AI on their academic writing. For the Kruskal–Wallis test, significant variations are seen in semesters and age groups for all three variables; however, only the knowledge variable shows significant variation across faculties. Moreover, chi-square test results indicate an association among components of knowledge, use, and impact of AI. The research suggests that academia should introduce AI as a pedagogical tool to improve students' comprehension, productivity, and writing quality. Furthermore, trends indicate that comprehensive policy formulation should be implemented to equip students of all faculties, semesters, and age groups to use technology equally.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save.

  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime

Price includes VAT (Russian Federation)

Instant access to the full article PDF.

Rent this article via DeepDyve

Institutional subscriptions

research article using chi square test

Similar content being viewed by others

research article using chi square test

Artificial intelligence in higher education: the state of the field

research article using chi square test

Algorithmically-driven writing and academic integrity: exploring educators' practices, perceptions, and policies in AI era

research article using chi square test

Learning What Works in Improving Writing: A Meta-Analysis of Technology—Oriented Studies Across Saudi Universities

Explore related subjects.

  • Artificial Intelligence
  • Digital Education and Educational Technology

Data availability

The data was collected through questionnaire using Google Forms; SPSS file can be provided if requested.

Ahmed, M., Siddiqui, M., & Usman, T. (2024). Impact of artificial intelligence-based writing assistant on the academic writing skills of university faculty in Pakistan. International Journal of Human and Society, 4 (1), 539–545.

Google Scholar  

Aladini, A. (2023). AI applications impact on improving EFL university academic writing skills and their logical thinking. Al-’ulūm Al-Tarbawiyyaẗ, 31 (2), 25–44. https://doi.org/10.21608/ssj.2023.320166

Article   Google Scholar  

Ali, Z. (2020). Artificial intelligence (AI): A review of its uses in language teaching and learning. IOP Conference Series: Materials Science and Engineering, 769 (1), 012043. https://doi.org/10.1088/1757-899x/769/1/012043

Almusharraf, N., & Bailey, D. (2021). A regression analysis approach to measuring the influence of student characteristics on language learning strategies. International Journal of Instruction, 14 (4), 463–482. https://doi.org/10.29333/iji.2021.14428a

Beiki, M. (2022). Review of writing-related theories. Cultural Arts Research and Development, 2 (1), 27–33. https://doi.org/10.55121/card.v2i1.20

Burkhard. (2022). Student perceptions of AI-powered writing tools: Towards individualized teaching strategies. 19th International Conference on Cognition and Exploratory Learning in Digital Age (CELDA 2022) . https://doi.org/10.33965/celda2022_202207l010

Chaudhry, I. S., Sarwary, S. A. M., Refae, G. A. E., & Chabchoub, H. (2023). Time to revisit existing student’s performance evaluation approach in higher education sector in a new era of Chatgpt — a case study. Cogent Education, 10 (1), 2210461. https://doi.org/10.1080/2331186x.2023.2210461

Chen, L., Chen, P., & Lin, Z. (2020). Artificial intelligence in education: A review. IEEE Access, 8 , 75264–75278. https://doi.org/10.1109/access.2020.2988510

Cunningham, U., Rashid, S., & Van Le, T. (2019). The effect of learner training on the use of digital tools to support English writing skills. Asian EFL Journal , 21 , 27–49. http://uu.diva-portal.org/smash/record.jsf?pid=diva2:1300583

Dhara, S., Chatterjee, S., Chaudhuri, R., Goswami, A., & Ghosh, S. K. (2022). Artificial intelligence in assessment of students’ performance. In CRC Press eBooks (pp. 153–167). https://doi.org/10.1201/9781003184157-8

Dizon, G., & Gayed, J. M. (2021). Examining the impact of Grammarly on the quality of mobile L2 writing. The JALT CALL Journal, 17 (2), 74–92. https://doi.org/10.29140/jaltcall.v17n2.336

Feise, R. J. (2002). Do multiple outcome measures require p-value adjustment? BMC Medical Research Methodology, 2 (1), 8. https://doi.org/10.1186/1471-2288-2-8

Fitria, T. N. (2021). Grammarly as AI-powered English writing assistant: students’ alternative for writing English. Metathesis, 5 (1), 65–78. https://doi.org/10.31002/metathesis.v5i1.3519

Flavell, J. H. (1979). Metacognition and cognitive monitoring: A new area of cognitive-developmental inquiry. American Psychologist, 34 (10), 906–911. https://doi.org/10.1037/0003-066x.34.10.906

Ghafar, Z. N., Salh, H. F., Abdulrahim, M. A., Farxha, S. S., Arf, S. F., & Rahim, R. I. (2023). The role of artificial intelligence technology on English language learning: a literature review. Canadian Journal of Language and Literature Studies, 3 (2), 17–31. https://doi.org/10.53103/cjlls.v3i2.87

Ginting, P., Batubara, H. M., & Hasnah, Y. (2023). Artificial intelligence powered writing tools as adaptable aids for academic writing: Insight from EFL college learners in writing final project. Zenodo, 6 (10), 4640–4650. https://doi.org/10.5281/zenodo.8407887

Grájeda, A., Burgos, J., Olivera, P. C., & Sanjinés, A. (2023). Assessing student-perceived impact of using artificial intelligence tools: Construction of a synthetic index of application in higher education. Cogent Education, 11 (1), 2287917. https://doi.org/10.1080/2331186x.2023.2287917

Imran, M., & Almusharraf, N. (2023). Analyzing the role of ChatGPT as a writing assistant at higher education level: A systematic review of the literature. Contemporary Educational Technology, 15 (4), ep464. https://doi.org/10.30935/cedtech/13605

Kaur, P., Stoltzfus, J., & Yellapu, V. (2018). Descriptive statistics. Biostatistics, 4 (1), 60–67. https://doi.org/10.4103/ijam.ijam_7_18

Keleş, P. U., & Aydın, S. (2021). University students’ perceptions about artificial intelligence. Shanlax International Journal of Education, 9 , 212–220. https://doi.org/10.34293/education.v9is1-may.4014

Knowles, E. (2005). The Oxford dictionary of phrase and fable. In Oxford University Press eBooks . https://doi.org/10.1093/acref/9780198609810.001.0001

Kothari, C. R. (2004). Research methodology: Methods and techniques . New Age International.

LeCompte, M. D., & Schensul, J. J. (2015). Ethics in ethnography: A mixed methods approach . AltaMira Press.

Lei, H. (2022). High school students’ foreign language vocabulary acquisition in the era of artificial intelligence. Advances in Social Science, Education and Humanities Research, 637 , 669–662. https://doi.org/10.2991/assehr.k.220131.121

Li, A. W. (2023). Using Perceptive to support AI-based online writing assessment across the disciplines. Assessing Writing, 57 , 100746. https://doi.org/10.1016/j.asw.2023.100746

Mahmud, F. A. (2023). Investigating EFL students’ writing skills through artificial intelligence: Wordtune application as a tool. Journal of Language Teaching and Research, 14 (5), 1395–1404. https://doi.org/10.17507/jltr.1405.28

Malik, A. R., Pratiwi, Y., Andajani, K., Numertayasa, I. W., Suharti, S., Darwis, A., & Marzuki, M. (2023). Exploring artificial intelligence in academic essay: Higher education student’s perspective. International Journal of Educational Research Open, 5 , 100296. https://doi.org/10.1016/j.ijedro.2023.100296

Malik, S., Sadiq, U., & Khan, J. A. (2021). Belief, practices, and challenges of Pakistani primary grade government school teachers: Variable analysis affecting pronunciation and phonics teaching. Humanities & Social Sciences Reviews, 9 (1), 206–217. https://doi.org/10.18510/hssr.2021.9122

Marzuki, Widiati, U., Rusdin, D., Darwin, D., & Indrawati, I. (2023). The impact of AI writing tools on the content and organization of students’ writing: EFL teachers’ perspective. Cogent Education, 10 (2), 1–17. https://doi.org/10.1080/2331186x.2023.2236469

Mayo, D. G., & Cox, D. R. (2006). Frequentist statistics as a theory of inductive inference. In Institute of Mathematical Statistics eBooks (pp. 77–97). https://doi.org/10.1214/074921706000000400

McHugh, M. L. (2013). The chi-square test of independence. Biochemia Medica, 23 (2), 143–149. https://doi.org/10.11613/bm.2013.018

Nazari, N., Shabbir, M. S., & Setiawan, R. (2021). Application of artificial intelligence powered digital writing assistant in higher education: Randomized controlled trial. Heliyon, 7 (5), e07014. https://doi.org/10.1016/j.heliyon.2021.e07014

Ostertagová, E., Ostertag, O., & Kováč, J. (2014). Methodology and application of the Kruskal-Wallis test. Applied Mechanics and Materials, 611 , 115–120. https://doi.org/10.4028/www.scientific.net/amm.611.115

Pacheco-Mendoza, S., Guevara, C., Samaniego, J., & Fernandez, J. (2023). Artificial intelligence in higher education: A predictive model for academic performance. Education Sciences, 13 (10), 990. https://doi.org/10.3390/educsci13100990

Park, J. (2019). An AI-based English grammar checker vs. human raters in evaluating EFL learners’ writing. Multimedia-Assisted Language Learning , 22 (1), 112–131. https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART002449524

Rashid, S., Cunningham, U., Watson, K., & Howard, J. (2018). Revisiting the digital divide(s): Technology-enhanced English language practices at a university in Pakistan. Australian Journal of Applied Linguistics, 1 (2), 64–87. https://doi.org/10.29140/ajal.v1n2.7

Ribeiro, R. (2022). AI in English Language Learning | Cambridge English . World of Better Learning | Cambridge University Press. https://www.cambridge.org/elt/blog/2020/03/09/artificial-intelligence-english-language-learning/

Rothman, K. J. (1990). No adjustments are needed for multiple comparisons. Epidemiology, 1 (1), 43–46. https://doi.org/10.1097/00001648-199001000-00010

Rusmiyanto, R., Huriati, N., Fitriani, N., Tyas, N. K., Rofi’i, A., & Sari, M. N. (2023). The role of artificial intelligence (AI) in developing English language learner’s communication skills. Journal on Education, 6 (1), 750–757. https://doi.org/10.31004/joe.v6i1.2990

Saini, N. (2023). Research paper on artificial intelligence and its applications. International Journal for Research Trends and Innovation, 8 (4), 356–360.

Sharifi, A., Ahmadi, M., & Ala, A. (2021). The impact of artificial intelligence and digital style on industry and energy post-COVID-19 pandemic. Environmental Science and Pollution Research, 28 (34), 46964–46984. https://doi.org/10.1007/s11356-021-15292-5

Slimi, Z. (2023). The impact of artificial intelligence on higher education: An empirical study. European Journal of Educational Sciences . https://doi.org/10.19044/ejes.v10no1a17

Sumakul, D. T. Y. G., Hamied, F. A., & Sukyadi, D. (2022). Students’ perceptions of the use of AI in a writing class. Advances in Social Science, Education and Humanities Research, 624 , 52–57. https://doi.org/10.2991/assehr.k.220201.009

Toncic, J. (2020). Teachers, AI grammar checkers, and the newest literacies: Emending writing pedagogy and assessment. Digital Culture & Education, 12 (1), 26–51.

Turner, E. (2021). Causes for leaving jobs: A comparative analysis. 2nd International Conference on Research in Management . https://doi.org/10.33422/2nd.icrmanagement.2021.02.43

Utami, S. P. T., Andayani, A., Winarni, R., & Sumarwati, S. (2023). Utilization of artificial intelligence technology in an academic writing class: How do Indonesian students perceive? Contemporary Educational Technology, 15 (4), ep450. https://doi.org/10.30935/cedtech/13419

Verma, M. (2018). Artificial intelligence and its scope in different areas with special reference to the field of education. International Journal of Advanced Educational Research , 3 (1), 5–10. http://files.eric.ed.gov/fulltext/ED604401.pdf

Victorivna, K. L., Oleksandrovych, V. A., Oleksandrivna, K. I., & Oleksandrivna, K. N. (2022). Artificial intelligence in language learning: What are we afraid of? Arab World English Journal, 8 , 262–273. https://doi.org/10.24093/awej/call8.18

Wang, T., Lund, B., Marengo, A., Pagano, A., Mannuru, N. R., Teel, Z. A., & Pange, J. (2023). Exploring the potential impact of artificial intelligence (AI) on international students in higher education: Generative AI, chatbots, analytics, and international student success. Applied Sciences, 13 (11), 6716. https://doi.org/10.3390/app13116716

Zhao, X. (2022). Leveraging artificial intelligence (AI) technology for English writing: Introducing Wordtune as a digital writing assistant for EFL writers. RELC Journal, 54 (3), 890–894. https://doi.org/10.1177/00336882221094089

Download references

Acknowledgements

The authors would like to thank Prince Sultan University for its support.

No funding was sought from any external source for this research.

Author information

Authors and affiliations.

Prince Sultan University, Riyadh, Saudi Arabia

Shaista Rashid

Department of English, Bahauddin Zakariya University, Multan, Pakistan

Sadia Malik & Faheem Abbas

Department of Statistics, Bahauddin Zakariya University, Multan, Pakistan

Javaria Ahmad Khan

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Sadia Malik .

Ethics declarations

Conflict of interest.

There is no conflict of interest of any sort.

Ethical statement

All participants gave their informed consent for inclusion before they participated in the study. Their responses were evaluated anonymously as the data was presented in aggregate form.

Additional information

Publisher's note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Rashid, S., Malik, S., Abbas, F. et al. Pakistani students’ perceptions about knowledge, use and impact of artificial intelligence (AI) on academic writing: a case study. J. Comput. Educ. (2024). https://doi.org/10.1007/s40692-024-00338-7

Download citation

Received : 15 May 2024

Revised : 08 August 2024

Accepted : 14 August 2024

Published : 11 September 2024

DOI : https://doi.org/10.1007/s40692-024-00338-7

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Artificial intelligence
  • Academic writing
  • Higher education
  • Language pedagogy
  • Find a journal
  • Publish with us
  • Track your research

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here .

Loading metrics

Open Access

Peer-reviewed

Research Article

Migrants from Ukraine in the Polish labour market as perceived by Poles from rural areas and towns

Roles Conceptualization, Writing – original draft

* E-mail: [email protected]

Affiliation Faculty of Agriculture and Economics, University of Agriculture in Krakow, Krakow, Poland

ORCID logo

Roles Writing – original draft

Affiliation Faculty of Organization and Management, Silesian University of Technology, Gliwice, Poland

  • Wioletta Knapik, 
  • Lidia Luty, 
  • Monika Zioło, 
  • Monika Odlanicka-Poczobutt

PLOS

  • Published: September 13, 2024
  • https://doi.org/10.1371/journal.pone.0306895
  • Reader Comments

Fig 1

The article is devoted to presenting the topic of migration of Ukrainian nationals to Poland. The work makes use of a survey under a project carried out in Polish rural areas and small towns. Seven hundred interviews were held in total. We conducted a quantitative analysis of its results here. The employed methods involve variable frequency distribution. The independence of the features was tested with the non-parametric chi-square test of independence. The association of the investigated variables was determined with Cramér’s V. The research shows that the most numerous foreign nationals in the Polish labour market in 2021 were Ukrainians. The positive trend started in 2017. The respondents perceived the migration of Ukrainian nationals to Poland mostly positively, especially regarding seasonal work. They also emphasized that the Ukrainians performed work at variance with their qualifications. Only every fifth participant agreed that migrants took away jobs from Poles. Most of the respondents pointed out that small business owners benefited from employing Ukrainians. The overwhelming majority of the respondents noted an increase in migration from Ukraine after the full-scale invasion and that entire families of Ukrainians were coming to Poland. Nearly half of them agreed that the support system for Ukrainian migrants was a burden on municipal budgets.

Citation: Knapik W, Luty L, Zioło M, Odlanicka-Poczobutt M (2024) Migrants from Ukraine in the Polish labour market as perceived by Poles from rural areas and towns. PLoS ONE 19(9): e0306895. https://doi.org/10.1371/journal.pone.0306895

Editor: Justyna Dominika Kowalska, Medical University of Warsaw, POLAND

Received: January 10, 2024; Accepted: June 25, 2024; Published: September 13, 2024

Copyright: © 2024 Knapik et al. This is an open access article distributed under the terms of the Creative Commons Attribution License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data are in the manuscript and supporting information files.

Funding: This work was supported by the Ministry of Education and Science (grant number 552216/2022).There was no additional external funding received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Migration of Ukrainian nationals to Poland should be considered in two broader contexts, economic and sociocultural [ 1 – 3 ]. In 2022, the media announced Poland was facing a situation nobody would expect a few months back. Over two million Ukrainian nationals fleeing the Russian aggression were crossing the Polish border [ 4 ]. Over one million of them could stay for longer.

Many as 12 million people may want to leave Ukraine permanently and that refugee policies in potential destination countries are likely to have a substantial impact on the distribution of Ukrainian refugees between different countries [ 5 ].

The Polish government had no migration policy and the country was on the brink of an unparalleled migration crisis. No refugee camps were built in Poland but the relocation mechanism seemed to be necessary. Especially with the Polish parliamentary elections in 2023. Anti-immigrant spin is only to be expected during the campaign.

Between June and August 2022 there were conducted a survey of refugees in the border provinces of Podkarpackie voivodeship and Lubelskie voivodeship to identify their actual needs and expected scope of long-term assistance and support. The aim of the survey was to obtain basic information about refugees from Ukraine who left their country across the Polish-Ukrainian border in 2022. It was particularly important to identify their health needs.

The war forced Ukrainians from across the country to flee. Most of them (72%) lived over 500 km from a Polish border crossing. Every fifth refugee lived 101 km to 500 km from a Polish border crossing. Statistics on foreign nationals in the Polish labour market were completely dominated by Ukrainians in 2021. In 2021, Ukrainian nationals were granted 325,213 work permits (64.5% of all permits for foreign nationals), which was a 68.9% increase compared to 2017. The trend reversed in 2022. The number of issued permits for Ukrainian nationals shrunk by 74%, which is 240,000 permits. The change was not due to a smaller number of employees from Ukraine but the Act on aid to Ukrainian nationals [ 6 ], which allows for employment without a permit.

The economic cooperation is growing as Ukrainian businesses move to Poland because of the war, taking advantage of the geographic proximity. Those Ukrainian companies that had already been present in 2022, expand with new branches. Ukrainian nationals become sole proprietors or find employment with Polish companies. The success of these endeavours hinges on historical and cultural factors to a large extent. The number of businesses started by Ukrainian nationals in Poland has been gradually rising since the full-scale invasion. The period from March 2022 to the end of January 2023 saw a total of 17,764 new sole proprietorships. Most of the new Ukrainian businesses recorded with the Polish Central Registration and Information on Business are situated in Mazowieckie Voivodeship (24%), Dolnośląskie Voivodeship (15%), Małopolskie Voivodeship (14%), and Pomorskie Voivodeship (11%). Ukrainian businesswomen make up 41% of all the owners. The structure of new businesses by sex does not reflect the demographic structure of war refugees. The register of Ukrainian nationals and their families who have been granted refugee status under the Act (discussed later in the article) has over 985,000 people (as on 20.02.2023), 49% of which are women and 11% are working-age men [ 7 ].

Data from the Ministry of Labour and Social Policy, Central Statistical Office, and Social Insurance Institution show that recent years saw an increase in the number of foreign nationals employed in Poland. This means Poland has reached the primary goal of its immigration policy liberalised in 2007–2008 to increase the influx of foreigners to Poland and their employment. The grounds for the liberalising Acts assumed seasonal employment of foreign nationals from five Eastern European countries would occur in those professions that Poles find unattractive. The current data neither support nor negates the premise. Still, the hypothesis that the increase in foreign workforce did not affect the employment level of Poles seems justified considering employer declarations of entrusting work to foreign nationals and sectors manned by foreigners. It is evident from the relative employment stability of Poles in sectors where foreigners are most often employed. This suggests a complementary nature of seasonal employment of foreign nationals and Poles, which means the liberalisation of the immigration policy in Poland is [ 8 , 9 ].

To conduct a comprehensive analysis of labour migration from Ukraine to Poland, it is necessary to consider the employment of Ukrainians from 2010 to 2019. This segment of the Polish labour market has undergone significant changes. Although in 2010, 52% of Ukrainians in the Polish labour market worked in agriculture, mainly fruit-picking seasonal jobs, 23% in the construction industry, 21% in services, and merely 4% in industry, in 2019, 60% worked in services and only 12% in agriculture (14% in industry and construction each) [ 10 ].

A new type of seasonal work permit issued by district labour offices on behalf of District Starost was introduced on 1 January 2018 [ 11 ]. The number of seasonal work declarations issued to Ukrainian nationals gradually increased from 2013 to 2017. It declined by 15.67 pp. year-on-year in 2018. The fluctuations persisted until 2020. In 2021, the number rose nearly to the all-time high of 2017. The mean share of permits for Ukrainians compared to other nationals was 93.1% from 2013 to 2020. In 2021, it was much lower, around 82.5%. In a similar manner, the Act on aid to Ukrainian nationals related to the war in Ukraine of 12 March 2022 streamlines their access to the Polish labour market. An amendment to this act signed on 26 March 2022 by the President of the Republic of Poland effective retroactively of 24 February 2022 specifies that foreign nationals not covered by the Act who are displaced persons listed in the implementing decision of the EU Council may apply for the temporary protection under the Act on protection of foreign nationals in the territory of Poland:

  • by crossing the Polish border, Ukrainian nationals acquire the right to legal stay and access to the labour market in Poland for 18 months. Should the circumstances so demand, the period can be extended by another 18 months, totalling three years;
  • Ukrainian nationals can obtain a PESEL identification number, which streamlines official contacts, such as medical treatment and access to electronic medical documents;
  • the refugees have a right to family benefits, parenting benefits, Dobry Start school benefit, family care capital for every child aged 12 to 35 months, subsidy for daycare centre, children’s club, or daycare provider, and welfare benefits under general regulations;
  • Ukrainian nationals can start and conduct business in Poland just as Polish nationals. The prerequisite is to have a PESEL number [ 12 ].

The growth of the number of Polish population driven by migration has exacerbated problems that Poland faced even before the Russo-Ukrainian war [ 13 , 14 ]. Long-term integration of Ukrainians will require substantial expenses, while public engagement and the general spirit from the early days of the full-scale invasion are abating.

In addition, the anti-democratic policies of the Polish government, especially in the last months of 2023, have contributed to building an anti-Ukrainian sentiment [ 15 ].

The European Commission provided EUR 348 million for various aid programmes from 28 February 2022 (of the declared EUR 500 million). A humanitarian fund of EUR 11.6 billion is planned for 2021–2027. It is spent on food, water, basic household products, healthcare, psychosocial support, emergency shelters, and the basic needs of those in the worst situation. The humanitarian aid in Ukraine reached eleven million people as of today. Over 8.9 million people were given food, 4.4 million received medical interventions or supplies, and over 2 million, cash. Note that even before the invasion, the EU granted Ukraine various support loans amounting to EUR 5 billion from 2014 to 2021. After the invasion, the European Commission proposed a new EUR 1 billion macro-financial assistance operation for Ukraine in the form of long-term loans on favourable terms on 1 July 2022. It is the first part of the exceptional macro-financial assistance package of up to EUR 9 billion announced in the Commission’s communication on assistance to and reconstruction of Ukraine of 18 May 2022 and endorsed by the European Council on 23–24 June 2022 [ 16 ].

In March 2022, the EU triggered the Temporary Protection Directive, an EU emergency response system applied to mass influxes of migrants. The directive provides for:

  • immediate collective protection for displaced persons;
  • relieving national asylum systems in EU countries.

The rights of the beneficiaries of temporary protection include a residence permit, access to employment, access to accommodation or housing, access to medical care, and access to education for minors [ 17 ].

The Polish Act on aid to Ukrainian refugees provides for principles of regularising stay and benefits and discounts for Ukrainian nationals. Poles who host refugees from Ukraine will also receive compensation benefits. Institutions and individuals who provided accommodation and food to Ukrainian nationals who arrived in Poland from Ukraine due to the war, as well as Ukrainian nationals holding a Pole’s Card who arrived with their immediate family, will be eligible for a cash benefit. The Act was amended on April 30 to extend the benefit provision for up to 120 days after the Ukrainian national’s arrival in Poland. The benefit period may be extended due to exceptional circumstances. The benefit amount in PLN 40 a day per person [ 18 ].

Material and method

The research aims to investigate the situation of Ukrainian nationals in the seasonal labour market in rural areas and towns in Poland. The problem is presented before the full-scale Russian aggression on Ukraine and during the war against the backdrop of cultural and social conditions, mainly in the context of support for war refugees, their problems, and relationships between Poles and Ukrainians. The study is founded on surveys intended to gather information on seasonal work conditions and perception of Ukrainian immigrants. It reflects a subjective assessment of moods concerning the investigated problem among residents of rural and town areas in Poland. The research objectives are to diagnose the conditions of seasonal labour migration in three areas: 1. labour market, considering sex, age, education, and region of residence of the respondents; 2. characteristics of stay; and 3. institutional support for Ukrainian migrants ( Fig 1 ).

thumbnail

  • PPT PowerPoint slide
  • PNG larger image
  • TIFF original image

https://doi.org/10.1371/journal.pone.0306895.g001

In the field of the labour market, respondents were asked several questions related to their opinions on the negative impact of refugees on destabilizing the economy and taking jobs away from Poles. On the positive side, the interviewees were asked whether small enterprises benefit from additional labour force and whether Poland gains more seasonal workers. Additional questions encompassed consistency of current jobs of migrants with their qualifications and if they form long-term relations with their employers.

Next dimension which the survey encompassed was a characteristic of stay. In this aspect respondents were asked on the impact of war on the number of migrants, length of their stay and whether they settled in Poland alone or with their families.

The last aspect addressed was related to the institutional support for Ukrainian migrants. In this section, respondents were asked about various types of financial and non-financial assistance provided to migrants by the government, that they were aware of.

The employed methods involve variable frequency distribution. The independence of the features was tested with the non-parametric chi-square test of independence. The association of the investigated variables was determined with Cramér’s V. The analyses were conducted in Dell Statistica v. 13.1 (StatSoft Polska). Frequency analysis is used to represent information on the general number of occurrences and percentages of individual responses to reveal the data structure. This is how it has been employed in the article. Cramer’s V is a chi-squared measure of the degree of association between variables. The coefficient was calculated for those variables for which the chi-squared test of independence indicated a statistically significant association (at a significance level of 0.05).

Using the Likert scale popular in marketing, market, and social surveys as a model, the authors decided to measure responses on a bipolar, five-point scale. The respondents chose their answers on a scale from complete acceptance to complete rejection, with a neutral position in the middle.

The research is part of the project ‘National Identity of Poles in Light of Migration of Ukrainian Nationals. Prevention of Social Conflicts’ was part of the programme of the Ministry of Education and Science ‘Science for the Public’. The diagnosis of economic dimensions of seasonal migrations of Ukrainian nationals into Polish rural areas and towns employed an original survey questionnaire. The survey was conducted using a computer-assisted telephone interview (CATI) method. It involved a representative sample of adult residents of rural areas and towns. The sampling frame consisted of a database of telephone numbers from rural areas and towns with populations up to fifteen thousand residents. A total of 5,000 randomly selected telephone numbers were used, with a 14% response rate. In total, seven hundred interviews were conducted. The database was purchased legally from ZN Direct sp. z o.o. Up to five attempts were made to reach each respondent. The preliminary assumption was to interview a proportional share of residents in every NUTS 1 unit ( Fig 2 ). The actual sample was consistent with the planned structure (up to 3 pp. difference). The survey was conducted in March 2023.

thumbnail

A new macro-region scheme was introduced in Poland on 1 January 2018 with an additional macro-region, Mazowieckie Voivodeship with the capital of Poland, Warsaw. It is because of the strong impact of the agglomeration on statistics of a large central region with a different profile of urban areas much smaller than the capital.

https://doi.org/10.1371/journal.pone.0306895.g002

Both sexes had a similar representation in the sample. The largest group of respondents was aged 35–54, which was 45.7% of the sample. Most of the respondents held a high school or university diploma ( Table 1 ).

thumbnail

https://doi.org/10.1371/journal.pone.0306895.t001

The literature review covers research areas presented in the Results section, i.e. 1. labour market; 2. characteristics of stay; and 3. institutional support for Ukrainian migrants.

The liberalization of access to the Schengen zone for Ukraine nationals in 2017 allowed many Ukrainians to travel to the Entire European Union (except for Ireland and Great Britain), Iceland, Lichtenstein, Norway, and Switzerland visa-free. This event undoubtedly increased the popularity of Poland as the destination for Ukrainian nationals for whom geographic and cultural proximity are decisive factors. Ukrainian nationals can work without a visa only in Poland as long as they have a biometric passport and a declaration of employment. With a Polish work permit, Ukrainian employees gain protection under the Polish Labour Law. Polish employers are obliged to conform to all employment requirements, including registering Ukrainian employees with the Social Insurance Institution [ 19 ].

1. Labour market

Most of the respondents exhibited a positive perception of the Ukrainian migration to Poland. The majority highlighted the benefit of an additional workforce for seasonal work, a point emphasized by over 82% of the respondents. Only 20.6% of the participants agreed that migrants took away jobs from Poles. Over 60% of the interviewees pointed out that small business owners benefited from employing Ukrainians. Note that nearly 40% of the respondents believed that Ukrainians worked jobs not matching their qualifications. It may be because of the language barrier preventing migrants from taking up jobs in line with their trade, which hurts both them as they are employed beneath their qualifications and the Polish labour market as it cannot access the full potential of the new employees. Most respondents were unsure whether migrants who settled in Poland for an extended period returned to their previous employers.

The opinion of Poles on Ukrainian refugees was positive in each investigated aspect. Poles pointed out the benefits of the influx of additional labour. The youngest age group was also the most sceptical about the new employees, perhaps because of the greatest risk of losing jobs in favour of refugees ( Fig 3 ). Consequently, they more often believed that migrants from Ukraine were undercutting the economy and taking jobs away from Poles.

thumbnail

Symbols as per Table 1 .

https://doi.org/10.1371/journal.pone.0306895.g003

The educational background of the respondents is a substantial differentiating factor regarding the attitude of Poles towards Ukrainians ( Table 2 ). People with university degrees whose jobs are not threatened so much more often pointed out that the refugee influx was beneficial for small business owners and provided a workforce for seasonal increases in labour demand. The lower the education, the greater the probability of belief that refugees had a detrimental impact on the Polish labour market.

thumbnail

https://doi.org/10.1371/journal.pone.0306895.t002

The regional background also played a role in the perception of the refugee impact on the economy. Respondents from North and Mazowieckie Voivodeship less often agreed with the notion that the refugees were destabilising the economy. The Central and East macro-regions had more affirmative responses.

Macroregion influenced also the opinion of respondents on whether refugees form long-term relationships with their employers. People from the North-Western and Central regions were particularly likely to notice this phenomenon. Additionally, people from the Central region were the least likely to have no opinion on this matter, whereas interviewees from the North-Western region were the least likely to disagree with the statement that such relationships are formed.

However, it is worth pointing out that none of the relations were strong despite their statistical significance. Strength of effects in statistically significant relations between variables varied between 0,109 in the weakest relation, which was between Macro-region and statement that “Ukrainians destabilise the Polish economy” and 0,169 in the strongest relation education and statement that “Ukrainians take jobs away from Poles” ( Table 2 ). These results show, that none of the relations was particularly strong, as V-Cramer value below 0,3 means weak strength of effect.

2. Characteristics of stay

Most of the respondents (84.1%) noticed an increase in the influx of migrants from Ukraine after the Russian invasion ( Fig 4 ). Notably, 64.7% of them emphasized that the Ukrainians were bringing their families along, which may be beneficial to the future of demographics in Poland. It may also be linked to the fact that over 41.5% of the respondents believed that the migrants would stay in Poland for longer.

thumbnail

https://doi.org/10.1371/journal.pone.0306895.g004

Nevertheless, it is worth noting that all of the relationships should be described as weak despite their statistical significance. The strength of effects in relationships assessed as statistically significant, with p-values, ranged from 0.141 in the weakest relationship (between sex and ‘Settlement of labor migrants with their families’) to 0.210 in the strongest relationship (between education and ‘Increase in the number of refugees since the invasion’) ( Table 3 ). Looking at this result, one can state that none of the relationships were particularly strong, as all Cramér’s V values below 0.3 indicate a weak strength of effect.

thumbnail

https://doi.org/10.1371/journal.pone.0306895.t003

The strongest differentiating factor for the answers given to these questions was the educational background. Higher education entailed affirmative answers to question about increased number of refugees since the outbreak of war, where people with higher education were much more prone to notice higher influx of migrants ( Fig 5 ). It might be related to the fact, that they are more likely to be engaged in organisations working with migrants from Ukraine.

thumbnail

https://doi.org/10.1371/journal.pone.0306895.g005

There were no factors significantly differentiating the respondents regarding the length of stay of the migrants ( Table 3 ). Sex affected the answer regarding the settlement of migrants with their families ( Table 3 ). Women were more likely to state that migrants settle with their families, whereas men twice almost twice as often had no opinion in this matter. Age and macro-region significantly affected none of the answers tested in this group.

3. Institutional support for Ukrainian migrants

Nearly half of the respondents (49%) agreed that the support system for Ukrainian migrants was a burden on municipal budgets ( Fig 5 ). This opinion was expressed more often by women (55%) than men (43.7%). As much as 63% of the respondents in the young age group (18–34) agreed with that statement. The older age group (35–54) agreed to a lesser extent (49%), while the oldest group agreed in 45%. The educational background had a statistically significant impact on the selected answer. More than half of people with basic and secondary education (54%) believed migrant support to be a burden on municipal budgets. This statement resonated most with residents of South-West.

Response ‘I don’t know’ concerning the support measures used by migrants from Ukraine who stayed in the respondent’s municipality was given by 39.4% of the respondents ( Fig 6 ). The percentage varied across the macro-regions from 30.7% (Mazowieckie) to 43.9% (North).

thumbnail

https://doi.org/10.1371/journal.pone.0306895.g006

The respondents were the most aware of support benefits to cover maintenance, in particular food, clothing, footwear, personal hygiene products, and lodgings fees. Awareness of health insurance contributions covered by the state is slightly smaller.

The South-West and Mazowieckie Voivodeship macro-regions exhibited the highest level of awareness concerning support benefits for migrants from Ukraine (Figs 7 and 8 ). The least knowledgeable in this regard were residents of North and East.

thumbnail

https://doi.org/10.1371/journal.pone.0306895.g007

thumbnail

https://doi.org/10.1371/journal.pone.0306895.g008

It has been over a year since the largest exodus of Ukrainians fleeing the war in their country in the history of Poland and Europe. From 1.2 to 1.3 million of them settled in Poland in addition to those who lived there before. Researchers from the Faculty of Political Science and International Studies, University of Warsaw and University of Economics and Human Sciences in Warsaw probed the public perception of Ukrainian refugees for the second time in January 2023. The survey results show that the positive attitude of Poles towards Ukrainians and support for them—mainly military—have remained virtually the same since the invasion. Nevertheless, a quarter of the respondents gave the affirmative answer to an additional question: ‘Has your attitude towards migrants from Ukraine changed over the last six months from June 2022?’ Moreover, 68% of them declared the change was towards a negative perception, which meant a worse attitude. As the report reads, ‘the following are listed among the potential threats to Poland posed by migrants from Ukraine:

  • an adverse impact on the Polish economy / state budget / inflation;
  • an adverse impact on the labour market;

The impact of Ukrainian nationalism declined compared to April 2022, but the respondents mentioned entitlement mentality and different culture of Ukrainians more often [ 20 ].

According to Łodziński and Szonert [ 21 ], the migration policy is becoming a policy without politics: coherent efforts in various aspects of migration, such as the labour market, policy towards the international community of Poles, protection of borders, and refugee policy with no broader and official discussion concerning its long-term goals. In this policy without politics, priorities tend to congregate around the needs of the labour market, demographics, and migration, considering the actual political context instead of ideological and strategic presumptions.

The impact of stereotypes on the way to integration is an important social aspect [ 22 , 23 ]. The strength of negative stereotypes and negative narration of media [ 24 ] in both cultures (Polish and Ukrainian) stemming from past armed and social conflicts between the nations will most probably dwindle gradually as their coexistence continues [ 25 ]. Immigrants and refugees face the most significant challenges with transferring their skills and adapting to the host country’s labour market [ 26 ]. Economic immigrants, who base their decisions on how easily their human capital and other resources can be transferred to host countries, tend to be more favourably selected to enter the labour market [ 27 ]. Migrants tend to move to countries where their skills are relatively scarce, filling in gaps in the natives’ set of skills [ 28 ].

The Ukrainians’ continuing desire for their own homeland should be considered. So, finding one’s own place among the Polish community is a big challenge. More civic-oriented identity becomes the predominant approach to nation-building in Ukraine, citizenship will be a central part of the process [ 29 – 33 ].

As regards assistance to Ukrainians, the public is divided along another line as well. Nearly half of the respondents (48%) believed Poland should provide more support, but more than a third were against it. Some called for curbing welfare. The researchers with the University of Warsaw pointed out another trend that had been there all along and has deteriorated slightly recently. Approval for the Act on aid to Ukrainian nationals (which grants them benefits equal to those for EU citizens) has slightly declined by 4 pp. since April 2022. Although most Poles are in favour of aid to Ukrainians in the form of a single payment of PLN 300 (49%), access to a free healthcare system (62%), and admission of children of Ukrainian refugees to Polish schools (87%), there is no broad approval of family and parent benefits, like PLN 500 monthly per child or social welfare benefits. Five per cent more people perceive it negatively than in 2022. Up to 16 pp. more (36% today) of the respondents were not in favour of state-funded lodgings for Ukrainians. In February, the Polish government introduced partial payment for refugees who spent more than 120 days in Poland. The public is annoyed by the more widespread foreign language, mostly Russian, used by the refugees. It is perceived as a ‘language of the enemy’, and Poles become disoriented as to who they are [ 20 ].

Other important efforts include governmental, non-governmental, and business initiatives to help the refugees find employment, such as Polish language courses, vocational training (including IT), job board applications and websites for Ukrainians, including governmental pracawpolsce.gov.pl opened in summer 2022 [ 34 ].

Selected cash benefits available to Ukrainians after the invasion:

  • PLN 300, a one-time livelihood benefit;
  • PLN 500, a child benefit under the Rodzina 500+ scheme;
  • family care capital for every child aged 12 to 35 months;
  • PLN 95 to PLN 135 a month of a family benefit;
  • PLN 1,000, parent benefit;
  • PLN 215.84, nursing benefit;
  • nursing pay;
  • PLN 1,000 for childbirth;
  • care benefit;
  • up to PLN 400, subsidy for daycare centre, children’s club, or daycare provider;
  • PLN 300, Dobry Start scheme for schoolchildren;
  • PLN 90 to PLN 110, allowance for education and rehabilitation of a child with disabilities;
  • rent allowance;
  • welfare ticket;
  • meals for children and students from Ukraine;
  • reimbursable medicines;
  • PLN 500, Health4Ukraine medicines allowance;
  • benefits granted under specific conditions and for a specific time under the Act on aid to Ukrainian nationals related to the war in Ukraine of 12 March 2022.

The Polish government has provided substantial aid to the refugees, just like local governments, grassroots organizations, and the people. According to research by the Polish Economic Institute, the total (annual) public assistance and private aid (estimated for the first three months after the invasion) provided to the refugees is around PLN 25.4 billion, or 0.97 per cent of the Polish GDP for 2021 [ 7 ].

An additional mobility barrier concerns the large heterogeneity in social insurance rights across European countries. These rights, including old-age pensions, unemployment payments, and government-financed healthcare services, are determined at the national level, and programs differ strongly across countries [ 35 ].

Economic aspects of migration were also investigated in a survey among 808 employers from Małopolskie and Podkarpackie Voivodeships who employed migrants from Ukraine and had a business in one of the voivodeships between 2014 and 2018. The most represented sectors were construction (31.7%) and industry (15,8%). Employers representing services (15.5%) and trade (13.7%) were also rather numerous. The share of the other sectors did not exceed 10%. The survey results show that the main reason behind hiring Ukrainian employees is cost reduction through lower remuneration compared to Polish workers. This argument was provided by 57.3% of the surveyed employees. The result is not significantly varied across the population. The second most popular reason was permanent issues with finding employees with vocational profiles for specific existing vacancies in the local labour market. This reason was provided by 46.3% of the employers. The third reason was better efficiency, quality of work, availability, or work discipline of Ukrainian workers compared to their Polish colleagues. This factor was provided by 40.2% of the employers [ 36 ].

But, there is the distinct local character of the demands for foreign workers in the Mazovian region, and specifically in the direct surroundings of Warsaw. This is a region with relatively strong and market-oriented horticulture sector that has employed temporary migrant workers on a massive scale for over a decade [ 37 ].

Results of a 2022 survey on 1481 refugees from Ukraine who came to Opolskie Voivodeship (south-western Poland) revealed their relatively high competencies. An overwhelming majority of the refugees (62.7%) had both qualifications and professional experience. Moreover, nearly half of them (48%) had a university degree; they were mostly economists (18.9%) psychologists and educators (14.9%), engineers and technicians (10.8%), and physicians (8.4%) followed by representatives of humanities and law graduates (7.4%), personal services (7.5%), and management (6.1%). Trade, production, and transport professionals constituted a much smaller group: 4.2%, 4.6%, and 1.4%, respectively [ 38 ].

According to the portal Rynek Pracy [ 39 ], the number of foreign nationals, including Ukrainians, who get employment and social insurance is growing. In the first quarter of 2019, the Social Insurance Institution had 610,000 foreign nationals registered, including 455,000 Ukrainians. The number grew to 1,057,000 foreign nationals in the first quarter of 2023 with 738,000 Ukrainians. The number of Ukrainians with Polish social insurance grew significantly from 2019 to 2023.

In mid-February 2023, the EWL Migration Platform employment agency [ 40 ], Centre for East European Studies of the University of Warsaw [ 41 ], and the EWL Foundation for Support of Migrants on the Labour Market conducted a survey on 400 adult Ukrainians from across Poland. It demonstrated that 82% of adult refugees from Ukraine found employment in Poland, reaching 84% in the working-age group. The largest share (34%) started working within one to three months after arrival. This high level of employment within the first months of their stay reflects the determination of the refugees to find a job, the openness of Polish businesses to new employers, and the needs of the Polish market and economy.

Migrants from Ukraine fuelled a short-term spike in retail and private consumption in 2022, especially in Poland and Estonia. Moreover, as they grow more integrated, they will join the workforces of the host countries and improve production output in the medium and long term. Ukrainian migrants abroad drive private consumption in host countries, contributing to economic growth. According to National Bank of Ukraine data, Ukrainians abroad spent two billion dollars a month in 2022, over three times more than in the previous year. Estimates by the International Monetary Fund and study by the United Nations, controlled for all other conditions, forecast the contribution of Ukrainian migrants to improve production compared to a base scenario without migration by 2.2 per cent to 2.3 per cent in Estonia, Poland, and Czechia, and by 0.6 per cent to 0.65 per cent in Germany in 2026. Integration of migrants from Ukraine will affect labour markets in host countries [ 42 ].

The European Central Bank, in particular, expects the share of working-age Ukrainian migrants to reach 25% to 55% of the workforce in the Eurozone in the mid-term. Although in the short term, migrants pose additional challenges to the state budget, they will probably be beneficial to the finances and economy of host countries if they remain in the countries for more than several years and are active in the labour market [ 43 ].

Bird and Amaglobeli [ 44 ] estimated the short-term fiscal impact of migrants from Ukraine on the economies of EU states to reach EUR 30–37 billion, which is 0.19–0.23% of the EU’s GDP. Ukraine’s neighbours and Baltic states will spend the largest sums. According to the European Investment Bank [ 45 ], Latvia could spend 9% of its GDP on the adaptation of migrants, Estonia over 7%, and Hungary, Poland, and Czechia, 4–6%. Estimates by the International Monetary Fund expect the long-term net fiscal effect to be beneficial to Europe because Ukrainians actively integrate with the European labour market. Taxes paid by Ukrainian migrants in Poland are a particularly good example of their integration with the labour market.

Regarding Ukrainian nationals in the Polish labour market, 78% of respondents in a 2020 survey among 502 temporary workers from Ukraine working in Poland through OTTO Work Force companies were satisfied with working in Poland. It is 6 pp. more than in the previous year when the workers were more critical. The value for 2022 was virtually the same as for 2020 (79% and 78%, respectively). The percentage was the highest in 2017 (94%) which may be because June 2017 was the first time Ukrainian nationals could legally stay in Poland under the visa-free regime [ 46 ].

The Centre of Migration Research of the University of Warsaw estimated that Ukrainians paid PLN 10 billion in taxes in Poland (about USD 2.4 billion) [ 47 ].

In summary, the rebuilding of Ukraine after the war will probably offer an opportunity for Polish business [ 48 – 51 ]. The networks of enterprises that are being established today will be more effective in the new socioeconomic environment [ 52 – 54 ]. Demographics are an important determinant of the contribution of Ukrainian migrants to the economic growth of Poland [ 55 , 56 ]. Refugees coming to Poland are mainly women with children [ 57 ]. Their permanent settlement in Poland will improve demographic indicators (more children), which is of value in light of the ageing population. Migrants’ labour market integration is a topic of increasing public concern, particularly in the light of the recent refugee crisis faced by mainland Europe. However, non-economic migrants should not be seen as an inevitable burden but as further investments in the host country whose relevant skills can serve to close existing labour market gaps and improve their integration substantially [ 58 ]. Public opinion toward migrants is not positive and explicitly negative during economic downturns. However economic benefits of labour immigration overweight disadvantages. As the Brexit poses the first precedent since WWII when the economic and social ties in Europe loose [ 59 ].

The labour market situation of migrants from Ukraine is changing month by month. It is shaped by several factors: 1. Pre-war migrants active in the agricultural and horticulture industries (the primary targets of seasonal migration in rural and town areas in Poland) are mainly men. Some of them, especially younger ones, no longer migrate due to the war. 2. Before the war, work in Polish agricultural and horticulture industries had been more financially beneficial than during the war, as the geographic reach of migration is growing (towards Western Europe).3. The perception of Ukrainians in Poland is changing because of the war, aid to the migrants, and the growing burden of cultural differences. Therefore, future research should focus on seasonal migrations in the context of broader demographic and social characteristics of the migrants, the profitability of seasonal work in Poland for Ukrainians, and relationships between Poles and Ukrainians in the context of their stay in Poland.

Conclusions

Most foreign nationals in the Polish labour market in 2021 were Ukrainians. The positive trend started in 2017 when they could enter Poland without a visa for the first time. The number of Ukrainian workers in the Polish labour market declined compared to other foreign nationals from 2021 to 2022. The change was not due to a smaller number of Ukrainian employees but the Act, which allows for employment without a permit. The number of permits issued by individual macro-regions varied from 2017 to 2022. The leading macro-region changed from Mazowieckie Voivodeship to North-West (closer to the German border).

The main research objective is to diagnose the conditions of seasonal labour migration, considering sex, age, education, and region of residence of the respondents. The respondents perceived the migration of Ukrainian nationals to Poland mostly positively, especially regarding seasonal work. The youngest respondents and those from the Central and East macro-regions were the most sceptical about the positive impact of Ukrainian migration on the Polish economy. Those with higher education, living in the North, and from Mazowieckie Voivodeship were more optimistic. They also emphasized that the Ukrainians performed work at variance with their qualifications. Only every fifth participant agreed that migrants took away jobs from Poles. Most of the respondents pointed out that small business owners benefited from employing Ukrainians.

A specific objective includes characteristic of stay. The overwhelming majority of the respondents noted an increase in migration from Ukraine after the full-scale invasion and that entire families of Ukrainians were coming to Poland. Less than half of the interviewees believed the migrants would stay in Poland for longer. People with university degrees pointed out the increased number of migrants following the full-scale invasion. It may be because they are more often involved in the efforts of organisations that help migrants from Ukraine. Women more often stated that the migrants settled together with families.

A last specific objective concerns an institutional support for Ukrainian migrants. Nearly half of them agreed that the support system for Ukrainian migrants was a burden on municipal budgets. Most in this group were women and people aged 18–34. At the same time, 40% of the survey population did not know what support means were used by migrants from Ukraine. The respondents were the most aware of support to cover maintenance, in particular food, clothing, footwear, personal hygiene products, and lodgings fees. Participants from the Central and East macro-regions were not inclined to perceive Ukrainians as a growth opportunity for the Polish economy. Residents of East and North had poor awareness of welfare benefits migrants from Ukraine were eligible for.

The study is limited by the systemic solutions presented in the article, which fail to include actions dedicated to the specific characteristics of regions in Poland. The results demonstrate varied acceptance for migrants depending on the residence, sex, and age of the respondents. The diagnosis of the labour market and work conditions of seasonal migrants was conducted considering the historical memory of Poles and stereotypes affecting the opinions on support for Ukrainians. This conclusion and others offered here may be useful when defining objectives for a regional policy regarding seasonal immigrant labour. Researchers investigating the problem of immigrants in the seasonal labour market face the challenge of factoring in applicable labour policies, work conditions, remuneration, and other quantities considering employers and migrant employees. Such a broader research perspective can juxtapose the results with the effectiveness of governmental programmes aimed at supporting seasonal migrant workers. The public opinion survey presented in the article has to be expanded with future research considering respondents’ experience and professional contacts with Ukrainians and how they are supported in the labour market.

Supporting information

S1 checklist. human participants research checklist..

https://doi.org/10.1371/journal.pone.0306895.s001

  • View Article
  • Google Scholar
  • PubMed/NCBI
  • 6. Ustawa z dnia 12 marca 2022 roku o pomocy obywatelom Ukrainy w związku z konfliktem zbrojnym na terytorium tego państwa [„Polish Journal of Laws of 12 March (2022), Act on aid to Ukrainian nationals related to the war in Ukraine”]. Available from: https://orka.sejm.gov.pl/proc9.nsf/ustawy/2069_u.htm .
  • 7. Polish Economic Institute. 2023. Available from: https://pie.net.pl/en/nearly-14000-ukrainian-firms-were-established-in-poland-from-january-to-september-2022/ .
  • 10. Falkowski K. Polsko-ukraińskie stosunki gospodarcze w latach 2010–2019, ze szczególnym uwzględnieniem migracji zarobkowej [„Polish-Ukrainian economic relations in 2010–2019, with a special focus on labour migration”]. In Kowalski AM, Weresa MA, editors. Polska. Raport o konkurencyjności 2021. Bilateralna współpraca gospodarcza a przewagi konkurencyjne [„Poland. Competitiveness report 2021. Bilateral economic cooperation and competitive advantages”]. Warszawa: SGH Oficyna Wydawnicza; 2021. pp. 223–245.
  • 11. Foreigners on the Greater Poland labour market. 2023. Available from: https://wuppoznan.praca.gov.pl/cudzoziemcy .
  • 12. Sakson A. Ukraińscy imigranci zarobkowi w Polsce w czasie pandemii COVID-19 i wojny rosyjsko-ukraińskiej (2020–2022)” [“Ukrainian labour migrants in Poland during COVID-19 pandemic and Russian-Ukrainian war (2020–2022)”]. In Adamczyk A, Sakson A, Trosiak C, editors. Mniejszości, emigranci i uchodźcy, „stare” i „nowe” wyzwania [“Minorities, migrants and refugees, "old" and "new" challenges”]. Poznań: Wydawnictwo Naukowe Wydziału Nauk Politycznych i Dziennikarstwa Uniwersytetu im. Adama Mickiewicza w Poznaniu; 2022, pp. 59–167.
  • 13. Fedyuk O, Kindler M. Migration of Ukrainian nationals to Portugal: The visibility of a new migration. In: Fedyuk O, Kindler M, editors. Ukrainian migration to the European Union. Lessons from migration studies. Landscape Publisher: Springer International Publishing Editors; 2016, pp. 179–192.
  • 14. White A, Grabowska I, Kaczmarczyk P, Slany K. The impact of migration on Poland: EU mobility and social change, London: UCL Press; 2018.
  • 16. Economy and Finance, Ukraine, European Commission. 2022. Available from: https://economy-finance.ec.europa.eu/international-economic-relations/candidate-and-neighbouring-countries/neighbouring-countries-eu/neighbourhood-countries/ukraine_en .
  • 17. Infographic–Refugees from Ukraine in the EU. 2023. Available from: https://www.consilium.europa.eu/en/infographics/ukraine-refugees-eu/ .
  • 18. Wypłata świadczeń dla Polaków goszczących uchodźców–złóż wniosek [„Payment of benefits to Poles hosting refugees–apply”]. 2022. Available from: https://www.krakow.pl/aktualnosci/258278,26,komunikat,wyplata_swiadczen_dla_polakow_goszczacych_uchodzcow___co_trzeba_wiedziec.html .
  • 19. Koval N, Vaičiūnas L Reichardt I. Polacy i Ukraińcy w codziennych kontaktach [„Poles and Ukrainians in everyday contacts”]. Wrocław: Kolegium Europy Wschodniej im. Jana Nowaka-Jeziorańskiego; 2021.
  • 20. Kacprzak I. Badanie: zmienia się nastawienie Polaków do uchodźców z Ukrainy. [„Survey: Attitudes of Poles towards refugees from Ukraine are changing”] 2023. Available from: https://tiny.pl/wkmvx .
  • 22. Ager A, Strang A. Indicators of Integration: Final Report. London: Home Office; 2004.
  • 34. Błaszczak A. Ukraińscy uchodźcy odnaleźli się na polskim rynku pracy [“Ukrainian refugees have found their way into the Polish labour market”] 2023. Available from: https://www.rp.pl/rynek-pracy/art37993771-ukrainscy-uchodzcy-odnalezli-sie-na-polskim-rynku-pracy .
  • 35. European Commission. Coordination of Social Security Systems at a Glance. 2019 Statistical Report. Brussels: European Commission.
  • 36. Olszewska-Łabędź B. Znaczenie migracji z Ukrainy dla rynku pracy w Polsce [“The importance of migration from Ukraine for the labour market in Poland”]. In Jędrzejewska J, Szymczyk P, editors. Ekonomiczne, kulturowe i społeczne aspekty migracji [“Economic, cultural, and social aspects of migration”], Lublin: Wydawnictwo Naukowe Tygiel; 2022. pp. 96–115.
  • 37. Górny A, Kaczmarczyk P. Temporary farmworkers and migration transition: on a changing role of the agricultural sector in international labour migration to Poland. In: Rye J.F, O’Reilly K, editors. International labour migration to Europe’s rural regions, Routledge Taylor & Francis Group, London and New York: 2021, pp. 86–103.
  • 39. Rynek Pracy [“Labour Market”]. Available from: https://rynekpracy.pl/slownik/rynek-pracy .
  • 40. EWL Migration Platform. 2023. Available from: https://ewl.com.pl/en/ewl-migration-platform/ .
  • 41. Centre for East European Studies. University of Warsaw. 2023. Available from: https://english.studium.uw.edu.pl/
  • 42. Uchodźcy z Ukrainy w Polsce. Wyzwania i potencjał integracji [“Refugees from Ukraine in Poland. Challenges and potential for integration”]. 2022. Monitor Deloitte. Available from: https://www2.deloitte.com/content/dam/Deloitte/pl/Documents/Reports/pl-Uchodzcy-z-Ukrainy-w-Polsce-Report.pdf .
  • 43. Botelho V. The impact of the influx of Ukrainian refugees on the Euro area labour force.” ECB Economic Bulletin 4. 2022.
  • 45. European Investment Bank. Available from: https://www.eib.org/en/index .
  • 46. Satisfaction of Ukrainians with work in Poland increases. 2023. Available from: https://ottoworkforce.com/pl/en/satisfaction-of-ukrainians-with-work-in-poland-increases .
  • 47. Ukrainian World Congress. 2022. Ukrainian refugees uphold rather than burden Polish economy, 9 November. Available from: https://www.ukrainianworldcongress.org/ukrainian-refugees-uphold-rather-than-burden-polish-economy/ .
  • 52. Brunarska Z, Kindler M, Szulecka M, Toruńczyk-Ruiz S. Ukrainian migration to Poland: A local” mobility? In Fedyuk O, Kindler M, editors. Ukrainian migration to the European Union, IMISCOE Research Series. Springer, Cham; 2016. pp. 1–14.
  • 58. Zwysen W, Demireva N. Who benefits from host country skills? Evidence of heterogeneous labour market returns to host country skills by migrant motivation. ISER Working Paper Series, 2020; 06, University of Essex, Institute for Social and Economic Research (ISER), Colchester.

IMAGES

  1. R卡方检验(CHI-SQUARE TEST)_r方检验-CSDN博客

    research article using chi square test

  2. PPT

    research article using chi square test

  3. Chi Square Test

    research article using chi square test

  4. Chi-Square (χ2) Statistic: What It Is, Examples, How and When to Use the Test

    research article using chi square test

  5. How to Report a Chi Square Test from SPSS in APA Style

    research article using chi square test

  6. Chi-square test Question Example

    research article using chi square test

VIDEO

  1. Chi Square Test in Research Methodology

  2. Chi Square Test (Part 2 Test Of Independency questions from Book)(MBS, MBA, MPA Statistics Videos)

  3. 16. Data Mining: Chi-square test for Correlation analysis of nominal data

  4. Interpretation of association between two categorical variables using Chi square Test

  5. Chi-square test(χ2-test) of Goodness of fit for Normal Distribution

  6. 1

COMMENTS

  1. The Chi-square test of independence

    The Chi-square test is a non-parametric statistic, also called a distribution free test. ... The Chi-square is a valuable analysis tool that provides considerable information about the nature of research data. It is a powerful statistic that enables researchers to test hypotheses about variables measured at the nominal level. As with all ...

  2. Chi-square Tests in Medical Research

    In this issue of Anesthesia & Analgesia, Sharkey et al 1 report a randomized trial comparing the incidence of bradycardia after phenylephrine versus norepinephrine to prevent and treat spinal-induced hypotension in women undergoing cesarean delivery with spinal anesthesia. The authors used a chi-square (χ 2) test to compare the groups and ...

  3. Statistical notes for clinical researchers: Chi-squared test and Fisher

    The chi-squared test performs an independency test under following null and alternative hypotheses, H 0 and H 1, respectively.. H 0: Independent (no association). H 1: Not independent (association). The test statistic of chi-squared test: χ 2 = ∑ (0-E) 2 E ~ χ 2 with degrees of freedom (r - 1)(c - 1), Where O and E represent observed and expected frequency, and r and c is the number of ...

  4. Chi-square Test and its Application in Hypothesis Testing

    Hypothetical data for calculating the Chi-square test for our example of testing an association between smoking and lung disease is given in Table 4. Chi-square test can be calculated manually by using the formula described above. Refer [Table 5 and Table 6] for manual calculations. Chi-square value for our example as shown in Table 6 is 3.42 ...

  5. Chi-square test and its application in hypothesis testing

    Chi-square test is a nonparametric test used for two speci c. purpose: (a) T o test the hypothesis of no association between. two or more groups, population or criteria (i.e. to check ...

  6. The chi-square test of independence

    The Chi-square statistic is a non-parametric (distribution free) tool designed to analyze group differences when the dependent variable is measured at a nominal level. Like all non-parametric statistics, the Chi-square is robust with respect to the distribution of the data. Specifically, it does not require equality of variances among the study ...

  7. The Chi-Square Test: Often Used and More Often Misinterpreted

    Overall, less than half of the chi-square articles (43.75%; n = 14) had interpretations that were justified by the type of chi-square test used. All three articles in the American Journal of Evaluation included the correct usage of the chi-square test, whereas only a third (two out of six) of the articles in Educational Evaluation and Policy ...

  8. (PDF) The Chi-square test of independence

    The Chi-square test of independence. Mary L. McHugh. Department of Nursing, School of Health and Human Services, National University, Aero Court, San Diego, California, USA. Corresponding author ...

  9. Chi-square Tests in Medical Research

    Chi-square Tests in Medical Research. Chi-square Tests in Medical Research Anesth Analg. 2019 Nov;129(5):1193. doi: 10.1213/ANE.0000000000004410. Authors Patrick Schober 1 , Thomas R Vetter 2 Affiliations 1 From the Department of Anesthesiology, Amsterdam ...

  10. CoVID-19 symptoms analysis of deceased and recovered cases using Chi

    The Chi-square test is adopted with asymptotic significance level to show the strength of each symptom on recovered and deceased cases independently. The study found that the recovered cases are associated with different symptoms based on the patient history, where the deceased cases showed that high fever is not responsible for increasing the ...

  11. The Chi-Square Test of Distance Correlation

    Method-wise, the chi-squared test is nonparametric, extremely fast, and applicable to bias-corrected distance correlation using any strong negative type metric or characteristic kernel. The test exhibits a similar testing power as the standard permutation test, and can be used for K-sample and partial testing.

  12. Chi-square test under indeterminacy: an application using pulse count

    Background The data obtained from the counting process is known as the count data. In practice, the counting can be done at the same time or the time of the count is not the same. To test either the K counts are differed significantly or not, the Chi-square test for K counts is applied. Results The paper presents the Chi-square tests for K counts under neutrosophic statistics. The test ...

  13. PDF Karl Pearson's chi-square tests

    Chi-square test shall be taken into consideration in this study. The reason is that this test is commonly used by researchers compared to other non-parametric tests. This study deals with applications of Chi-square test and its use in educational sciences. Chi-square test . Chi-square test is used to find if there is any correlation

  14. A Practical Application of Chi-square Test in Hypothesis Testing

    The Chi Square is as a test used to test the relationships between categorical variables in. the same population. It is represented by χ2. It is used whenthe data is categorical. It. provides a ...

  15. Article The chi-square test: Its use in rehabilitation research

    The chi-square test: its use in rehabilitation research. Arch Phys Med Rehabil 1995;76:678-81. • Objective: This report examines the impact of collecting and analyzing sequential data from the same sample using the chi-square test. Researchers in rehabilitation frequently analyze categorical data collected repeatedly from the same sample.

  16. Choosing Statistical Tests

    Readers who are acquainted not just with descriptive methods, but also with Pearson's chi-square test, Fisher's exact test, and Student's t test will be able to interpret a large proportion of medical research articles. Criteria are presented for choosing the proper statistical test to be used out of the most frequently applied tests.

  17. Aerobic exercise and its impact on musculoskeletal pain in older adults

    Differences between groups at first evaluation (baseline) were compared using chi-square and t-tests. Results are reported as mean (SE) or proportion. Longitudinal data were analyzed using generalized estimating equations (GEE) . Separate analyses were conducted in which repeated measurements were coded by calendar year for questionnaire ...

  18. The Chi-Square Test: Often Used and More Often Misinterpreted

    The examination of cross-classified category data is common in evaluation and research, with Karl Pearson's family of chi-square tests representing one of the most utilized statistical analyses for answering questions about the association or difference between categorical variables.

  19. PDF The Chi Square Test

    Cell Counts Required for the Chi-Square Test You can safely use the chi-square test with critical values from the chi-square distribution when no more than 20% of the expected counts are less than 5 and all individual expected counts are 1 or greater. In particular, all four expected counts in a 2 2 table should be 5 or greater.

  20. When to Use a Chi-Square Test (With Examples)

    You should use the Chi-Square Goodness of Fit Test whenever you would like to know if some categorical variable follows some hypothesized distribution. Here are some examples of when you might use this test: Example 1: Counting Customers. A shop owner wants to know if an equal number of people come into a shop each day of the week, so he counts ...

  21. (PDF) Introduction to Chi square test

    7. McNemar's chi square test. • The McNemar's chi square test ( 2 McNemar) is used to. determine if there is a s tatistic ally significant diff erence in. proportions betw een paired data ...

  22. How to Conduct a Chi-Square Test

    In this article, we will focus on the Chi-Square test for independence, as it is more commonly used in sociology research. When to Use a Chi-Square Test. Before diving into the details of how to conduct a Chi-Square test, it is essential to understand when it is appropriate to use this statistical tool. The Chi-Square test is best suited for ...

  23. Postoperative sensitivity of composites using novel Bacillus subtilis

    A Chi-square test was used to compare postoperative sensitivity between the two groups. The level of significance was set at p &lt; 0.05. A noteworthy association was observed between sensitivity and the group variable at all four evaluation periods: after one day (p = 0.002), 1 week (p = 0.002), 2 weeks (p = 0.007) and one month.

  24. McNemar chi2 test revisited: comparing sensitivity and ...

    The McNemar chi (2) test, used to compare discordance of two dichotomous responses, can be applied for this purpose. However, applying the McNemar test to a 2 x 2 table for comparing the accuracy of examinations is not recommended, since this test is sensitive to the proportion of positive versus negative subjects.

  25. Online Signature Verification Using Chi-Square (χ2) Feature Selection

    Signature verification is a potential research area because of its social, legal and cultural acceptance since time immemorial. Hence unlike other biometrics it is more prone to forgery. So we have proposed an online signature verification system with user specific Chi-Square (χ2) feature selection to enhance the accuracy of our system. The aim of our work is to verify the signature of a ...

  26. Prevalence, types and outcome of injuries among abattoir workers in

    A cross-sectional survey involving 300 workers from three major meat processing facilities in the Kumasi metropolis of Ghana was carried out using a structured questionnaire from April to June 2023. The prevalence, types and outcome of injuries among workers were assessed. Test of association was established by Chi square analysis.

  27. The Chi square test: an introduction

    The SVM has attained an accuracy of 99.58% in classification using the five features chosen from the Chi-Square test. Lastly, SHAP, an explainable AI model, has been used to assess the classifier ...

  28. Pakistani students' perceptions about knowledge, use and impact of

    Moreover, chi-square test results indicate an association among components of knowledge, use, and impact of AI. The research suggests that academia should introduce AI as a pedagogical tool to improve students' comprehension, productivity, and writing quality. Furthermore, trends indicate that comprehensive policy formulation should be ...

  29. Migrants from Ukraine in the Polish labour market as perceived by Poles

    The independence of the features was tested with the non-parametric chi-square test of independence. The association of the investigated variables was determined with Cramér's V. The research shows that the most numerous foreign nationals in the Polish labour market in 2021 were Ukrainians. The positive trend started in 2017.