Brazilian Longitudinal Study of Adult Health (ELSA-Brasil) participant’s profile regarding self-rated health: a multiple correspondence analysis

Background Self-rated health (SRH) - one of the most common health indicators used to verify health conditions - can be influenced by several types of socioeconomic conditions, thereby reflecting health inequalities. This study aimed to evaluate the participant profiles regarding the association between self-rated health and social and occupational characteristics of the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil). Methods Cross-sectional design, including 11,305 individuals. Self-rated health was categorized as good, fair, and poor. The relationship between socio-demographic, psychosocial work environment, health-related variables, and self-rated health was analyzed by multiple correspondence analysis (stratified by age: up to 49 years old and 50 years old or more). Results For both age strata, group composition was influenced by socioeconomic conditions. Poor SRH was related to lower socioeconomic conditions, being women, black self-declared race/ethnicity, being non-married/non-united, low decision authority, low skill discretion, and obesity. Conclusion To promote health, interventions should focus on reducing existing socioeconomic, race, and gender inequalities in Brazil.

imbalance between work and social life influence negatively on self-rated health.
All of these factors are related in a complex way to self-rated health, and it is important to verify how these relationships are influenced by the existing inequalities in Brazil and how they are associated with health in the working population. Different methods have been applied to investigate negative perceptions of health [10,12]. The large majority of studies have been using regression or multilevel models [8,12,13]. However, another way to examine this, which pays more attention to exploring and explaining relationships between categorical indicators, is through multiple correspondence analysis. The advantage of this statistical method is the absence of any assumption about probability distributions and the lack of need to establish predetermined relations among the variables.
Some studies show that the correspondence analysis is a technique that make it possible to illustrate the relationships between several categorical variables [14,15], and also allows for the "construction of complex visual maps whose structuring can be interpreted" [16]. Thus far, we have found few studies envolving self-rated health that used correspondence analysis [15,17,18]. Accordingly, the aim of this study was to evaluate participant profiles regarding the association of self-rated health and social and occupational characteristics in the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil), using the multiple correspondence analysis technique.

Study population
This study used baseline data (2008)(2009)(2010) from the ELSA-Brasil study. The ELSA-Brasil is a longitudinal multicentric cohort study of 15,105 civil servants (35-74 years) conducted at six study research centres in three regions of the country, including the Northeast, South, and Southeast. These research centres are located in five federal universities and the Oswaldo Cruz Foundation [19,20].
The present study did not use information about retired participants, since they do not have information about occupational characteristics (socio-occupational category and psychosocial work environment). Also, participants that declared their race/ethnicity as Asian or Indigenous were excluded due to the small number of participants in each category (2.4 and 1%, respectively). Furthermore, the information of participants that declared their race/ethnicity as Asian is mainly centered in one of the research centres in São Paulo. The exclusion of Indigenous people was made considering that our participants are urban indigenous in a small number, and they do not represent the indigenous population. Finally, participants who had missing data for any of the study variables were also excluded ( Fig. 1).

Socio-demographic variables
The variables were age, sex, self-declared race/ethnicity (white, brown and black), marital status (married/united, non-married/non-unitedthis category includes single, divorced or separated, and widowed people), education (the categories were complete elementary school or less, completed high school, and completed university degree or more. This variable considers the highest level of completed education, with exception of elementary school, i.e. a participant that did not completed high school was considered in the complete elementary school or less category), month per capita household income (low -up to $234, medium -from $234 to $702, and high -more than $702. The cutoff points of this variable were based on the 2008 minimum wage in Brazil and the median income of our population. The low category considers one salary, the medium category considers from 1 to 3 salaries, and the high category considers more than 3 salaries. The median income of our population was $702), and socio-occupational category (manual, middle and higher). This last variable considers different forms of insertion in production (considering the position in the typical occupation), the occupation itself (which non-manual was qualified by the level of formal education required by the occupation and manual was qualified by the sector specialization), and the hierarchy in production. The categories were defined by the Center for Development and Regional Planning (CEDEPLAR), Faculty of Economic Sciences of the Federal University of Minas Gerais (UFMG), based on the literature [21,22].

Psychosocial work environment variable
The variable representing the psychosocial work environment was job stress (demand-control model). Job stress was accessed using the Swedish demand control support questionnaire (DCSQ). This questionnaire contains 17 items, five items refer to the psychological demand dimension, six items refer to the control dimension, and six items refer to the social support dimension. In our study, the repetitive work item (control dimension) and the social support dimension were not considered since the study about the dimensional structure of the DCSQ in the Brazilian context suggests this item exclusion and a better goodness-of-fit without the social support dimension [23]. The scores of the DCSQ (job demands, 5 items; skill discretion, 3 items; and decision authority, 2 items) were dichotomized into high and low at the median for these dimensions (14,11, and 6 points, respectively) [24].

Health related variables
The health variables used in analyses were self-rated health and body mass index. Self-rated health (SRH) was measured using the following question: "In general, compared to people of your age, how do you consider your state of health?". The response options were: "very good, good, fair, poor, or very poor". For the analyses, the answers were categorized as good self-rated health (very good and good), fair, and poor (poor and very poor). The body mass index (BMI) cutpoints were considered as: ≤ 24.9 kg/m 2 for underweight and normal weight (the categories of underweight, ≤ 18.5 kg/m 2 , and normal weight were grouped due to the small number of participants who were underweight, < 1%), between 25 and 29.9 kg/m 2 for overweight, and ≥ 30 kg/m 2 for obesity.

Statistical analyses
Proportions were used to describe population characteristics regarding self-rated health. Self-rated health, sex, self-declared race/ethnicity, marital status, education, per capita household income, body mass index, sociooccupational category, and job strain were analyzed by multiple correspondence analysis (MCA). Stratified analyses by age were conducted due to our consideration of aging as an effect modifier [7,12,25]. Since our average population age is 49.14 years, two age groups were created to stratify the analyses (up to 49 years old and 50 years old or more).
Correspondence analysis is an exploratory technique applied to categorical data. This analysis graphically illustrates the relationship within one set of variables, and the proximity of categories, in space, indicates a relationship or correspondence between them [26,27]. The advantage of this statistical method is the absence of any assumption about probability distributions and the lack of a need to establish predetermined relations among the variables, such as the unidirectional relationships estimated by regression models. This type of analysis provides total inertia, which means the percentage of variability explained by each dimension, and in this paper, the number of dimensions was chosen by analyzing the decline of adjusted inertias (eigenvalues) [27].
Scatterplots (formed by the coordinates of each category in each dimension) were analyzed with regard to dimensions, and clusters of categories were created to delineate different profiles in the sample. The results based on hierarchical cluster analysis (dendrogram) of the standard coordinates obtained in the correspondence analysis were confronted with the resulting clusters visualized in the multiple correspondence plot. The dendrogram provided a clear visualization of the categories of the variables in each group, and it is more useful in the case of many dimensions in the MCA (which become hard to use biplots).
The x-axis of the scatterplots represents the data variability explained by the first dimension, while the y-axis represents the data variability explained by the second dimension. The dots represent each variables categories.

Results
In the study population (11,305 participants), the largest proportion of individuals reported good SRH (81.6%), followed by fair (16.7%) and poor (1.7%) categories. The percentage of poor self-rated health was lower among men, married/united, white self-declared race/ethnicity, participants aging up to 49 years old, with completed university degree or more, with high per capita household income, with higher socio-occupational category, with normal weight, with high job demands, high skill discretion, and high decision authority ( Table 1).
The multiple correspondence analyses were stratified by age. The plot allowed for the identification of three groups for both age strata (Figs. 2 and 3). Figure 2 presents the plot of multiple correspondence analysis for participants up to 49 years old, and Fig. 3 for participants 50 years old or more. For the youngest group (up to 49 years old), the inertia of the two first dimensions was 80.5%. The first dimension explained 70.8% of data variability (x-axis of the graph) and the second 9.7% (yaxis of the graph). For the oldest group, the inertia of the two first dimensions was 83.5%. The first dimension explained 75.6% of data variability (x-axis of the graph) and the second 7.9% (y-axis of the graph).
The MCA results show modest, but relevant, differences between age groups. Figure 2 (up to 49 years old) shows that fair SRH and brown self-declared race/ethnicity were in the same group as better socio-economic conditions and good SRH. Figure 3 (50 years old or more) shows the same categories related to middle socio-economic conditions and poor SRH.
Besides fair SRH and brown self-declared race/ethnicity, there was no difference in the MCA results between the two age strata. Men, white self-declared race/ethnicity, married/united, completed university degree or more, higher socio-occupational category, high per capita household income, high decision authority, high skill discretion, high and low job demands, normal weight, and overweight were related to good SRH. Women, black self-declared race/ethnicity, non-married/ non-united, completed high school, middle sociooccupational category, medium per capita household income, low decision authority, low skill discretion, and obesity were related to poor SRH. One of the cluster groups was not related to SRH. This group (group 3 for both age strata) included participants with complete elementary school or less, low per capita household income, and manual socio-occupational category (Figs. 2  and 3).

Discussion
In this study, multiple correspondence analysis was used as a way to graphically represent and interpret the relationship between self-rated health and social and occupational characteristics. The composition of the different groups formed in the MCA reflected existing socioeconomic inequalities in Brazil. The results, similar for both age strata, show groups divided by better (group 2), average (group 1), and worse (group 3) socioeconomic conditions. The group with better conditions was related to good SRH, white self-declared race/ethnicity, and being men. Meanwhile, the average socioeconomic group was associated with poor SRH, black self-declared race/ethnicity, and being women. Lastly, the worst socioeconomic group was not related to social characteristics.
Since 2001, several policies have been presented by the Brazilian government [1,29], focused on increasing the educational level [30], equalizing the income distribution [29], reducing poverty [29], and improving access to health services [6]. Despite these advances, Brazil is still a country burdened by inequalities [4,6,31]. After 2016, socioeconomic inequality begun to increase again, and the "Continuous National Household Sample Survey (PNAD-contínua)" shows differences in average earnings according to levels of education [32], and important racial disparities in education, employment, and income between white and non-white population (black and brown) [2].
The ELSA-Brasil is composed of civil servants of higher education institutions, with a career path, in which occupations require a certain level of education. As expected, the MCA results show an association between education and socio-occupation category. Even with these population characteristics, the results of the current study add to the knowledge about working conditions and socioeconomic, racial, and gender inequalities in Brazil. In our study, low decision authority at work and low skill discretion were related to being women, black self-declared race/ethnicity, average socioeconomic conditions, and middle socio-occupational category.
Despite the actions by the Brazilian government to promote gender equality in the workplace [33] and increase women's access to education [30], Brazil [34] had the worst percentage of women in politics position (10,5%) among South American countries, and women with the same years of study and occupation as men, still receive lower wage [4,31]. Also, in Brazil, the non-white population (black and brown) have lower education, and when employed, they usually received half of the income that white population received [2]. Another Brazilian study with civil servants found similar results. Women had more job strain and psychological distress than men. However, occupational status did not have the same role in psychological distress for both genders, as men with routine-non-manual or manual work had a higher prevalence [35].
Independent of these differences, several occupational studies demonstrated the importance of working conditions for health [7,9,11]. Brazilian studies with the working population demonstrated an association between job strain and cardiovascular risk [36], metabolic syndrome [37], migraine [38], poor quality of life [39], poor self-rated health [40], job dissatisfaction [41], and sickness-absenteeism from the job [42]. Our study also adds to the knowledge about working and health conditions since low decision authority at work, and low skill discretion were related to obesity and poor self-rated health. The prevalence of obesity has been increasing over the years in Brazil [43], and during our baseline the prevalence increased from 13.4% in 2008 to 14.9% in 2010 [44]. Some studies have shown that high values of body mass index are associated with poor self-rated health [45,46], and despite the unclear relationship between job strain and the development of obesity [47], some longitudinal studies found an association between changes in BMI [48], abdominal obesity [49], and job strain.
Our results also show a small difference between "up to 49 years old" and "50 years old or more" groups' composition. In the oldest group, fair SRH and brown selfdeclared race/ethnicitywere associated with poor SRH. Aging is pointed out as an important condition for the deterioration of health over the years [12,13,25] even after adjustment for socioeconomic conditions (income, education, and occupation) [25], and aging itself is a possible explanation for older people considering that fair SRH is in the same group as poor. This result shows that self-rated health should perhaps not always be stratified as poor or good, as our study shows that fair SRH may represent different conditions depending on age.
Finally, our study had similar results to other international studies [50][51][52]. Better education, income, and socio-occupational category were related to good selfrated health. These results reinforce that self-rated health is a relevant indicator to analyse health conditions in different countries with different social backgrounds.
One of the limitations of the present study is the generalization of our findings to the non-worker population, as our results are from a cohort of civil servants. However, one of the advantages of this study was the possibility to describe the complex relationship between self-rated health and occupational characteristics, since correspondence analysis is a technique to explain these relationships. A limitation of this type of analysis consists of being an exploratory technique that provide only point estimates. However, this limitation allowed the participants' profiles identification without the results being affected by our sample size, which minor effects could lead to statistically significant tests [53]. In this way, the group composition of this study could be used in future studies that consider longitudinal analysis in a working population. Another limitation of our study is the exclusion of 3,3% of our sample due to missing values. However, these missing values did not differ between socio-demographic characteristics.

Conclusions
To conclude, our study reinforces the relevance of the non-dichotomization of self-rated health. The results show that our participant profiles regarding self-rated health are similar for both age groups, and existing gender, racial, socioeconomic, and workplace inequalities somehow affected the group compositions. It was also possible to observe the importance of the psychosocial work environment on self-rated health and obesity, suggesting that further longitudinal studies are necessary to understand the relationship between these health conditions and occupational characteristics. In this way, in addition to health promotion policies, more actions need to be done to continue reducing inequalities in Brazil, since it may have an important role in health conditions.