Skip to main content

Latent variable modeling to develop a robust proxy for sensitive behaviors: application to latrine use behavior and its association with sanitation access in a middle-income country



Globally, diarrhea is a leading cause of child morbidity and mortality. Although latrines are integral for reducing enteric pathogen transmission, several studies have shown no evidence that latrine ownership improved child health. There are a number of explanations for these results. One explanation is that latrine access does not equate to latrine use. Latrine use, however, is difficult to accurately ascertain, as defecation behavior is often stigmatized. To address this measurement issue, we measure latrine use as a latent variable, indicated by a suite of psychosocial variables.


We administered a survey of 16 defecation-related psychosocial questions to 251 individuals living in rural Ecuador. We applied latent class analysis (LCA) to these data to model the probability of latrine use as a latent variable. To account for uncertainty in predicted latent class membership, we used a pseudo-class approach to impute five different probabilities of latrine use for each respondent. Via regression modeling, we tested the association between household sanitation and each imputed latrine use variable.


The optimal model presented strong evidence of two latent classes (entropy = 0.86): consistent users (78%) and inconsistent users (22%), predicted by 5 of our 16 psychosocial variables. There was no evidence of an association between the probability of latrine use, predicted from the LCA, and household access to basic sanitation (OR = 1.1, 95% CI = 0.6–2.1). This suggests that home access to a sanitation facility may not ensure the use of the facility for every family member at all times.


Effective implementation and evaluation of sanitation programs requires accurate measurement of latrine use. Psychosocial variables, such as norms, perceptions, and attitudes may provide robust proxy-measures. Future longitudinal studies will help to strengthen the use of these surrogate measures, as many of these factors may be subject to secular trends. Additionally, subgroup analyses will elucidate how our  proxy indicators of latrine defecation vary by individual-level characteristics.

Peer Review reports


Diarrheal disease, which is both preventable and treatable, is a persistent leading cause of morbidity and mortality for children under 5 years of age throughout the globe. Diarrhea-related mortality accounts for nearly 500,000 child deaths per year, which are concentrated in resource-limited settings [1]. Millions of children worldwide are undernourished [2], and while malnutrition has a host of causes, enteric pathogen exposure is associated with both acute and chronic malnutrition [3, 4]. Water, sanitation, and hygiene (WASH) interventions reduce exposure to enteric pathogens that cause illness [5] and it is estimated that diarrheal-related child mortality could be reduced by more than one-third in low and middle-income countries (LMICs) if sanitation interventions were implemented at full-scale [6]. Sanitation blocks multiple transmission routes of enteric pathogens, reducing individual-level exposures as well as community-level exposures [7, 8]. Indeed, sanitation programs are viewed by governments as fundamental for achieving Sustainable Development Goal (SDG) 6.2, that is, to end open defecation and provide universal access to adequate and equitable sanitation [9]. Yet, results from two recent studies of latrine construction interventions did not find an effect on child health; one rationale for these results is limited uptake of latrine use [10, 11]. Moreover, an analysis of Demographic and Health Survey data has shown a null association between access to improved sanitation facilities and prevalence of diarrhea in children throughout much of the globe [12]. These results suggest a need to emphasize latrine use in programmatic sanitation interventions. Like others, we suggest that WASH program success can be better evaluated if behavior is incorporated as an indicator of WASH success rather than simply the presence of sanitation hardware [13].

Historically, sanitation intervention success has been measured as a the presence of a household sanitation facility (traditionally and synonymously referred to as “sanitation access” within the development field and the focus of the SDGs) rather than individual sanitation practices [14]. One reason that toilet presence has been the accepted exposure variable is that it is relatively easy to measure. Latrine use, however, is difficult to measure and no gold standard of measurement exists. A limited number of intervention studies include a metric of defecation behavior, and the methods used to capture latrine use behavior vary across studies [15]. The most commonly used measure is self-reported defecation practices; however, outside social contexts where report of open defecation is not stigmatized, such as in India [16], survey questions asking about a respondent’s sanitation behaviors present challenges, such as social desirability bias due to the sensitivity of the topic [17, 18]. More recent literature has sought to move beyond a binary self-report of defecation practices, such as Jenkins et al.’s 2014 scale of defecation [19]; or Dreibelbis et al.’s 2015 [20] and Lopez et al.’s 2019 [21] use of psychosocial factors to predict latrine use. Dreibelbis et al.’s study and Lopez et al.’s study, in particular, are promising examples of integrating social theory and quantitative assessment of WASH behavior. These studies, however, rely on self-reported defecation behaviors and/or report of other’s defecation behaviors as outcomes. Such limitations potentially add misclassification and bias into their study results.

Beyond self-report or proxies of population-level psychosocial factors, hygiene behavior is commonly captured through direct observation [22, 23], which is a time-consuming effort subject to the Hawthorne effect [24]. Other measures of behavior include use of sensors inside a latrine to quantify the number of times a latrine was entered [25] and observations of the latrine to assess whether it appears to have been used [26]. While sensors may supply a more objective estimate of overall latrine use than self-report [27], neither sensors nor observations provide information necessary for assessing individual-level latrine use, or the proportion of users on the household or community levels. Such details are integral for understanding exposure risk of negative health outcomes, such as diarrheal disease and undernutrition.

This measurement conundrum for latrine use behavior is not unique to the sanitation field. Rather, many sub-fields of public health that rely on self-report of sensitive behaviors face similar issues of misclassification and measurement error. For example, epidemiology studies focusing on sexually transmitted infections have long been challenged by measurement of behavior [28], with self-reported condom use, a key intervention, presenting misclassification within the study [29, 30]. Here, we take a latent variable approach to capturing latrine use in a LMIC population with high latrine access, variable latrine use, and stigmatized self-report of defecation practices. Our approach is rooted in health behavior theory that recognizes both demographic factors and social processes as determinates of individual-level behavior [31]. Using psychosocial indicators of latrine use behavior, we first apply latent variable modeling to create a proxy-indicator of latrine use at the individual level. Latent class modeling specifically refers to a group of techniques that identify one’s underlying propensities to respond to particular questions and classifies a person into a group given their responses [32]; thus, it lends itself well to measuring sensitive concepts that may not be easily captured by one direct question. We hypothesize that individuals in this population will generally belong to one of three classes: those who always use a latrine for defecation, those who sometimes use a latrine for defecation, and those who never use a latrine. With an assigned likelihood of latrine use, we next test the association between the latent variable measure of latrine use and household-level access to an improved, non-shared sanitation facility.



The data are from a cross-sectional sample as part of a longitudinal study examining the relationship between latrine use and child health. Data were collected between July and August 2016, which is the dry season of the geographical study site. We surveyed 251 individuals (in 98 households) 13 years and older living in 20 rural communities in Esmeraldas, Ecuador (see Table 1 for demographics). The communities selected for enrollment participated in previous enteric pathogen transmission studies [33]. Eligibility for household enrollment was based on whether a child 12 month of age or younger lived in the home. Within each selected community, a household census was conducted to determine if eligibility criteria was met; if so, the household was approached for participation. Of eligible households, 94% were contacted and of these, 94% participated. Each individual 13 years or older living in an enrolled household was also approached for participation. Ninety-two percent of eligible individuals were present during the field team’s visits and 81% of them consented to interview. The study communities are heterogeneous in terms of race/ethnicity; there are Afro-Ecuadorians, Chachi (an indigenous groups), and mestizos.

Table 1 Background characteristics of the study sample

Latrine use for defecation

Individuals in our study site practice a variety of sanitation behaviors. They range from private household latrine use, to public community latrine use, to open defecation [21]. Defecation practices vary by demographic factors such as individual age, gender, place in the family hierarchy, and time spent outside of the home, as well as external factors, such as personal safety and privacy. External factors influencing defecation extend beyond the individual level. Given the tropical climate of the study site, rainfall also influences when and where one defecates. Details regarding multidimensional drivers of latrine use for defecation are presented elsewhere [21].


Study participants were interviewed face-to-face by a local, trained survey team within the participant’s home. The interviews focused on the psychosocial determinants of defecation within latrines and sociodemographic information. The survey tool was designed using the Integrated Behavior Model of Water Sanitation and Hygiene (IBM-WASH), a health behavior framework that presents behavior as a byproduct of intersecting dimensions of technology, context, and psychosocial factors that operate through ecological levels [34]. We applied the IBM-WASH framework to develop indicators reflecting determinants of latrine defecation behavior as we conceptualize human behavior to be dynamic influenced by a multidimensional ecosystem.

The survey tools consisted of 16 indicators, each measured by a specific question, to assess psychosocial determinants of latrine use for defecation (Table 2). Each survey question had three possible responses: yes, no, or don’t know. We determined these indicators using ethnographic data from the study site as well as their demonstrated association with consistent defecation in a latrine in the communities of study. Details are summarized in a prior publication [21]. Some of these indicators reflect descriptive social norms of defecation behavior in the communities; for example, whether a respondent agrees or disagrees that men in their village use a latrine during the dry season. Other indicators reflect themes of personal safety, convenience, and anxiety related to latrine use, as well as attitudes towards sharing a latrine with others. We also asked about perceptions of the latrine cleanliness and maintenance, and about habitual latrine use.

Table 2 Descriptive statistics for latent class analysis indicators: the proportion of respondents that agree, disagree, or answer don’t know to each survey statement

The survey team documented the presence of a sanitation facility following WHO/UNICEF Joint-Monitoring Programme definitions. A sanitation facility is considered improved if it separates human excreta from contact with the facility user [35]. Improved sanitation facilities that are not shared with other households are defined as basic sanitation, whereas an improved facility that is shared with at least two households is considered limited sanitation. An unimproved facility does not provide hygienic separation of fecal matter from latrine users.

There are a number of important covariates for the analysis. Education and wealth, metrics of socioeconomic position [36], are associated with latrine access and the desire to own a latrine [37, 38]. Defecation practices, as well as the sanitation facility used, vary by race/ethnicity within the study site. Women and men display distinct patterns of latrine defecation that may not be captured by the other covariates [39, 40]. Because wealth ascertainment in LMIC research often reflects asset ownership, which proxies one’s access to resources [41], we measured asset ownership and whether one’s household walls were constructed with cement. Asset deprivation is a composite indicator considered within the Standard of Living component in the Multidimensional Poverty Index that accounts for access to three different types of assets: information, mobility, and livelihood [42]. Sociodemographic and asset covariates were measured via self-report; trained field staff observed household construction materials.

The Institutional Review Board at the University of Michigan and the Universidad San Francisco de Quito Bioethics Committee approved the data collection methods and study procedures.

Analytical approach

We used latent class analysis (LCA) to classify individuals according to their propensity to use a latrine for defecation. LCA is a modeling approach that enables estimation of the probability of an individual belonging to one of a small number of hypothesized latent (or unobserved) classes based on a set of measured indicators, and allows for measurement error to be explicitly accounted for when estimating the final categorical latent construct [43]. To model latrine use as a latent construct, we used the 16 psychosocial determinants of latrine defecation measured in the aforementioned survey tool, and employed full information maximum likelihood (FIML) with robust standard errors (see Additional file 1 for polychoric correlations between indicators). Of note, because there is no gold standard metric for latrine defecation, we chose not to compare the latent assignment of latrine use for defecation to any other metric. Thus, this approach does not validate the final classifications. Rather, we assess for validity by examining the final model entropy, considering the item probability for each indicator, comparing these patterns to prior research conducted in the study site, and finally, by evaluating the final model with the Hill criteria for causality.

Model building proceeded in a sequential process by first specifying the measurement model, where the 16 measured determinants were indicators of latent class membership. We then specified the hypothesized number of latent classes, first including three classes and then including two classes. In the three-class model, we expect individuals to be grouped into the following latrine user categories: never users, occasional users, and always users. A two-class model aggregates these categories into latrine users and non-users or consistent latrine users and inconsistent latrine users. Interpretation of classes beyond three groups is challenging; it is difficult to disaggregate meaning results between theoretical “less than always latrine users” and “more than never latrine users”. To determine the optimal class solution (that is, the number of classes within the sample), we compared the model fit between the three-class and two-class models. Good fitting models are characterized by: [1] a low Bayesian Information Criterion (BIC) value; [2] a statistically non-significant chi-square goodness-of-fit statistic that assessed the observed class distribution and expected probability distribution of classes; and [3] relative entropy above 0.80 [44,45,46,47] (see Additional file 2 for a comparison of class assignment in each of the models tested). Once the optimal class solution was determined, we examined the latent-class conditional item probabilities for each indicator (i.e., the probability of response to each survey question given class assignment). Uninformative indicators were removed from the model (see Additional file 3 for conditional item probabilities for all indicators). An indicator was considered uninformative if the item probability confidence interval overlapped between class assignments. Please also see the Additional file 4 for gender-stratified LCA models.

In addition to identifying latent classes of latrine users, we also described the characteristics of the individuals in each class. Because of the inherent uncertainty in predicted class membership via LCA, we employed a multiple imputation (or pseudo-class) approach [48], using five random draws from a uniform distribution to generate five simulated class memberships for each respondent (given their original predicted class probability). Demographic descriptive statistics are presented for the modal class membership for each individual from the five imputations.

Using each imputed class membership as the outcome to test the association between class membership (the dependent variable) and household-level access to a basic sanitation facility, we fit five logistic regression models. Each of the regression models was adjusted for: individual-level educational attainment (categorical), race/ethnicity (binary, as Afro-Ecuadorian and non-Afro-Ecuadorian), and gender (binary). While it is theoretically plausible that gender acts as an effect modifier, it was not possible to explore this given the small sample size. At the household-level, the following variables were also included as proxy-indicators to control for wealth: cement household walls (binary) and whether a household is asset- deprived (binary). We accounted for household clustering when computing standard errors and planned to run multinomial logistic models if the three-class outcome was selected and binary logistic regression if the two-class outcome was chosen.

Following the regression modeling, estimates from the five models fitted to the imputed data sets were combined following rules described by Rubin [49] (see Additional file 5 for demographic proportions for each imputation and for model results for each imputation). Combining the models allows for the standard error of the association between latrine access and latrine use for defecation to be calculated.

Analyses were conducted using the R (version 3.4.1) packages polycorr, poLCA, and survey; see Additional file 6 for R code.


The study sample was young (median age = 23), predominately female (65%) and Afro-Ecuadorian (63%). Most participants did not complete high school (81%), and of those that worked in the last year, it was most common to receive cash remittance. Slightly more than half of respondents live in households with non-cement walls (51%) and in households with access to basic sanitation (53%). Sixty-one percent of participants were from asset-deprived homes, and overall access to improved sanitation is high in the population (86%), with half of the sample having access to a privately owned, improved sanitation facility (i.e., basic sanitation) (Table 1).

The responses to the 16 indicators are presented in Table 2. Survey questions asking about within-home latrine sharing and about convenience of latrine use while outside of the home had more evenly split responses relative to the rest of the survey questions. While most survey questions generated a yes or no response, indicators asking about descriptive norms of latrine use by men and neighbors tended to have more don’t know responses.

Model fit

Using all 16 indicators, the 3-class and 2-class model solutions presented similar model fit statistics and predicted class memberships (Table 3). Nested group profiles are evident between the models. We thus label the three-class groups as “never users” (1%), “sometimes users” (25%), and “always users” (74%) and membership in the two-class model as “inconsistent latrine users” (22%) and “consistent latrine users” (78%). Within the 3-class model it is not feasible statistically to distinguish the 1% of the sample (n = 3, i.e., the “never users”) from the remainder in further multivariable analyses; thus, the 3-class solution was rejected.

Table 3 Model fit information and distribution of predicted classes probabilities using all 16 indicators

After removing uninformative indicators from the two-class model, five indicators remained. The chi-squared (395) and BIC (1527) estimates of the 5-indicator 2-class model were drastically improved relative to the initial 16-indicator model (chi-squared = 2.9E+ 13 and BIC = 4675), while the entropy estimate (0.86) and predicted class memberships remained the same (0.78 and 0.22). The two-class model with five indicators was used in further analyses.

Overall, the final two-class model presents evidence of high conditional probabilities for four of the five indicators in the consistent latrine use class, indicating robust classification (Table 4). This is also reflected in the high value for the entropy estimate. The indicator for within-household latrine sharing (“There are too many people in this household for just one latrine”), however, has a low conditional item probability among those with assigned consistent latrine use membership. Only 52% of consistent latrine users responded no to this question, revealing that this indicator may not appropriately distinguish between classes.

Table 4 The parameter estimates for the final 2-class model that included 5 indicators: item conditional probability, standard error (SE), and response to the survey question

Latent class membership assignment

Based on the final 2-class model, consistent latrine use was lowest among young adults (age 18–21), and also increased with higher educational attainment (Table 5). Consistent latrine use was also lowest among those that were unemployed in the prior year, while access to resources at the household-level showed no differences in latrine use. Accounting for individual factors and household resources, the fully adjusted regression model results showed no evidence of an association between household access to a basic sanitation facility and the probability of latrine use (Table 6; Basic Sanitation OR = 1.1, 95% CI = 0.6–2.1).

Table 5 The proportion of the population that are classified as consistent latrine users, by background characteristics
Table 6 Overall association between access to basic sanitation and latrine use (combined from each model using imputed latrine use as the outcome)


For latrines to reduce pathogen transmission they must remain clean and be used. Thus, research and programmatic evaluation need to have accurate defecation behavior measurements. To this end, our findings reveal three important insights into the complicated relationship between latrine presence and latrine use. First, our model predicts that less than a quarter of this population practices inconsistent latrine defecation (22%, Table 2), even when access to improved sanitation is high (86%, Table 1). Although we do not present self-reported latrine use data, we suspect that misclassification would have been high in this variable given that less than 10% of respondents disagreed with the statement “I use a latrine daily” (see Additional file 7 for further comparisons). Additionally, we can also find insight from prior qualitative research on this topic. From qualitative work, we know that defecation behaviors besides latrine use are not uncommon and that self-reported defecation behaviors in a quantitative survey context are likely misclassified. The triangulation of misclassification in self-report latrine use for defecation and prior research in the study site suggests that psychosocial indicator variables are a potential surrogate for latrine defecation that minimizes misclassification bias. Second, we found no evidence that the presence of basic sanitation within the home was associated with latrine use. This result suggests that private, within-home sanitation does not ensure the use of said facility for every family member each time one needs to defecate. Our model result is consistent with data from India, which present variable sanitation practices among individuals living in a household with a sanitation facility [16, 50]. Some individuals will defecate in the household’s latrine, while others practice open defecation. Our null finding also provides an explanation for the null relationship between latrine access and child health observed in a number of settings [10,11,12]. And third, we found that five simple indicators could be used as reasonable indicators of the two latent classes of individuals in this population, as evidenced by a high entropy value for the final model that approached one (0.86). This high entropy value indicates limited misclassification in group assignment and is also an overall reflection of the indicators’ high conditional item probabilities in the consistent latrine use class [51, 52]. The strength of these indicators suggests that they are plausible underlying manifestations of latrine defecation. In further examining which indicators were informative for group classification, social theories of behavior provide additional rationale for these model results.

Of the 16 indicators examined, those that distinguished between class memberships asked individuals about community defecation norms (i.e., other’s latrine use for defecation) and latrine sharing between households. We also conducted a sensitivity analysis with these five societal-level indicators to determine whether they were useful in distinguishing between three classes of latrine users; the three-class model results were reasonable, however, the sample size of the groups was too small to permit well-powered comparisons. The strength of community level norm indicators to classify latent class membership, coupled with social theories of behavior, suggests potential of these indicators to serve as a proxy measure for individual behavior. Social theories of behavior view individual factors alone as limited metrics of behavior [53]. Sociocultural factors shaping daily habits in invisible ways [54]; humans often exhibit the behavior of those around them, where normative expectations playing a key role in one’s own actions [55]. For example, the LCA suggests that agreement with indicators asking whether others use latrines presents a very high probability that one is a consistent latrine user. Moreover, agreement with statements regarding latrine sharing between households may reflect a strong commitment to latrine use; that is, an individual implies that they would willingly face a variety of barriers to use someone else’s latrine (inconvenience, distance, dirty environment, etc.) if within-household access to a latrine did not exist. The psychological literature supports commitment as one important driver to behavior [56], again providing evidence of a plausible mechanism driving latrine defecation. If we apply traditional metrics of Hill’s exposure-outcome causality criteria to the study findings, our results exhibit strength, consistency (with the body of research), plausibility, and coherence [57]. Use of the Hill criteria provides evidence that these indicators are valid proxies of latrine defecation behavior.

Although the indicators used in the model were plausible, consistent, and coherent, our approach does not validate that the selected psychosocial variables are robust indicators of latrine use. Given the challenges of collecting unbiased measurements of latrine use, validating the predictive power of these indicators is difficult. Nevertheless, to address this limitation, we conducted extensive fieldwork to assess drivers of latrine defecation [21]. Beyond this initial work to define the 16 indicator variables that have potential to be surrogate indicators of latrine use, our LCA analysis suggests that community-level norms of latrine defecation are the strongest indicators. Thus, there is good potential for these indicators to generalize to other populations. Indeed the social environment tends to influence health behaviors [58]. Related to defecation, social norms and the behaviors of others in our social networks have been identified as determinants of latrine use behavior in India and Ethiopia [59,60,61]. The Community-Led Total Sanitation Campaign, a sanitation program targeting LMICs communities that seeks to end open defecation and has received lots of attention throughout the globe, operates through levels of behavior change that include community-pressure to promote latrine use [62, 63]. Our research was conducted in a very specific population; and it plausible that the latrine use and its drivers differ across the 20 communities included in our study as well as from drivers in other cultural contexts. Prior comparisons of the determinants of latrine defecation across cultures and geographical location highlight the role of shared community values, as well as perceptions of latrine cleanliness, on influencing behavior [21]. However, because prior studies are not void of self-report defecation behavior, it is plausible that identified determinants may be biased. Thus, determining which social norms of latrine defecation are applicable indicators across cultures requires further research. Furthermore, because social norms surrounding defecation behavior may also be influenced by the maintenance or cleanliness of a latrine, incorporating additional information on the latrine would add to this exploration and further the generalizability of this research.


As an often neglected side of the epidemiologic triad, we focus on human behavior at the interface of the environment and biological agent of disease. Our approach, illustrated here for latrine defecation, follows methods commonly used to measure socially stigmatized actions, such as intimate partner violence [64], aggression and bullying [65], or substance abuse [66]. Our work within the WASH sector, which allows us to test common assumptions of how people defecate, a critical component to disease transmission. More research is needed, however, to refine the selection of latrine use indicators. First and foremost, gender specific indicators, which may be different by life course stage [39], will likely provide better insight into population-level drivers of behavior and more accurate classification of latrine users (see our sensitivity analysis in Additional file 4). Second, inconsistent latrine use may have a different set of determinants than consistent latrine use, as these behaviors are not strictly opposites; thus, additional work is required to determine indicators of inconsistent latrine use and whether said indicators can distinguish groups of people. Third, because psychosocial norms, attitudes, and beliefs may change over time, longitudinal analysis are required to determine if these indicators are temporally consistent. Finally, future research is needed to test hypotheses assessing whether latrine use measures are adequate indicators of reduced exposure to enteric pathogens.

Regardless of the disease system of study, epidemiologists interested in population dynamics that lead to health and disease should include metrics of behavior and social process into their research. Development of psychosocial indicators of sensitive behaviors and application of LCA is a useful tool for measurement of sensitive behavioral risk factors, such as hygiene or sexual activity, where self-report data suffer from misclassification and no gold standard measurement exists.



Bayesian Information Criterion


Confidence Interval


Full Information Maximum Likelihood


Integrated Behavior Model of Water Sanitation and Hygiene


Latent Class Analysis


Low and Middle-Income Countries


Odds Ratio


Sustainable Development Goal


Standard Error


Water, sanitation, and hygiene


  1. Troeger C, Forouzanfar M, Rao PC, Khalil I, Brown A, Reiner RC, et al. Estimates of global, regional, and national morbidity, mortality, and aetiologies of diarrhoeal diseases: a systematic analysis for the Global Burden of Disease Study 2015. Lancet Infect Dis [Internet]. 2017 [cited 2018 Jan 21];17(9):909–948. Available from:

  2. Black RE, Victora CG, Walker SP, Qar Z, Bhutta A, Christian P, et al. Maternal and Child Nutrition 1 Maternal and child undernutrition and overweight in low-income and middle-income countries. Lancet [Internet]. 2013 [cited 2017 Oct 5];382(3):427–451. Available from:

  3. Scrimshaw NS. Historical concepts of interactions, synergism and antagonism between nutrition and infection. J Nutr. 2003;133:316S–21S.

    Article  Google Scholar 

  4. Guerrant RL, DeBoer MD, Moore SR, Scharf RJ, Lima AAM. The impoverished gut—a triple burden of diarrhoea, stunting and chronic disease. Nat Rev Gastroenterol Hepatol [Internet]. 2012 [cited 2017 Dec 7];10(4):220–229. Available from:

  5. Bartram J, Cairncross S. Hygiene, Sanitation, and Water: Forgotten Foundations of Health. PLoS Med [Internet]. 2010 [cited 2017 Dec 7];7(11):e1000367. Available from:

  6. Bhutta ZA, Das JK, Walker N, Rizvi A, Campbell H, Rudan I, et al. Interventions to address deaths from childhood pneumonia and diarrhoea equitably: what works and at what cost? Lancet [Internet]. 2013 [cited 2017 Dec 7];381(9875):1417–1429. Available from:

  7. Mara D, Lane J, Scott B, Trouba D. Sanitation and health. PLoS Med [Internet]. 2010 [cited 2017 Dec 7];7(11):e1000363. Available from:

  8. Fuller JA, Eisenberg JNS. Herd Protection from Drinking Water, Sanitation, and Hygiene Interventions. Am J Trop Med Hyg [Internet]. 2016 [cited 2017 Dec 7];95(5):1201–1210. Available from:

  9. United Nations Secretary-General. Progress towards the Sustainable Development Goals: Rport E/2017/66. 2017.

  10. Clasen T, Boisson S, Routray P, Torondel B, Bell M, Cumming O, et al. Effectiveness of a rural sanitation programme on diarrhoea, soil-transmitted helminth infection, and child malnutrition in Odisha, India: a cluster-randomised trial. Lancet Glob Heal. 2014;2(11):e645–53.

    Article  Google Scholar 

  11. Patil SR, Arnold BF, Salvatore AL, Briceno B, Ganguly S, Colford JM, et al. The effect of India’s total sanitation campaign on defecation behaviors and child health in rural Madhya Pradesh: A cluster randomized controlled trial. PLoS Med. 2015;11(8):e1001709.

  12. Fuller JA, Westphal JA, Kenney B, Eisenberg JNS. The joint effects of water and sanitation on diarrhoeal disease: a multicountry analysis of the Demographic and Health Surveys. Trop Med Int Heal [Internet]. 2015 [cited 2017 Dec 11];20(3):284–292. Available from:

  13. Luby S. Is targeting access to sanitation enough? Lancet Glob Heal [Internet]. 2014 [cited 2018 Jan 22];2(11):e619–e620. Available from:

  14. Bendahmane Diane PB and AS. SER I ES Water and Sanitation Indicators Measurement Guide. Assist Foodnutrition Tech. 1999;1.

  15. Garn J, Sclar GD, Freeman MC, Alexander K, Penakalapati G, Clasen T. The impact of sanitation interventions on latrine coverage and latrine use: systematic review and meta analysis. Int J Hyg Environ Health [Internet]. 2016; Available from:

  16. Banda K, Sarkar R, Gopal S, Govindarajan J, Harijan BB, Jeyakumar MB, et al. Water handling, sanitation and defecation practices in rural southern India: a knowledge, attitudes and practices study. Trans R Soc Trop Med Hyg. 2007;101(11):1124–30.

    Article  Google Scholar 

  17. Manun’Ebo M, Cousens S, Haggerty P, Kalengaie M, Ashworth A, Kirkwood B. Measuring hygiene practices: a comparison of questionnaires with direct observations in rural Zaïre. Trop Med Int Health [Internet]. 1997;2(11):1015–21 Available from:

    Article  Google Scholar 

  18. Nanan D, White F, Azam I, Afsar H, Hozhabri S. Evaluation of a water, sanitation, and hygiene education intervention on diarrhoea in northern Pakistan. Bull World Heal Organ. 2003;81(3):160–5.

    CAS  Google Scholar 

  19. Jenkins MW, Freeman MC, Routray P. Measuring the safety of excreta disposal behavior in India with the new safe san index: reliability, validity and utility. Int J Environ Res Public Health. 2014;11(8):8319–46.

    Article  Google Scholar 

  20. Dreibelbis R, Jenkins M, Chase RP, Torondel B, Routray P, Boisson S, et al. Development of a multidimensional scale to assess attitudinal determinants of sanitation uptake and use. Environ Sci Technol. 2015;49(22):13613–21.

    Article  CAS  Google Scholar 

  21. Lopez V, Berrocal VJ, Corozo Angulo B, Ram PK, Trostle J, Eisenberg JN. Determinants of latrine use behavior: the psychosocial proxies of individual-level defecation practices in rural coastal Ecuador. Am J Trop Med Hyg. 2019.

  22. Bentley ME, Boot MT, Gittelsohn J, Stallings RY. The use of structured observations in the study of health behaviour. Occas Pap [Internet]. 1994; Available from:[1].pdf. Accessed 9 Jan 2018.

  23. Curtis V, Cousens S, Mertens T, Traore E, Kanki B, Diallo I. Structured observations of hygiene behaviours in Burkina Faso: validity, variability, and utility. Bull World Health Organ [Internet] 1993;71(1):23–32. Available from:

  24. Ram PK, Halder AK, Granger SP, Jones T, Hall P, Hitchcock D, et al. Is structured observation a valid technique to measure handwashing behavior? Use of acceleration sensors embedded in soap to assess reactivity to structured observation. Am J Trop Med Hyg. 2010;83(5):1070–6.

    Article  Google Scholar 

  25. Clasen T, Fabini D, Boisson S, Taneja J, Song J, Aichinger E, et al. Making sanitation count: developing and testing a device for assessing latrine use in low-income settings. Environ Sci Technol. 2012;46(6):3295–303.

    Article  CAS  Google Scholar 

  26. Arnold BF, Khush RS, Ramaswamy P, Rajkumar P, Durairaj N, Ramaprabha P, et al. Short report: reactivity in rapidly collected hygiene and toilet spot check measurements: a cautionary note for longitudinal studies. Am J Trop Med Hyg. 2015;92(1):159–62.

    Article  Google Scholar 

  27. Sinha A, Nagel CL, Thomas E, Schmidt WP, Torondel B, Boisson S, et al. Assessing latrine use in rural India: a cross-sectional study comparing reported use and passive latrine use monitors. Am J Trop Med Hyg. 2016;95(3):720–7.

    Article  Google Scholar 

  28. Fenton KA, Johnson AM, McManus S, Erens B. Measuring sexual behaviour: methodological challenges in survey research. Sex Transm Infect [Internet]. 2001 [cited 2017 Dec 11];77(2):84–92. Available from:

  29. Zenilman JM, Weisman CS, Rompalo AM, Ellish N, Upchurch DM, Hook EWI, et al. Condom Use to Prevent Incident STDs: The Validity of Self-Re... : Sexually Transmitted Diseases. Sex Transm Dis [Internet]. 1995 [cited 2017 Dec 11]; Available from:

  30. Rose E, DiClemente RJ, Wingood GM, Sales JM, Latham TP, Crosby RA, et al. The Validity of Teens’ and Young Adults’ Self-reported Condom Use. Arch Pediatr Adolesc Med [Internet]. 2009 [cited 2017 Dec 11];163(1):61. Available from:

  31. Burke NJ, Joseph G, Pasick RJ, Barker JC. Theorizing social context: rethinking behavioral theory. Heal Educ Behav [Internet] 2009;36(5 suppl):55S–70S. Available from:

  32. Lazarsfeld P, Henry N. Latent structure analysis. Boston: Houghton Mifflin; 1968.

    Google Scholar 

  33. Eisenberg JNS, Cevallos W, Ponce K, Levy K, Bates SJ, Scott JC, et al. Environmental change and infectious disease: how new roads affect the transmission of diarrheal pathogens in rural Ecuador. Proc Natl Acad Sci U S A [Internet] 2006;103(51):19460–19465. Available from:

  34. Dreibelbis R, Winch PJ, Leontsini E, Hulland KR, Ram PK, Unicomb L, et al. The integrated behavioural model for water, sanitation, and hygiene: a systematic review of behavioural models and a framework for designing and evaluating behaviour change interventions in infrastructure-restricted settings. BMC Public Health [Internet]. 2013;13(1):1015. Available from:,

  35. WHO/UNICEF, Joint Monitoring Programme for Water Supply and Sanitation (JMP). [Internet]. Available from: Accessed 9 Jan 2018.

  36. Galobardes B, Shaw M, Lawlor DA, Lynch JW, Davey Smith G. Indicators of socioeconomic position (part 1). J Epidemiol Community Health [Internet]. 2006 [cited 2017 Dec 11];60(1):7–12. Available from:

  37. Jenkins MW, Curtis V. Achieving the ‘good life’: Why some people want latrines in rural Benin. Soc Sci Med [Internet]. 2005 [cited 2017 Dec 7];61(11):2446–2459. Available from:

  38. Ghosh A, Cairncross S. The uneven progress of sanitation in India. J Water, Sanit Hyg Dev [Internet]. 2014;4(1):15. Available from: [cited 2017 Dec 7].

  39. Caruso BA, Clasen T, Yount KM, Cooper HLF, Hadley C, Haardörfer R. Assessing women’s negative sanitation experiences and concerns: the development of a novel sanitation insecurity measure. Int J Environ Res Public Health. 2017;14:755-77.

  40. Sinha A, Nagel CL, Schmidt WP, Torondel B, Boisson S, Routray P, et al. Assessing patterns and determinants of latrine use in rural settings: A longitudinal study in Odisha, India. Int J Hyg Environ Health. 2017;220(5):906-15.

  41. Harttgen K, Vollmer S. Using an asset index to simulate household income. Econ Lett [Internet]. 2013 [cited 2017 Dec 7];121(2):257–262. Available from:

  42. Alkire S, Santos ME. Acute Multidimensional Poverty: A New Index for Developing Countries. SSRN Electron J [Internet]. 2010 [cited 2017 Dec 11]; Available from:

  43. Biemer PP. Latent class analysis of survey error. Hoboken, NJ: John Wiley & Sons, Inc.; 2010 [cited 2018 Jan 9]. Available from:

  44. Lin TH, Dayton CM. Model Selection Information Criteria for Non-Nested Latent Class Models. J Educ Behav Stat [Internet]. 1997 [cited 2018 Jan 9];22(3):249–264. Available from:

  45. Forster MR. Key Concepts in Model Selection: Performance and Generalizability. J Math Psychol [Internet]. 2000 [cited 2018 Jan 9];44(1):205–231. Available from:

  46. Li T, Ma S, Ogihara M. Entropy-based criterion in categorical clustering. In: Twenty-first international conference on Machine learning - ICML ‘04. New York, New York: ACM Press; 2004 [cited 2018 Jan 9]. p. 68. Available from:

  47. Muthén L, Muthén B. Mplus Discussion: What is a good value of entropy? [Online Commentary] [Internet]. 2007 [cited 2018 Jan 10]. Available from:

  48. Wang C-P, Hendricks Brown C, Bandeen-Roche K. Residual Diagnostics for Growth Mixture Models. J Am Stat Assoc [Internet]. 2005 [cited 2017 Dec 11];100(471):1054–1076. Available from:

  49. Rubin DB. Multiple Imputation for Nonresponse in Surveys [Internet]. Wiley-Interscience; 1987;287. Available from: [cited 2018 Jan 16].

  50. Arnold BF, Khush RS, Ramaswamy P, London AG, Rajkumar P, Ramaprabha P, et al. Causal inference methods to study nonrandomized, preexisting development interventions. Proc Natl Acad Sci U S A. 2010;107(52):22605–10.

    Article  CAS  Google Scholar 

  51. Celeux G, Soromenho G. An entropy criterion for assessing the number of clusters in a mixture model. J Classif [Internet]. 1996 [cited 2017 Dec 11];13(2):195–212. Available from:

  52. Kaplan D, Keller B. A Note on Cluster Effects in Latent Class Analysis. Struct Equ Model A Multidiscip J [Internet]. 2011 [cited 2017 Dec 11];18(4):525–536. Available from:

  53. Glass TA, McAtee MJ. Behavioral science at the crossroads in public health: Extending horizons, envisioning the future. Soc Sci Med [Internet]. 2006 [cited 2017 Dec 11];62(7):1650–1671. Available from:

  54. Bourdie P. The logic of practice [Internet]. 1980 [cited 2017 Oct 5]. Available from:

  55. Bicchieri C. The grammar of society: the nature and dynamics of social norms. UK: Cambridge University Press; 2006. 260 p.

  56. Lokhorst AM, Werner C, Staats H, van Dijk E, Gale JL. Commitment and Behavior Change. Environ Behav [Internet]. 2013 [cited 2018 Jan 8];45(1):3–34. Available from:

  57. Hill AB. The environment and disease: association or causation? J R Soc Med [Internet]. 2015 [cited 2017 Dec 11];108(1):32–37. Available from:

  58. Bandura A. Human agency in social cognitive theory. Am Psychol [Internet]. 1989;44(9):1175–1184. Available from:

  59. Aiemjoy K, Stoller N GS. Is Using a Latrine “A Strange Thing To Do”? A Mixed-Methods Study of Sanitation Preference and Behaviors in Rural Ethiopia. Am J Trop Med Hyg [Internet]. 2017;96(1):65–73. Available from:

  60. Shakya HB, Christakis NA, Fowler JH. Social network predictors of latrine ownership. Soc Sci Med. 2015;125:129–38.

    Article  Google Scholar 

  61. O’Reilly K, Louiss’ E. The toilet tripod: Understanding successful sanitation in rural India. Heal Place [Internet]. 2014;29:43–51. Available from:

  62. Kar K, Chambers R. Handbook on Community-Led Total Sanitation [Internet]. Vol. 44, Children. 2008. 96 p. Available from: Accessed 9 Jan 2018.

  63. Sigler R, Mahmoudi L, Graham JP. Analysis of behavioral change techniques in community-led total sanitation programs. Health Promot Int [Internet]. 2015 [cited 2017 Dec 7];30(1):16–28. Available from:

  64. Ansara DL, Hindin MJ. Exploring gender differences in the patterns of intimate partner violence in Canada: a latent class approach. J Epidemiol Community Health [Internet]. 2010 [cited 2017 Dec 11];64(10):849–854. Available from:

  65. Williford AP, Brisson D, Bender KA, Jenson JM, Forrest-Bank S. Patterns of Aggressive Behavior and Peer Victimization from Childhood to Early Adolescence: A Latent Class Analysis. J Youth Adolesc [Internet]. 2011 [cited 2017 Dec 11];40(6):644–655. Available from:

  66. Patrick ME, Bray BC, Berglund PA. Reasons for Marijuana Use Among Young Adults and Long-Term Associations With Marijuana Use and Problems. J Stud Alcohol Drugs [Internet]. 2016 [cited 2017 Dec 11];77(6):881–888. Available from:

Download references


We would like to thank the study respondents for their time and participation, as well as the field staff for their dedication. We also thank Betty Corozo Angulo for her substantive input into survey design during the study piloting, which resulted in an improved questionnaire.


Data collection was funded through the University of Michigan Graham Institute’s Dow Sustainability Fellows Program. The National Institute of General Medical Sciences under the Modeling of Infectious Disease Agents Study (U01-GM110712) of the National Institutes of Health funded the analyses.

Availability of data and materials

The datasets used and analyzed during the current study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations



VKL designed the research question and primary data collection, funded and managed data collection, cleaned and analyzed data, wrote the paper, and had primary responsibility for final content. BTW guided data analysis and contributed to data interpretation. PJC and EQ contributed to data interpretation and initial drafts of the manuscript. JNSE contributed to data interpretation, had primary responsibility for final content, and provided financial support for research activities. All authors read and approved the final manuscript.

Corresponding author

Correspondence to V. K. Lopez.

Ethics declarations

Ethics approval and consent to participate

This study received ethical approval in the United States, via the Institutional Review Board at the University of Michigan (study number HUM00109763), and in Ecuador, via the Universidad San Francisco de Quito Bioethics Committee. Per the ethical approval, all individuals approached for enrollment in the study provided written consent for participation.

Consent for publication

The authors declare that they have no competing interests.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Polychoric correlations. (DOCX 79 kb)

Additional file 2:

Class assignment in each model. (DOCX 78.5 kb)

Additional file 3:

Conditional item probabilities for all 16 indicators. (DOCX 28 kb)

Additional file 4:

Gender-stratified LCA models. (DOCX 141 kb)

Additional file 5:

Imputed latrine use assignment. (DOCX 151 kb)

Additional file 6:

R Code for Primary Analysis. (DOCX 92.1 kb)

Additional file 7:

Comparison of latrine use metrics and LCA model consistency. (DOCX 150 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lopez, V.K., West, B.T., Clarke, P.J. et al. Latent variable modeling to develop a robust proxy for sensitive behaviors: application to latrine use behavior and its association with sanitation access in a middle-income country. BMC Public Health 19, 90 (2019).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: