Measuring 8 to 12 year old children’s self-report of power imbalance in relation to bullying: development of the Scale of Perceived Power Imbalance
BMC Public Health volume 19, Article number: 1046 (2019)
While power imbalance is now recognized as a key component of bullying, reliable and valid measurement instruments have yet to be developed. This research aimed to develop a self-report instrument that measures power imbalance as perceived by the victim of frequent aggressive behavior.
A mixed methods approach was used (468 participants, Grade 4 to 6). This paper describes the exploratory (n = 111) and confirmatory factor analysis of the new instrument (n = 337), and assessment of reliablity and construct validity.
A 2-factor model represented physical and social aspects of power imbalance (n = 127: normed chi-square = 1.2, RMSEA = .04, CF1 = .993). The social factor included constructs of group and peer valued characteristics.
This research will enhance health and education professionals understanding of power imbalance in bullying and will inform the design and evaluation of interventions to address bullying in children.
This paper discusses the development and validation of the Scale of Perceived Power Imbalance (SPPI), an instrument designed to measure children’s experience of power imbalance associated with bullying. The definition of bullying provides a basis for the development of the measurement tool . For the purpose of this study, school bullying is defined as a form of aggression that is distinguished by repeated physical or emotional harm within a relationship of power imbalance . This differs to the definition provided by Olweus , who included intent to harm in the definition of bullying. The criteria of intent differentiates purposeful acts of aggression from accidental harm . It has been proposed that intentionality is understood within the context and goals of bullying: that harm is intended by the perpetrator, is perpetrated within the social dynamic of the peer group, and is perceived by the victim of bullying . Based on a legal framework, it has been proposed that the judgement of intent rests on the likelihood that a reasonable person would foresee that aggressive behavior would result in harm [1, 5]. This criteria is difficult to apply as intent is not easily observed [1, 6]. For these combined reasons, the criteria of intent is not included in the definition of bullying for this paper.
The concept of repetition as an essential criteria of bullying has also been questioned by researchers, because single acts of aggression can provide an ongoing threat resulting in long term physical or emotional harm [5, 7]. However, the uniform definition of school bullying as unwanted aggression that is repeated and involves a power imbalance is accepted internationally [8, 9]. Consistent with this, repetition of aggression has been empirically associated with an experience of significantly greater threat and harm by targeted children than those who experienced aggression without repetition [10,11,12]. Children who are bullied perceive that they do not have the power to stop repeated aggression, contributing to an increase in harm . The repeated acts of aggression that occur in a power imbalanced relationship place a load on neuroendocrine pathways that respond to stress, increasing the risk of poor health, learning and developmental outcomes . Outcomes associated with bully victimization include fear, loss of hope, anxiety, depression and suicidal ideation . The core concept that differentiates bullying from aggression is, therefore, the abuse of power by the perpetrator, and the experience of a power imbalance by the victim . Despite the importance of the concept of power imbalance in bullying research, it is reported in recent literature that power imbalance has not been measured effectively, resulting in the inaccurate reporting of aggression as bullying .
Reports of bullying begin to increase in grade 4 to 6 (age 8 to 12) as children develop the cognitive capacity for self-reflection and place increasing value on social hierarchies [16, 17]. The concept of power imbalance as central to the definition of bullying has been driven by researchers, and children themselves may not consider repetition or power imbalance when talking about bullying . They are, therefore, likely to report all acts of aggression as bullying . With this in mind, researchers have sought to include the constructs of repetition and power imbalance in their self-report bullying instruments. Some researchers include a definition and ask children to consider the definition while answering questions . There is evidence that many children fail to apply the definition of bullying correctly when answering the questions that follow and are prone to report aggressive behavior as bullying . Other children may avoid answering questions truthfully when they read the word bully within the definition because of the stigma and shame associated with being a victim of bullying . In this way the definition-based approach is contrary to conventional self-report measurement in psychology where questions ask about very specific behaviors and experience to discourage bias associated with respondents giving socially-desirable answers .
A second method of assessment seeks to increase the accuracy of measurement by asking children who report frequent victimization direct questions to assess perceived power imbalance. This is referred to as the behavioral-based method of assessment . For example, Hunter et al.  aimed to differentiate bullying from victimization using three individual items to determine if the aggressor was physically stronger, in a bigger group, or more popular than the child completing the survey (ages 8–13). Concurrent and discriminant validity of the instrument were supported; students who experienced power imbalance perceived more threat and loss of control, and were at greater risk of depression. Similarly, the Californian Bully Victim Scale (CBVS) included three items to measure power imbalance from the perspective of the victim, asking “how popular, smart in schoolwork, and physically strong” the aggressor was . Predictive validity was established; students who experienced power imbalance reported lower connectedness to school, life satisfaction, and hope. Five items were added to the CBVS: “how likeable, good looking, athletic, old, and how much money” the aggressor had in comparison to the victim . These authors assessed the validity of a definition versus behavioral approach (grade 5 to 9) by comparing responses to one definition-based item from the Olweus Bullying Questionnaire (OBQ) . Power imbalance was not significantly associated between the two measures. The authors concluded that the definition-based method might not detect some forms of power imbalance . Notwithstanding, the accuracy of each item to detect the power differential using the behavioral approach remains unclear . Cornell and Limber  claim that a satisfactory method to measure the power differential is yet to be identified.
Volk et al.  recommended that new instruments intended to measure bullying are based on insights gained from qualitative studies, and that specific forms of power are presented and validated. We have designed an instrument that uses this approach to measure the individual perception of power imbalance associated with repeated victimization at the level of the dyad. Our approach was innovative in that we worked with children aged 8 to 12 years to design an instrument and used factor analysis to explore the psychometric fit of items designed to measure the power imbalance component of bullying. The new instrument was implemented in an online survey.
The aim of this research was to establish the reliability and validity of the new instrument, named the Scale of Perceived Power Imbalance (SPPI). On the basis of previous work  we anticipated a negative association between perceived peer support and perceived experience of power imbalance by children who reported frequent victimization. We expected to find a moderate correlation of the SPPI with an existing measure of bully victimization, and that the SPPI would demonstrate reliability over time and invariance by gender, grade, and school.
Quantitative data collection occurred for Phase 1 in November 2015 and for Phase 2 in May and June 2016. Participants for each phase of the research were children in Grades 4 to 6 (aged 8 to 12 years) who were purposively sampled from low fee-paying private, independent schools in metropolitan Perth, Western Australia . In Phase 1 participants were recruited from all eligible students (N = 174) via an information letter sent to parents from one school. Active consent was received for 121 students, of these 111 (64%) students were at school on the day of data collection and completed the online survey (59% female, 41% male). Thirteen percent of participants spoke a language other than English at home.
In the second phase of the research letters were sent to the principals of 10 primary schools inviting participation in the research. Of these, four principals agreed for their school to participate . Participants were recruited from all eligible students (N = 642) via an information letter sent home by the school. Active consent was received for 351 participants, 14 students were absent on the day of data collection and one student did not assent to participate, therefore, a total of 337 (52%) students completed the online survey (51% male, 49% female). 24% of participants spoke a language other than English at home. The principal from one participating school (N = 174) was asked and agreed for data collection to include a retest after 2 weeks, active parental consent was received for 34% of students (n = 58), retesting was undertaken after 2 weeks, 50 of 58 participants (86%) completed the instrument a second time.
Ethics approval was obtained from the Curtin University Human Research Ethics Committee (RDHS-38-15) and the Principal of each participating school. Written informed consent was gained from the parents/guardians of the minors included in this study. Each school was informed in writing of the process, and was asked to arrange for immediate care by the school psychologist or chaplain should any child become upset as a result of answering the online questionnaire.
An online questionnaire was administered to participants in each school on one occasion in a classroom setting. On the day of data collection, the first author explained the research and the procedure to each group of students and answered any questions. Following this, students simultaneously completed the questionnaire. A research assistant was present to help children who had difficulty with reading and comprehension. A record of student questions was recorded in a research diary. A description of the measurement instruments included in the questionnaire is presented in the following section.
Design of the new Scale of Perceived Power Imbalance (SPPI)
The scale was developed for the local context using qualitative methodology . The scale was displayed in response to children’ s report of frequent victimization and followed the stem question, “When these things happened to you was the mean student …” Six items were assessed for face validity by children, and for content validity by reviewers with expertise in education, psychology, social work, and public health (see Fig. 1 and Table 1). Items included “good looking” and “in the most popular group.” Universal agreement of scale-level content validity was given by the expert reviewers using the method recommended by Polit and Beck . Two additional items were recommended by children who assessed face validity; “much stronger than you,” and “bigger than you,” and one additional item “with a group of students” was recommended at expert review (see Table 1). In Phase 1 the 9-item SPPI was displayed in response to children’s report of frequent aggression by the Adolescent Peer Relations Instrument (APRI)  or the Personal Experiences Checklist (PECK)  through the display logic function of Qualtrics™ online survey software. The binary response was 0 (no) or 1 (yes).
Exploratory factor analysis (EFA) (N = 111), revealed a 2-factor solution, 3-items represented physical power and 5-items represented social power. The data analytic approach and results of EFA are documented in the statistical analysis and results section. The item “really smart” was reworded to “really clever” because the factor loading was less than 0.5. An additional item “tougher than you” was added to form a 4-item subscale representing physical power (see Table 1). Expert reviewers (2 psychologists, 7 teachers) assessed the item content validity index (I-CVI) of the new instrument. The I-CVI of the four-item physical power factor was within the recommended range of .78 and 1.0 . The I-CVI of three items of the social power factors were below the recommended minimum of .78: “good at sport” (.55) and “really clever,” “good looking” (.67). Items were based on theory and were designed with children, who are considered experiential experts and have a right to be heard . This gave a priori evidence to keep the items unless repeated empirical testing found a statistically weak relation of the item to the factor .
In Phase 2 the SPPI was positioned in the online questionnaire following the APRI . The SPPI was included twice in the questionnaire and followed a frequency of victimization of 2 or 3 times a month as reported by the APRI; specifically the combined APRI verbal and physical victim scales (representing overt victimization), and the APRI social victim scale. The items of the APRI and the stem question are worded to capture the perceived goal of intent to harm, or feeling of hurt. This differentiates aggression from playful teasing.
Adolescent Peer Relations Instrument (APRI)
The APRI  was developed in Australia. Each six-item scale measures verbal, physical, and social victimization respectively and has demonstrated a clear factor structure and validity . The APRI was reliable with primary school aged children (Grade 5, 6; α =.81 to .90) . Following review of the instrument by children and an expert panel the wording of two items were adapted: “I was ridiculed” was changed to “students teased and made fun of me”, and “my property was damaged on purpose” was changed to “my property was hidden, taken or damaged on purpose.” A seventh item “a student said mean things behind my back” was added to the social victim scale to reflect language used by children in focus groups. The adapted 19-item APRI victimization instrument was answered using a six-point scale from 0 (never) to 5 (every day), responses of 4 and 5 were coded together (several times a week/everyday) to match the OBQ. Cronbach’s alpha scores were acceptable for each scale (.86 to .91) in the EFA research phase.
Personal Experiences Checklist (PECK)
The PECK was developed in Australia to measure children’s experience of being bullied by self-report (n = 647, age 8 to 15) . The PECK did not define bullying and did not include a measure of repetition, intent, or power imbalance. The authors observed “it is arguable whether these elements are adequately assessed in any current measures of bullying” . Items were simple to read and relevant to children of 8 years of age. Items included, “other students said mean things behind my back” and “other students teased me about things that aren’t true” . Children who responded 0 (sometimes) to 4 (most days) to any item on the PECK verbal-relational scale answered the new power imbalance instrument in Phase 2. Cronbach’s alpha of the relational-verbal scale in this phase of the research was good (α = .94).
Olweus Bullying Questionnaire (OBQ)
The OBQ  is used to report prevalence rates of bullying victimization in schools. A definition of bullying precedes the item, “How often have you been bullied at school in the last few months?” The response is coded on a five-point scale: 0 (I haven’t been bullied at school in the past couple of months) to 4 (Several times a week). The screening question classifies non-involved victims of bullying quite accurately (specificity of 94.3%) but is not good at identifying true victims (sensitivity of 56.3%) . Convergent validity of the new instrument was assessed with the OBQ screening question. The rates of bullying reported by multiple items are approximately double that of the single OBQ question . The OBQ was placed last in the questionnaire so that children were not primed to think about bullying when answering behavior-based questions .
Perceptions of Peer Support Scale (PPSS)
The PPSS  measures children’s perception of friendships at school, and was included to measure the discriminant validity of the SPPI. Items are scored on a 3-point scale, 0 (No) to 2 (A lot of the time). A higher score indicates higher perceived peer support. The reliability of the PPSS with Grade 6 children in Western Australia was high (α = .92, n = 1163) . Perceived peer support predicted the regular bullying (p < .05) and occasional bullying (p < .001) of other students. For example, children who reported that they bully others were more likely to answer that someone would chose them on their team “lots of times” than “sometimes” or “never” . We therefore expected that children who report victimization with power imbalance would experience low levels of peer support. The PPSS was placed before the APRI in the online questionnaire.
This paper reports the EFA and CFA that followed an extensive process of item development for the SPPI that emerged through qualitative research with children ages 9 to 11. The rationale for Phase 1 was to explore the factor structure of items identified through qualitative analysis . The rationale for Phase 2 was that CFA would confirm a two-factor structure of perceived power imbalance, measuring physical and social characteristics of power. A minimum of five cases per variable was maintained throughout factor analysis .
Data analytic approach for exploratory factor analysis (EFA)
Data were assessed for frequency of responses to each item and missing data. EFA of the SPPI was run in SPSS using Principal Axis Factoring (PAF) with promax rotation. A Kaiser-Meyer-Olkin (KMO) value of .6 represented a minimum value for sampling adequacy . Parallel analysis  was conducted, in conjunction with a scree plot, to determine the number of underlying dimensions . Missing data were excluded listwise, and factor loadings less than .30 were suppressed .
EFA was continued in MPlus to confirm the factor structure identified in SPSS. EFA was based on the weighted least mean squares (WLSMV) estimator to account for the binary data and small sample sizes with non-normally distributed data . Goodness of fit was reported with: Normed chi-square values < 3 ; comparative fit index (CFI > .90) ; Root-mean-square error of approximation (RMSEA < .08 [or a 90% CI that captures .08] indicates a reasonable fit; < .05 [or a 90% CI that captures .05] indicates a good fit); Standardised Root Mean Square Residual (SRMR ≤ .08) . A minimum factor loading of .32 was accepted, .55 or higher was considered good . The fit of items to relevant subscales was assessed and decisions made on initial items to be included . Results were interpreted to give names to each subscale .
Data analytic approach of CFA
Data analysis began in SPSS; data were assessed for frequency of responses to each item and tolerance (values < .10 indicate multicollinearity) . Analysis continued in MPlus Version 7 . Missing data were deleted listwise, affecting 4.5 to 5.2% of responses for any one analysis. Fit indices as reported for Phase 1, and communality (R2 > .50) suggesting that 50% of the variance of the item can be explained by the factor to which it is linked . Composite reliability of each factor was calculated . Consistent with the method used by Marsh et al.  items were free to cross-load onto factors in the data set.
Second, we used multi-group analysis to determine how consistent the factor structure was over gender and grade. The maximum likelihood robust estimator (MLR) was used to account for incomplete and non-normal distribution of data . Invariance of mean and covariance structures were assessed using the method detailed by Byrne . Taking into account missing data, group sizes for the analysis were for gender: girl, n = 72; boy, n = 54, and for grade: grade 4, n = 50; grades 5–6 (combined to account for smaller classroom sizes in some participating schools), n = 77. Invariance testing began with a configural model incorporating the baseline model for each group. There were no residual covariance’s to constrain equal in the configural model. Invariance analysis then tested for equivalence of factor loadings, covariance structures, intercepts, and latent factor means. The Satorra-Bentler Scaled Chi-Square  was used in Chi-square difference tests. Invariance was suggested by a non-significant corrected MLR chi-square value (p > .05) .
Third, test-retest reliability was assessed using Spearman’s rho (rs) to account for non-normal distribution of data. Test-retest reliability was calculated for the variable bullied total at Time 1 and Time 2. Bullied total represented self-report of frequent victimization and perceived power imbalance and was calculated by the combined ranked score of the 19-item APRI (0–4) and the SPPI (0 or 1): 0 (Not bullied) (APRI =0 or 1, Power = 0); 1 (Frequent victimization without power imbalance) (APRI = 2, 3, or 4, Power = 0); 2 (2 or 3 times a month with power imbalance) (APRI =2, Power = 1); 3 (Once a week with power imbalance) (APRI =3, Power = 1); 4 = (Several times a week/Every day with power imbalance) (APRI =4, Power = 1). Total score scales were whole numbers between 0 and 4, and matched the OBQ .
The final stage of analysis assessed the construct validity of the SPPI. Analysis began in MPlus allowing control for measurement error by assessing validity at the latent construct level. Discriminant validity was assessed by low correlations of a 3-factor model of social victimization (APRI), social power, and physical power (SPPI) in MPlus; and by negative correlations of a 3-factor model of children’s perception of peer support (PPSS; Ladd et al., 1996), social power, and physical power (SPPI) in MPlus. Convergent validity of the SPPI with the OBQ prevalence item was assessed in SPSS; Spearman’s rho (rs) was used to account for non-normal distribution of data.
Exploratory factor analysis
The following number of children reported frequent victimization by each scale: APRI verbal (n = 48), APRI physical (n = 28), APRI social (n = 36), and PECK verbal-relational (n = 55). When missing data were deleted listwise, the number of participants who answered the power imbalance scale in response to the APRI verbal (n = 44), physical (n = 28), and social scales (n = 34) was insufficient to run EFA, based on a minimum of five cases per item  or the minimum Kaiser criterion for sampling adequacy of .60 . Results of the EFA are therefore reported on the power imbalance items that followed the PECK Verbal-relational scale (n = 51). Data were normally distributed. EFA in SPSS resulted in a KMO of .576 suggesting a “mediocre” fit of the data  and resulted in three eigenvalues ≥ 1. Because the eigenvalue criterion can lead to the extraction of too many factors  parallel analysis was calculated, resulting in a 2-factor structure. EFA of the 2-factor model in MPlus resulted in acceptable fit (normed χ2 = 1.25, RMSEA = .070 [90%CI = .00 to 0.148], CF1 = .942, SRMR = .119). The 90% confidence interval was wide; it included the value of .05 suggesting that the result reflected a small sample size. Factor loadings are shown in Table 2, the correlation between factors was −.039. Item 5 “trying to be more popular” did not load onto either factor.
Confirmatory factor analysis
CFA of the hypothesized 9-item 2-factor model of the SPPI did not provide a good fit to the data. On examination, the item “good at sport” had a low communality when answered in response to either physical/verbal or social victimization (R2 = .183, and .034 respectively), and did not load onto the factor when answered in response to the APRI social victim scale (standardized loading = .184). This was consistent with a high tolerance of the item (.919) indicating the uniqueness of the item. “Good at sport” was excluded from further analysis and an 8-item model of the SPPI was tested for fit.
Baseline model fit of the 8-item SPPI in response to verbal/physical victimization was acceptable (n = 146: normed χ2 =1.96, RMSEA = .081 [90%CI = .041 to .119], CF1 = .906). Standardized factor loadings ranged from .353 “really clever” to .941 “tougher than you”. The R2 for “really clever”, and “good looking” were low (see Table 3). In response to social victimization factor loadings of the baseline model of the 8-item SPPI were adequate (n = 127: normed χ2 =1.2, RMSEA = .04 [90% CI = .00 to .091], CF1 = .993). The communality of Items “really clever”, and “good looking” remained low (see Table 3). This is addressed in the discussion.
Multi-group analysis of the 8-item SPPI was conducted with the MLR estimator using the baseline models of best fit to assess invariance of the instrument. In each data set, the configural model of the SPPI answered in response to children’s self-report of verbal/physical victimization was inadmissible because of a negative residual variance for the item “in the most popular group” (gender,-.257; grade at school, −.043). The unstandardized factor loading for gender was high for boy (39.522, standardized =1.451), and for Grade 5–6 (9.647, standardized =1.085).
Invariance was therefore assessed on the 8-item SPPI that was answered in response to self-report of social victimization. Invariance of the mean and covariance structures was demonstrated across gender as evidenced by a non-significant corrected χ2 between the nested model and the comparison model at each specified level of the model (p > .05). Invariance of the mean and covariance structures was likewise demonstrated between grade at school (Grade 4, and Grades 5–6) as evidenced by a non-significant corrected χ2 between the nested model and the comparison model at each specified level of the model (p > .05) (reported in Table 4).
Test-retest reliability (2-week interval) of the combined ranked score (n = 50) from the APRI and SPPI was indicated by a moderately strong correlation when answered in response to self-report of verbal/physical victimization rs =.773 (p < .001), and a strong correlation in response to social victimization rs =.841 (p < .001).
Construct validity of the SPPI was assessed in relation to the preceding APRI social victim scale. CFA of the 3-factor model of social victimization, social power, and physical power resulted in a good fit (n = 124: normed χ2 =1.09, RMSEA = .027 [90%CI = .00 to .057], CF1 = .983). Correlations between factors were low suggesting discriminant validity of the two latent factors of power imbalance (.294) and between the APRI social victim scale and power imbalance (social power = .325, physical power = .357).
Discriminant validity was supported by negative correlations between children’s perception of peer support (PPSS; Ladd et al., 1996) and each latent factor of the SPPI: social victimization (physical power = −.292, social power = −.175), CFA of the 3-factor model (n = 120: normed χ2 =1.0, RMSEA = .009 [90%CI = .00 to .041], CF1 = .997); and verbal/physical victimization (physical power = −.138, social power = .035), CFA (n = 140: normed χ2 =1.09, RMSEA = .025 [90%CI = .00 to .046], CF1 = .957).
Convergent validity was supported by a moderate correlation between the screening item of the OBQ  and the two latent factors of the eight-item SPPI (overt and social victimization) (n = 331, rs =.533). Fifty five per cent of children reported victimization in a relationship of power imbalance by either the SPPI or the OBQ. The overlap in self-report of power imbalance between instruments was 20.2% (see Table 5). Fewer children reported being bullied by the OBQ screening item (23.8%) in comparison to those who reported frequent victimization by the combined ranked score of the 19-item APRI and the eight-item SPPI (50.8%) (see Table 5).
An additional finding was that 11% of children who reported frequent social victimization and 8% of those who reported frequent overt victimization did not report an experience of power imbalance by the 8-item SPPI (see Table 6).
Analyses of the items in the SPPI resulted in a two-factor structure. Social power measured the peer-valued characteristics of clever, appearance, athleticism, belonging to the popular group, and being with a group. Physical power measured the physical characteristics of age, size, strength, and toughness. Physical measures of power imbalance are more obvious, thus easier to quantify, than peer-valued characteristics, as evidenced by consistent fit to the data. Likewise, the group aspects of social power displayed consistently high factor loadings in response to children’s report of overt and physical victimization. The peer-valued characteristics of social power, “good at sport,” “clever,” and “good looking,” however, displayed inconsistent factor loadings and low communalities.
The factor social power measured characteristics that are valued by peers and associated with peer acceptance or belonging. The a priori factor structure was supported by the extant literature. The item “good at sport” did not load onto the social power factor in CFA. This is in contrast to a high response rate of yes for children answering this item in the SPPI (30% to verbal/physical victimization and 22% to social victimization). When Green et al.  previously measured power imbalance by individual items, children also responded most frequently to the item “more athletic” (39.3%). The reliability of the measure of power imbalance was however, not reported by the authors. The low factor loading for “good at sport” in our research does not mean that athleticism is not associated with power imbalance, the low communality and high tolerance (0.919) of the item suggested, however, that it was not related to the other items of social power .
Felix et al.  found that the item “smart in school work” was possibly not a good question to address power imbalance based on a small response rate to the item. Despite this the item was retained in the CBVS . Similarly, in our research less than 20% of the variance of the item “really clever” was explained by the social power factor to which it was linked. Items such as “good at sport” and “clever”, and their opposites, might reflect an academic or ability-related form of power imbalance. This warrants further investigation. Moreover, in focus groups, children referred to smart as “getting their way out of trouble” by hiding the behavior from adults . This social manipulation resulted in feelings of hopelessness and the inability to escape the situation by the victim, increasing the power of the perpetrator over the victim. This is consistent with recent qualitative research. Teachers may not recognize perpetration of aggression by popular students and may even place the responsibility for victimization on the targeted student . Resilience is fostered when children receive social support. It is, however, possible that some children perceive that the teacher is supporting the child who is doing the bullying and is therefore not available as a source of social support . This form of power is difficult to assess, and has been difficult to quantify, but must not be ignored because it is associated with poor health outcomes . For this reason, we propose that further investigation into “smart” or “clever” as a form of power imbalance might inform the development of prevention strategies, including the promotion of resilience.
The item “good looking” was used as a measure of power in the revised CBVS  and was found to be associated with power in focus group analysis . However, we found that only 22% of the variance of the item was explained by the social power factor to which it was linked. In focus groups, children referred to “looks”, but beyond appearance, looks also related to clothes, shoes, smart phones, and possessions. These are all consistent with the attribution of appearance to social power . These items were kept because: a) they may be providing important information regarding the latent construct; and b) the amount of variance they contribute may be important, even if their factor loading was consistently low .
Cascardi et al.  have suggested that popularity is potentially a superficial feature of power imbalance. Our initial invariance analysis models were not identified for the SPPI items answered in response to the verbal/ physical victimization sub-scale of the APRI due to very high factor loadings on the item “in the most popular group”. These results suggest that boys were likely to experience a very high experience of power imbalance when the aggressor belonged to the popular group. Girls in Grades 5 and 6 were similarly more likely to experience a high power imbalance if the aggressor belonged to the most popular group. This is consistent with qualitative findings that popularity is a key influence on bullying within the group .
Preliminary support was found for the construct validity of the SPPI. Discriminant validity was supported by the strong 3-factor structure of the PPSS, SPPI physical power sub-scale, and social power sub-scale. As expected the correlation between perceived peer support and perceived power imbalance was very low, suggesting that children who felt supported by peers experienced lower rates of victimization with power imbalance. Consistent with previous research we found that double the number of children reported being bullied when they answered questions about individual types of bullying compared to the screening item of the OBQ .
This study used a single method of anonymous self-report and there was no verification of victimization by a different source, for example peers or teachers. There are, however, ethical considerations with peer nomination . Agreement between the multiple approaches of quantitative analysis did support the 2-factor solution of power related to peer-valued characteristics and physical characteristics. However, thematic analysis of focus group discussion resulted in three forms of power imbalance reflecting physical characteristics (age), peer valued characteristics, and group membership and position . Future research might benefit from exploring group and peer-valued characteristics of power as different constructs.
The SPPI was developed for the local context, however the qualitative phase of the study was reported in the context of prior international research . There is a great need to reduce the harm of bullying in schools . Victims of bullying experience a power imbalance that hinders their perceived ability to stop the repeated aggression. For this reason it is necessary to continue investigating the nature and measurement of power imbalance.
A limitation of this study is the focus on power imbalance associated with traditional forms of bullying and not cyberbullying. The form of power imbalance is likely to differ between traditional and cyberbullying . Bauman et al. proposed that the imbalance of power associated with cyberbullying should not be assessed by self-report where possible because of the difficulty in inferring power imbalance from a subjective response to online communication . In contrast, it is widely recommended that self-report will provide the most accurate measure of the power imbalance in traditional bullying [9, 20, 38]. This research focused on providing evidence on the reliability and validity of a specific measurement technique to attain children’s self-reported experience of power imbalance associated with traditional forms of bullying .
There is reasonable agreement among researchers concerning the influence of physical factors, such as age, size, and toughness, on power imbalance. These have long been acknowledged in the literature and were found to have a strong factor structure in the analysis undertaken in this study. The influence of social factors, such as being good looking, smart, or popular, on power imbalance is much less clear for a number of reasons. Firstly, social forms of power are subtle in nature and not easily recognized by authority figures. Secondly, social power draws strength through peer group dynamics. This is supported in this study by the strong factor loadings of items that were specific to the peer-group. Thirdly, the factors that comprise social power are often highly valued by peers. The characteristics, themselves, are neutral; it is the status that peers attribute to these characteristics that confers power on those who possess them. Further research is required to better understand the influence of social factors on power imbalance. Specifically, we need to understand how social power reflects: 1) the dominance goals of the perpetrator; 2) group functioning; and 3) the perceived inability of the targeted child to overcome repeated victimization.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the first author on reasonable request.
Adolescent Peer Relations Instrument
Confirmatory factor analysis
Californian Bully Victim Scale
Exploratory factor analysis
Item content validity index
Olweus Bullying Questionnaire
Personal Experiences Checklist
Perceptions of Peer Support Scale
Scale of Perceived Power Imbalance
Bauman S, Underwood MK, Card NA. Definitions: another perspective and a proposal for beginning with cyberaggression. In: Bauman S, Cross D, Walker J, editors. Principles of cyberbullying research: definitions, measures, and methodology. New York, Hove: Routledge, Taylor & Francis; 2013. p. 41–5.
Cascardi M, Brown C, Iannarone M, Cardona N. The problem with overly broad definitions of bullying: implications for the schoolhouse, the statehouse, and the ivory tower. J Sch Violence. 2014;13(3):253–76.
Olweus D. The revised Olweus bullying questionnaire. Bergen, Norway: Research Center for Health Promotion (HEMIL), University of Bergen, N−5020 Bergen, Norway: Mimeo; 1996.
Volk AA, Veenstra R, Espelage DL. So you want to study bullying? Recommendations to enhance the validity, transparency, and compatibility of bullying research. Aggress Violent Behav. 2017;36:34–43.
Smith PK, del Barrio C, Tokunaga RS. Definitions of bullying and cyberbullying: how useful are the terms? In: Bauman S, Cross D, Walker J, editors. Principles of cyberbullying research: definitions, measures, and methodology. New York, Hove: Routledge, Taylor & Francis; 2013. p. 26–40.
Nelson HJ, Burns SK, Kendall GE, Schonert-Reichl KA. The factors that influence and protect against power imbalance in covert aggression and bullying among preadolescent children: a thematic analysis. J Sch Nurs. 2018;34(4):281–91.
Finkelhor D, Turner HA, Hamby S. Let’s prevent peer victimization, not just bullying. Child Abuse Negl. 2012 Apr;36(4):271–4.
United Nations Educational, Scientific and cultural organization. Behind the numbers: ending school violence and bullying. France: UNESCO Education Sector; 2019. p. 70.
Gladden RM, Vivolo-Kantor AM, Hamburger ME, Lumpkin CD. Bullying surveillance among youths: uniform definitions for public health and recommended data elements, version 1.0. Atlanta: National Center for Injury Prevention and Control, Centers for Disease Control and Prevention and U.S. Department of Education; 2014. Available from: http://www.cdc.gov/violenceprevention/pdf/bullying-definitions-final-a.pdf. Cited 2015 Sep 7
Hunter SC, JME B, David W. Perceptions and correlates of peer-victimization and bullying. Br J Educ Psychol. 2007;77(4):797–810.
Esbensen F-A, Carson DC. Consequences of being bullied: results from a longitudinal assessment of bullying victimization in a multisite sample of American students. Youth Soc. 2009;41(2):209–33.
Ybarra ML, Espelage DL, Mitchell KJ. Differentiating youth who are bullied from other victims of peer-aggression: the importance of differential power and repetition. J Adolesc Health. 2014;55(2):293–300.
Swearer SM, Hymel S. Understanding the psychology of bullying: moving toward a social-ecological diathesis–stress model. Am Psychol. 2015;70(4):344–53.
Bonanno RA, Hymel S. Beyond hurt feelings: investigating why some victims of bullying are at greater risk for suicidal ideation. Merrill-Palmer Q. 2010;56(3):420–40.
Rodkin PC, Espelage DL, Hanish LD. A relational framework for understanding bullying: developmental antecedents and outcomes. Am Psychol. 2015;70(4):311–21.
Andreou E. Social preference, perceived popularity and social intelligence: relations to overt and relational aggression. Sch Psychol Int. 2006;27(3):339–51.
Eccles JS. The development of children ages 6 to 14. Futur Child. 1999;9:30–44.
Cuadrado-Gordillo I. Repetition, power imbalance, and intentionality: do these criteria conform to teenagers’ perception of bullying? A role-based analysis. J Interpers Violence. 2012;27(10):1889–910.
Vaillancourt T, McDougall P, Hymel S, Krygsman A, Miller J, Stiver K, et al. Bullying: are researchers and children/youth talking about the same thing? Int J Behav Dev. 2008;32(6):486–95.
Olweus D. School bullying: development and some important challenges. Annu Rev Clin Psychol. 2013;9(1):751–80.
Kert AS, Codding RS, Tryon GS, Shiyko M. Impact of the word “bully” on the reported rate of bullying behavior. Psychol Sch. 2010;47(2):193–204.
Achenbach TM. Checklists and rating scales. In: McLeod BD, Jensen-Doss AE, Ollendick TH, editors. Diagnostic and behavioral assessment in children and adolescents, a clinical guide. New York: The Guilford Press; 2013. p. 133–63.
Felix ED, Sharkey JD, Green JG, Furlong MJ, Tanigawa D. Getting precise and pragmatic about the assessment of bullying: the development of the California bullying victimization scale. Aggress Behav. 2011;37(3):234–47.
Green JG, Felix ED, Sharkey JD, Furlong MJ, Kras JE. Identifying bully victims: definitional versus behavioral approaches. Psychol Assess. 2013;25(2):651–7.
Malecki CK, Demaray MK, Coyle S, Geosling R, Rueger SY, Becker LD. Frequency, power differential, and intentionality and the relationship to anxiety, depression, and self-esteem for victims of bullying. Child Youth Care Forum. 2015;44(1):115–31.
Cornell DG, Limber SP. Law and policy on the concept of bullying at school. Am Psychol. 2015;70(4):333–43.
Burns S. An analysis of upper primary school children who bully others [Ph. D]. Perth: Curtin University of Technology; 2007.
ACARA. Guide to understanding ICSEA (Index of Community Socio-educational Advantage) values. 2015. Available from: http://www.acara.edu.au/_resources/Guide_to_understanding_icsea_values.pdf. Cited 2016 Sep 21.
Nelson HJ, Kendall GE, Burns SK, Schonert-Reichl KA, Kane RT. Development of the student experience of teacher support scale: measuring the experience of students who report bullying. Int J Bullying Prev. 2019;1(2):99–110.
Nelson HJ, Burns SK, Kendall GE, Schonert-Reichl KA. Preadolescent children’s perception of power imbalance in bullying: a thematic analysis. PLoS One. 2019;14(3):e0211124.
Polit DF, Beck CT. The content validity index: are you sure you know what’s being reported? Critique and recommendations. Res Nurs Health. 2006;29(5):489–97.
Parada RH. Adolescent peer relations instrument: a theoretical and empirical basis for the measurement of participant roles in bullying and victimisation of adolescence: an interim test manual and a research monograph: a test manual. Publication unit, self-concept enhancement and learning facilitation (SELF) research Centre. Sydney: University of Western Sydney; 2000.
Hunt C, Peters L, Rapee RM. Development of a measure of the experience of being bullied in youth. Psychol Assess. 2012;24(1):156–65.
UNICEF. Convention on the Rights of the Child. 1989. Available from: https://www.unicef.org/crc/. Cited 2017 Apr 6.
Bollen KA. Evaluating effect, composite, and causal indicators in structural equation models. MIS Q. 2011;35(2):359–72.
Marsh HW, Nagengast B, Morin AJS, Parada RH, Craven RG, Hamilton LR. Construct validity of the multidimensional structure of bullying and victimization: an application of exploratory structural equation modeling. J Educ Psychol. 2011;103(3):701–32.
Finger LR, Yeung AS, Craven RG, Parada RH, Newey K. Adolescent peer relations instrument: assessment of its reliability and construct validity when used with upper primary students. Brisbane: Australian Association for Research in Education; 2008. p. 9.
Vaillancourt T, Trinh V, McDougall P, Duku E, Cunningham L, Cunningham C, et al. Optimizing population screening of bullying in school-aged children. J Sch Violence. 2010;9(3):233–50.
Huang FL, Cornell DG. The impact of definition and question order on the prevalence of bullying victimization using student self-reports. Psychol Assess. 2015;27(4):1484–93.
Ladd GW, Kochenderfer BJ, Coleman CC. Friendship quality as a predictor of young children’s early school adjustment. Child Dev. 1996;67:1103–18.
Russell DW. In search of underlying dimensions: the use (and abuse) of factor analysis. Personal Soc Psychol Bull. 2002;28(12):1629–46.
Beavers AS, Lounsbury JW, Richards JK, Huck SW, Skolits GJ, Esquivel SL. Practical considerations for using exploratory factor analysis in educational research. Pract Assess Res Eval. 2013;18(6):1–13.
O’Connor BP. SPSS and BAS programs for determining the number of components using parallel analysis and Velicer’s MAP test. Behav Res Methods Instrum Comput. 2000;32(3):396–402.
Costello AB, Osborne JW. Best practices in exploratory factor analysis: four recommendations for getting the most from your analysis. Pract Assess Res Eval. 2005;10(7):9.
Byrne BM. Structural equation modeling with MPlus: Basic concepts, application, and programming. Great Britain: Routledge; 2012. p. 412.
Kline RB. Principles and practice of structural equation modeling. New York: The Guilford Press; 2005.
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model Multidiscip J. 1999;6(1):1–55.
Tabachnick BG, Fidell LS. Using multivariate statistics. 6th. Harlow: Pearson; 2013. p. 1072.
Brown TA, Moore MT. Confirmatory factor analysis. In: Hoyle RH, editor. Handbook of structural equation modeling. New York: Guilford Press; 2012. p. 361–79.
Colwell S. The composite reliability calculator; 2016. https://doi.org/10.13140/RG.2.1.4298.088.
Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York: Guilford Press; 2011. p. 427.
Muthén LK, Muthén BO. MPlus user’s guide: statistical analysis with latent variables. 7th ed. Los Angeles: Muthén & Muthén; 2015. p. 870.
Raykov T. Estimation of composite reliability for congeneric measures. Appl Psychol Meas. 1997;21(2):173–84.
Satorra A, Bentler PM. Ensuring positiveness of the scaled difference chi-square test statistic. Psychometrika. 2010;75(2):243–8.
Kaiser HF. An index of factorial simplicity. Psychometrika. 1974;39(1):31–6.
Rosen LH, Scott SR, DeOrnellas K. Teachers’ perceptions of bullying: a focus group approach. J Sch Violence. 2017;16(1):119–39.
Kiefer SM, Wang JH. Associations of coolness and social goals with aggression and engagement during adolescence. J Appl Dev Psychol. 2016;44:52–62.
Little TD, Lindenberger U, Nesselroade JR. On selecting indicators for multivariate measurement and modeling with latent variables: when “good” indicators and bad and “bad” indicators are good. Psychol Methods. 1999;4(2):192–211.
Burns S, Maycock B, Cross D, Brown G. The power of peers: why some students bully others to conform. Qual Health Res. 2008;18(12):1704–16.
Lee S, Kim C-J, Kim DH. A meta-analysis of the effect of school-based anti-bullying programs. J Child Health Care. 2015;19(2):136–53.
The authors are grateful to Roberto Parada, who has given permission to use and modify the Adolescent Peer Relations Instrument, to Caroline Hunt who has given permission to use the Personal Experiences Checklist, and to Dan Olweus who has given permission to use the Olweus Bullying Questionnaire in this research.
HN would like to acknowledge the contribution of an Australian Government Research Training Program Scholarship in supporting this research. The stipend scholarship provided a living allowance for HN while completing post-graduate study.
Ethics approval and consent to participate
Ethics approval was obtained from the Curtin University Human Research Ethics Committee (RDHS-38-15) and the Principal of each participating school. Written informed consent was obtained from the parents/guardians of the minors included in this study.
Consent for publication
Sharyn Burns is a member of the editorial board of BMC Public Health.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Nelson, H.J., Kendall, G.E., Burns, S.K. et al. Measuring 8 to 12 year old children’s self-report of power imbalance in relation to bullying: development of the Scale of Perceived Power Imbalance. BMC Public Health 19, 1046 (2019). https://doi.org/10.1186/s12889-019-7375-z