Agreeableness reflects a person's orientation toward cooperation, empathy, and social harmony. High scorers tend to be warm, trusting, helpful, and conflict-averse. They prioritize others' needs and are skilled at maintaining interpersonal relationships. Low scorers tend to be more competitive, skeptical, and blunt — prioritizing their own goals over social harmony. This does not make them malicious; it makes them better suited to adversarial negotiations, critical evaluation, and situations where uncomfortable truths must be spoken.
Agreeableness has interesting interactions with gender norms. On average, women score somewhat higher than men on Agreeableness across most cultures — though there is enormous overlap between the distributions. Highly agreeable individuals earn less money on average, partly because they negotiate less assertively. A study by Judge, Livingston, and Hurst (2012) found that disagreeable men earned substantially more than agreeable men, while disagreeable women paid a smaller premium — likely reflecting different social expectations.
In team settings, Agreeableness is associated with better group cohesion and lower conflict, but teams composed entirely of highly agreeable people may engage in groupthink. The most effective teams often combine high-Agreeableness members (who maintain relationships) with lower-Agreeableness members (who challenge assumptions).
Neuroticism — sometimes reversed and labeled "Emotional Stability" — measures the tendency to experience negative emotions like anxiety, sadness, anger, and self-consciousness. High-Neuroticism individuals experience these states more frequently and intensely, and they take longer to return to baseline after emotional arousal. This is not a character flaw — it is a dimension of emotional reactivity that has costs and benefits depending on context.
High Neuroticism is one of the strongest predictors of psychological distress and is elevated in most anxiety and mood disorders. However, moderate Neuroticism also correlates with conscientiousness about risks, alertness to problems, and empathy — a highly neurotic person may be a better safety officer, a more vigilant caregiver, or a more thorough editor precisely because their brain is tuned to detect and respond to problems.
Facets of Neuroticism include anxiety, angry hostility, depression, self-consciousness, impulsiveness, and vulnerability to stress. These facets can dissociate — someone can be highly anxious (prone to worry) while rarely experiencing angry hostility, or vice versa. Understanding which specific facets are elevated is more clinically and practically useful than a single Neuroticism score.
Unlike the Myers-Briggs Type Indicator, which assigns you to one of 16 discrete types, Big Five assessments produce continuous percentile scores. If you score in the 72nd percentile on Conscientiousness, it means you scored higher than approximately 72% of the reference population. There is no threshold that makes you "conscientious" or "not conscientious" — it is a spectrum, and most people cluster around the middle.
Standard Big Five assessments typically include 44 to 300 items (depending on the version) rated on a 5-point Likert scale from "strongly disagree" to "strongly agree." Shorter instruments like the BFI-10 (10 items) or BFI-44 (44 items) are used in research contexts where time is limited. The NEO-PI-3, with 240 items, measures not only the five broad factors but also 30 specific facets (6 per factor), giving a much richer picture.
Test-retest reliability for the Big Five is strong — typically in the 0.70–0.85 range over periods of weeks to months, meaning most people score very similarly when retested. Over years and decades, scores do shift gradually (more on this in the misconceptions section), but within a measurement window, the Big Five is considerably more stable than many situational assessments.
The Myers-Briggs Type Indicator (MBTI) dominates popular culture and corporate HR departments. The Big Five dominates academic research. This disparity exists for several clear reasons.
| Dimension | Big Five | MBTI |
|---|---|---|
| Score format | Continuous percentile scores | Discrete binary categories (E/I, S/N, T/F, J/P) |
| Test-retest reliability | High (0.70–0.85 over months) | Moderate–low (up to 50% get different types on retest) |
| Predictive validity | Strong for job performance, health, relationships | Limited validated predictive power in research literature |
| Cross-cultural validity | Replicated across 50+ languages/cultures | Less systematically validated cross-culturally |
| Factor basis | Derived from empirical factor analysis | Based on Jungian theory, not factor analysis |
| Granularity | Five dimensions + 30 facets | Four dimensions |
The MBTI's central problem is its use of forced-choice binary categories where continuous spectra clearly exist. Someone who scores 51% "Thinking" and 49% "Feeling" gets typed as a T, while someone who scores 95% Thinking also gets typed T — despite being radically different. Research has repeatedly shown that most people cluster near the midpoints on MBTI dimensions, which means the categorization discards the most statistically common cases. A 2003 review in the Journal of Research in Personality concluded that MBTI type assignments are not consistent over time and that the instrument has limited predictive validity for job performance.
The Big Five is not flawless. Critics note that five factors may not be the optimal number (some researchers argue for six factors, including Honesty-Humility, as in the HEXACO model), and the model is descriptive rather than explanatory — it tells you what personality differences exist, but not why they exist or how they can change. But as a practical measurement tool, it is substantially more reliable and valid than type-based alternatives.
Big Five scores predict occupational fit better than any other personality measure. Here are some empirically-supported patterns:
In romantic relationships, research finds that similarity on Agreeableness, Conscientiousness, and Emotional Stability (low Neuroticism) predicts relationship satisfaction. Neuroticism is the single strongest personality predictor of relationship dissatisfaction — particularly the combination of one high-Neuroticism partner with one low-Neuroticism partner, who often interpret the same situations very differently. For friendships and social networks, Extraversion matters more for how many relationships you form, while Agreeableness matters more for the quality and depth of those relationships.
High Neuroticism paired with low Conscientiousness is associated with the worst health outcomes — more frequent negative emotional experiences and fewer compensating health behaviors. High Conscientiousness is one of the most robust predictors of longevity and physical health maintenance. Openness predicts engagement with preventive health information and flexibility in adapting to health challenges.
A personality test is a hypothesis generator, not a verdict. Here is how to extract genuine value from your Big Five scores:
A low Conscientiousness score does not mean you are lazy — it means that environments relying on rigid schedules and meticulous documentation will require more deliberate effort from you than from naturally high-Conscientiousness people. The actionable question is: what systems or structures can compensate for naturally lower organization, and what environments will demand less from this trait? Similarly, a high Neuroticism score is a prompt to build in more stress-regulation practices — exercise, sleep, therapy, structured problem-solving — not a sentence to a life of anxiety.
If you are using an instrument that measures facets (sub-dimensions within each factor), those details are often more actionable than the broad factor scores. Knowing you score high on "anxiety" but low on "angry hostility" within Neuroticism tells you something different than a single Neuroticism number. Knowing your Conscientiousness is high on "achievement striving" but lower on "order" points toward a specific strategy (pair yourself with an organized collaborator; use external systems rather than relying on internal neatness).
In professional settings, knowing your team members' broad Big Five profiles can help anticipate communication friction. A high-Conscientiousness manager supervising a low-Conscientiousness creative professional will experience real friction — not because either is deficient, but because their natural rhythms differ. Awareness creates space for explicit agreements rather than attributions of bad faith.
Because personality does shift across the lifespan and in response to major life changes, taking the Big Five periodically (every few years, or after significant life transitions) gives you a longitudinal view of your own development. Research suggests that intentional behavioral practice in a direction opposite to your natural tendencies can — over time — shift trait-level scores modestly. This is not about fighting who you are; it is about targeted practice in domains where your natural profile creates recurring problems.
Self-report personality measures capture how you see yourself. Research by Vazire and Mehl (2008) and others shows that for some traits — especially Extraversion and Conscientiousness — observer ratings (how others see you) are equally or more predictive of objective behaviors than self-reports. Asking a trusted colleague or partner to rate you on the same items you rated yourself often surfaces blind spots worth examining.
The Big Five is a tool for self-knowledge and interpersonal understanding — not a fixed label or a prediction engine. Used well, it is one of the most evidence-backed frameworks available for understanding the patterns in how you think, feel, and behave across the situations of your life.
Get your OCEAN percentile scores with detailed trait breakdowns and practical insights.
Start the Big Five Test →