There’s a popular belief in team building that the right answer is to hire the best individuals. Find the highest performers, put them together, and a great team follows naturally. The research on team composition doesn’t support this. What happens at the team level is not simply an aggregate of individual traits — it’s a function of how those traits interact, which creates dynamics that can’t be read from a set of individual profiles.
Understanding this doesn’t mean personality is the only variable that matters in team performance. It means it’s a variable that can be understood and used deliberately, rather than left to chance.
The meta-analytic picture
The most comprehensive review of personality and team performance came from Bell (2007), a meta-analysis covering 58 independent samples with over 4,700 teams. Bell examined how both the mean level and variability of Big Five traits across team members predicted team performance, viability, and cohesion.
The headline finding: team-level conscientiousness mean was the strongest and most consistent predictor of team performance across task types. When the average conscientiousness of a team is high, teams perform better — they’re more organised, more reliable about commitments, and more likely to follow through on shared tasks.
Team-level agreeableness mean was the second strongest predictor, particularly for team cohesion and viability (whether the team stays together and functions over time). Agreeable teams communicate more constructively, handle friction better, and maintain collaboration under pressure.
Extraversion and openness effects depended heavily on task type. For creative tasks, higher openness means predicted better outcomes. For coordination-heavy tasks with clear roles, openness variability sometimes introduced friction. Extraversion helped in tasks requiring communication and outward-facing work; it mattered less in independent-contribution settings.
The mean-versus-variance question
Bell’s analysis introduced a distinction that’s particularly important for practitioners: mean level (the average trait score of the team) versus variance (how much spread there is across members).
For conscientiousness and agreeableness, both the mean and the minimum matter. One low-conscientiousness team member can meaningfully undermine team performance even when the rest of the team is high — because reliability is a chain. A team waiting on a low-conscientiousness member for a shared deliverable absorbs that person’s risk.
Researchers have called this the “weakest link” effect: for traits where the task requires everyone to contribute, the lowest individual score can constrain overall team output more than the average suggests.
For openness and extraversion, variance can be a positive feature. Teams with mixed levels of openness — some highly curious and exploratory members, some more pragmatic and execution-focused — tend to outperform homogeneous teams on complex problems. The exploratory members generate a wider solution space; the more structured members filter and implement.

Too much of a good thing
One of the more counterintuitive findings in team composition research is that high mean levels of some traits can become liabilities past a certain threshold.
Too much conscientiousness in a team can produce over-planning and under-experimentation. Teams where everyone is highly organised and rule-following can struggle with ambiguity, novel problems, and the creative improvisation that complex work increasingly requires. They perform extremely well under clear conditions; they underperform when the problem is genuinely uncertain.
Too much extraversion creates competition for airtime. Teams with uniformly high extraversion tend to have louder conversations but lower-quality decisions — because the social dynamics favour confident assertion over careful deliberation, and disagreement is expressed as competition rather than inquiry.
Too much agreeableness can suppress necessary conflict. Teams that are uniformly high in agreeableness risk groupthink — the tendency to converge prematurely on consensus, avoid challenge, and not surface the uncomfortable truths that good decisions require. A study by De Dreu and Weingart (2003) found that some task conflict (disagreement about how work should be done) was positively associated with performance — agreeable teams are less likely to generate or sustain it.
The practical implication is that personality diversity isn’t just about fairness or representation. It’s about creating the right functional balance for the task the team is actually doing.
What balance actually looks like
Rather than prescribing a single optimal configuration, the research suggests task-dependent design:
For creative and strategy work: Benefit from high openness variance and moderate agreeableness mean. You want diverse ideas to emerge (openness spread) but constructive-enough dynamics to develop them (moderate agreeableness). High conscientiousness is less important here — structure comes second.
For execution-heavy work: Benefit from high conscientiousness mean and high agreeableness mean. Reliability, coordination, and low friction are the key determinants of performance. Openness variance matters less; you want everyone doing the agreed thing well.
For cross-functional or client-facing teams: Extraversion mean and agreeableness mean both predict outcomes. The work requires outward engagement and smooth interpersonal function. Neuroticism mean should be low — reactive team members create instability at exactly the moments external relationships are under pressure.

The neuroticism floor problem
One consistent finding across team composition research deserves specific attention: high neuroticism at the team level — particularly concentrated in one or two members — is reliably damaging.
A single high-N team member who is reactive under pressure, prone to catastrophising, or who generates social tension tends to have an outsized negative effect on team climate, communication quality, and decision-making. This isn’t a mean effect — it’s disproportionately driven by the highest-N individual in the team.
Neuman and Wright (1999) found that a team’s minimum agreeableness score and minimum conscientiousness score were strong predictors of performance — consistent with the weakest-link model. The same logic applies to the maximum neuroticism score: when one person’s reactivity sets the emotional register for the team, it can override the stabilising effect of everyone else.
This makes neuroticism particularly important to consider not just at the individual level but at the team composition level. High individual performance and high team neuroticism can coexist — and the team effect can dominate.
How team dynamics shift under pressure
One of the most important and underappreciated findings in team composition research is that personality effects are not static — they become more pronounced under stress. A team that appears to function well under normal conditions may behave very differently when under a hard deadline, navigating organisational conflict, or absorbing unexpected failure.
The research on team resilience points to two traits as particularly load-bearing when pressure rises: emotional stability (low neuroticism) and agreeableness. Under stress, disagreeable team members tend to become more competitive and less cooperative; neurotic team members become more reactive and generate more interpersonal noise. Both effects amplify exactly when the team needs to coordinate most efficiently.
LePine, Buckman, Crawford, and Methot (2011) examined how team personality profiles moderated responses to sudden increases in workload. Teams with high mean conscientiousness and low mean neuroticism adapted faster and maintained higher performance quality through the transition. Teams with high neuroticism variance — some members stable, some reactive — showed the most disruption: the reactive members created coordination overhead that consumed the capacity of the stable ones.
This has practical implications for how you think about team composition for high-stakes or volatile contexts. A team built around high individual performance but moderate stability may outperform a more stable team in normal conditions while dramatically underperforming when things get hard. The personality profile that looks good in a performance review may not be the profile you want when a major project derails.
The takeaway: if your team routinely operates under pressure — tight timelines, external scrutiny, frequent pivots — the neuroticism dimension of team composition deserves more weight in design than most hiring frameworks give it.
Personality versus skill in team selection
A persistent question in team composition is how much personality should be weighted against capability in selection decisions. The research offers a specific answer that most hiring processes ignore.
Neuman, Wagner, and Christiansen (1999) found that team-level personality explained variance in team performance even after controlling for individual ability scores. Personality added predictive power beyond cognitive ability and technical skill — it wasn’t just a proxy for competence.
The practical implication is not that personality matters more than skill. A highly agreeable person who can’t do the job is not a good hire. It’s that at comparable skill levels, personality composition at the team level is a meaningful predictor of collective output that point-in-time performance interviews and skills assessments don’t capture.
The common failure mode is this: organisations optimise individual selection for the highest-performing candidate in isolation, without considering how that person’s traits interact with the existing team profile. A team of five high-extraversion, high-openness individuals hiring a sixth doesn’t benefit from another high-extraversion, high-openness hire — they benefit from someone who provides what the team currently lacks. But most hiring processes don’t ask that question.

A more useful framing is compositional hiring: rather than asking “who is the best candidate?”, ask “what does this team’s personality profile need, and who fills that gap?” This requires knowing the current team’s trait distribution — something that personality data, rather than subjective manager impressions, can actually provide.
There is also a temporal dimension. Barry and Stewart (1997) found that the effects of team extraversion on performance were moderated by task type and team size. For small teams on interactive tasks, adding an extraverted member had diminishing returns beyond a moderate level; for larger teams with coordination requirements, extraversion predicted performance more linearly. The right compositional decision depends on what the team is actually doing and how many people are already doing it.
Practical design implications
For teams being built or restructured, the research points to a few practical principles:
Audit conscientiousness more carefully than extraversion. Extraversion is visible and gets overweighted in selection. Conscientiousness is quieter and underlies execution quality in almost every task type. A team with moderate extraversion and high conscientiousness tends to outperform the reverse.
Check for the weakest link, not just the mean. A team where seven people are high-conscientiousness and one person isn’t is more vulnerable than the mean suggests. The same applies to agreeableness when the work requires sustained collaboration.
Design for cognitive diversity, not trait homogeneity. Teams that score identically on openness or conscientiousness are not optimal teams — they’re homogeneous ones. Functional balance across dimensions creates more adaptive capacity than having everyone resemble the best individual performer.
Take the interpersonal climate seriously. Team viability — whether teams stay together and stay functional over time — is more strongly predicted by agreeableness and neuroticism than by any performance-adjacent trait. A team that can’t work together consistently will underperform its individual talent, regardless of other factors.
The team level is where personality research becomes most actionable. Individual trait profiles tell you how someone is likely to operate; the composition picture tells you what dynamics the team is likely to create — and that’s where the real performance leverage lives.