How the
Scoring Works
The framework, the dual-metric system, the scoring rules, and why each design decision was made.
Methodology Version: V6.0 (June 2026) — Dual-track scoring combining Young-Reed behavioral analysis (C1-C10) with Lifton’s psychological totalism framework (C11). Evidence-based jury methodology with formal validation metrics.
Criteria C1–C10 are derived verbatim from Daniella Mestyanek Young and Amy Reed’s The Culting of America: What Makes a Cult and Why We Love Them (Otterpine, 2026). Young and Reed define a cult as a group that meets these ten behavioral conditions. The composite scoring system was developed independently to extend analytical range for dataset-scale application — but the criteria themselves are Young and Reed’s.
Criterion C11 (Psychological Totalism) is derived from Robert Jay Lifton’s clinical and historical research on ideological systems. Lifton’s framework examines the structural mechanisms that create totalizing control over thought, emotion, and identity — independent of the specific ideology.
The three scoring outputs each measure a distinct dimension. Young’s Cultiness Score is a presence checklist (0–10): each of the ten behavioral criteria counts as “present” when the evidence shows it operating at a documented, recurring level — an intensity of 5 or higher on the criterion scale — and the score is simply the number of criteria present. The Composite Score reads those same per-criterion intensities but weights both how strongly each criterion applies (intensity) and how many apply (breadth) — producing a 0–100% scale that captures not just which criteria fire, but how powerfully and broadly. Lifton’s Totalism Score (C11) is an independent third axis measuring system-level psychological architecture: whether the organization structurally prevents exit from thought itself, regardless of its behavioral profile. The Cultiness and Composite scores are two different reductions of the same C1–C10 evidence (a presence count versus an intensity-weighted index); the Lifton totalism score is generated independently.
Young and Reed define a cult as a group that exhibits these ten behavioral patterns. Lifton’s eleventh criterion examines the system-level psychological totalism that enables and sustains those patterns.
Charismatic Leadership
A defined authority figure — or a central idea functioning in place of a person — whose directives are taken as truth and whose challengers are discredited. Does not require a living human leader; posthumous authority that is institutionally ratified and structurally unrevisable scores as high as or higher than a living leader.
Sacred Assumptions
Certain beliefs maintained against counter-evidence, with mantras repeated and alternatives dismissed. The key test is not whether beliefs exist but whether they are enforced against documented contradicting evidence. The most extreme form is architectural prevention of counter-evidence from being generated at all.
Transcendent Mission
A mission so large it justifies sacrifice, treats doubts as betrayal, and provides meaning in ways that override individual judgment. Organizations that structurally encourage internal dissent as an improvement mechanism score N/A — the structural opposite of this criterion.
Sublimation of Individuality
Identity demands, appearance and lifestyle conformity, and rest-as-weakness culture. Includes institutional statements that inflate membership value, creating a reputational double-bind where departure signals either rejection or poor judgment.
Isolation
Information environment narrows, outside perspectives are dismissed, the world shrinks. Isolation through institutional ecosystem completeness — parallel schools, hospitals, media, employment — scores equivalently to geographic compound isolation. Mechanism specificity: institutional enforcement of isolation scores 8–10; algorithmic or voluntarily mediated isolation scores 5–7.
Private Vernacular
Specialized vocabulary that marks membership identity, encodes a way of understanding reality difficult to access from outside, and terminates inquiry rather than enabling it. Standard professional field vocabulary does not check this criterion — the test is whether vocabulary operates as epistemological closure, not merely institutional naming.
Us-Versus-Them
More-enlightened-than-outsiders framing, defectors characterized as broken or corrupt, disagreement framed as bigotry or betrayal. Appropriate labor-management framing in unions and symmetrical partisan framing in political parties are distinguished from pathological enemy-construction.
Exploitation of Labor
Sacrifice extracted as virtue, labor monetized through institutional control. Financial extraction coerced through doctrinal framing with salvific or mission stakes is labor extraction. Systematic sexual harassment maintained through institutional cover-up and NDA architecture also checks this criterion. The delivery mechanism — financial, physical, sexual, or psychological — does not moderate intensity scoring. Compensation engineered to create exit barriers rather than fairly reward labor also checks this criterion.
High Exit Costs
Departure produces social, economic, or identity costs; exit is framed as betrayal. Spiritual absolutism — where departure means eternal damnation, complete family rupture, and total social network dissolution — scores at the same level as physical confinement. Mechanism specificity: institutional enforcement of exit costs (asset forfeiture, career destruction, legal consequences) scores 7–10; reputational or social costs without enforcement infrastructure scores 3–6.
Ends Justify the Means
Institutional harm tolerated in pursuit of mission, cover-ups occur, perpetrators are protected. Multi-generational non-correcting harm patterns score at the ceiling regardless of mechanism. The existence of internal dissenters who made the courageous choice within the same constraints establishes that compliant choices were genuine institutional choices, not forced outcomes.
Psychological Totalism
Robert Jay Lifton's framework describes a system where ideology becomes totalizing: it extends into every domain of life (totality), is self-sealing against critique (closed system), demands intellectual conformity, uses thought-terminating clichés, enforces emotional regulation, splits reality into absolute categories, uses sacred science claims to override evidence, maintains control through identity fusion with the organization, creates dependency on the system for meaning and morality, and is intended to be permanent/irreversible. The degree to which an organization instantiates these characteristics measures the degree of psychological totalism — not ideology per se, but the structural mechanisms that prevent exit from thought itself.
Every organization receives three scores capturing different dimensions of institutional coercion. The Cultiness and Composite scores are two reductions of the same C1–C10 criterion intensities (a presence count versus an intensity-weighted index); the Lifton totalism score is generated independently. All three are provided.
Young's Cultiness Score
0–10 binary checklist
Each of the ten criteria counts as present when its evidence reaches a documented, recurring intensity (5 or higher on the 1–10 criterion scale); the score is the count of present criteria. Produces three bands matching Young's Group Exit Checklist: Not Culty (0–2), Kinda Culty (3–5), Super Culty (6+). The presence threshold is essential — counting every criterion that merely has some evidence, rather than genuine documented presence, systematically inflates the score (an organization absent on every dynamic would otherwise read as a perfect 10).
Composite Score
Formula-based 0–100%
Formula: (Breadth ÷ 10) × (Mean Intensity ÷ 10) × 100 (C1-C10 only). Breadth = criteria with non-N/A scores. Mean Intensity = average of those scores. Adds two dimensions the binary instrument cannot capture: intensity variance and breadth-intensity interaction. Produces three tiers: Low-Control (0–29%), Moderate-Control (30–59%), High-Control (≥60%).
Totalism Score
0–10 intensity scale
Scores C11 (Psychological Totalism) on a 0–10 intensity scale, measuring the degree to which the organization exhibits system-level mechanisms that create totalizing control over thought and identity. Produces three classifications: Non-Totalizing (0–2.9), Moderately Totalizing (3–5.9), Psychologically Totalizing (6–10). Independent of C1-C10 scores. A behavioral cult (high C1-C10) may have low totalism (temporary structures); a totalizing system (high C11) may lack some behavioral hallmarks.
These three outputs are independent variables, not conversions of each other. Divergence between them is analytically meaningful. For example, a military institution might score high on C1-C10 (charismatic command, obedience structures) but lower on C11 (totalism) if it maintains explicit time-limits and member exit pathways. A new religious movement might score moderate on C1-C10 but extreme on C11 if it structures every domain of life and uses identity fusion.
The Composite Score (0–100%) maps to three tiers. These are the labels shown in the Dataset Explorer under the “Control Tier” column.
Young’s binary checklist (0–10 criteria met) maps to three bands, shown in the Dataset Explorer under the “Cultiness Tier” column.
The Totalism Score (0–10) maps to three classifications, shown in the Dataset Explorer under the “Totalism Tier” column.
Each criterion receives either a score of 1–10 or N/A. N/A is not a floor score. N/A designates structural absence — the criterion describes a dynamic that is inapplicable because the organization lacks the architecture for it, or because the organization’s documented behavior is the structural opposite of what the criterion describes.
Never assign N/A to soften a low score. Never assign a floor number when the evidence says the dynamic is absent. When body text explicitly states a dynamic is not operative, the score must be N/A — not 1 or 2. The distinction matters because N/A criteria are excluded from both the breadth count and the mean intensity calculation. Phantom scoring — floor numbers on structurally absent criteria — artificially inflates composite scores and produces misleading results.
Criterion structurally inapplicable — dynamic not operative and organization lacks the architecture for it
Essentially no evidence of this dynamic
Mild, occasional, or incidental presence
Moderate, recurring presence with documented examples
Strong, systematic presence across multiple documented behaviors
Extreme, defining feature with documented evidence of harm
Scores are anchored to publicly documented, verifiable behaviors — not reputation, not impression, not general public perception. Acceptable sources include court records and regulatory findings, investigative journalism from publications meeting T1–T2 credibility standards, peer-reviewed academic scholarship, government reports, and institutional self-documentation. Each criterion score includes at least one specific, verifiable example with citation.
Where evidence is limited, contested, or primarily indirect, confidence is rated Low and noted explicitly. The methodology does not paper over uncertainty.
All scores — across all three output tracks (Young’s, Composite, and Totalism) — are generated by an evidence-based jury protocol that eliminates calibration drift:
- An evidence brief is compiled for each criterion — documented behaviors from court records, investigative journalism, academic scholarship, and institutional documentation.
- The evidence brief is scored independently by four AI models (Claude, GPT-4o, Gemini, Llama) — no general knowledge, only the provided evidence. Models never see each other’s scores.
- The jury mean becomes the proposed score; a score spread greater than 5 points triggers additional review and evidence re-evaluation.
- For Young’s Cultiness Score: the binary checklist is derived from the jury intensity scores (≥5 = checks; <5 = does not check) for consistency and auditability.
- For Composite: formula applied to the jury-derived intensity scores, using the four-model consensus as the basis.
- For Totalism (C11): jury intensity scores for psychological totalism mechanisms, independent of C1-C10.
- A four-model consensus at 0–2 point spread indicates high reliability. Results include Krippendorff’s alpha and pairwise agreement metrics.
This dual-track methodology corrected systematic upward bias that existed in previous anchor-calibrated versions, particularly for C5 (Isolation), C8 (Labor Exploitation), and C9 (Exit Costs) when applied to political and media organizations. The addition of Lifton’s C11 adds system-level totalism measurement orthogonal to behavioral indicators.
AI-assisted scoring is used to generate proposed assessments at scale. Every proposed score — jury mean or solo assessment — passes through human review before entering the dataset. The reviewer verifies that each score is consistent with the body text, that N/A designations have structural rationale, that cited sources support the claims made, and that the assessment reflects consistent application of the methodology across the ideological and cultural spectrum.
The AI proposes. The human decides. No score enters the published dataset without that review.