Overview
LumiKin measures two things about every game and reports them independently. The first is what a child stands to gain from playing — measured as a Benefits Development Score (BDS) across cognitive, social-emotional, and motor dimensions. The second is the design pressure the game exerts on a developing brain — measured as a Risk Influence Score (RIS) across dopamine manipulation, monetization pressure, and social risk. From these two scores, LumiKin derives two outputs: a LumiScore (0–100), which is a harmonic mean of benefit and safety that penalises games that are high-risk or low-benefit on either axis; and a daily time recommendation in five tiers, derived from the RIS with adjustments for unusually high or low benefit scores. Many games score meaningfully on both axes simultaneously, and LumiKin is designed to make that visible rather than collapse it into a single judgement.
LumiKin does not measure how much a child enjoys a game, how skilled a player is, or whether a particular title is "good" or "bad" in any general sense. It measures structural design choices — what the game asks of a player and what mechanisms it uses to keep them playing. A beloved family favourite can carry meaningful design risks; a forgettable puzzle game can quietly deliver real cognitive benefit. The framework is deliberately separated from taste, popularity, and critical reception, all of which are well-served by other rating systems.
The framework is designed to be auditable. Every dimension has a fixed scoring scale, every category has a documented weight, and every score carries the methodology version that produced it. When the rubric changes, a new version is published and prior versions remain queryable. Reviewers — whether human or automated — score against the same rubric, with reference titles used to calibrate against drift. Limitations and edge cases (UGC platforms, live-service content, automated scoring) are documented openly in the Limitations section below rather than buried.
The Scoring Model
Every LumiKin game profile produces two independent composite scores and a time recommendation derived from them.
Benefits Development Score (BDS)
The BDS combines three benefit categories using fixed weights:
| Category | Weight |
|---|---|
| B1 — Cognitive development | 50% |
| B2 — Social-emotional development | 30% |
| B3 — Physical and motor development | 20% |
Each category score is normalised to 0–1 before weighting.
Formula: BDS = (B1_norm × 0.50) + (B2_norm × 0.30) + (B3_norm × 0.20)
Risk Influence Score (RIS)
The RIS combines three risk categories. Content risk (R4) is reported separately and does not feed into the time recommendation — this preserves compatibility with ESRB and PEGI ratings.
| Category | Weight |
|---|---|
| R1 — Dopamine manipulation design | 45% |
| R2 — Monetization pressure | 30% |
| R3 — Social and emotional risk | 25% |
Formula: RIS = (R1_norm × 0.45) + (R2_norm × 0.30) + (R3_norm × 0.25)
Time recommendation tiers
The primary output is a recommended daily time limit derived from the RIS, with adjustments for high-benefit and low-benefit games.
| RIS range | Base recommendation |
|---|---|
| 0.00 – 0.15 | 120 min/day |
| 0.16 – 0.30 | 90 min/day |
| 0.31 – 0.50 | 60 min/day |
| 0.51 – 0.70 | 30 min/day |
| 0.71 + | 15 min/day (not recommended for daily play) |
Tier adjustments:
- If
BDS ≥ 0.60andRIS ≤ 0.70: recommendation extends one tier (e.g. 60 → 90 min) - If
BDS < 0.20andRIS > 0.30: recommendation drops one tier (e.g. 60 → 30 min)
These adjustments do not stack.
B1 Cognitive
Rationale: Cognitive development is the largest single weight in the BDS (50%) because the age range LumiKin targets — roughly six to sixteen — overlaps with significant maturation of executive function, working memory, spatial reasoning, and metacognition. Games occupy a meaningful share of discretionary time in this window. The mechanics a child engages with repeatedly during this period are not neutral: they constitute a form of practice. Some games offer rich, varied cognitive challenges; others offer narrow, repetitive ones. The B1 dimensions attempt to characterise that landscape so parents and partners can distinguish between superficially similar titles that differ substantially in what they actually train.
What's measured: Ten dimensions, each scored 0–5 by a trained reviewer:
| Dimension | What is assessed |
|---|---|
| Problem solving | Novel problems requiring reasoning, experimentation, or logic |
| Spatial awareness | Mental rotation, 3D navigation, map reading, spatial planning |
| Strategic thinking | Planning ahead, resource management, evaluating trade-offs |
| Critical thinking | Evaluating information, questioning assumptions, evidence-based decisions |
| Memory and attention | Working memory, sustained attention, pattern recognition |
| Creativity and expression | Open-ended tools for building, designing, composing, or storytelling |
| Reading and language | Vocabulary, reading comprehension, narrative understanding |
| Math and systems thinking | Numerical reasoning, economic systems, statistical thinking |
| Learning transfer | Knowledge or skills that apply outside the game |
| Adaptive challenge | Difficulty that scales with player skill (flow state design) |
Scoring: Sum of 10 dimensions (0–5 each). Max 50. Normalised to 0–1 for weighting.
Weighting: B1 contributes 50% of the BDS — the largest single weight, reflecting the centrality of cognitive development in the age ranges LumiKin targets.
Research basis:
- Bavelier, D. & Green, C.S. (2019). Enhancing attentional control: Lessons from action video games. Neuron, 104(1), 147–163. [verify]
- Granic, I., Lobel, A., & Engels, R.C.M.E. (2014). The benefits of playing video games. American Psychologist, 69(1), 66–78. [verify]
- Sala, G., Tatlidil, K.S., & Gobet, F. (2018). Video game training does not enhance cognitive ability: A comprehensive meta-analytic investigation. Psychological Bulletin, 144(2), 111–139. [verify] (Included deliberately as a critical counterweight — LumiKin does not claim broad transfer effects, and the B1 framework distinguishes specific trained skills from general cognitive enhancement.)
B2 Social-Emotional
Rationale: Cooperative play, narrative-driven games, and well-moderated multiplayer environments can develop genuine social and emotional skills — perspective-taking, emotional regulation under pressure, ethical reasoning in low-stakes contexts. The same structural feature that makes games effective at training these skills (sustained social interaction with consequence) also makes them vulnerable to exploitation, which is why the social-emotional dimensions are scored separately from social-emotional risks (R3). A game can simultaneously develop authentic teamwork (high B2) and expose a child to identity pressure or unmoderated strangers (high R3); these are independent axes and the framework treats them that way.
What's measured: Six dimensions, each scored 0–5:
| Dimension | What is assessed |
|---|---|
| Teamwork and cooperation | Genuine collaboration where players depend on each other |
| Communication skills | Meaningful communication between players |
| Empathy and perspective-taking | Understanding other viewpoints, cultures, or emotional experiences |
| Emotional regulation | Persistence, managing frustration, coping with loss — through design, not punishment |
| Ethical reasoning | Moral dilemmas, consequences for choices, opportunities to consider fairness |
| Positive social interaction | Moderated multiplayer, prosocial incentives, constructive community design |
Scoring: Sum of 6 dimensions (0–5 each). Max 30. Normalised to 0–1 for weighting.
Weighting: B2 contributes 30% of the BDS.
Research basis:
- Greitemeyer, T. & Mügge, D.O. (2014). Video games do affect social outcomes: A meta-analytic review of the effects of violent and prosocial video game play. Personality and Social Psychology Bulletin, 40(5), 578–589. [verify]
- Harrington, B. & O'Connell, M. (2016). Video games as virtual teachers: Prosocial video game use by children and adolescents from different socioeconomic groups is associated with increased empathy and prosocial behaviour. Computers in Human Behavior, 63, 650–658. [verify]
B3 Motor
Rationale: Motor development is the smallest weight in the BDS (20%) because the dominant concern for screen time in this age range is sedentary behaviour rather than fine-motor deficit. The dimension is retained because some games — particularly motion-controlled, VR, and AR titles — meaningfully extend into physical activity, and because hand-eye coordination training has documented co-benefits in attention and reaction tasks. Games that contribute negligibly to motor development are scored low here without penalty to the overall BDS provided they perform well on B1 and B2.
What's measured: Four dimensions, each scored 0–5:
| Dimension | What is assessed |
|---|---|
| Hand-eye coordination | Precise timing, aiming, coordination between visual input and motor response |
| Fine motor skills | Precise small-muscle movements, dexterity, touch precision |
| Reaction time | Quick reflexes and rapid decision-making |
| Physical activity | Whole-body movement (VR, motion controls, AR) |
Scoring: Sum of 4 dimensions (0–5 each). Max 20. Normalised to 0–1 for weighting.
Weighting: B3 contributes 20% of the BDS.
Research basis:
- Polechoński, J., Dębska, M., & Dębski, P.G. (2020). Exergaming can be a health-related aerobic physical activity. BioMed Research International, 2020. [verify]
- Li, R., Polat, U., Makous, W., & Bavelier, D. (2009). Enhancing the contrast sensitivity function through action video game training. Nature Neuroscience, 12(5), 549–551. [verify]
R1 Dopamine Design
Rationale: Dopamine manipulation design is weighted highest in the RIS (45%) because the mechanisms involved — variable ratio reward schedules, near-miss feedback, loss aversion triggers, time-limited urgency — are the most thoroughly documented in behavioural psychology and the most aggressively engineered into modern free-to-play games. The same techniques are used in regulated gambling products. The developmental concern is acute: the prefrontal regulation that helps adults resist these mechanisms is still maturing through adolescence. R1 attempts to characterise the intensity and variety of these mechanisms in a given title, on the principle that a game with one mild streak feature and a game built end-to-end around variable rewards present qualitatively different risks even if both technically contain "engagement design".
What's measured: Ten risk factors, each scored 0–3:
| Factor | Description |
|---|---|
| Variable ratio rewards | Random reward schedules (loot boxes, gacha) |
| Streak mechanics | Daily login rewards with penalties for missing |
| Loss aversion triggers | Resources that decay or opponents that advance while absent |
| FOMO / time-limited events | Content that disappears, creating urgency |
| Artificial stopping barriers | Energy/lives systems that create frustration pressure |
| Notification and re-engagement | Notifications designed to create anxiety |
| Near-miss mechanics | Deliberate "almost won" feedback mimicking gambling |
| Infinite scroll / endless play | Auto-play, no natural endpoints |
| Escalating commitment | Sunk-cost design ("I've invested too much to stop") |
| Variable reward frequency | Precisely calibrated reward timing (slot machine pattern) |
Scoring: Sum of 10 factors (0–3 each). Max 30. Normalised to 0–1 for weighting.
Weighting: R1 contributes 45% of the RIS — the highest weight, reflecting the documented developmental harm of these mechanics in children's products.
Research basis:
- Zendle, D. & Cairns, P. (2018). Video game loot boxes are linked to problem gambling: Results of a large-scale survey. PLOS ONE, 13(11). [verify]
- King, D.L. & Delfabbro, P.H. (2019). Predatory monetization features in video games (e.g., 'loot boxes') and Internet gaming disorder. Addiction, 114(11), 1967–1969. [verify]
- UK Competition and Markets Authority (2023). Online choice architecture: How digital design can harm competition and consumers. Discussion paper. [verify]
- The R1 dimension list draws on the SHARP-G framework (Saini & Hodgins, 2024) [verify] adapted for children's contexts.
R2 Monetization
Rationale: Monetization pressure (R2, 30% of RIS) is scored separately from dopamine design because the mechanisms are distinct even when they co-occur. R1 measures psychological extraction — how strongly the game pulls a child into continuing to play. R2 measures financial extraction — how strongly the game pulls a child or parent into continuing to spend. A game can be high on one and low on the other: a free advertising-funded game may have minimal monetization pressure but heavy dopamine design; a flat-fee premium title may have low dopamine design but in-game purchase prompts targeting children. R2 is particularly load-bearing in regulatory contexts (UK Children's Code, FTC guidance on dark patterns in children's apps), and partner products typically need to filter on it independently of overall game quality.
What's measured: Eight risk factors, each scored 0–3:
| Factor | Description |
|---|---|
| Spending ceiling | Whether unlimited spending is possible |
| Pay-to-win mechanics | Gameplay advantage from spending |
| Currency obfuscation | Multiple virtual currencies obscuring real cost |
| Spending prompts | Prompts triggered by failure or frustration |
| Child-targeting design | Purchase UI explicitly targeting younger users |
| Ad pressure | Frequency and intrusiveness of advertising |
| Subscription pressure | Auto-renewal, locked features |
| Social spending pressure | Gifting mechanics, peer pressure to spend |
Scoring: Sum of 8 factors (0–3 each). Max 24. Normalised to 0–1 for weighting.
Weighting: R2 contributes 30% of the RIS.
Research basis:
- Federal Trade Commission (2022). Bringing dark patterns to light. FTC Staff Report. [verify]
- Information Commissioner's Office, UK (2020/2025). Age Appropriate Design Code (Children's Code). [verify]
- Fitton, D. & Read, J.C. (2019). Creating a framework to support the critical consideration of dark design aspects in free-to-play apps. Proceedings of IDC '19. [verify]
R3 Social Risk
Rationale: Social risk is scored independently of the positive social dimensions in B2 because they describe different phenomena even when they share a surface — multiplayer interaction. A game with strong cooperative design and active community moderation can develop genuine teamwork (high B2) while remaining low-risk (low R3). A game with the same multiplayer surface but without moderation, with public ranking, and with unmoderated voice chat available to strangers can develop similar surface skills while exposing a child to qualitatively different risks. R3 attempts to characterise those risks distinctly: the chance of toxic competitive environments, the presence of unknown adults in voice or chat, social comparison pressure tied to spending or in-game performance, and privacy practices. The most acute concern in this category — particularly for children under 13 — is unmoderated stranger contact, which is scored as its own factor and weighted heavily within R3.
What's measured: Six risk factors, each scored 0–3:
| Factor | Description |
|---|---|
| Social obligation | Pressure to play from group expectations (guilds, team events) |
| Competitive toxicity | Rank anxiety, public shaming, limited moderation |
| Stranger interaction risk | Unmoderated communication with unknown adults |
| Social comparison | Prominent leaderboards, spending-visible cosmetics |
| Identity and self-worth | Self-worth linked to in-game performance or possessions |
| Privacy risk | Data collected on minors without transparent consent |
Scoring: Sum of 6 factors (0–3 each). Max 18. Normalised to 0–1 for weighting.
Weighting: R3 contributes 25% of the RIS.
Research basis:
- Kowert, R. (2020). Dark Participation in Games. Frontiers in Psychology, 11. [verify]
- Internet Watch Foundation annual reports — relevant sections on grooming via gaming platforms. [verify]
- 5Rights Foundation (2021). Pathways: How digital design puts children at risk. [verify]
R4 Content Risk
Important: R4 is reported as a standalone content flag and does not feed into the RIS or the time recommendation. This is a deliberate design decision: content risk largely aligns with existing ESRB and PEGI ratings, and LumiKin's primary contribution is in areas those ratings don't cover (manipulation design, monetization, social risk). Including R4 in the time recommendation would create overlap with regulatory systems and reduce the distinctiveness of the LumiScore.
What's assessed: Five factors, each scored 0–3:
| Factor | Examples |
|---|---|
| Violence level | None → mild cartoon → moderate realistic → graphic |
| Sexual content | None → mild suggestive → moderate themes → explicit |
| Language | None → mild → moderate profanity → extreme/frequent |
| Substance references | None → mild references → depicted without consequence → depicted positively |
| Fear and horror | None → mild tension → moderate scares → intense horror |
Display: R4 scores are shown as a content advisory alongside the ESRB/PEGI rating on game pages, not as a component of the overall LumiScore.
Update and Versioning Policy
Games are rescored under three triggers. First, automated rescore is initiated when the upstream metadata source (RAWG) reports a substantive update — DLC, platform expansion, or content patches that materially alter mechanics. Second, manual rescore is initiated for live-service titles where mechanics have changed in ways the automated pipeline did not detect. Third, ad-hoc rescore is initiated when LumiKin receives credible information about changes via community reports, regulatory action, or publisher disclosure.
The scoring weights, dimension definitions, scoring scales, and tier thresholds are locked within a methodology version. Any change to these — adding a dimension, adjusting a weight, redefining a tier boundary — requires publishing a new methodology version. Version numbers follow semantic versioning: minor version increments (1.0 → 1.1) for additive changes that do not alter existing scores, major version increments (1.0 → 2.0) for changes that require all games to be rescored. Every score carries the methodology version that produced it. Historical scores remain queryable under their original version via the ?version=X parameter on this page and via the methodology_version field in the API. Partners consuming the API receive methodology_version and calculated_at fields on every score response.
Limitations and Edge Cases
UGC platforms (Roblox, Fortnite Creative). LumiKin scores individual experiences on UGC platforms as well as the platform itself. The platform score reflects the host environment (moderation quality, monetization model, stranger interaction risk). Individual experience scores reflect the specific content. Consumers of the API should be aware that a "Roblox" platform score is not a score for Roblox experiences as a category — individual experience scores vary widely and should be queried separately.
Live-service games with evolving content. Games with weekly content updates, seasonal events, or ongoing battle passes are scored on their current state at review time plus any persistent structural mechanics. Time-limited events are scored under R1 (FOMO) but the content of those events may not be individually reviewed. Live-service titles are prioritised for periodic rescore when mechanics change materially.
Early access games. Early access titles are scored at the state available at review time. Scores may change substantially on full release. LumiKin does not guarantee rescore coverage for all early access titles on release.
Regional content differences. Some games ship with different content ratings by region (e.g., a game rated M in North America may be rated 16+ in Europe with modified content). LumiKin scores the international/default release where possible. Regional variants are noted where known but are not systematically tracked.
AI-generated scores. Some scores are produced by an automated review pipeline using the LumiKin rubric as a prompt. These are marked review_tier: automated in the API. Expert and community reviews are marked accordingly. Automated scores undergo periodic spot-check auditing but have not been individually verified by a human reviewer.
Changelog
| Version | Date | Summary |
|---|---|---|
| 1.0 | 2026-04-26 | Initial published methodology. All weights, dimensions, and tier thresholds as described in this document. |