Plate 01 · 2024

ASA 400

Subject

THE FOUNDER

The screening room, 3:47 AM. Where the platform was assembled, mostly, between three and seven in the morning.

The Person Behind the Numbers

LUC J. RIBEIRO

FOUNDER · DATA ARCHAEOLOGIST · WRITER

Solo founder working from a converted screening room. Background in computational data science, music engineering, and screenwriting workshops. Built Hollywood Metrics because the most important industry on Earth was still running on hunch, agent rolodex, and three-martini lunches. The platform you're about to read about was assembled, mostly, between three and seven in the morning.

The film industry runs on opinions disguised as expertise. We run on numbers.

Founded: 2024
Based: Los Angeles
Team: Solo + Agents

2024

Founded

Films Analyzed

Screenplays Parsed

Quality Tiers

The Manifesto

WHY WE BUILT IT

One essay, plainly told.

Cinema is the only trillion-dollar industry where the dominant decision-making protocol is still a feeling. Studios greenlight nine-figure budgets on the strength of a pitch, a poster mood-board, and a phone call from someone's agent. Audiences are talked about in focus-group adjectives. Films are described as having 'heat,' or 'noise,' or 'energy.' These are weather words, used to make purchasing decisions that move billions of dollars.

Every other industry of comparable size has been pried open by data. Finance has Bloomberg. Sports has Statcast. Music has streaming-derived analytics that tell the label exactly which 12 seconds of a song to optimise. Hollywood, alone, has held the line. Not because the math doesn't work — the math works fine — but because the math wasn't doing the politics. The hunch was doing the politics.

We built Hollywood Metrics to do the math anyway.

The math wasn't doing the politics. The hunch was doing the politics. We built this to do the math anyway.

The platform you're reading about indexes 107 years of cinema across 74,571 films. It ingests ratings from IMDb, critic consensus from Rotten Tomatoes, audience reception from CinemaScore where the data exists, lifetime box office, Academy and Cannes laurels, and the underlying metadata for every credit on every picture. It runs all of it through a normalisation layer that produces a single composite quality score on a 0–100 scale. That score is comparable across decades — a 1947 noir and a 2024 superhero film can sit on the same axis and be measured against each other without anyone having to argue about taste.

On the screenplay side, we've parsed more than 4,800 produced and unproduced scripts into a quantitative feature set of twenty measurable dimensions: scene cadence, dialogue density, sentiment arc, vocabulary entropy, transition explicitness, character intro rate. Then a Random Forest, trained on those features and their commercial outcomes, lands every script in one of six tiers from S (masterwork) to E (needs work). The model is honest about its accuracy: 49% overall hit rate against historical box-office bands, climbing to 69% on action films, falling to 24% on comedy. We publish those numbers because comedy is genuinely hard and we're not going to pretend otherwise.

On top of the analytical core sits a layer of bespoke agents — purpose-built autonomous agentic workflows, not a chatbot wearing different hats. One agent dissects a script's structural metrics. Another rewrites scenes in the voice of any of ninety screenwriters whose techniques we've codified into style guides. A third forecasts opening weekend with a thousand simulated audience personas. A fourth assembles investor-ready pitch decks from the analysis output. Each one is narrow, focused, and good at exactly one thing.

We publish the accuracy numbers because comedy is genuinely hard and we're not going to pretend otherwise.

None of this is intended to replace the writer. The 2023 WGA contract made the position explicit: writers are creators, models are tools. We agree, in writing, in our terms of service, and in every product decision. The platform never stores your screenplay server-side after analysis completes. Originality scoring exists to prove your work is yours. Optional Solana-blockchain timestamping exists to prove when it was yours. The math, in this house, works for the human.

It is also not intended to be neutral. We have opinions. We think the auteur theory deserves more empirical respect than it gets in the era of brand-led franchise machines. We think the modern comedy script is undervalued by Random Forests because comedy depends on timing and timing doesn't tokenise. We think the average studio note would be improved by being replaced with a histogram. We think a 90-page screenplay with 64% dialogue and a positive sentiment arc is more likely to test well than a 130-page screenplay with 38% dialogue and a flat arc, and we have the data to back it up.

We think the average studio note would be improved by being replaced with a histogram.

What this site is for, in the end, is to let you check those opinions against the data yourself. Browse the catalogue. Read the methodology. Upload a script and watch the engine return numbers instead of vibes. The dashboard is dense on purpose — it is designed for the writer who wants to know exactly why their second act isn't landing, the executive who needs a defensible read on a property before a Monday meeting, and the producer who has been told by three different agents that the script is 'great' and would like a fourth, quieter, more numerical opinion.

That's the project. Numbers. Honestly reported. Made beautiful enough to be worth looking at.

Hollywood Metrics was conceived in a converted screening room, assembled at three in the morning, and is operated as a solo project with help from a federation of autonomous agents. If you find a thing that's wrong, write in. We will fix it and credit you in the changelog.

End · The Founder

Reel Two · The Record

Seventy-four thousand films. A century of frames. The signal under the footnotes.

On the Wire

COVERAGE WISHLIST

We have not been covered yet. We have opinions about who should cover us.

VARIETY

THE HOLLYWOOD REPORTER

DEADLINE

INDIEWIRE

THE INFORMATION

STRATECHERY

WIRED

THE RINGER

Awaiting first byline

Press Contact

The Foundation

DATA SOURCES

Two massive datasets power every visualization, prediction, and insight in the dashboard.

THE FILM DATABASE

A comprehensive collection of over 74,000 films spanning more than a century of cinema, aggregated from multiple authoritative sources.

Each film record combines ratings from IMDb and Rotten Tomatoes, box-office data, award histories, and rich metadata from TMDB.

75K+

Films

Genres

100+

Years

Sources

THE SCRIPT DATABASE

4,800+ screenplays analyzed with our quantitative engine, extracting 20 measurable features that capture structure, pacing, dialogue, character dynamics, and emotional tone.

Each script is classified into one of six quality tiers (S through E) using a Random Forest model.

4.8K+

Scripts

Metrics

Tiers

90+

Guides

How It Works

METHODOLOGY

Three systems, working in concert.

MASTER SCORE

A composite quality metric combining IMDb ratings, Rotten Tomatoes scores, box-office performance, and Academy Award recognition into a single 0–100 scale.

Weighting

IMDB

B.O.

AWARDS

TIER CLASSIFICATION

A Random Forest classifier trained on 4,800+ labeled scripts assigns each screenplay to one of six quality tiers.

Distribution

~5%

~15%

~30%

~25%

~15%

~10%

AI REWRITE PIPELINE

The rewrite system detects the screenplay's primary genre, matches it to one of 90+ master style guides, and feeds both to our bespoke rewrite engine.

Flow

Genre Detect

Style Match

Bespoke AI Rewrite

Stream Output

The Engine Room

ALL 20 SCREENPLAY METRICS

Every screenplay is analyzed across these quantitative features, grouped into six analytical categories.

Structure

Pacing

Visual

Characters

Dialogue

Tone

01Scene Count

Structure

Number of distinct scenes measuring the rate of visual shifts and pacing energy.

02Total Pages

Structure

Screenplay length. Industry standard: 90-125 pages.

03Avg Scene Length

Pacing

Average length of scenes in pages. Longer scenes allow dramatic development; shorter scenes create urgency.

04Scene Length Variance

Pacing

Standard deviation of scene lengths. High variance creates rhythmic pacing.

05Transition Density

Pacing

Explicit transitions (CUT TO, DISSOLVE, etc.) per page.

06Int/Ext Ratio

Visual

Ratio of interior to exterior scenes. Affects visual claustrophobia versus cinematic openness.

07Action Ratio

Visual

Percentage of action/description versus dialogue.

08Caps Density

Visual

Frequency of capitalized emphasis words in action lines.

09Unique Character Count

Characters

Number of distinct named characters.

10Top 3 Character Dominance

Characters

Percentage of all dialogue spoken by the top 3 characters.

11Character Intro Rate

Characters

Characters introduced per page.

12Dialogue Ratio

Dialogue

Percentage of script that is dialogue versus action/description.

13Avg Dialogue Length

Dialogue

Average number of words per dialogue block.

14Vocabulary Richness

Dialogue

Type-token ratio measuring word diversity.

15Avg Word Length

Dialogue

Average syllable count per word.

16Question Density

Dialogue

Frequency of questions in dialogue.

17Sentiment Mean

Tone

Overall emotional tone on a positive-to-negative scale.

18Sentiment Variance

Tone

Emotional fluctuation from scene to scene.

19Sentiment Arc Slope

Tone

Trend of sentiment from beginning to end.

20Exclamation Density

Tone

Frequency of exclamation marks. Signals urgency and emotion.

#	Metric	Category	Description
01	Scene Count	Structure	Number of distinct scenes measuring the rate of visual shifts and pacing energy.
02	Total Pages	Structure	Screenplay length. Industry standard: 90-125 pages.
03	Avg Scene Length	Pacing	Average length of scenes in pages. Longer scenes allow dramatic development; shorter scenes create urgency.
04	Scene Length Variance	Pacing	Standard deviation of scene lengths. High variance creates rhythmic pacing.
05	Transition Density	Pacing	Explicit transitions (CUT TO, DISSOLVE, etc.) per page.
06	Int/Ext Ratio	Visual	Ratio of interior to exterior scenes. Affects visual claustrophobia versus cinematic openness.
07	Action Ratio	Visual	Percentage of action/description versus dialogue.
08	Caps Density	Visual	Frequency of capitalized emphasis words in action lines.
09	Unique Character Count	Characters	Number of distinct named characters.
10	Top 3 Character Dominance	Characters	Percentage of all dialogue spoken by the top 3 characters.
11	Character Intro Rate	Characters	Characters introduced per page.
12	Dialogue Ratio	Dialogue	Percentage of script that is dialogue versus action/description.
13	Avg Dialogue Length	Dialogue	Average number of words per dialogue block.
14	Vocabulary Richness	Dialogue	Type-token ratio measuring word diversity.
15	Avg Word Length	Dialogue	Average syllable count per word.
16	Question Density	Dialogue	Frequency of questions in dialogue.
17	Sentiment Mean	Tone	Overall emotional tone on a positive-to-negative scale.
18	Sentiment Variance	Tone	Emotional fluctuation from scene to scene.
19	Sentiment Arc Slope	Tone	Trend of sentiment from beginning to end.
20	Exclamation Density	Tone	Frequency of exclamation marks. Signals urgency and emotion.

Technical Architecture

THE ORACLE ENGINE

A bespoke ensemble of custom-trained models and high-performance algorithms, engineered for absolute cinematic precision.

VIZ

Cinematic Vector Engine

Multi-Dimensional Mapping

Projects every film into a 20-dimensional space where similarity is geometric, not anecdotal.

CORE

Script-Quant Parser

High-Fidelity Extraction

Reads PDF, Fountain, and Final Draft. Returns twenty measurable dimensions in under a second.

HEX

Structural Decomposition

Pattern Recognition

Identifies the three-act spine, midpoint reversals, and structural anomalies across the corpus.

EVA

Emotional Valence Array

Narrative Sentiment Processing

Per-scene sentiment vectors fed through a learned model of dramatic momentum.

PNE

Neural Success Ensemble

Predictive Tier Classification

A Random Forest with a thousand decision trees. The model that places your script in S through E.

Oracle Generative Synthesis

Bespoke Story Transformation

The rewrite layer. Bespoke agents that speak in the voice of 90+ codified screenwriter styles.

RAW

Bare-Metal Implementation

Ultra-Low Latency Execution

Vector ops written close to the metal. Analysis runs in the time it takes to pour a coffee.

The Library

90+ MASTER STYLE GUIDES

Genre-specific screenwriting guides averaging 3,000+ words each, covering structure, dialogue, tone, pacing, and visual storytelling.

Sample Guide

PSYCHOLOGICAL-THRILLER

3,247 words

Structure & Pacing

Dialogue Techniques

Tone & Atmosphere

III

Browse the Catalog

82 guides

▸Absurdist / Surreal

▸Addiction & Recovery

▸Adult Animation

▸Anthology Series

▸Anti-Romance

▸Biopic

▸Body Horror / Cosmic

▸Buddy Comedy

▸Coming-of-Age Comedy

▸Coming-of-Age Drama

▸Conspiracy Thriller

▸Contemporary Romance

▸Creature Feature

▸Crime Drama / Gangster

▸Disaster Film

▸Docudrama / Docuseries

▸Documentary

▸Drug Trafficking

▸Ensemble Comedy

▸Epic Fantasy

▸Experimental / Arthouse

▸Fairy Tale Adaptation

▸Family Drama

▸Fish Out of Water

▸Folk Horror

▸Forbidden Love

▸Found Footage

▸Franchise / Sequel

▸Gothic Horror

▸Gothic Romance

▸Home Invasion / Survival

▸Horror Comedy

▸Immigrant / Diaspora

▸Indie / Mumblecore

▸Legal / Courtroom

▸LGBTQ+ Narrative

▸Limited Series

▸Love Triangle

▸Magical Realism

▸Martial Arts / Wuxia

▸Mockumentary

▸Musical

▸Neo-Noir

▸Novel to Screen

▸Parody / Spoof

▸Period / Historical Romance

▸Pilot Episode

▸Prison Drama

▸Procedural

▸Psychological Horror

▸Psychological Thriller

▸Revenge Thriller

▸Road Movie

▸Romantic Drama

▸Satirical / Social

▸Second Chance / Reunion

▸Serialized Prestige

▸Short Film

▸Single-Cam Sitcom

▸Sketch Comedy

▸Slapstick / Physical

▸Slasher

▸Soap Opera / Telenovela

▸Social Realism

▸Space Opera

▸Sports Drama

▸Spy / Espionage

▸Stage to Screen

▸Stoner / Hangover

▸Superhero

▸Supernatural Horror

▸Survival Drama

▸Sword & Sorcery

▸Techno-Thriller

▸Teen / YA Series

▸True Crime

▸Video Game Narrative

▸War Film

▸Wedding Ensemble

▸Western

▸Workplace Comedy

▸Zombie / Post-Apocalyptic

A screenwriter's desk in lamplight: a vintage typewriter, stacked script pages, a glass of whisky and a brass desk lamp. — The Page
Screenwriters: editorial style guides for the people who put the rooms on the page.

Open →

An empty canvas director's chair standing alone on a film set, warm light pooling around it in the dark. — The Set
Directors: the signatures behind the shot list, one hundred profiles deep.

Open →

A collection of cinema prime lenses laid out in a row on a dark cloth, glass elements glinting with warm light. — The Lens
Cinematographers: the eye that decided what the frame would hold.

Open →

Writer Protection

ORIGINALITY VERIFIED

The WGA's 2023 contract made it clear: writers are the creators. Our algorithms help prove it.

ORIGINALITY SCORING

Compare your screenplay against 74,000+ films and 4,800+ analyzed scripts. Get a quantitative originality score measuring how unique your premise, structure, and dialogue patterns are across every dimension we track.

SIMILARITY DETECTION

Our system identifies structural and thematic similarities with existing works in the database. Catch potential overlap before a studio notes session or a legal review does. Protect yourself before you pitch.

IP DOCUMENTATION

Timestamp your screenplay on the Solana blockchain for tamper-proof proof of authorship. Combined with our originality analysis, you get a verifiable record that your work existed in its current form at a specific moment in time.

Read the WGA's position on AI and writers' rights

Explore the Data

METHODOLOGY MEETS MACHINE LEARNING

74,000 films. 4,800 scripts. 20 metrics. One dashboard.

LAUNCH DASHBOARD Try the Oracle

LUC J. RIBEIRO

FOUNDER · DATA ARCHAEOLOGIST · WRITER

The film industry runs on opinions disguised as expertise. We run on numbers.

Founded

2024

Based

Los Angeles

Team

Solo + Agents

We built Hollywood Metrics to do the math anyway.

The math wasn't doing the politics. The hunch was doing the politics. We built this to do the math anyway.

We publish the accuracy numbers because comedy is genuinely hard and we're not going to pretend otherwise.

We think the average studio note would be improved by being replaced with a histogram.

That's the project. Numbers. Honestly reported. Made beautiful enough to be worth looking at.

End · The Founder

Metric