Organic Score: Calibration Data

Experimental

Empirical validation of the 5 signals used to compute the Organic Score. Corpus of 17 repos (11 healthy, 4 suspicious, 2 controls). Rebalanced twice: 2026-05-06 to add releases cadence (fork 40%→30%, ZF 55%→45%, releases 0%→20%); 2026-06-10 to add contributors breadth (fork 30%→25%, releases 20%→15%, contributors 0%→10%).

Last updated: June 2026

Signal Normalisation (0 → 100)

Signal	Gate	Thresholds	Weight
Fork / Star ratio	stars ≥ 5 000	≥ 10% → 100 · 7% → 50 · ≤ 2% → 0	25%
Watcher / Star ratio	always	≥ 0.5% → 100 · 0.1% → 50 · ≤ 0.01% → 0	5%
% zero-follower stargazers	sample ≥ 30	≤ 10% → 100 · 30% → 50 · ≥ 60% → 0	45%
Releases cadence	always	≥ 100 → 100 · 20 → 60 · 5 → 30 · 0 → 0	15%
Contributors / 1k stars	stars ≥ 5 000	≥ 3/1k → 100 · 2/1k → 80 · 1/1k → 50 · 0.5/1k → 25 · ≤ 0.2/1k → 0	10%

Best fit: 92% of repos correctly classified (healthy ≥ 70, suspicious ≤ 45). Calibrated 2026-05-06.

Corpus Results

Repo	Expected	Stars	Fork/★	Watch/★	Zero-fol.	Releases	Score	Tier	✓
pallets/flask	healthy	71 432	23.5%	2.9%	—	~140	100	Healthy	✅
langchain-ai/langchain	healthy	134 373	16.5%	0.6%	3.4%	~210	93	Healthy	✅
Significant-Gravitas/AutoGPT	healthy	183 636	25.2%	0.8%	—	~70	92	Healthy	✅
crewAIInc/crewAI	healthy	49 425	13.7%	0.7%	—	~55	92	Healthy	✅
langgenius/dify	healthy	138 645	15.7%	0.6%	3.9%	~320	92	Healthy	✅
agno-agi/agno	healthy	39 573	13.4%	0.6%	—	~100	91	Healthy	✅
mem0ai/mem0	healthy	53 711	11.2%	0.4%	—	~85	90	Healthy	✅
browser-use/browser-use	healthy	89 197	11.4%	0.5%	3.7%	~30	92	Healthy	✅
rtk-ai/rtk	healthy	32 308	5.8%	0.26%	7.4%	147	74	Healthy	✅
NousResearch/hermes-function-calling	healthy	1 292	—	1.4%	—	~5	68	Moderate	❌
yargs/yargs	healthy	11 471	8.9%	0.7%	—	~110	75	Moderate	✅
unionlabs/union	suspicious	74 134	5.2%	2.2%	—	~30	41	Suspicious	✅
shardeum/shardeum	suspicious	31 497	2.2%	0.9%	—	~10	8	Suspicious	✅
Anoma/anoma	suspicious	33 916	12.1%	0.6%	—	~20	91	Healthy	❌
langflow-ai/langflow	suspicious	147 213	6.0%	0.3%	—	~340	44	Suspicious	✅
sindresorhus/awesome	control	457 552	7.5%	1.8%	3.5%	—	70	Moderate	—
facebook/react	control	244 629	20.8%	2.7%	3.5%	~50	100	Healthy	—

Controls (awesome, react) are excluded from fit calculation. Anoma/anoma is an anomaly: fork/star looks healthy (12%) but known fraudulent by external sources.

Methodology vs. StarScout

StarMapper uses the 4 most accessible public signals. StarScout (CMU, 98% precision / 85% recall) relies on additional signals that require full dataset access.

Signal	StarMapper	StarScout	Notes
Fork / star ratio	✓ 25%	✓	Reduced 40%→30%→25%; fork/star penalises CLI tools with low fork rates by nature
% zero-follower stargazers	✓ 45%	partial	Strongest discriminator when sample size ≥ 30. Reduced slightly to make room for releases signal
Watcher / star ratio	✓ 5%	—	Weakly discriminating in practice, weight kept low
Releases cadence	✓ 15%	—	Total GitHub releases as proxy for active, maintained project. Reduced 20%→15% to make room for contributors
Contributors / 1k stars	✓ 10%	—	New signal (2026-06-10): community breadth proxy. Gated at ≥ 5 000 stars. Low ratio on large repos = engagement without real contributors
Clustering (account overlap across repos)	—	✓	Key signal in StarScout, requires full graph analysis
Temporal burst (stars in short window)	—	✓	Requires star timestamp history at scale
Account age + activity pattern	—	✓	Detects sophisticated fakes, not available from public API alone

StarMapper reaches ~92% accuracy on labelled corpus (weights: fork 25%, ZF 45%, watcher 5%, releases 15%, contributors 10%, 2026-06-10). StarScout reaches 98% precision using the full signal set. The gap is structural, not a calibration issue.

Caveats

Fork/star signal is gated at ≥ 5 000 stars. Below this threshold, the ratio is noisy on small repos.
CLI and developer tools (install via package manager, few forks) may have a lower fork/star ratio despite being organic. The fork signal is gated at ≥ 5 000 stars and its weight was reduced (70%→40%) to account for this.
Zero-follower signal requires ≥ 30 enriched users (users StarMapper has seen as stargazers). It is unavailable for repos not scanned on StarMapper.
Viral repos or niche communities (CLI tools, curated lists) may score lower despite being organic. The score reflects signals, not intent.
The score is not an accusation. Repos can score poorly due to community structure (e.g., crypto projects have high watcher counts but also many bot accounts).

Organic Score: Calibration Data

Signal Normalisation (0 → 100)

Corpus Results

Methodology vs. StarScout

Caveats

References