Subject contains Unicode lookalike character (homograph attack)
subject-lookalike-char-substitution
What this tier means
High-confidence threat indicator — phishing, impersonation, BEC, or scam pattern. Strong contributor to the trash decision.
How Gorganizer detects this
The subject line contains at least one character from a curated set of 21 Unicode codepoints that are visually indistinguishable from ASCII Latin letters: 14 Greek uppercase letters (Α Β Ε Ζ Η Ι Κ Μ Ν Ο Ρ Τ Υ Χ — U+0391–U+03A7) and 7 Cyrillic lowercase letters (а е о р с у х — U+0430–U+0445). This is a **single-character homograph substitution attack**: a spammer replaces exactly one character in a brand name — e.g. `Ρaypal` (Greek Rho, U+03A1, for P) or `Αmazon` (Greek Alpha, U+0391, for A) or `аccount` (Cyrillic а, U+0430, for a) — so the word passes the `subject-mixed-script-word` per-word check (which requires multiple script-mixing characters in the same token) while still defeating exact-string keyword filters that look for literal "paypal", "amazon", or "account". The subject is normalized to NFC before scanning so composed/decomposed variants produce identical results. Detection fires on ANY single occurrence of a lookalike codepoint. Guard: the signal is suppressed for emails from known-trusted sender types (`strict_invoice`, `strict_work`, `strict_shopping`) so that legitimate business notifications are not penalised for localised display names or Cyrillic characters in their own-brand content. Weight: +4 — matches `subject-mixed-script-word` because single-char substitution is equally attack-exclusive; no legitimate sender intentionally places Greek Rho where Latin P belongs.
False-positive guard
Every signal in Gorganizer feeds a multi-module score — never a sole verdict. This is a threat-tier signal — it adds a strong contribution to the trash score. The full pipeline still requires convergence across multiple modules + a margin over the safety floor before deletion happens, and Gmail's trash (30-day recovery) is always used — never permanent delete.
About the scoring engine
Gorganizer's scoring engine emits over 1,800 signals across six modules — headers, sender, subject, body, attachments, and structural metadata. Every email is scored by every module independently; the final verdict requires multiple modules to agree and the trash score to beat the safety floor by a margin.
Sacred safety guards — never delete starred emails, replies, calendar invites, receipts/invoices, or attachments — apply unconditionally regardless of any signal.
Ready to clean your inbox?
Gorganizer scans your Gmail with this signal and 1,800+ others, then cleans everything in one click. $4.99 one-time, no subscription.
Get started