Descriptions_staging
- Last updated
- Save as PDF
Security
Abnormal Account Behavior
Criminal & Illegal
- Child Abuse Images Sites
- Criminal Activity Sites
- Hacking Sites
- Illegal Drug Sites
- Piracy & Copyright Theft Sites
- Terrorism Sites
Cyber-Security
- Any DarkNet
- Botnets Sites
- Brand App Credential Leak
- Confidential Information
- Credential Leak
- Credentials for Sale
- Credit Card Leak
- Cryptocurrency Mining Sites
- Cyber-Threat Language
- DarkNet Forum
- DarkNet Market
- DarkNet Ransomware
- Email Leak
- Fraud Report Language
- Known Data Leak
- Malware & Compromised Links
- Non-Ascii Character Classifier
- Phishing & Fraud Links
- Potential Doxing
- Spam
- Spam Commenter
- Spam Sites
- Spyware & Questionable Software Sites
- Suspicious Links
Physical Security
- General Hazard Language
- Gun Threat Language
- Physical Threats
- Protest Language
- Public Safety Language
- Self-Harm Language
- Violence Sites
- Weapons Images
Compliance
Corporate Compliance & Confidential Information
Cross-Industry Compliance Standards
Financial Services Compliance Standards
- Financial Promotions
- Financial Related Complaints
- Financial Testimonials
- Housing Discrimination
- IIROC
- Lending Descrimination
- Lending Risks
- Misleading Financial Communications
- Potentially Missing Link
- Promissory Financial Statements
- Promissory Images
- RESPA
- Truth in Lending
- Truth in Savings
- Unfair or Deceptive Advertising
Insurance Services Compliance Standards
- General Investment Terms
- Health Insurance Terms
- Insurance Legal Matters
- Insurance Terms
- Lending Disclosures Risk
- Life Insurance Terms
- Property and Casualty Terms
Legal Services Compliance Standards
Life Sciences Compliance Standards
Other Noteworthy Activity
Regulated Data
- Address
- CUSIP Numbers
- Canadian SIN
- Credit Card Numbers
- EMEA Passport Numbers
- Individual Taxpayer Numbers
- International Bank Account Numbers
- PHI
- PII
- Phone Numbers
- SSN
- SWIFT-BIC Codes
- US Driver Licenses
- US Passport Numbers
- Usernames & Passwords
Trademark Infringement
- Common Trademark Violations
- MLB Trademark Violations
- MLS Trademark Violations
- NBA Trademark Violations
- NFL Trademark Violations
- NHL Trademark Violations
Acceptable Use
Abuse & Hate
Adult
- Adult Language
- Alcohol & Tobacco Sites
- Gambling Sites
- Lingerie, Suggestive & Pinup Sites
- Nudity Sites
- Potential Nudity Images
- R-Rated Sites
- Strong Profanity
Controversial Topics
- Cult Sites
- Fake News Sites
- Political Sites
- Political Terms
- Religious Sites
- Reported Bullying
- Sex Education Sites
- Weapons Sites
Institutional Compliance
Pornography
Social Activity
LinkedIn Activity
Other
Entertainment
- Arts Sites
- Dating & Personal Sites
- Entertainment Sites
- Fashion & Beauty Sites
- Games Sites
- Greeting Card Sites
- Leisure & Recreation Sites
- Restaurants & Dining Sites
- Shopping Sites
- Sports Sites
- Travel Sites
Finance
General
- Any URL
- Business Sites
- Dead Link
- Domain Fraud
- General Interest Sites
- Job Search Sites
- News Sites
- No URL
- Personal Sites
- Product Support Terms
- Promotions
- Real Estate Sites
- Transportation Sites
Health
Images, Videos, & Documents
- Image Sharing Sites
- Natively Uploaded Document
- Streaming Media & Download Sites
- Streaming and Download Language
- Uploaded Video
Internet Communication
- Chat Sites
- Instant Messaging Sites
- Peer-to-Peer Sites
- Social Networking Sites
- Torrent Repository Sites
- Web-based Email Sites
Internet Infrastructure
Language Identification
- Chinese Language Identified
- French Language Identified
- Japanese Language Identified
- Korean Language Identified
- Non-English Language Identified
- Portugese Language Identified
- Spanish Language Identified
- Vietnamese Language Identified
Public Sector
Technology
Brand App Credential Leak
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Inbound
- Entities: Brand
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier triggers on credentials that belong to a Brand's app or website.
Use Cases and Positive Examples
(All Languages)
Url With Credentials To Brand App
http://username:p&ssw0rd@login.bizco.com
Credit Card Leak
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Inbound
- Entities: People, Brand
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects valid credit card numbers.
Use Cases and Positive Examples
(All Languages)
Luhn-Validated Number With Context Words
How about the amex linked to my BizCo account? 378282246310005
378282246310005 ccv 393
Luhn-Validated Number With Spacings
Hacked this one yesterday: 4111 1111 1111 1111
Credentials for Sale
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: People, Brand
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects the actual sale of credentials or the promotion of sale of credentials.
Use Cases and Positive Examples
(All Languages)
Credentials For Sale Phrases
Hacked BizCo accounts 4 sale here.
Potential Doxing
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: People
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects the potential practice of researching and broadcasting private or identifying information (especially personally identifying information) about an individual.
Use Cases and Positive Examples
(All Languages)
Doxxing Phrase With Canadian Sin
Use this #dox Simba, his SIN: 130 692 544
Doxxing Phrase With Credit Card
Swipe these card digits to get back at Simba: 378282246310005 #dox
Doxxing Phrase With Date Of Birth
Doxx him, Simba, dob: 10-01-99
Doxxing Phrase With Email Address
#doxxing Simba's email address is cantwaittobeking@priderock.com
Doxxing Phrase With Emea Passport
Want to dox Simba and travel the world? Try: GBR 107192637
Doxxing Phrase With Iban
Try doxxing Simba with this info: MD5582438851912756964355
Doxxing Phrase With Itin
Looking to dox Simba during tax season? Try 998-80-3984
Doxxing Phrase With Mailing Address
I'll pay someone to dox the 'great' Simba: 509 5th Ave
Doxxing Phrase With Ssn
535-34-1493, use it for doxxing Simba
Doxxing Phrase With Url
Click here to dox Simba: http://scar.hyena.headquarters.com
Doxxing Phrase With Us Driver License
Dox Simba with CA license: WKF135791
Doxxing Phrase With Us Passport
Use passport 192483953 to #dox Simba!
Doxxing Phrase With Us Phone
Dox for Simba! Phone: 9093842243
Non-Ascii Character Classifier
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
This classifier detects suspicious mixed script tokens, with English as a reference language
Use Cases and Positive Examples
(All Languages)
Suspicious Non Ascii Language
????l1ck h3r3 for a ?????lls ???oyc3
I ????????????'???? ???????????????????????????? ???????????? ???? ???????????????? ???????????????? ???????????????? ???? ???????????????????????????? ???????? ???????????????????????????? ???????????????? ???????????????? ????????????????????????????????
?????s Suha ??s ??? ?????????? ???????? ?????????????????????????? ?????????????????? ???????? ????????? ??????
???? ???????????????????????????????? $300 ???????????? ???????????? ???? ???????????????????????? ???????? $3,150 ???????? ???????????????????????????? ???????????? 3????????????????.
???? ???????????????????????????? ???????????????? ???????? ???????????????????????????????????????? ???????????????????????????? ???????? ???????????????? $???????????? ???????????? ???????????? ???? ???????????????? ???????????????????????? ???????????????? ???? ???????????
Cyber-Threat Language
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Account, Brand
- Data Source(s): Linkedin, Facebook, Youtube, Twitter, Darknet
- UI Element(s): None
Description
Security teams at large corporations do not have the bandwidth or resources to keep up-to-date with the latest threats to their organizations. Sometimes, information related to such threats are posted on social media by whitehats and very rarely blackhats. The goal of the cyber threat chatter classifier is to curate a brand-specific cybersecurity-related social media feed for security teams to consume.
Use Cases and Positive Examples
(All Languages)
Cyber Attack Phrases
New ransomware discovered to infiltrate BizCo networks.
Cyber Breach Phrases
BizCo api keys found in several chat rooms last month.
Cyber Malware Phrases
Emotet poised to exploit zerodays on certain BizCo endpoints.
Cyber Vulnerabilities Phrases
AMD chips released last year have critical vulnerability that affects BizCo servers.
Confidential Information
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects private and personally identifying information for People Entities.
Use Cases and Positive Examples
(All Languages)
Canadian Sin
Here is the sin number: 130 692 544
Credit Card
378282246310005 | CCV 555 | 90429
Emea Passport
GBR 107192637
Iban
MD5582438851912756964355
Itin
Please file it under tax number 998803984
It is 998-80-3984.
Ssn
My social is 535341493.
It is 535-34-1493.
Us Driver License
drivers license: WKF135791
Us Passport
The passport#: 192483953
The passport card reads C92483953
Credential Leak
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis + Phrase Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects content which exposes email and password of an individual or organization. Only credentials where the username is an email address will be detected.
Use Cases and Positive Examples
(All Languages)
Credential Dump With Email Address
okaygo@gmail.com:passw0rd user@bizco.com:p@ssword peter.parker@sony.com:sp1dey
Nearby Password With Email Address
Email: user@bizco.com Password: p@ssword For more click below.
Url Credential With Email Address
... https://highflyer@bizco.com:go@lltheway@accesscash.com ...
Email Leak
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis + Phrase Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects content which exposes email (that is not part of a credential pair) of an individual or organization.
Use Cases and Positive Examples
(All Languages)
Email Address
... high.flyer@bizco.com ...
Fraud Report Language
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
This classifier detects language related to fraud, scams, and identity theft
Use Cases and Positive Examples
(All Languages)
Fraud Report Language
I totally got scammed by that guy over Venmo
Ignore that impersonator account - it's not me!
Spam
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Behavior Analysis, Content Pattern Analysis, Phrase Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The classifier then extracts possible spam signatures from posts. When a post appears that has some suspicious characteristics (see below) we extract its spam signatures, and then test if any of them exist in our spam signatures database. If a signature already exists in the database, we return that the post is spam, and the policy will detect it. If not, we insert the signature into the database, but return that the post is not spam. The signature insertion process happpened in in real-time historically, but now is run periodically.
Use Cases and Positive Examples
(All Languages)
General
Click here to buy 1K followers!
Gun Threat Language
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Brand, Account, Location
- Data Source(s): Linkedin, Twitter, Facebook, Youtube, Darknet
- UI Element(s): None
Description
This classifier detects reports of past or current gun violence threats.
Use Cases and Positive Examples
(English)
Incident Language And Strong Gun Reference
Reported shooter last night near Sam's.
Strong Gun Threats
Shooter at large, currently on campus.
Strong Incident Language And Gun Reference
Atlanta police are investigating gunshots heard at the corner of 5th and 6th.
General Hazard Language
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Brand, Account, Location
- Data Source(s): Linkedin, Twitter, Facebook, Youtube, Darknet
- UI Element(s): None
Description
This classifier detects language that indicates a real-time report of a general hazard.
Use Cases and Positive Examples
(English)
Flooding
Floodwaters are reaching parts of Upper West End.
Gas Leak
The gas leak led to a small explosion that charred the length of Broadstone Ave.
General Emergency Language
Reported paramedic activity near the Arby's.
Power Outage
Downed power lines across much of North Hyde Drive.
Public Safety Language
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Brand, Account, Location
- Data Source(s): Linkedin, Twitter, Facebook, Youtube, Darknet
- UI Element(s): None
Description
This classifier detects language indicating a report of a public safety issue in a location.
Use Cases and Positive Examples
(English)
Earthquakes
There was an 6.0 M earthquake today in San Jose!
Evacuation Alerts
At 6pm this evening, the Governor issued an evacuation order for all costal counties.
Incident Language And Crime Reference
Atlanta police are investigating a break-in at the corner of 5th and 6th.
Riots Or Looting
Rioters are moving slowly downtown towards the courthouse.
Severe Storm
A NOAA warning has been issued for several counties.
Shelter In Place Alerts
Local shops and storefronts are sheltering in place until the all clear.
Terrorism
A bombing has been reported as an act of terrorism.
Wildfires And Forestfires
Firefighters attempt to contain the wildfire near San Francisco.
Urban Fires
The I-40 highway between exits 140 and 141 is shut down due to a vehicle fire.
Physical Threats
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location, People
- Data Source(s): Twitter, Darknet, Facebook, Linkedin, Youtube
- UI Element(s): None
Description
This classifier detects language that indicates intent to physically harm or commit violence such as death threat, bomb threat and arson
Use Cases and Positive Examples
(English)
General Threat Language
I know where you live.
Hope Of Future Violence
I really hope someone kills you.
I dream about someone kidnapping and stabbing you.
Threat Of Future Violence
Imma stab you.
Im gonna punch Harry in the mouth.
Threat Of Non-Personal Violence
We gunna bomb the school.
Self-Harm Language
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Twitter, Linkedin, Facebook
- UI Element(s): None
Description
This classifier detects language indicating harm to self.
Use Cases and Positive Examples
(All Languages)
Hopeless And Distress
I wish I was never born.
General Intent And Urgency
I want to die.
Intent And Urgency With Mention Of Means
How do I commit suicide?
Firearms
I want to blow my brains out.
Suffocation
Imma put a bag over my head.
Overdose
I could try to overdose on xanax.
Cutting
Watch me cut myself.
Jumping
I think I'll jump off a bridge.
Protest Language
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: People, Brand, Account, Location
- Data Source(s): Twitter, Darknet
- UI Element(s): None
Description
This classifier detects language indicating current or planned protest activity.
Use Cases and Positive Examples
(English)
Protest Language
Let's march on the town square and demand change.
(French)
Protest Language
Madame, regarde ??a! piquet de gr??ve!
Account Hack Language
Categorization
- Category: Security
- Sub-Category: Abnormal Account Behavior
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
Brand content using language commonly used by popular hacker groups after an account hack.
Use Cases and Positive Examples
(All Languages)
Common Language Post-Hack
Syrian electronic army was here.
Product Support Terms
Categorization
- Category: Other
- Sub-Category: General
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
This classifier triggers when content contains common product support terms and phrases.
Use Cases and Positive Examples
(All Languages)
Product Support Terms
I understand your frustration. I'd like to look into this further for you.
I hope to resolve this for you as soon as possible.
Please give us a call to let us know your feedback.
Spam Commenter
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Text Analysis
Details
- Methodology: Time Series Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The Spam Commenter Classifier detects Commenter accounts that repeatedly post spam (i.e. "spammy" Commenters). While a Commenter is Classified as "spammy", all content posted by that Commenter meeting minimum length requirements will trigger the Classifier and associated Content Policy Actions (e.g. Notify, Delete, etc.).< A Commenter is considered "spammy" if, in the past year ...< - The account posted spam at least 5 times and at least 30% of their posts are classified as spam, or< - The account posted spam content at least 2 times and at least 50% of their total posts are classified as spam< The Spammy Commenter list is automatically updated once a week.< A content item must meet the following two requirements to be considered in the calculations above: it must have at least 6 words AND at least 5 of the words have at least 4 characters.
Use Cases and Positive Examples
Spanish Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as Spanish.
Use Cases and Positive Examples
(All Languages)
Spanish
A los tontos no les dura el dinero.
French Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as French.
Use Cases and Positive Examples
(All Languages)
French
Chacun voit midi ?? sa porte.
Japanese Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as Japanese.
Use Cases and Positive Examples
(All Languages)
Japanese
??????????????????????????????????????????
Korean Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as Korean.
Use Cases and Positive Examples
(All Languages)
Korean
????????? ??? ?????????.
Non-English Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as not English.
Use Cases and Positive Examples
(All Languages)
Chinese
???????????????????????????
French
Chacun voit midi ?? sa porte.
Japanese
??????????????????????????????????????????
Korean
????????? ??? ?????????.
Portuguese
Cada macaco no seu galho.
Spanish
A los tontos no les dura el dinero.
Vietnamese
?????t nh?? chu???t l???i.
Portugese Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as Portuguese.
Use Cases and Positive Examples
(All Languages)
Portuguese
Cada macaco no seu galho.
Vietnamese Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as Vietnamese.
Use Cases and Positive Examples
(All Languages)
Vietnamese
?????t nh?? chu???t l???i.
Chinese Language Identified
Categorization
- Category: Other
- Sub-Category: Language Identification
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The main language of the content has been identified as Chinese.
Use Cases and Positive Examples
(All Languages)
Chinese
???????????????????????????
Stock Tickers
Categorization
- Category: Other
- Sub-Category: Finance
- Type: Text Analysis
Details
- Methodology: Lexical + Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects the presence of stock tickers in content. Current exchanges covered are:
- NYSE
- NASDAQ
- CSE
- TSX
- OTC (OTCQX, OTCQB, Pink Open Market) for USA and Canada stocks
In addition to crypto symbols and exchange traded funds (ETF's).
Use Cases and Positive Examples
(All Languages)
Decorated Ticker Reference Lowercase
Better buy the $gme dip.
Decorated Ticker Reference Uppercase
Sky's the limit with $GME!
Ticker Reference With Context Phrase
Sky's the limit with GME stock!
Promotions
Categorization
- Category: Other
- Sub-Category: General
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier matches on text that indicates sales promotions.
Use Cases and Positive Examples
(All Languages)
Promotional Language
25% off clearance sale today on all inventory.
Streaming and Download Language
Categorization
- Category: Other
- Sub-Category: Images, Videos, & Documents
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier matches on text containing streaming or download language.
Use Cases and Positive Examples
(All Languages)
Streaming Download Phrase With Link
Here's the full movie: http://torrent.gladiator.com
Mergers & Acquisitions
Categorization
- Category: Compliance
- Sub-Category: Corporate Compliance & Confidential Information
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This policy attempts to detect dissemination of confidential information about mergers and acquisitions. It only triggers if there is no URL present.
Use Cases and Positive Examples
(English)
Mergers And Acquisitions Language
Company is acquired by another company with a cash for stock deal
Mergers And Acquisitions Rumors
There have been rumors of an acquisition.
Named Company Phrases
Amazon to acquire Target!
Earnings & Financial Updates
Categorization
- Category: Compliance
- Sub-Category: Corporate Compliance & Confidential Information
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier looks for detections of announcements phrases about record-breaking sales, subscribers, purchases, or similar.
Use Cases and Positive Examples
(English)
Bookings Announcements
Our bookings have an all-time high!
Confidential Earnings Phrase
Some turbulence ahead of this initial public offering.
Check out the inside scoop on earnings before anybody else does! DM us for more information.
Direct Earnings Phrase
First quarter results are higher than expected.
Investment Earnings Phrase
The revenue per share is gonna jump.
Full Disclosure Risk
Categorization
- Category: Compliance
- Sub-Category: Cross-Industry Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects inside information by publicly traded companies and other issuers.
Use Cases and Positive Examples
(English)
Bookings Announcements
Our bookings have an all-time high!
Confidential Earnings Phrase
Some turbulence ahead of this initial public offering.
Check out the inside scoop on earnings before anybody else does! DM us for more information.
Direct Earnings Phrase
First quarter results are higher than expected.
Investment Earnings Phrase
The revenue per share is gonna jump.
Layoffs & Restructuring
Categorization
- Category: Compliance
- Sub-Category: Corporate Compliance & Confidential Information
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This policy identifies posts containing information about layoffs and restructuring.
Use Cases and Positive Examples
(English)
Layoff Language
They just announced layoffs effective for multiple departments.
A company reorg will take place later this month.
Vacated Position
He'll be stepping down as CEO.
Phone Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
This classifier detects the presence of US phone numbers in content.
Use Cases and Positive Examples
(All Languages)
Delineated International Phone Number
Try this line: +49 901 4498 893.
Delineated Uk Phone Number
44 102.1436.234 is the one to use.
(019467) 14413 is the best for the evening hours.
Delineated Us Phone Number
(541) 754-3010 is the one to use.
Non-Delinated International Phone Number With Phrase
Here's my cell, +34215325353.
Non-Delinated Us Phone Number With Phrase
Here's my cell, 15417543010.
CUSIP Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Inbound
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
This classifier detects the presence of CUSIP Numbers in content.
Use Cases and Positive Examples
(All Languages)
Cusip Number
I believe that Google's CUSIP is 38259P508
SSN
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of SSN in content.
Use Cases and Positive Examples
(All Languages)
Social Security Number With Context Phrases
Here's my ssn: 535341493.
Social Security Number With Spacings
These digits are probably sensitive: 535-34-1493.
International Bank Account Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of IBAN in content.
Use Cases and Positive Examples
(All Languages)
Iban With No Spaces
Here it is, IL143698381678293529782.
Iban With Spaces
Please send it to DE91 1000 0000 0123 4567 89.
Address
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects the presence of mailing addresses in content.
Use Cases and Positive Examples
(All Languages)
Street Address
I live at 123 Main St.
Individual Taxpayer Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of ITIN in content.
Use Cases and Positive Examples
(All Languages)
Delineated Itin
My ITIN is 900-72-1111.
Non-Delineated Itin
His tax number is 904711295.
US Driver Licenses
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Inbound
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of US Driver Licenses in content.
Use Cases and Positive Examples
(All Languages)
Licenses
Can you use my drivers license number? 748938930
My drivers license, 198390498, isn't used for anything else, so let's use that number please."
Canadian SIN
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects the presence of Canadians SIN numbers in content.
Use Cases and Positive Examples
(All Languages)
Luhn-Validated Candian Sin
Here is the sin number: 130 692 544
Credit Card Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Textual Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of credit card numbers in content.
Use Cases and Positive Examples
(All Languages)
Luhn-Validated Number With Context Words
How about the amex linked to my BizCo account? 378282246310005
378282246310005 ccv 393
Luhn-Validated Number With Spacings
Try this one: 4111 1111 1111 1111
Usernames & Passwords
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of username and password pairs in content.
Use Cases and Positive Examples
(All Languages)
Password Implied From Context
Here's the pwd: abc123.
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
This classifier detects the presence of email addresses in content.
Use Cases and Positive Examples
(All Languages)
Email Address
Contact me at socialpatrol@gmail.com
PII
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects the presence of PII in content (excluding private messages).
Use Cases and Positive Examples
(All Languages)
Non-Specific Potentially Private Digits
The account# is 450912045.
Phi Flagged
Medical record number: A395929559
EMEA Passport Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
This classifier detects the presence of EMEA passports in content.
Use Cases and Positive Examples
(All Languages)
Emea Passport
Passport no. Qf674456
US Passport Numbers
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of US passports in content.
Use Cases and Positive Examples
(All Languages)
Us Passport
Passport 110250822
PHI
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects the presence of PHI in content.
Use Cases and Positive Examples
(All Languages)
Implied Phi Number By Context Phrases
Medical record number: A395929559
Private Health Complications
Ah, he was just diagnosed with the flu.
SWIFT-BIC Codes
Categorization
- Category: Compliance
- Sub-Category: Regulated Data
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account, Brand, Location
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Mask
- Match Phrase: Yes
Description
This classifier detects the presence of SWIFT-BIC Numbers in content.
Use Cases and Positive Examples
(All Languages)
Swift-Bic
Trying to transfer money to DEUTDEFF500.
Financial Related Complaints
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
FINRA Rule 4530(b) requires that a financial firm or an associated person of the firm track and archive written customer complaints. For purposes of this Rule, "customer complaint" means any grievance by a customer or any person authorized to act on behalf of the customer involving the activities of the member or a person associated with the member in connection with the solicitation or execution of any transaction or the disposition of securities or funds of that customer.
Guidance: https://www.finra.org/
Such an event must be reported to FINRA not later than 30 calendar days after the event. Furthermore, the firm shall report to FINRA statistical and summary information regarding written customer complaints in such detail as FINRA shall specify by the 15th day of the month following the calendar quarter in which customer complaints are received by the member.
Letter from the FINRA CEO regarding this guidance: https://www.finra.org/
Use Cases and Positive Examples
(English)
Customer Complaint With Financial Term
I want my money back right now! The worst service.
Strong Financial Complaint
I was charged in excess.
Housing Discrimination
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The Fair Housing Act prohibits discrimination based on race, color, national origin, religion, sex, familial status, or handicap in the sale and rental of housing, in mortgage lending, and in appraisals of residential real property. In addition, the FHA makes it unlawful to advertise or make any statements that indicate a limitation or preference based on race, color, national origin, religion, sex, familial status, or handicap.
Guidance: https://nationalfairhousing.org/
Use Cases and Positive Examples
(English)
Discriminatory Housing Statement
Only Jewish people live in that neighborhood.
Lending Descrimination
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The Equal Credit Opportunity Act, as implemented by Regulation B, prohibits creditors from making any oral or written statement, in advertising or other marketing techniques, to applicants or prospective applicants that would discourage on a prohibited basis a reasonable person from making or pursuing an application. However, a creditor may affirmatively solicit or encourage members of traditionally disadvantaged groups to apply for credit, especially groups that might not normally seek credit from that creditor.
Furthermore, when denying credit, a creditor must provide an adverse action notice detailing the specific reasons for the decision or notifying the applicant of his or her right to request the specific reasons for the decision. This requirement applies whether the information used to deny credit comes from social media or other sources.
It is also important to note that creditors may not, with limited exceptions, request certain information, such as information about an applicant's race, color, religion, national origin, or sex.
Press release regarding guidance as it relates to social media: https://www.ffiec.gov/
Use Cases and Positive Examples
(English)
No Reason Given For Denial Of Credit
You loan application was denied.
Soliciting Potentially Discimrinatory Info When Applying For Credit
We need ask for the application for credit. How old are you?
Truth in Savings
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The Truth in Savings Act (TISA), as implemented by Regulation DD, and, for credit unions, by Part 707 of the NCUA Rules and Regulations, imposes disclosure requirements designed to enable consumers to make informed decisions about deposit accounts. Regulation DD and Part 707 require disclosures about fees, annual percentage yield (APY), interest rate, and other terms. Under Regulation DD and Part 707, a depository institution may not advertise deposit accounts in a way that is misleading or inaccurate or misrepresents the depository institution's deposit contract.< If an electronic advertisement displays a triggering term, such as "bonus" or "APY," then Regulation DD and Part 707 require the advertisement to clearly state certain information, such as the minimum balance required to obtain the advertised APY or bonus. For example, an electronic advertisement can provide the required information via a link that directly takes the consumer to the additional information.< A press release regarding guidance as it relates to social media: https://www.ffiec.gov/
Use Cases and Positive Examples
(English)
Missing Details Related To Savings Account
Get some bonus cash with this new account offer.
Wonderful opportunity to get a free checking account!
Misleading Financial Communications
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The United Kingdom's Financial Conduct Authority (FCA) Conduct of Business Sourcebook (COBS) 4.2 requires financial promotions to be fair, clear, and not misleading. A financial promotion is an advertisement for a financial product.
The relevant subsections of the compliance code are subsections 4 and 5. Subsection 4 enumerates specific instances that require caution under this rule, where the advertised product:
- Puts the client's capital at risk.
- Has short and long term prospects.
- Has a complex charging structure.
- Includes products not produced by the firm.
Guidance: https://www.fca.org.uk
Subsection 5 delineates that a firm cannot use "guaranteed", "protected" or "secure", or use a similar term unless they can explain why the use of such a term is fair, clear, and not misleading. Social media communications pose challenges to authors trying to abide by this regulation because of the restricted character limit. Oftentimes, the author will include a link to additional information\
that renders the otherwise uncompliant financial promotion to be fair, clear, and not misleading.
Additionally, authors can post content from their social media accounts as a standalone message or post content as a reply in the context of a conversation.
Further guidance: https://www.fca.org.uk.
Use Cases and Positive Examples
(English)
Contact Phrase With Financial Term
Chat with us to get a loan.
Want to get a loan? Chat with us!
Strong Misleading Financial Phrases
You could work with a professional like me.
Strong Sentiment With Financial Term
Our loans are fantastic! http://loanoffer.com.
Fantastic loan offer! http://loanoffer.com.
IIROC
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The Investment Industry Regulatory Organization of Canada (IIROC) is the national self-regulatory organization which oversees all investment dealers and trading activity on debt and equity marketplaces in Canada. IIROC Rule 3600 is analogous to FINRA Rule 2210; therefore, the same set of rules powers both classifiers. Please refer to the FINRA Retail Communications classifier for more background.
Guidance: https://www.iiroc.ca/
Use Cases and Positive Examples
(English)
Promissory Financial Guidance
I can definitely recommend bitcoin!
Promissory Language With Financial Term
I can promise to get high profit from your investments.
You can expect a high interest vehicle from me.
Strong Financial Promissory Language
I promise market beating returns.
Promissory Financial Statements
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
FINRA Rule 2210 governs broker dealers' communications with the public, including both retail communication and communication with institutional investors; all social media interactions fall into the retail communication category. Such communications must:
- Be based on principles of fair dealing and good faith, must be fair and balanced.
- Not be false, exaggerated, unwarranted, promissory or misleading.
- Not omit any material fact or qualification if the omission, in light of the context of the material presented, would cause the communications to be misleading.
- Provide balanced treatment of risks and potential benefits, consistent with the risks of fluctuating prices and the uncertainty of dividends, rates of return and yield inherent to investments.
Guidance: https://www.finra.org/
Use Cases and Positive Examples
(English)
Promissory Financial Guidance
I can definitely recommend bitcoin!
Promissory Language With Financial Term
I can promise to get high profit from your investments.
I will secure a high interest vehicle for you.
Strong Financial Promissory Language
I promise market beating returns.
Lending Disclosures Risk
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The same set of rules power this classifier and Truth In Lending. Please refer to that documentation page for more background.
Use Cases and Positive Examples
(English)
Missing Details Related To A Loan
Let's get you a student loan then.
This is a steal of an APR; don't miss!
Missing Details Irrelevant Url
https://google.com Zero transaction fees apply now!
Truth in Lending
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
Advertisements relating to credit must present certain Information in a clear and conspicuous manner. It includes requirements regarding the proper disclosure of the annual percentage rate and other loan features. If an advertisement for credit states specific credit terms, it must state only those terms that actually are or will be arranged or offered by the creditor.
For electronic advertisements, such as those delivered via social media, Regulation Z permits providing the required information on a table or schedule that is located on a different page from the main advertisement if that table or schedule is clear and conspicuous and the advertisement clearly refers to the page or location.
Regulation Z requires that, for consumer loan applications taken electronically, the financial institution must provide the consumer with all Regulation Z disclosures within the required time frames. Regulation Z does not exempt applications taken via social media.
Use Cases and Positive Examples
(English)
Missing Details Related To A Loan
Let's get you a student loan then.
This is a steal of an APR; don't miss!
Missing Details Irrelevant Url
https://google.com Zero transaction fees apply now!
Unfair or Deceptive Advertising
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
Unfair, Deceptive, or Abusive Acts or Practices (UDAAPs) can cause significant financial injury to consumers, erode consumer confidence, and undermine the financial marketplace. Under the Dodd-Frank Act, it is unlawful for any provider of consumer financial products or services or a service provider to engage in any unfair, deceptive or abusive act or practice.
Guidance: https://www.cfpaguide.com/
Examples include:
- Misrepresentation about loan terms. If "3.5% fixed payment 30-year loan" is mentioned, but the actual mortgage offers is adjustable rate, then the statement is in violation.
- Inadequate disclosure of material lease terms in television advertising. If "no money down" or "$0 down" are mentioned in a statement, but more material costs were not disclosed, then the statement is in violation.
Use Cases and Positive Examples
(English)
Coercisve Financial Demands
Don't concern yourself with the details of this stock option.
Vauge Targeted Financial Demands
You must authorize it right now to see return on investment.
Financial Testimonials
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects testimonials language recommending or promoting or praising financial advisors or institutions.
Use Cases and Positive Examples
(English)
Strong Testimonial Phrase
I highly recommend Derrick!
Testimonial Phrase With A Financial Term
We are very happy with how our investments are growing.
Lending Risks
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier flags on language associated with mortgages that impiles lack of due diligence in the underwriting process.
Use Cases and Positive Examples
(English)
Implication Of No Due Diligence
All you need is 20% down for a conventional loan.
First time home buyers EZ to qualify.
We'll get you a line of credit with zero down.
RESPA
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
RESPA seeks to reduce unnecessarily high settlement costs by requiring disclosures to homebuyers and sellers, and by prohibiting abusive practices in the real estate settlement process. All borrowers must be given information about real estate transactions, settlement services, and relevant consumer protection laws, as well as the possibility of mortgage servicing being transferred. Borrowers are entitled to initial and annual escrow account statements, as well as itemized statements of actual settlement costs. RESPA outlaws kickbacks, referral fees, and unearned fees, prohibits sellers from requiring borrowers to purchase title insurance from specific companies, and does not allow loan servicers to require excessively large escrow accounts.
Guidance: https://files.consumerfinance.gov/
HUD's summary of RESPA: https://www.hud.gov/
Use Cases and Positive Examples
Potentially Missing Link
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This triggers when a url link is suspected to be missing.
Use Cases and Positive Examples
(English)
Phrases Implying A Link That Is Missing
Top 10 reasons to get health insurance:
Financial Promotions
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The United Kingdom's Financial Conduct Authority (FCA) Conduct of Business Sourcebook (COBS) 8.22 governs the inclusion financial promotions in outbound communications. A financial promotion is defined as an invitation to engage in investment activity or to engage in claims management activity that is communicated in the course of business.
Link to handbook: https://www.handbook.fca.org.uk/
Use Cases and Positive Examples
(English)
Promoting A Financial Learning Session
Register to learn more about trading bonds.
Register for the financial seminar!
Sponsored Post Risk
Categorization
- Category: Compliance
- Sub-Category: Cross-Industry Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This policy looks for sponsored posts.
Use Cases and Positive Examples
(English)
Usage Of A Sponsored Hashtag
By the way, this post is #spon
Material Connections Risk
Categorization
- Category: Compliance
- Sub-Category: Cross-Industry Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
Triggers on language that indicates the lack of disclosing material connections shown between advertisers and endorsers.
Use Cases and Positive Examples
(English)
General
Shown results not typical.
Sweepstakes Disclosure Risk
Categorization
- Category: Compliance
- Sub-Category: Cross-Industry Compliance Standards
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
Hosting a lottery as a non-governmental agency is illegal. A giveaway is considered a lottery when:
- The giveaway offers one or more prizes of value,
- The winners of the giveaway are chosen at random, and
- The entry requires a payment of money or other consideration.
The word "consideration" is used loosely to cover anything that is directly or indirectly of value to the company, monetary or otherwise.
FTC guidelines for social media contests: https://www.ftc.gov/tips-advice/business-center/guidance/ftcs-endorsement-guides-what-people-are-asking#socialmediacontests
Use Cases and Positive Examples
(English)
Sweepstakes Language
Enter your name to win this free iPad!
Spam Complaints
Categorization
- Category: Compliance
- Sub-Category: Other Noteworthy Activity
- Type: Text Analysis
Details
- Methodology: Machine Learning
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier detects when content is reporting the incident of spam.
Use Cases and Positive Examples
(English)
Reporting Of Spam In Other Content
There has been a lot of spam getting through recently.
They should do more to delete these spam comments.
Customer Complaints
Categorization
- Category: Compliance
- Sub-Category: Other Noteworthy Activity
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
Matches on text regarding general complaints about a company or the service from its employees.
Use Cases and Positive Examples
(English)
Negative Sentiment Towards A Company
This is the worst company ever!
Legal Discussions
Categorization
- Category: Compliance
- Sub-Category: Legal Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
The American Bar Association Model Rules of Professional Conduct contain specific restrictions on lawyer advertising and solicitation of clients through in-person and virtual contact, or cold calling. This classifier applies to posts less than 500 characters.
Use Cases and Positive Examples
(English)
Promissory Legal Language
I can promise you'll get bail tomorrow.
Promotion Of Legal Services
We can offer our help with litigation.
Wronful Association With A Government Agency
Our firm has some influence at the DOJ.
Insurance Terms
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
This classifier triggers content that contains finance terms and also contains an insurance keyword.
Use Cases and Positive Examples
(English)
Finance Term And Insurance Phrase
Take advantage of introductory rates for this new policy.
Health Insurance Terms
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
This classifier triggers content that contains health terms and also contains an insurance keyword.
Use Cases and Positive Examples
(English)
Health Term And Insurance Phrase
The coinsurance won't be applicable until after you've met your deductible.
General Investment Terms
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
This classifier triggers content that contains investment terms and also contains an insurance keyword.
Use Cases and Positive Examples
(English)
Investment Term And Insurance Phrase
Do you have a prospectus available before the renewal period begins?
Insurance Legal Matters
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
This classifier triggers content that contains legal terms and also contains an insurance keyword.
Use Cases and Positive Examples
(English)
Legal Language And Insurance Phrase
Can reach a settlement over your gaps in coverage?
Life Insurance Terms
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
This classifier triggers content that contains life insurance terms and also contains an insurance keyword.
Use Cases and Positive Examples
(English)
Life Term And Insurance Phrase
Are you interested in a universal life policy?
Property and Casualty Terms
Categorization
- Category: Compliance
- Sub-Category: Insurance Services Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook
- UI Element(s): None
Description
This classifier triggers content that contains terms related to property and also contains an insurance keyword.
Use Cases and Positive Examples
(English)
Property Term And Insurance Phrase
I want to avoid litigations around the increase in our premiums.
Health Claims
Categorization
- Category: Compliance
- Sub-Category: Life Sciences Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This classifier aims to detect health-related claims that employees of life sciences firms are not allowed to make, such as promises for healing diseases or losing weight.
Use Cases and Positive Examples
(English)
Promissory Medical Claim
This will cure allergies!
This drug prevents your disease.
Drug Usage Risks
Categorization
- Category: Compliance
- Sub-Category: Life Sciences Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
FDA regulations require identification of statements and questions about off-label use of drugs, that is, using a drug to treat unintended conditions.
Since there is currently no way of differentiating between off-label and on-label use of drugs, the current classifier is restricted to detecting questions about drugs and how to use them.
Triggers when all of the following conditions are met:
- The name of a drug. This is a list of known drugs that can be updated with customer specific drugs upon request.
- Specifically-phrased questions about dosage, safety, usage, similarity to the other drugs, shelf life, or side effects
Use Cases and Positive Examples
(English)
Investigating Question Concerning Usage Of A Drug
What is the recommended dosage?
Is it safe to use Tylenol with an antidepresant?
Adverse Drug Experience
Categorization
- Category: Compliance
- Sub-Category: Life Sciences Compliance Standards
- Type: Text Analysis
Details
- Methodology: Lexical + Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
FDA regulations require that adverse drug side effects reported by consumers be recorded and reported to the FDA.
Triggers when all of the following conditions are met:
- The name of the drug. This is a list of known drugs that can be updated with customer specific drugs upon request.
- The name of the side effect. This is a list that includes both medical and lay terminology. Can be updated with new side effects upon request.
- Sentence structure indicates drug-related cause and effect
Use Cases and Positive Examples
(English)
Description Of An Adverse Drug Effect
Claforan will result in vomiting blood.
Deceptive Advertising Risk
Categorization
- Category: Compliance
- Sub-Category: Life Sciences Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
This policy detects posts where employees of the brand like, retweet, or promote customer testimonials.
Use Cases and Positive Examples
(English)
Strong Testimonial Phrase
Let me tell you about my experience with the treatment.
testimonial: I can't get enough of this product.
HIPAA
Categorization
- Category: Compliance
- Sub-Category: Life Sciences Compliance Standards
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Outbound
- Entities: Account
- Data Source(s): Linkedin, Twitter, Facebook, Youtube
- UI Element(s): None
Description
HIPAA provides the ability to transfer and continue health insurance coverage for millions of workers and their families when they change or lose their jobs, helps reduce health care fraud and abuse, ensures industry-wide standards for health care information on electronic billing and other processes, and requires protection and confidential handling of protected health information. This classifier tries to find combinations of identifiable information and health information.
Use Cases and Positive Examples
(English)
Phi Classifier Flagged
Here's my medical record number: 4909F9129E
Sensitive Medical Information
I got the medical imaging results back today.
Political Terms
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers on language including political speech (democrats, republicans, electoral college, SCOTUS, etc.)
Use Cases and Positive Examples
(English)
Political Speech
The government is trying to pass a new bill to the judicial branch without review.
Who cares about Trump giving the State of the Union when he's got a wall to build.
Reported Bullying
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Text Analysis
Details
- Methodology: Phrase Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
Triggers on text that refers to a person being bullied or harrassed. It does not trigger on content that is actual bullying.
Use Cases and Positive Examples
(English)
Reports Of Bullying
They have been bullying me every day.
NHL Trademark Violations
Categorization
- Category: Compliance
- Sub-Category: Trademark Infringement
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when NHL Team names are found in content.
Use Cases and Positive Examples
(All Languages)
Nhl Trademark Violation
We are offering box seats to the Boston Bruins games for the rest of the year.
MLB Trademark Violations
Categorization
- Category: Compliance
- Sub-Category: Trademark Infringement
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when MLB Team names are found in content.
Use Cases and Positive Examples
(All Languages)
Mlb Trademark Violation
We know the New York Yankees will be back with a killer season!
NFL Trademark Violations
Categorization
- Category: Compliance
- Sub-Category: Trademark Infringement
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when NFL Team names are found in content.
Use Cases and Positive Examples
(All Languages)
Nfl Trademark Violation
Doesn't matter how much they pay the QB, the Dallas Cowboys will not make the playoffs.
NBA Trademark Violations
Categorization
- Category: Compliance
- Sub-Category: Trademark Infringement
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when NBA Team names are found in content.
Use Cases and Positive Examples
(All Languages)
Nba Trademark Violation
If only the New York Knicks could get rid of the owner.
MLS Trademark Violations
Categorization
- Category: Compliance
- Sub-Category: Trademark Infringement
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when MLS Team names are found in content.
Use Cases and Positive Examples
(All Languages)
Mls Trademark Violation
Now that our firm is thinking of sponsoring the LA Galaxy, I can't wait to see them!
Common Trademark Violations
Categorization
- Category: Compliance
- Sub-Category: Trademark Infringement
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when content mentions a commonly abused trademarked term.
Use Cases and Positive Examples
(All Languages)
Common Trademark Violation
Due to COVID, my firm didn't get to support March Madness this year!
Coke price is up and down like it's Wall street.
Pornographic Language
Categorization
- Category: Acceptable Use
- Sub-Category: Pornography
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when content contains pornographic language
Use Cases and Positive Examples
(All Languages)
Pornographic Language
Whether you like cock or twat it's fine.
Hate or Derogatory Language
Categorization
- Category: Acceptable Use
- Sub-Category: Abuse & Hate
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin, Darknet
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Language that indicates hate or intolerance of any group (e.g. racial hate, religious hate, etc.)
Use Cases and Positive Examples
(English)
General
That is some gay shit.
(Spanish)
General
Esos moromierda ??rabes siguen entrando.
(French)
General
Je parie que ce tapette aime les hommes.
(Other Languages)
Hate Speech De
An meiner Uni wimmelt es von Schlitzauge.
Hate Speech Dut
Die vent is een totale flikker.
Hate Speech En
That is some gay shit.
Strong Profanity
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin, Darknet
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when content contains language considered strongly profane. Supports English and major European languages (e.g. Spanish, French, German, etc).
Use Cases and Positive Examples
(English)
Strong Profanity
Go fuck yourself!
(Spanish)
Strong Profanity
??No me importa una mierda cu??nto dinero ganaste!
(French)
Strong Profanity
Vous pouvez suivre vos conseils d'investissement et aller vous faire foutre!
(Japanese)
Strong Profanity
?????????????????????
(Other Languages)
Strong Profanity Arabic
?????????? ??????
Strong Profanity Dutch
Krijg de ziekte.
Strong Profanity German
Triff mich im verdammten Park, Arschloch.
Strong Profanity Italian
Parla di tua madre la prossima voltaanculo, fanculo.
Strong Profanity Korean
?????? ?????? ??? ??????
Strong Profanity Polish
Ona jest tak?? dziwk??.
(Portuguese)
Strong Profanity
N??o posso acreditar em voc??, seu filho da puta!
(Viatnamese)
Strong Profanity
B???n kh??ng gi??p ???????c g??, n??n ????? b???n
(Chinese)
Strong Profanity
??????????????????
Adult Language
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Text Analysis
Details
- Methodology: Lexical
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin, Darknet
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when content contains language considered adult. Supports English and some European languages (e.g. Spanish, French, German, etc).
Use Cases and Positive Examples
(All Languages)
Adult Language Nl
Ik heb net mijn grootste klant verloren gatverdamme.
Adult Language De
Ich werde dir in den Arsch treten!
Adult Language En
Well damn it, what a mess you've made!
Don't be an ass, just let know what you think about my post.
Adult Language Es
??Eres un imb??cil! ??Nunca me escuchas!
Adult Language Fr
Je crois que vous cr??ez un spectacle de merde.
Adult Language Pt
Ela partiu meu cora????o, aquela boceta.
Uploaded Video
Categorization
- Category: Other
- Sub-Category: Images, Videos, & Documents
- Type: Wip
Details
- Methodology: WIP
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Links to any video.
Domain Fraud
Categorization
- Category: Other
- Sub-Category: General
- Type: Collaborative Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
Domain Fraud is triggered when an URL originates from a domain that the customer has added to the Domain Fraud Classifier watch/block list in Domain Discover. This way a Domain identified as fraudulent by a Domain Discover user can be automatically blocked or trigger an alert in Patrol. Positive examples will vary from customer to customer.
Any URL
Categorization
- Category: Other
- Sub-Category: General
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight
- Match Phrase: Yes
Description
Triggers when content contains any URL.
LinkedIn Recommendations
Categorization
- Category: Social Activity
- Sub-Category: Linkedin Activity
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin
- UI Element(s): None
Description
Link to documentation: https://help.proofpoint.com/Content_Patrol_Classifier_Documentation/Descriptions/li_recommendations
No URL
Categorization
- Category: Other
- Sub-Category: General
- Type: Text Analysis
Details
- Methodology: Content Pattern Analysis
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): None
Description
Triggers when content does not contain a URL.
DarkNet Forum
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Darknet
- UI Element(s): Mask
- Match Phrase: Yes
Description
Content originating from DarkNet Forum domain, one of the sources of DarkNet content.
LinkedIn Endorsements
Categorization
- Category: Social Activity
- Sub-Category: Linkedin Activity
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin
- UI Element(s): None
Description
Link to documentation: https://help.proofpoint.com/Content_Patrol_Classifier_Documentation/Descriptions/li_skills
DarkNet Market
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Darknet
- UI Element(s): Mask
- Match Phrase: Yes
Description
Content originating from DarkNet Market domain, one of the sources of DarkNet content.
Natively Uploaded Document
Categorization
- Category: Other
- Sub-Category: Images, Videos, & Documents
- Type: Link Categorization
Details
- Methodology: WIP
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Linkedin
- UI Element(s): None
Description
Matches on content with PPT, PPTS, PPTX, DOC, DOCX and/or PDF attachment. Documents can be uploaded in standard posts, group posts, or private messages
LinkedIn Invitations
Categorization
- Category: Social Activity
- Sub-Category: Linkedin Activity
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Account
- Data Source(s): Linkedin
- UI Element(s): None
Description
Link to documentation: https://help.proofpoint.com/Content_Patrol_Classifier_Documentation/Descriptions/li_invitations
DarkNet Ransomware
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Darknet
- UI Element(s): Mask
- Match Phrase: Yes
Description
Content originating from DarkNet Ransomware domain, one of the sources of DarkNet content.
Known Data Leak
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Darknet
- UI Element(s): None
Description
Known Data Leak is triggered when content originates from a Known Data Leak (i.e. Collection1, etc.). For example, a Known Data Leak may consist of a user database from a hacked web site that has been leaked on the DarkNet.
Promissory Images
Categorization
- Category: Compliance
- Sub-Category: Financial Services Compliance Standards
- Type: Image Analysis
Details
- Methodology: Machine Learning + Deep Learning
- Directionality: Outbound
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
Description
This classifier flags images that may communicate a financial promise. This classifier also operates on the unique frames of videos.
Potential Nudity Images
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Image Analysis
Details
- Methodology: Machine Learning + Deep Learning
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
Description
This classifier flags images that potentially contain nudity. This classifier also operates on the unique frames of videos. which uses machine learning and deep learning. These technologies are powerful and state-of-the-art but are non-deterministic, meaning it???s hard to explain predictions. 11 months ago we evaluated a number of third-party systems and chose this system based on its predictive power. Because it is a third-party system, we have control only over the threshold for flagging images. We maintain an internal set of testing images to tun this threshold. Currently the threshold is set to catch ???softcore??? and ???hardcore??? nudity. Of course, aiming to catch ???software??? nudity opens the classifier up to more false positives.
Weapons Images
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Image Analysis
Details
- Methodology: Machine Learning + Deep Learning
- Directionality: Bidirectional
- Entities: People, Location, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
Description
This classifier flags images that contain guns in plain sight. This classifier also operates on the unique frames of videos.
Gambling Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Sites that refer to or contain games that involve the winning or losing of money based on strategy and chance. Includes information, tips, strategies, and rules for gambling games. Examples are bookie, betting, lotto, etc.
Positive Examples
https://www.ontariobets.com
http://www.casino.ca
Network Errors
Categorization
- Category: Other
- Sub-Category: Internet Infrastructure
- Type: Wip
Details
- Methodology: WIP
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links that result in network errors when trying to load them.
Positive Examples
https://httpstat.us/503
Government Sites
Categorization
- Category: Other
- Sub-Category: Public Sector
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Government sites.
Positive Examples
http://www.dhs.gov
Spyware & Questionable Software Sites
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Linkes to sites that contain software that reports information back to a central server, including spyware and keyloggers.< May also contain include software with a legitimate purpose but is still deemed objectionable for some customers. Web analysts should not enable this classifier.
Positive Examples
http://index-of.es
Blockchain Sites
Categorization
- Category: Compliance
- Sub-Category: Other Noteworthy Activity
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to blockchain technology.
Positive Examples
https://blockchain.com
https://polygon.technology
Web Category Avoidance Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Institutional Compliance
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Anonymizers are frequently used to bypass web filtering technology. This category is applied to any site that provides a proxy of another site with the intent of circumventing filters. This includes proxies and anonymizers for surfing websites while disguising source IP address, cookies, etc. Examples are stay anonymous, anonymous email receiver, anonymous proxy, etc.
Positive Examples
http://www.instantproxies.com
Political Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages covering laws, rules, legal assistance and issues related to immigration. Examples are immigration law, visa assistance, embassies, green card, etc.< Information, issues and other related fields on politics. Examples are elections, independent party, republicans, etc< Web pages that tackle issues and laws on legal aspects, except divorce and immigration. Examples are law firms, corporate law, court hearings, lawyer, etc.< Opinions, commentaries, annotations and other related information usually found in news and magazines. Examples are political criticism, political discussion, editorial page, etc.< Government organizations, departments, or agencies. Includes police, firefighters, elections commissions, elected representatives, and government sponsored programs and research. Examples are city of, local government, senate, etc.
Positive Examples
http://www.thehill.com
http://www.politico.com
General Interest Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites of General interest.
Positive Examples
https://www.keystonexl.com
School Cheating Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Institutional Compliance
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Test answers, pre-written term papers and essays, full math problem solvers that show the work, and similar web sites that can be used to cheat on homework and tests. Also includes sites where students can pay to have others do their homework for them. Examples are free term papers, homework cheats, student papers, etc.
Positive Examples
http://www.homeworkjoy.com
Chat Sites
Categorization
- Category: Other
- Sub-Category: Internet Communication
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Internet Chat services.
Positive Examples
http://www.chatib.us
http://xat.com'
Religious Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that discuss the beliefs, traditions and other information on Latter-Day Saints. Examples are mormons, mormonism, LDS movement, etc.< Web pages that pursue an anti-religion agenda or that challenge religious, spiritual, metaphysical, or supernatural beliefs. Also includes pages that explain or put forward a philosophy about the truth of an afterlife or similar are unknowable or impossible to prove/disprove. Examples are agnostic, apatheism, atheism etc.< Web pages that tackle beliefs, traditions and information on Catholicism. Examples are parish priest, catholic priest, catholic church, diocese, etc.< Web pages which discuss beliefs, traditions and other information on Hinduism. Examples are vedas, brahman, saivism, etc.< Web pages which speak about the beliefs, traditions and other information on Islam. Examples are quran, allah, shia islam, etc.< Web pages which tackle the beliefs, traditions and other information on Judaism. Examples are torah, judaism, etc.< Web pages that discuss the traditions, beliefs and articles on Buddhism. Examples are buddha, theravada, mahayana, etc.< Web pages the provide information on the beliefs, traditions and other related areas on Christianity excludes Catholicism. Examples are anabaptist, christian, Lutheran, Jesus, God, Virgin Mary, etc.< Information and articles on non-common religions from around the world, including their beliefs, practices, rituals, creeds, ethics and history. Examples are bahai faith, yin yang, etc.< Sites that do not fall to any specific category under Religion belong to this category, including web pages that have two or more Religion categories.'
Positive Examples
http://www.bible.com
http://www.islamweb.net
http://www.vatican.va
http://www.churchofjesuschrist.org
Greeting Card Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Greeting Cards.com.
Positive Examples
http://www.americangreetings.com
Violence Sites
Categorization
- Category: Security
- Sub-Category: Physical Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Location, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that provide instructions on how to commit violence excluding Criminal Skills (e.g. bomb making. ) This includes militancy, torture, crime-scene photos, and descriptions/pictures of a violent, bloody or gory nature. It also includes sites that promote violence and sedition. Examples are mutilation, crime scene, massacre etc.
Positive Examples
https://rotten.com
Hacking Sites
Categorization
- Category: Security
- Sub-Category: Criminal & Illegal
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web sites with information or tools specifically intended to assist in online crimes, such as unauthorized access to computers or fraud. This also includes phone system hacking (aka phreaking).
Positive Examples
http://elite-hackers.com
Torrent Repository Sites
Categorization
- Category: Other
- Sub-Category: Internet Communication
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites containing Torrent Repositories.
Positive Examples
http://thepirate-bay.org
Web-based Email Sites
Categorization
- Category: Other
- Sub-Category: Internet Communication
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites that provide Web-based Email.
Positive Examples
http://www.yahoomail.com
http://www.protonmail.com
Pornographic/Sexually Explicit Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Pornography
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Any site that includes any of the following: - Images or videos depicting sexual acts, sexual arousal, or explicit nude imagery intended to be sexual in nature. - Graphic pictures of human excretion. Examples are hentai, porn, xxx, etc. - Sexual content, products or services related to sex but without nudity or other explicit pictures (even ads) on the page. This includes sex toys, escort services, erotic stories, textual how-to, pleasure guides, etc. Examples are adult webcam, sex lubricant, sex tips, etc.
Positive Examples
http://pornhub.com
http://xvideos.com
http://playboy.com
Financial Sites
Categorization
- Category: Other
- Sub-Category: Finance
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites related to Finance.
Positive Examples
http://www.morganstanley.com
http://www.forbes.com
http://www.marketwatch.com
Sex Education Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that cater to or discuss the gay, lesbian, bisexual or transgender lifestyle. Examples glb, lgbt, bisexual, etc.< Web pages with educational materials and clinical explanations of sex aimed at teens and children. Examples are sexual anatomy, sexual health, family planning, etc.< Web pages that push the pro-choice viewpoint or otherwise overtly encourage abortions. Also includes websites of organizations that offer the abortion procedure as a service. Examples are abortion rights, reproductive rights, women''s rights on abortion, etc.< Web pages that condemn abortion or otherwise overtly push a pro-life agenda, such as organizations dedicated to changing the laws with the goal of making abortions illegal. Sites offering alternatives to abortion should not be considered pro-life unless they explicitly push a pro-life agenda. Examples are aborticide, fetal rights, feticide, etc.< Web pages that discuss abortion from a historical, medical, legal, or other not overtly biased point of view. Examples are abortion pill, pregnancy termination, fetal abortion, etc.
Positive Examples
http://www.optionsforsexualhealth.org
Forums & Newsgroups Sites
Categorization
- Category: Other
- Sub-Category: Technology
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Forums and/or Newsgroups.
Positive Examples
http://www.reddit.com
http://www.stackexchange.com
Private IP Address Links
Categorization
- Category: Other
- Sub-Category: Internet Infrastructure
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Private IP Addresses.
Positive Examples
http://10.0.0.1
Phishing & Fraud Links
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that impersonate other web pages usually with the intent of stealing passwords, credit card numbers, or other information. Also includes web pages that are part of scams, such as a "419" scam where a person is convinced to hand over money with the expectation of a big payback that never comes.
Positive Examples
https://zvelo.com/category-example-test/phishing/index.html
Image Sharing Sites
Categorization
- Category: Other
- Sub-Category: Images, Videos, & Documents
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages used for Image Sharing.
Positive Examples
http://www.flickr.com
http://www.imgur.com
R-Rated Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Sites whose primary purpose and majority of content is child appropriate, but who have regular or irregular sections of the site with sexually themed, non-educational material, should be categorized as R-Rated. Examples are sex drugs, penis enlargement, rate my ass, etc.< Generic category for tasteless material or other material potentially inappropriate for children not already covered by another category such as Violence or R-Rated. Examples are satanism, etc.< Web pages that use either frequent profanity or serious profanity. A single mildly profane word does not qualify, but pervasive or offensive use of profanity does. Examples are asshole, bastard, swearword, etc.'
Positive Examples
http://www.wwtdd.com
Arts Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to the arts.
Positive Examples
http://www.moma.org
Criminal Activity Sites
Categorization
- Category: Security
- Sub-Category: Criminal & Illegal
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Location, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages providing information on how to perpetrate activity such as burglary, murder, bomb-making, lock picking, non-online scams, non-online fraud and fake drug tests. Also may include photos of illegal activities such as necrophilia and zoophilia.
Positive Examples
https://pickyourtools.com
Business Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages that are Business related.
Positive Examples
http://www.dnb.com
https://www.lendlease.com
Personal Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Personal sites.
Positive Examples
https://americasfreedomfighters.com
Leisure & Recreation Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages that discuss Liesure and Recreation.
Positive Examples
https://line-of-action.com
Job Search Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Job Search sites
Positive Examples
http://www.monster.com
http://www.themuse.com
Entertainment Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Entertainment sites.
Positive Examples
http://www.entertainmentweekly.com
http://www.eonline.com
Games Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites related to Games.
Positive Examples
http://www.playstation.com
Information Security Sites
Categorization
- Category: Other
- Sub-Category: Technology
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites that discuss Information Security.
Positive Examples
http://www.proofpoint.com
http://www.siteadvisor.com
Social Networking Sites
Categorization
- Category: Other
- Sub-Category: Internet Communication
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Social Networking.
Positive Examples
http://www.facebook.com
http://www.linkedin.com
Cult Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Also includes extraterrestrial, folk religions, cults, mysticism etc. Examples are satanism, numerology, scientology, feng shui, etc.< Web pages that are dedicated to a group of historical polytheistic religious traditions-primarily those of cultures known to the classical world. Examples are pagans, wiccans, magic spells, etc.
Positive Examples
http://www.heavensgate.com/
http://www.rael.org/home
http://scientology.com
Child Abuse Images Sites
Categorization
- Category: Security
- Sub-Category: Criminal & Illegal
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that show the physical or sexual abuse of children such as kiddie porn, pedophilia, or child abuse.
Positive Examples
https://zvelo.com/category-example-test/abuse/index.html
Lingerie, Suggestive & Pinup Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Sites that contain photos and videos where the person who is the subject of the photo is wearing sexually provocative clothing such as lingerie. Examples are bikini, bustier, negligee, etc.
Positive Examples
https://www.victoriasecrets.com
Unknown Sites
Categorization
- Category: Other
- Sub-Category: Internet Infrastructure
- Type: Link Categorization
Details
- Methodology: WIP
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Mask
- Match Phrase: Yes
Description
Links to pages that cannot be classified at this time.
Positive Examples
https://www.nrpa.gov
http://cooperativadequito.com
Illegal Drug Sites
Categorization
- Category: Security
- Sub-Category: Criminal & Illegal
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that in any way endorse or glorify common illegal drugs, the misuse of prescription drugs, the misuse of inhalants, positive references to the culture of drug use whether specific drugs are mentioned or not. This includes sites giving non-clinical descriptions or stories about being high, as well as blogs and other posts about getting high, crack, heroine, morphine, etc. < Web pages about marijuana or about smoking marijuana. It includes Web pages on legalizing marijuana, using marijuana for medicinal purposes, marijuana info pages, and pages that display pictures of marijuana plants if shown in a way that could be considered an endorsement of the drug. This does not include government sponsored Web pages, such as the Drug Enforcement Agency. Examples are cannabis, blunts, panama red etc."
Positive Examples
https://hightimes.com
https://elplanteo.com/
Computers & Technology Sites
Categorization
- Category: Other
- Sub-Category: Technology
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages discussing Computers & Technology.
Positive Examples
http://www.microsoft.com
Fake News Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Sites with strong reputations for spreading false news stories.
Positive Examples
http://NationalReport.net
Shopping Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Shopping.
Positive Examples
http://www.amazon.com
http://www.ebay.com
http://www.craigslist.com
Search Engines & Portals Sites
Categorization
- Category: Other
- Sub-Category: Technology
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Search Engines and/or Portals.
Positive Examples
http://www.google.com
http://www.bing.com
Streaming Media & Download Sites
Categorization
- Category: Other
- Sub-Category: Images, Videos, & Documents
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages that contain Streaming Media and/or Downloads.
Positive Examples
http://www.youtube.com
http://www.vimeo.com
Peer-to-Peer Sites
Categorization
- Category: Other
- Sub-Category: Internet Communication
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites that Peer-to-Peer networking files.
Positive Examples
http://sharedrop.io
http://hide.me
Any DarkNet
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Account Analysis
Details
- Methodology: Metadata Analysis
- Directionality: Inbound
- Entities: Brand, People
- Data Source(s): Darknet
- UI Element(s): Mask
- Match Phrase: Yes
Description
Triggers when content comes from the DarkNet.
Botnets Sites
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to command and control servers used to send commands to infected machines called bots. Bots are compromised machines running software that are used by hackers to send spam, phishing attacks, and denial of service attacks.
Positive Examples
https://zvelo.com/category-example-test/botnet/index.html
Terrorism Sites
Categorization
- Category: Security
- Sub-Category: Criminal & Illegal
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web sites that provide access to illegally obtained files such as pirated software (aka warez), pirated movies, pirated music, etc. This includes information or software available specifically for the purpose of using or stealing protected copyrighted materials without paying for them. Examples include lists of software serial numbers, "cracks", "rippers", etc.
Positive Examples
https://zvelo.com/category-example-test/terrorism/index.html
Alcohol & Tobacco Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Sites that promote or sell tobacco products such as cigarettes, cigars, and chew.
Positive Examples
https://winstoncigarettes.com/
https://gtc.marlboro.com
https://www.jackdaniels.com/en-ca/
Real Estate Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Real Estate.
Positive Examples
https://www.century21.com
https://www.corcoran.com
Instant Messaging Sites
Categorization
- Category: Other
- Sub-Category: Internet Communication
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages that offer Instant Messaging.
Positive Examples
http://whatsapp.com
Dead Link
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links that no longer resolve or whose content is otherwise unreachable, generating Page not found errors.
Positive Examples
https://www.proofpoint.com/deadlink
Dating & Personal Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Dating sites.
Positive Examples
http://www.match.com
http://www.eharmony.com
Parked Domains
Categorization
- Category: Other
- Sub-Category: Internet Infrastructure
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links that refer to Parked Domains.
Positive Examples
http://www.superkoder.com
http://www.clcrc.com
Translator Sites
Categorization
- Category: Other
- Sub-Category: Technology
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages that provide language translation.
Positive Examples
http://www.babelfish.com
http://translate.google.com
Suspicious Links
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web sites that meet a range of generally suspicious link criteria. More specifically, this classifier triggers on URLs that meet any of the following conditions. - Whois record indicates that site is less than 30 days old. - Includes a raw IP address - Contains a .exe file extension
Positive Examples
http://suspiciouslinks.nexgate.test
http://127.0.0.1
Sports Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Sports.
Positive Examples
http://www.espn.com
Spam Sites
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web pages that impersonate other web pages usually with the intent of stealing passwords, credit card numbers, or other information. Also includes web pages that are part of scams, such as a "419" scam where a person is convinced to hand over money with the expectation of a big payback that never comes.
Positive Examples
https://zvelo.com/category-example-test/spam/index.html
Hate & Intolerance Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Abuse & Hate
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, People, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites that promote hate and intolerance
Positive Examples
http://americannaziparty.com
News Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages that provide News coverage.
Positive Examples
http://www.apnews.com
Health & Medicine Sites
Categorization
- Category: Other
- Sub-Category: Health
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites that discuss Health & Medicine.
Positive Examples
https://www.healthline.com
http://www.webmd.com
Piracy & Copyright Theft Sites
Categorization
- Category: Security
- Sub-Category: Criminal & Illegal
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Web sites that provide access to illegally obtained files such as pirated software (aka warez), pirated movies, pirated music, etc. This includes information or software available specifically for the purpose of using or stealing protected copyrighted materials without paying for them. Examples include lists of software serial numbers, "cracks", "rippers", etc.
Positive Examples
http://iwatchgameofthrones.net
http://magnetdl.com
Weapons Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Controversial Topics
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Guns and weapons when not used in a violent manner such as descriptions, sport hunting, gun clubs, or paintball. Also includes other weapons like crossbows, knives, etc. Examples are gun crossbow, knives, rifles, etc.
Positive Examples
http://weapons-universe.com
http://us.glock.com
Cryptocurrency Mining Sites
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Cryptocurrency Mining Websites that use cryptocurrency mining technology without user permission. This is considered a malicious category.
Positive Examples
http://shopmedalert.com
Nudity Sites
Categorization
- Category: Acceptable Use
- Sub-Category: Adult
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Sites that contain Non-sexual photographs or drawings showing nudity including butts and women's bare breasts. Examples are topfree, public bathing, bare butt, etc.
Positive Examples
https://treatsmagazine.com
Malware & Compromised Links
Categorization
- Category: Security
- Sub-Category: Cyber-Security
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Compromised web pages are pages that appear to be legitimate, but house malicious code or link to malicious websites hosting malware. These sites have been compromised by someone other than the site owner. Examples are defaced, hacked by, etc.< When viruses and spywares report stolen information back to a particular URL, or frequently checks a URL for updates, then this is considered a malware call-home address. < Web pages that host viruses, exploits, and other malware are considered Malware Distribution Points.
Positive Examples
https://zvelo.com/category-example-test/malware-call-home/index.html
https://zvelo.com/category-example-test/compromised/index.html
Cryptocurrency Sites
Categorization
- Category: Compliance
- Sub-Category: Other Noteworthy Activity
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account, Brand
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Websites that discuss cryptocurrency such as Bitcoin, Ethereum, Litecoin, and others. This category does not include cryptocurrency mining without permission but can include web pages which discuss cryptocurrency mining and web pages which perform cryptocurrency mining with user permission (opt-in).
Positive Examples
https://www.gemini.com
https://www.f2pool.com
Education Sites
Categorization
- Category: Other
- Sub-Category: Public Sector
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links related to Education.
Positive Examples
https://www.si.edu
http://www.columbia.edu'
Download Sites
Categorization
- Category: Other
- Sub-Category: Technology
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Download sites.
Positive Examples
http://www.softpedia.com
Restaurants & Dining Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Restaurants and Dining.
Positive Examples
http://www.redlobster.com
Transportation Sites
Categorization
- Category: Other
- Sub-Category: General
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Transportation.
Positive Examples
http://www.mule.com
Travel Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to pages related to Travel.
Positive Examples
http://www.kayak.com
Fashion & Beauty Sites
Categorization
- Category: Other
- Sub-Category: Entertainment
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to Fashion and Beauty sites.
Positive Examples
http://www.sephora.com
Advertisements & Pop-Ups Sites
Categorization
- Category: Other
- Sub-Category: Internet Infrastructure
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
This flags links that contain adverstisements and pop-ups.
Positive Examples
http://neobux.com
Non-profits & NGOs Sites
Categorization
- Category: Other
- Sub-Category: Public Sector
- Type: Link Categorization
Details
- Methodology: URL Categorization Engines
- Directionality: Bidirectional
- Entities: Account
- Data Source(s): Twitter, Youtube, Facebook, Linkedin
- UI Element(s): Highlight,Mask
- Match Phrase: Yes
Description
Links to sites for Non-Profits and/or NGOs.
Positive Examples
https://www.centralparknyc.org