The Uniform Resource Locator(URL) object describes the characteristics of a URL. Defined in RFC 1738 and by D3FEND d3f:URL.
Attributes
Section titled “Attributes”category_ids
- Type:
integer_t - Requirement: recommended
- Values:
0-Unknown: The Domain/URL category is unknown.1-Adult/Mature Content3-Pornography4-Sex Education5-Intimate Apparel/Swimsuit6-Nudity7-Extreme9-Scam/Questionable/Illegal11-Gambling14-Violence/Hate/Racism15-Weapons16-Abortion17-Hacking18-Phishing20-Entertainment21-Business/Economy22-Alternative Spirituality/Belief23-Alcohol24-Tobacco25-Controlled Substances26-Child Pornography27-Education29-Charitable Organizations30-Art/Culture31-Financial Services32-Brokerage/Trading33-Games34-Government/Legal35-Military36-Political/Social Advocacy37-Health38-Technology/Internet40-Search Engines/Portals43-Malicious Sources/Malnets44-Malicious Outbound Data/Botnets45-Job Search/Careers46-News/Media47-Personals/Dating49-Reference50-Mixed Content/Potentially Adult51-Chat (IM)/SMS52-Email53-Newsgroups/Forums54-Religion55-Social Networking56-File Storage/Sharing57-Remote Access Tools58-Shopping59-Auctions60-Real Estate61-Society/Daily Living63-Personal Sites64-Restaurants/Dining/Food65-Sports/Recreation66-Travel67-Vehicles68-Humor/Jokes71-Software Downloads83-Peer-to-Peer (P2P)84-Audio/Video Clips85-Office/Business Applications86-Proxy Avoidance87-For Kids88-Web Ads/Analytics89-Web Hosting90-Uncategorized92-Suspicious93-Sexual Expression95-Translation96-Non-Viewable/Infrastructure97-Content Servers98-Placeholders99-Other: The Domain/URL category is not mapped. See thecategoriesattribute, which contains a data source specific value.101-Spam102-Potentially Unwanted Software103-Dynamic DNS Host106-E-Card/Invitations107-Informational108-Computer/Information Security109-Internet Connected Devices110-Internet Telephony111-Online Meetings112-Media Sharing113-Radio/Audio Streams114-TV/Video Streams118-Piracy/Copyright Concerns121-Marijuana
The Website categorization identifies.
hostname
- Type:
hostname_t - Requirement: recommended
The URL host as extracted from the URL. For example: www.example.com from www.example.com/download/trouble.
path
- Type:
string_t - Requirement: recommended
The URL path as extracted from the URL. For example: /download/trouble from www.example.com/download/trouble.
port
- Type:
port_t - Requirement: recommended
The URL port. For example: 80.
query_string
- Type:
string_t - Requirement: recommended
The query portion of the URL. For example: the query portion of the URL http://www.example.com/search?q=bad&sort=date is q=bad&sort=date.
scheme
- Type:
string_t - Requirement: recommended
The scheme portion of the URL. For example: http, https, ftp, or sftp.
url_string
- Type:
url_t - Requirement: recommended
The URL string. See RFC 1738. For example: http://www.example.com/download/trouble.exe. Note: The URL path should not populate the URL string.
categories
- Type:
string_t - Requirement: optional
The Website categorization names, as defined by category_ids enum values.
resource_type
- Type:
string_t - Requirement: optional
The context in which a resource was retrieved in a web request.
subdomain
- Type:
string_t - Requirement: optional
The subdomain portion of the URL. For example: sub in https://sub.example.com or sub2.sub1 in https://sub2.sub1.example.com.
Constraints
Section titled “Constraints”At least one of: url_string, path