U.S. Securities & Exchange Commission
SEC Seal
Home | Previous Page
U.S. Securities and Exchange Commission

Searching With HTML Zones and FieldsUpdated!

The Simple and Internet search modes provide for limiting a query to a given area of HTML documents. These areas can be either zones or fields. For the purpose of searching HTML documents, consider the following sample HTML document:

<html>
<head>
<title>The Code Tag Information Page</title>
</head>
<body>
<h1>Fun with the Code Tag</h1>
Use the code tag to render code font within a document
<code>This is an example</code>
</body>
</html>

There are many other tags that HTML documents can contain, but they have been excluded for sake of brevity. Matching start and end tags (e.g., <title> and </title>) outline a zone within an HTML document. Any content between the matching start and end tags belongs to the zone. Given the previous sample HTML, here are a few zones:

Zone Content
title The Code Tag Information Page
body Fun with the Code Tag Use the code tag to render code font within a document This is an example
h1 Fun with Code
code This is an example

To search within a zone using the Simple search mode, use the = operator. To search within a zone using the Internet search mode, simply use the colon (:) operator. Here are some example search queries that would find the sample HTML document:

Mode Examples Explanations
Simple title=Information Find all documents with the term Information in their title zone.
body=code Find all documents with the term code in their p zone.
mobil and exxon Find all documents with the terms mobil and exxon.
mobil not exxon Find all documents with the term mobil but does not contain the term exxon
mobil not exxon and oil Find all documents with the terms mobil and oil but does not contain the term exxon
mobil not (exxon and oil) Find all documents with the term mobil but does not contain the terms exxon nor the term oil. The parenthesis changes the context
Internet title:Information Find all documents with the term Information in their title zone or field.
body:code Find all documents with the term code in their body zone.
body:example Find all documents with the term example in their body zone. Remember, the h1 and code zones reside within the body zone.
body:code:example Invalid query! The Internet search mode cannot perform searches on nested zones.

Finally, zones only exist for the following HTML tags:

Tag Description
<a> anchor
<abbrev> abbreviation
<acronym> acronyms
<address> address
<au> author name
<banner> banner
<base> used to resolve relative addressing
<blockquote> block quote
<body> body
<cite> citation
<code> code sample
<dfn> definition
<fn> footnote
<form> form
<h1> heading level 1
<h2> heading level 2
<h3> heading level 3
<h4> heading level 4
<h5> heading level 5
<h6> heading level 6
<head> header block
<html> html zone
<lang> alternate language
<link> provides information relating current document to other documents
<note> separated notational text
<person> person element
<q> quotation
<samp> sample
<textarea> text area of a form

 

Return to Top of Page | Go to Search Overview | Go to EDGAR Search Page
Go to Website Search Page

 

http://www.sec.gov/edgar/searchedgar/htmlzones.htm


Modified: 02/20/2003