![]() |
||||||||||||||||
|
||||||||||||||||
|
|
Searching With HTML Zones and Fields
The Simple and Internet search modes provide for limiting a query to a given area of HTML documents. These areas can be either zones or fields. For the purpose of searching HTML documents, consider the following sample HTML document:
|
| Zone | Content |
|---|---|
title |
The Code Tag Information Page |
| body | Fun with the Code Tag Use the code tag to render code font within a document This is an example |
h1 |
Fun with Code |
code |
This is an example |
To search within a zone using the Simple search mode, use the = operator. To search within a zone using the Internet search mode, simply use the colon (:) operator. Here are some example search queries that would find the sample HTML document:
| Mode | Examples | Explanations |
|---|---|---|
| Simple | title=Information |
Find all documents with the term Information in their title zone. |
body=code |
Find all documents with the term code in their p zone. | |
mobil and exxon |
Find all documents with the terms mobil and exxon. | |
mobil not exxon |
Find all documents with the term mobil but does not contain the term exxon | |
mobil not exxon and oil |
Find all documents with the terms mobil and oil but does not contain the term exxon | |
mobil not (exxon and oil) |
Find all documents with the term mobil but does not contain the terms exxon nor the term oil. The parenthesis changes the context | |
| Internet | title:Information |
Find all documents with the term Information in their title zone or field. |
body:code |
Find all documents with the term code in their body zone. | |
body:example |
Find all documents with the term example in their body zone. Remember, the h1 and code zones reside within the body zone. | |
body:code:example |
Invalid query! The Internet search mode cannot perform searches on nested zones. |
Finally, zones only exist for the following HTML tags:
| Tag | Description |
|---|---|
| <a> | anchor |
| <abbrev> | abbreviation |
| <acronym> | acronyms |
| <address> | address |
| <au> | author name |
| <banner> | banner |
| <base> | used to resolve relative addressing |
| <blockquote> | block quote |
| <body> | body |
| <cite> | citation |
| <code> | code sample |
| <dfn> | definition |
| <fn> | footnote |
| <form> | form |
| <h1> | heading level 1 |
| <h2> | heading level 2 |
| <h3> | heading level 3 |
| <h4> | heading level 4 |
| <h5> | heading level 5 |
| <h6> | heading level 6 |
| <head> | header block |
| <html> | html zone |
| <lang> | alternate language |
| <link> | provides information relating current document to other documents |
| <note> | separated notational text |
| <person> | person element |
| <q> | quotation |
| <samp> | sample |
| <textarea> | text area of a form |
Return to Top of Page | Go to Search Overview | Go to EDGAR Search Page
Go to Website Search Page
http://www.sec.gov/edgar/searchedgar/htmlzones.htm
| Home | Previous Page | Modified: 02/20/2003 |