605-16 : EDGAR Filer Manual v65 6.5.16 page 6-12

Created by: SEC Office of Structured Disclosure

Tag restrictions in 5.2.2 apply to block text XML content

Facts of type "text block" whose un-escaped content contains xml elements must satisfy a content model derived from the BODY tag as defined in 5.2.2.

This specifies the circumstances under which content containing escaped xml elements will be interpreted as HTML. HTML as accepted by EDGAR is detailed in Section 5.2.2 and is based on HTML 3.2 elements and entities, with extensions (such as allowing the "style" attribute) and restrictions (such as disallowing the "textarea" element). Text block content may additionally use HTML elements "span", "tbody", "thead", "tfoot", and the unprefixed "lang" attribute.

Other than exceptions specified in 6.10, there is no prohibition on escaped HTML elements appearing in other XBRL elements in schemas (such as link:definition) or linkbases (such as link:label), and the renderer (see 6.24) does not un-escape that content.

REMARK: Restrictions on the content of href are documented in EFM 5.2.2.1. There is a DTD (edbody.dtd in the "lib" folder) used for validation of unescaped content, but a change to Schema (and therefore disallowing named entities) may occur at some time in the future. The DTD by itself does NOT check for nested <table> elements, but they are disallowed, as variation _030ng shows.

REMARK: For reference only, here are tags forbidden, keeping in mind that all permitted tags are lower case: <acronym/> <applet/> <area/> <base/> <basefont/> <bdo/> <button/> <col/> <colgroup/> <del/> <fieldset/> <font/> <form/> <frame/> <frameset/> <iframe/> <input/> <ins/> <label/> <legend/> <map/> <meta http-equiv="name" content="content"/> <noframes/> <noscript/> <object/> <option/> <param/> <q/> <s/> <S/> <script/> <select/> <style/> <textarea/>

REMARK: Variations 018 through 022 require http://www.xbrl.org/dtr/type/nonNumeric-2010-12-16.xsd in edgartaxonomies.xml

Variations

Number Name Description References Data Inputs Result Outputs
_001ng 6.5.16 Instance with TextBlock containing head and body tags, NOGOOD. Instance with TextBlock containing head and body tags, NOGOOD. PCR 10865   
_002ng 6.5.16 Instance with TextBlock containing several disallowed HTML elements, NOGOOD. Instance with TextBlock containing several disallowed HTML elements, NOGOOD. PCR 10865   
_003ng 6.5.16 Instance has TextBlock with one completely bogus character entity and three character entities that would be allowed in EDGAR pg. 5-19/5-20 but are not defined in XHTML and therefore - NOGOOD. Instance has TextBlock with one completely bogus character entity and three character entities that would be allowed in EDGAR pg. 5-19/5-20 but are not defined in XHTML and therefore - NOGOOD. PCR 10865   
_004ng 6.5.16 Instance has TextBlock with uppercase HTML tags, NOGOOD. Instance has TextBlock with uppercase HTML tags, NOGOOD. PCR 10865   
_005ng 6.5.16 An instance with TextBlock containing a NONSENSE tag, NOGOOD. An instance with TextBlock containing a NONSENSE tag, NOGOOD. PCR 10865   
_006ng 6.5.16 An instance with a TextBlock whose prohibited HTML happens to be in a head tag, NOGOOD. An instance with a TextBlock whose prohibited HTML happens to be in a head tag, NOGOOD. PCR 10865   
_007gd 6.5.16 An instance with a TextBlock using all of the allowed character entities, GOOD. An instance with a TextBlock using all of the allowed character entities, GOOD. PCR 10865 
    valid
 
_008ng 6.5.16 An instance with a TextBlock containing an illegal href containing javascript, NOGOOD. An instance with a TextBlock containing an illegal href containing javascript, NOGOOD. PCR 10865   
_009gd 6.5.16 An instance with a TextBlock containing an href containing a www.sec.gov reference, GOOD. An instance with a TextBlock containing an href containing a www.sec.gov reference, GOOD PCR 10865SPR 14861 
    valid
 
_010ng 6.5.16 An instance with a TextBlock containing an href containing non-SEC reference, NOGOOD. An instance with a TextBlock containing an href containing non-SEC reference, NOGOOD. PCR 10865   
_011gd 6.5.16 An instance with a TextBlock containing an img containing local src reference, GOOD. An instance with a TextBlock containing an img containing local src reference, GOOD. PCR 10865 
    valid
 
_012ng 6.5.16 An instance with a TextBlock containing an img containing nonlocal src reference, NOGOOD. An instance with a TextBlock containing an img containing nonlocal src reference, NOGOOD. PCR 10865   
_013ng 6.5.16 Instance with TextBlock using CDATA encoding containing several disallowed HTML elements, NOGOOD. Instance with TextBlock using CDATA encoding containing several disallowed HTML elements, NOGOOD.    
_014ng 6.5.16 Instance with TextBlock containing several disallowed HTML elements, NOGOOD. Instance with TextBlock containing several disallowed HTML elements, NOGOOD.    
_015ng 6.5.16 Instance with TextBlock containing some disallowed HTML attributes / element combinations, NOGOOD. Instance with TextBlock containing some disallowed HTML attributes / element combinations, NOGOOD.    
_016gd 6.5.16 Instance with TextBlock containing several allowed HTML attributes / element combinations, GOOD. Instance with TextBlock containing several allowed HTML attributes / element combinations, GOOD.  
    valid
 
_017gd 6.5.16 Instance with a string containing several disallowed HTML attributes / element combinations, GOOD. Instance with a string containing several disallowed HTML attributes / element combinations, GOOD.  
    valid
 
_018ng 6.5.16 An instance with an Escaped Item containing an illegal href containing javascript, NOGOOD. An instance with an Escaped Item containing an illegal href containing javascript, NOGOOD. 11971   
_019ng 6.5.16 An instance with a Derived Escaped Item containing an illegal href containing javascript, NOGOOD. An instance with a Derived Escaped Item containing an illegal href containing javascript, NOGOOD. 11971   
_020gd 6.5.16 An instance with a String containing an illegal href containing javascript, GOOD. An instance with a String containing an illegal href containing javascript, GOOD. 11971 
    valid
 
_021ng 6.5.16 An instance with a Derived Escaped Item containing an href containing non-SEC reference, NOGOOD. An instance wwith a Derived Escaped Item containing an href containing non-SEC reference, NOGOOD. 11971   
_022ng 6.5.16 An instance with a derived Escaped Item Type containing a NONSENSE tag, NOGOOD. An instance with a derived Escaped Item Type containing a NONSENSE tag, NOGOOD. 11971   
_023ng 6.5.16 An instance with a Text Block Item Type containing an xmlns prefix binding, NOGOOD. An instance with a Text Block Item Type containing an xmlns prefix binding, NOGOOD. 11971   
_024ng 6.5.16 Well-formed XML containing unquoted comments prior to disallowed content, NOGOOD. Well-formed XML containing unquoted comments prior to disallowed content, in a type that is derived from {SomeStandardNamespace}:escapedItemType, GOOD. 11151.611968.2   
_025ng 6.5.16 Well-formed XML containing unquoted comments around disallowed content, NOGOOD. Well-formed XML containing unquoted comments around disallowed content, in a type that is derived from {SomeStandardNamespace}:escapedItemType, NOGOOD. 11151.611968.2   
_026gd 6.5.16 Disallowed content in a comment, GOOD. Disallowed javascript and an off site link inside a comment, GOOD. 11151.611968.2 
    valid
 
_027gd 6.5.16 Disallowed content in a quoted comment, GOOD. Disallowed javascript and an off site link inside a quoted comment, GOOD. 11151.611968.2 
    valid
 
_028ng 6.5.16 Undefined custom entity, NOGOOD. Undefined custom entity, NOGOOD. 11151.611968.2   
_030ng 6.5.16 Nested table, NOGOOD. Nested table, NOGOOD. 11151.611968.2   
_031ng 6.5.16 Image src attribute points to non existent local image, NOGOOD. Image src attribute points to non existent local image, NOGOOD. 11151.611968.2   
_032ng 6.5.16 Image src attribute points to a jpg that is not really an image, NOGOOD. Image src attribute points to a jpg that is not really an image, NOGOOD. 11151.611968.2   
_033ng 6.5.16 Image src attribute points to a nonlocal jpg even though it is on the SEC web site, NOGOOD. Image src attribute points to a nonlocal jpg even though it is on the SEC web site, NOGOOD. 11151.611968.2