Revisiting mime-types and file extensions
    Michel Fortin 
    michel.fortin at michelf.com
       
    Thu Jun 21 15:12:27 EDT 2007
    
    
  
Le 2007-06-21 à 12:35, Phil Mocek a écrit :
> Can anyone confirm that having the HTML fragment identifier
> formatted as Thomas proposes is valid?  Both this [W3C design
> issue paper][1] and [Wikipedia article][2] suggest otherwise, but
> I'm not sufficiently familiar with this to say for certain.
>
> [1]: <http://www.w3.org/DesignIssues/Fragment.html>
> [2]: <http://en.wikipedia.org/wiki/Fragment_identifier>
The content for the id attribute for HTML 4 must obey [this rule][3]:
> ID and NAME tokens must begin with a letter ([A-Za-z]) and may be  
> followed by any number of letters, digits ([0-9]), hyphens ("-"),  
> underscores ("_"), colons (":"), and periods (".").
[3]: http://www.w3.org/TR/html401/types.html#type-name
For XML, and by extension XHTML, it must obey the Name construct,  
which is defined as such [in the spec][4]:
     NameChar ::= Letter | Digit | '.' | '-' | '_' | ':' |  
CombiningChar | Extender
     Name ::= (Letter | '_' | ':') (NameChar)*
[4]: http://www.w3.org/TR/REC-xml/#NT-Name
Basically, HTML and XML both disallow digits as the first character  
of the id attribute. That's why the version number is preceded with a  
"v" in my profile page.
While invalid as an XML or HTML id value, a fragment identifier  
starting with a digit is still perfectly correct in a URL. Here's the  
definition of URL fragment according to [RFC 3986][5]:
     fragment    = *( pchar / "/" / "?" )
     pchar       = unreserved / pct-encoded / sub-delims / ":" / "@"
     unreserved  = ALPHA / DIGIT / "-" / "." / "_" / "~"
     pct-encoded = "%" HEXDIG HEXDIG
     sub-delims  = "!" / "$" / "&" / "'" / "(" / ")"
                 / "*" / "+" / "," / ";" / "="
[5]: http://tools.ietf.org/html/rfc3986
Michel Fortin
michel.fortin at michelf.com
http://www.michelf.com/
    
    
More information about the Markdown-Discuss
mailing list