Html |
Creates a new HtmlTokenizer.
This constructor will attempt to auto-detect the appropriate encoding to use by examining the first four bytes of the stream and, if a unicode byte-order-mark is detected, use the appropriate unicode encoding. If no byte order mark is detected, then it will default to UTF-8.