Html |
public HtmlTokenizer( Stream stream, Encoding encoding, bool detectEncodingFromByteOrderMarks, int bufferSize = 4096 )
Creates a new HtmlTokenizer.
This constructor allows you to change the encoding the first time you read from the HtmlTokenizer. The detectEncodingFromByteOrderMarks parameter detects the encoding by looking at the first four bytes of the stream. It will automatically recognize UTF-8, little-endian UTF-16, big-endian UTF-16, little-endian UTF-32, and big-endian UTF-32 text if the stream starts with the appropriate byte order marks. Otherwise, the user-provided encoding is used.