org.cyberneko.html
public class HTMLConfiguration extends ParserConfigurationSettings implements XMLPullParserConfiguration
This configuration recognizes the following features:
This configuration recognizes the following properties:
For complete usage information, refer to the documentation.
Version: $Id: HTMLConfiguration.java,v 1.9 2005/02/14 03:56:54 andyc Exp $
See Also: HTMLScanner HTMLTagBalancer
| Nested Class Summary | |
|---|---|
| protected class | HTMLConfiguration.ErrorReporter
Defines an error reporter for reporting HTML errors. |
| Field Summary | |
|---|---|
| protected static String | AUGMENTATIONS Include infoset augmentations. |
| protected static String | BALANCE_TAGS Balance tags. |
| protected static String | ERROR_DOMAIN Error domain. |
| protected static String | ERROR_REPORTER Error reporter. |
| protected boolean | fCloseStream
Stream opened by parser. |
| protected XMLDocumentHandler | fDocumentHandler Document handler. |
| protected HTMLScanner | fDocumentScanner Document scanner. |
| protected XMLDTDContentModelHandler | fDTDContentModelHandler DTD content model handler. |
| protected XMLDTDHandler | fDTDHandler DTD handler. |
| protected XMLEntityResolver | fEntityResolver Entity resolver. |
| protected XMLErrorHandler | fErrorHandler Error handler. |
| protected HTMLErrorReporter | fErrorReporter Error reporter. |
| protected Vector | fHTMLComponents Components. |
| protected Locale | fLocale Locale. |
| protected NamespaceBinder | fNamespaceBinder Namespace binder. |
| protected HTMLTagBalancer | fTagBalancer HTML tag balancer. |
| protected static String | FILTERS Pipeline filters. |
| protected static String | NAMESPACES Namespaces. |
| protected static String | NAMES_ATTRS Modify HTML attribute names: { "upper", "lower", "default" }. |
| protected static String | NAMES_ELEMS Modify HTML element names: { "upper", "lower", "default" }. |
| protected static String | REPORT_ERRORS Report errors. |
| protected static String | SIMPLE_ERROR_FORMAT Simple report format. |
| protected static boolean | XERCES_2_0_0 Parser version is Xerces 2.0.0. |
| protected static boolean | XERCES_2_0_1 Parser version is Xerces 2.0.1. |
| protected static boolean | XML4J_4_0_x Parser version is XML4J 4.0.x. |
| Constructor Summary | |
|---|---|
| HTMLConfiguration() Default constructor. | |
| Method Summary | |
|---|---|
| protected void | addComponent(HTMLComponent component) Adds a component. |
| void | cleanup()
If the application decides to terminate parsing before the xml document
is fully parsed, the application should call this method to free any
resource allocated during parsing. |
| XMLDocumentHandler | getDocumentHandler() Returns the document handler. |
| XMLDTDContentModelHandler | getDTDContentModelHandler() Returns the DTD content model handler. |
| XMLDTDHandler | getDTDHandler() Returns the DTD handler. |
| XMLEntityResolver | getEntityResolver() Returns the entity resolver. |
| XMLErrorHandler | getErrorHandler() Returns the error handler. |
| Locale | getLocale() Returns the locale. |
| void | parse(XMLInputSource source) Parses a document. |
| boolean | parse(boolean complete)
Parses the document in a pull parsing fashion.
|
| void | pushInputSource(XMLInputSource inputSource)
Pushes an input source onto the current entity stack. |
| protected void | reset() Resets the parser configuration. |
| void | setDocumentHandler(XMLDocumentHandler handler) Sets the document handler. |
| void | setDTDContentModelHandler(XMLDTDContentModelHandler handler) Sets the DTD content model handler. |
| void | setDTDHandler(XMLDTDHandler handler) Sets the DTD handler. |
| void | setEntityResolver(XMLEntityResolver resolver) Sets the entity resolver. |
| void | setErrorHandler(XMLErrorHandler handler) Sets the error handler. |
| void | setFeature(String featureId, boolean state) Sets a feature. |
| void | setInputSource(XMLInputSource inputSource)
Sets the input source for the document to parse.
|
| void | setLocale(Locale locale) Sets the locale. |
| void | setProperty(String propertyId, Object value) Sets a property. |
Parameters: complete True if the pull parser should parse the remaining document completely.
Returns: True if there is more document to parse.
Throws: XNIException Any XNI exception, possibly wrapping another exception. IOException An IO exception from the parser, possibly from a byte stream or character stream supplied by the parser.
See Also: HTMLConfiguration
Hint: To use this feature to insert the output of <SCRIPT> tags, remember to buffer the entire output of the processed instructions before pushing a new input source. Otherwise, events may appear out of sequence.
Parameters: inputSource The new input source to start scanning.
Parameters: inputSource The document's input source.
Throws: XMLConfigurationException Thrown if there is a configuration error when initializing the parser. IOException Thrown on I/O error.
See Also: HTMLConfiguration