限定符和类型 | 方法和说明 |
---|---|
Remark |
NodeFactory.createRemarkNode(Page page,
int start,
int end)
Create a new remark node.
|
Text |
NodeFactory.createStringNode(Page page,
int start,
int end)
Create a new text node.
|
Tag |
NodeFactory.createTagNode(Page page,
int start,
int end,
java.util.Vector attributes)
Create a new tag node.
|
void |
Node.doSemanticAction()
Perform the meaning of this tag.
|
NodeIterator |
Parser.elements() |
NodeList |
Parser.extractAllNodesThatMatch(NodeFilter filter)
Extract all nodes matching the given filter.
|
NodeList |
Parser.parse(NodeFilter filter) |
void |
Parser.postConnect(java.net.HttpURLConnection connection)
Called just after calling connect.
|
void |
Parser.preConnect(java.net.HttpURLConnection connection)
Called just prior to calling connect.
|
void |
Parser.setConnection(java.net.URLConnection connection)
Set the connection for this parser.
|
void |
Parser.setEncoding(java.lang.String encoding)
Set the encoding for the page this parser is reading from.
|
void |
Parser.setInputHTML(java.lang.String inputHTML)
Initializes the parser with the given input HTML String.
|
void |
Parser.setResource(java.lang.String resource)
Set the html, a url, or a file.
|
void |
Parser.setURL(java.lang.String url)
Set the URL for this parser.
|
void |
Parser.visitAllNodesWith(NodeVisitor visitor)
Apply the given visitor to the current page.
|
构造器和说明 |
---|
Parser(java.lang.String resource)
Creates a Parser object with the location of the resource (URL or file).
|
Parser(java.lang.String resource,
ParserFeedback feedback)
Creates a Parser object with the location of the resource (URL or file)
You would typically create a DefaultHTMLParserFeedback object and pass
it in.
|
Parser(java.net.URLConnection connection)
Construct a parser using the provided URLConnection.
|
Parser(java.net.URLConnection connection,
ParserFeedback fb)
Constructor for custom HTTP access.
|
限定符和类型 | 方法和说明 |
---|---|
java.net.URLConnection |
ConnectionManager.openConnection(java.lang.String string)
Opens a connection based on a given string.
|
java.net.URLConnection |
ConnectionManager.openConnection(java.net.URL url)
Opens a connection using the given url.
|
void |
ConnectionMonitor.postConnect(java.net.HttpURLConnection connection)
Called just after calling connect.
|
void |
ConnectionMonitor.preConnect(java.net.HttpURLConnection connection)
Called just prior to calling connect.
|
限定符和类型 | 方法和说明 |
---|---|
char |
Page.getCharacter(Cursor cursor)
Read the character at the given cursor position.
|
static void |
Lexer.main(java.lang.String[] args)
Mainline for command line operation
|
Node |
Lexer.nextNode()
Get the next node from the source.
|
Node |
Lexer.nextNode(boolean quotesmart)
Get the next node from the source.
|
Node |
Lexer.parseCDATA()
Return CDATA as a text node.
|
Node |
Lexer.parseCDATA(boolean quotesmart)
Return CDATA as a text node.
|
void |
Page.setConnection(java.net.URLConnection connection)
Set the URLConnection to be used by this page.
|
void |
StringSource.setEncoding(java.lang.String character_set)
Set the encoding to the given character set.
|
abstract void |
Source.setEncoding(java.lang.String character_set)
Set the encoding to the given character set.
|
void |
Page.setEncoding(java.lang.String character_set)
Begins reading from the source with the given character set.
|
void |
InputStreamSource.setEncoding(java.lang.String character_set)
Begins reading from the source with the given character set.
|
void |
Page.ungetCharacter(Cursor cursor)
Return a character.
|
构造器和说明 |
---|
Lexer(java.net.URLConnection connection)
Creates a new instance of a Lexer.
|
Page(java.net.URLConnection connection)
Construct a page reading from a URL connection.
|
限定符和类型 | 方法和说明 |
---|---|
void |
AbstractNode.doSemanticAction()
Perform the meaning of this tag.
|
限定符和类型 | 方法和说明 |
---|---|
java.lang.String |
StringExtractor.extractStrings(boolean links)
Extract the text from a page.
|
限定符和类型 | 方法和说明 |
---|---|
void |
Feedback.error(java.lang.String message,
ParserException e)
Error message.
|
限定符和类型 | 方法和说明 |
---|---|
static java.lang.String |
ScriptDecoder.Decode(Page page,
Cursor cursor)
Decode script encoded by the Microsoft obfuscator.
|
Tag |
TagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan the tag.
|
Tag |
StyleScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan for style definitions.
|
Tag |
ScriptScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan for script.
|
Tag |
Scanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Scan the tag.
|
Tag |
CompositeTagScanner.scan(Tag tag,
Lexer lexer,
NodeList stack)
Collect the children.
|
限定符和类型 | 方法和说明 |
---|---|
void |
MetaTag.doSemanticAction()
Perform the META tag semantic action.
|
void |
BaseHrefTag.doSemanticAction()
Perform the meaning of this tag.
|
限定符和类型 | 方法和说明 |
---|---|
int |
FunctionalTests.countImageTagsWithHTMLParser() |
void |
ParserTestCase.parseAndAssertNodeCount(int nodeCountExpected) |
void |
ParserTestCase.parseNodes() |
void |
ParserTest.testDoubleQuotedCharset()
Test the case of a double quoted charset directive.
|
void |
ParserTest.testFixSpaces() |
void |
ParserTest.testImageCollection() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers1()
Test to ensure that the
Tag being created by the
CompositeTagScanner has the correct startLine and endLine
information in the TagData it is constructed with. |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers2() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers3() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers4() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers5() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers6() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers7() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers8() |
void |
LineNumberAssignedByNodeReaderTest.testLineNumbers9() |
void |
ParserTest.testLinkCollection() |
void |
FunctionalTests.testNumImageTagsInYahooWithoutRegisteringScanners()
Based on a suspected bug report by Annette Doyle,
to check if the no of image tags are correctly
identified by the parser
|
void |
ParserTest.testSingleQuotedCharset()
Test the case of a single quoted charset directive.
|
void |
ParserTest.testSwitchCharset()
Test the case of a charset directive different than the HTTP header.
|
void |
ParserTest.testURLWithSpaces() |
限定符和类型 | 方法和说明 |
---|---|
void |
FilterTest.testAnd()
Test and filtering.
|
void |
FilterTest.testAttribute()
Test attribute filtering.
|
void |
FilterTest.testChild()
Test child filtering.
|
void |
FilterTest.testNodeClass()
Test node class filtering.
|
void |
FilterTest.testNot()
Test not filtering.
|
void |
FilterTest.testOr()
Test or filtering.
|
void |
FilterTest.testString()
Test string filtering.
|
void |
FilterTest.testTagName()
Test tag name filtering.
|
限定符和类型 | 方法和说明 |
---|---|
void |
PageTests.test1() |
void |
PageTests.test10() |
void |
PageTests.test11() |
void |
PageTests.test12() |
void |
PageTests.test13() |
void |
PageTests.test14() |
void |
PageTests.test15() |
void |
PageTests.test16() |
void |
PageTests.test17() |
void |
PageTests.test18() |
void |
PageTests.test19() |
void |
PageTests.test2() |
void |
PageTests.test20() |
void |
PageTests.test21() |
void |
PageTests.test22() |
void |
PageTests.test23() |
void |
PageTests.test24() |
void |
PageTests.test25() |
void |
PageTests.test26() |
void |
PageTests.test27() |
void |
PageTests.test28() |
void |
PageTests.test29() |
void |
PageTests.test3() |
void |
PageTests.test30() |
void |
PageTests.test31() |
void |
PageTests.test32() |
void |
PageTests.test33() |
void |
PageTests.test34() |
void |
PageTests.test35() |
void |
PageTests.test36() |
void |
PageTests.test37() |
void |
PageTests.test38() |
void |
PageTests.test39() |
void |
PageTests.test4() |
void |
PageTests.test40() |
void |
PageTests.test41() |
void |
PageTests.test5() |
void |
PageTests.test6() |
void |
PageTests.test7() |
void |
PageTests.test8() |
void |
PageTests.test9() |
void |
AttributeTests.testAltAndSrc() |
void |
LexerTests.testAttributedTag()
Test operation with attributed tags.
|
void |
LexerTests.testCommentInScript()
See bug #1227213 Particular SCRIPT tags close too late.
|
void |
LexerTests.testConjoined()
See bug #825820 Words conjoined
|
void |
LexerTests.testDosEOL()
Test operation with Dos line endings.
|
void |
LexerTests.testDosLineEndingInName()
See bug #1493884 Lexer returns a TagNode with a 'null' name
|
void |
AttributeTests.testEmptyAltAndSrc() |
void |
LexerTests.testEOF_EOL()
Test operation with line endings near the end of input.
|
void |
LexerTests.testEscapedQuote()
See bug #899413 bug in javascript end detection.
|
void |
LexerTests.testFidelity()
Test the fidelity of the toHtml() method.
|
void |
LexerTests.testJIS()
Test case for bug #789439 Japanese page causes OutOfMemory Exception
No exception is thrown in the current version of the parser,
however, the problem is that ISO-2022-JP (aka JIS) encoding sometimes
causes spurious tags.
|
void |
LexerTests.testJsp()
See bug #880283 Character ">" erroneously inserted by Lexer
|
void |
TagTests.testMissingValueToHTML()
Test the toHTML method for a missing value attribute.
|
void |
TagTests.testMultiLine1() |
void |
TagTests.testMultiLine2() |
void |
TagTests.testMultiLine3() |
void |
TagTests.testMultiLine4() |
void |
TagTests.testMultiLine5() |
void |
TagTests.testMultiLine6() |
void |
TagTests.testMultiLine7() |
void |
AttributeTests.testNoSpace() |
void |
PageTests.testNull()
Test initialization with a null value.
|
void |
LexerTests.testPI()
Unit test for new PI parsing code.
|
void |
AttributeTests.testPredicates() |
void |
LexerTests.testPureTag()
Test operation with only tags.
|
void |
LexerTests.testPureText()
Test operation without tags.
|
void |
LexerTests.testRemark()
Test operation with comments.
|
void |
AttributeTests.testSetQuote() |
void |
AttributeTests.testSrcAndAlt() |
void |
AttributeTests.testSrcAndEmptyAlt() |
void |
LexerTests.testStackOverflow()
Check for StackOverflow error.
|
void |
TagTests.testStandAloneToHTML()
Test the toHTML method for a standalone attribute.
|
void |
LexerTests.testTagStops()
Test that tags stop string nodes.
|
void |
TagTests.testTagWithCloseTagSymbolInAttribute() |
void |
TagTests.testTagWithOpenTagSymbolInAttribute() |
void |
TagTests.testTagWithSingleQuote() |
void |
LexerTests.testUnixEOL()
Test operation with Unix line endings.
|
void |
PageTests.testURLConnection()
Test initialization with a real value.
|
void |
LexerTests.testUrlInStyle()
See bug #1227213 Particular SCRIPT tags close too late.
|
限定符和类型 | 方法和说明 |
---|---|
void |
RemarkNodeParserTest.test3DashesEnding()
Test a comment ending with ---.
|
void |
RemarkNodeParserTest.testDashEnding()
Test a comment ending with -.
|
void |
RemarkNodeParserTest.testDashesInComment()
Bug reported by John Zook [594301]
If dashes exist in a comment, they dont get added to the comment text
|
void |
RemarkNodeParserTest.testDoubleComment()
Test a comment declaration with two comments.
|
void |
RemarkNodeParserTest.testEmptyComment()
Test a comment declaration without any comments.
|
void |
StringParserTest.testEmptyLines() |
void |
RemarkNodeParserTest.testExclamationComment()
Test exclamation mark ending.
|
void |
RemarkNodeParserTest.testGetText() |
void |
RemarkNodeParserTest.testInvalidTag()
Bug reported by John Zook [594301], invalid remark nodes are accepted as remark nodes.
|
void |
CompositeTagScannerHelperTest.testIsXmlEndTagForFalseMatches() |
void |
CompositeTagScannerHelperTest.testIsXmlEndTagForRealXml() |
void |
StringParserTest.testLastLineWithOneChar()
Based on a bug report submitted by Cedric Rosa, if the last line contains a single character,
Text does not return the string node correctly.
|
void |
RemarkNodeParserTest.testNonRemark() |
void |
RemarkNodeParserTest.testRemarkBug()
Test unparsed remark node.
|
void |
RemarkNodeParserTest.testRemarkWithBlankLine()
This is the simulation of bug report 586756, submitted
by John Zook.
|
void |
RemarkNodeParserTest.testRemarkWithNothing()
This is the simulation of a bug report submitted
by Claude Duguay.
|
void |
RemarkNodeParserTest.testSingleComment()
Test a comment declaration with a comment.
|
void |
StringParserTest.testStringBeingMissedBug()
This is a bug reported by John Zook (586222), where the first few chars
before a remark is being missed, if its on the same line.
|
void |
StringParserTest.testStringWithEmptyLine() |
void |
StringParserTest.testTagCharsInText()
Bug reported by Roger Sollberger
For the following HTML : <a href="http://asgard.ch">[< ASGARD ></a><br> The string node is not correctly identified |
void |
RemarkNodeParserTest.testTagWithinRemark()
Test tag within remark.
|
void |
StringParserTest.testTextBug1()
The bug being reproduced is this :
<HTML><HEAD><TITLE>Google</TITLE> The above line is incorrectly parsed in that, the text Google is missed. |
void |
StringParserTest.testTextBug2()
Test string containing link.
|
void |
StringParserTest.testToHTML() |
void |
StringParserTest.testToPlainTextString() |
void |
RemarkNodeParserTest.testToRawString() |
限定符和类型 | 方法和说明 |
---|---|
void |
CompositeTagScannerTest.testComplexNesting() |
void |
CompositeTagScannerTest.testCompositeTagCorrectionWithSplitLines() |
void |
CompositeTagScannerTest.testCompositeTagWithAnotherTagChild() |
void |
CompositeTagScannerTest.testCompositeTagWithDeadlock() |
void |
CompositeTagScannerTest.testCompositeTagWithErroneousAnotherTag() |
void |
CompositeTagScannerTest.testCompositeTagWithErroneousAnotherTagAndLineBreak() |
void |
CompositeTagScannerTest.testCompositeTagWithNestedTag() |
void |
CompositeTagScannerTest.testCompositeTagWithOneTextChild() |
void |
CompositeTagScannerTest.testCompositeTagWithSelfChildren() |
void |
CompositeTagScannerTest.testCompositeTagWithTagChild() |
void |
CompositeTagScannerTest.testCompositeTagWithTwoNestedTags() |
void |
ScriptScannerTest.testDecodePage()
See bug #902121 StringBean throws NullPointerException
Contributed by Reza Motori (rezamotori)
|
void |
ScriptScannerTest.testDecodeScript()
See bug #902121 StringBean throws NullPointerException
Contributed by Reza Motori (rezamotori)
|
void |
CompositeTagScannerTest.testDisallowedChildren() |
void |
CompositeTagScannerTest.testEmptyCompositeTag() |
void |
CompositeTagScannerTest.testEmptyCompositeTagAnotherStyle() |
void |
CompositeTagScannerTest.testErroneousCompositeTag() |
void |
CompositeTagScannerTest.testErroneousCompositeTagWithChildren() |
void |
CompositeTagScannerTest.testErroneousCompositeTagWithChildrenAndLineBreak() |
void |
CompositeTagScannerTest.testInvalidNesting()
Extracted from "http://scores.nba.com/games/20031029/scoreboard.html"
which has a lot of table columns with unclosed DIV tags because the
closing DIV doesn't have a slash.
|
void |
CompositeTagScannerTest.testParentConnections() |
void |
CompositeTagScannerTest.testParseTwoCompositeTags() |
void |
ScriptScannerTest.testScan() |
void |
JspScannerTest.testScan()
In response to bug report 621117, wherein jsp tags
are not recognized if they occur within string nodes.
|
void |
ScriptScannerTest.testScanBug()
Test javascript tag attributes.
|
void |
ScriptScannerTest.testScanBugWG()
Test script code.
|
void |
ScriptScannerTest.testScanNoEndTag()
Tests a bug in ScriptScanner where a NPE would be thrown if the
script tag was not closed before the document ended.
|
void |
ScriptScannerTest.testScanQuotedEndTag() |
void |
ScriptScannerTest.testScanScriptWithComments() |
void |
ScriptScannerTest.testScanScriptWithJavascriptLineEndings() |
void |
ScriptScannerTest.testScanScriptWithLinks() |
void |
ScriptScannerTest.testScanScriptWithTags() |
void |
ScriptScannerTest.testScanScriptWithTagsInComment() |
void |
ScriptScannerTest.testScriptCodeExtraction() |
void |
ScriptScannerTest.testScriptCodeExtractionWithMultipleQuotes() |
void |
ScriptScannerTest.testScriptCodeExtractionWithNewlines()
There was a bug in the ScriptScanner when there was multiline script and
the last line did not have a newline before the end script tag.
|
void |
ScriptScannerTest.testScriptsWithForm()
See bug #839264 toHtml() parse error in Javascripts with "form" keyword
Contributed by Ivan Wang (xj92wang)
|
void |
ScriptScannerTest.testScriptTagComments()
Submitted by Dhaval Udani - reproducing bug 664404
|
void |
XmlEndTagScanningTest.testSingleTagParsing() |
void |
TagScannerTest.testTagExtraction() |
void |
CompositeTagScannerTest.testTwoConsecutiveErroneousCompositeTags() |
void |
JspScannerTest.testUnclosedTagInsideJsp()
Testcase submitted by Johan Naudts, demonstrating bug
717573, NullPointerException when unclosed HTML tag
inside JSP tag
|
void |
CompositeTagScannerTest.testUrlBeingProvidedToCreateTag() |
void |
CompositeTagScannerTest.testXmlTypeCompositeTags() |
限定符和类型 | 方法和说明 |
---|---|
void |
DoctypeTagTest.DocTypeElementTest()
See bug #833592 DOCTYPE element is not parsed correctly
Contributed by Trevor Watson (t007).
|
void |
LinkTagTest.testAccessKey() |
void |
TagTest.testAttributesReconstruction() |
void |
LinkTagTest.testBadImageInLinkBug() |
void |
LinkTagTest.testBaseRefLink() |
void |
BodyTagTest.testBodyEnding() |
void |
BodyTagTest.testBodyMixed() |
void |
TagTest.testBodyTagBug1()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
BodyTagTest.testBodywithJsp() |
void |
LinkTagTest.testBrokenLink()
Test suggested by Cedric Rosa
A really bad link tag sends parser into infinite loop
|
void |
TagTest.testBrokenTag()
Bug report by Cedric Rosa, causing null pointer exceptions when encountering a broken tag,
and if this has no further lines to parse
|
void |
AppletTagTest.testChangeAppletClass() |
void |
AppletTagTest.testChangeAppletParams() |
void |
AppletTagTest.testChangeArchive() |
void |
AppletTagTest.testChangeCodebase() |
void |
ScriptTagTest.testCreation() |
void |
CompositeTagTest.testDigupStringNode() |
void |
ImageTagTest.testDirectRelativeLinks() |
void |
TitleTagTest.testDoubleTitleTag()
If there are duplicates of the title tag, the parser crashes.
|
void |
ImageTagTest.testDynamicRelativeImageScan() |
void |
ImageTagTest.testEmptyStringElement() |
void |
TagTest.testEmptyTag()
Ignore empty tags.
|
void |
TagTest.testEmptyTag2()
Ignore empty tags.
|
void |
TagTest.testEmptyTag3()
Ignore empty tags.
|
void |
TagTest.testEmptyTag4()
Ignore empty tags.
|
void |
TagTest.testEmptyTag5()
Ignore empty tags.
|
void |
TagTest.testEmptyTag6()
Ignore empty tags.
|
void |
TagTest.testEmptyTagParseParameter()
Test parseParameter method
Created by Kaarle Kaila (09 Jan 2003)
This test just wants the text in the element
|
void |
EndTagTest.testEndTagFind() |
void |
LinkTagTest.testErroneousLinkBug() |
void |
LinkTagTest.testErroneousLinkBugFromYahoo()
Test case based on a report by Raghavender Srimantula, of the parser giving out of memory exceptions.
|
void |
LinkTagTest.testErroneousLinkBugFromYahoo2()
Test case based on a report by Raghavender Srimantula, of the parser giving out of memory exceptions.
|
void |
TableTagTest.testErroneousTables() |
void |
ImageTagTest.testExtractImageLocnInvertedCommasBug()
This is the reproduction of a bug which causes a null pointer exception
|
void |
LinkTagTest.testExtractLinkInvertedCommasBug2()
This is the reproduction of a bug which produces multiple text copies.
|
void |
CompositeTagTest.testFindPositionOf() |
void |
LinkTagTest.testFreshMeatBug()
A bug in the freshmeat page - really bad html
tag - <A>Revision<\a>
Reported by Mazlan Mat
Note: Actually, this is completely legal HTML - Derrick
|
void |
LinkTagTest.testFTPLink() |
void |
TagTest.testHTMLOutputOfDifficultLinksWithRegisterScanners()
See bug #741026 registerScanners() mangles output HTML badly.
|
void |
TagTest.testIgnoreState() |
void |
ImageTagTest.testImageTag()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
ImageTagTest.testImageTagBug()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
ImageTagTest.testImageTageBug2()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
ImageTagTest.testImageTagOnMultipleLines()
This is the reproduction of a bug reported
by Annette Doyle
|
void |
ImageTagTest.testImageTagsFromYahoo()
Test case to reproduce bug reported by Annette
|
void |
ImageTagTest.testImageTagsFromYahooWithAllScannersRegistered()
Test case to reproduce bug reported by Annette
|
void |
ImageTagTest.testImageTagSingleQuoteBug()
This bug occurs when there is a null pointer exception thrown while scanning a tag using LinkScanner.
|
void |
ImageTagTest.testImageWithNewLineChars() |
void |
ImageTagTest.testImageWithSpaces()
Test image url which contains spaces in it.
|
void |
TitleTagTest.testIncompleteTitle()
Testcase to reproduce a bug reported by Cedric Rosa,
on not ending the title tag correctly, we would get
null pointer exceptions..
|
void |
TagTest.testIncorrectInvertedCommas() |
void |
TagTest.testIncorrectInvertedCommas2() |
void |
TagTest.testIncorrectInvertedCommas3() |
void |
DivTagTest.testInputInDiv()
Test case for bug #735193 Explicit tag type recognition for CompositTags not working.
|
void |
LinkTagTest.testIrcIsNotAHTTPLink()
Bug #784767 irc://server/channel urls are HTTPLike?
|
void |
LinkTagTest.testJavascriptLink() |
void |
JspTagTest.testJspTag()
Check if the JSP Tag is being correctly recognized.
|
void |
JspTagTest.testJspTagsInQuotedAttribes()
See bug #772700 Jsp Tags are not parsed correctly when in quoted attributes.
|
void |
JspTagTest.testJspTagsInUnQuotedAttribes()
See bug #772700 Jsp Tags are not parsed correctly when in quoted attributes.
|
void |
LabelTagTest.testLabelsID() |
void |
LabelTagTest.testLabelWithJspTag() |
void |
LabelTagTest.testLabelWithManyCompositeTags() |
void |
LabelTagTest.testLabelWithOtherTags() |
void |
TagTest.testLargeTagBug()
The following should be identified as a tag :
<MYTAG abcd\n"+ "efgh\n"+ "ijkl\n"+ "mnop> Creation date: (6/17/2001 5:27:42 PM) |
void |
LinkTagTest.testLinkDataContents() |
void |
LinkTagTest.testLinkNodeBug()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
LinkTagTest.testLinkNodeBug2()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
LinkTagTest.testLinkNodeBug3()
The bug being reproduced is this :
When a url ends with a slash, and the link begins with a slash,the parser puts two slashes This bug was submitted by Roget Kjensrud Creation date: (6/17/2001 4:01:06 PM) |
void |
LinkTagTest.testLinkNodeBug4()
The bug being reproduced is this :
Simple url without index.html, doesent get appended to link This bug was submitted by Roget Kjensrud Creation date: (6/17/2001 4:01:06 PM) |
void |
LinkTagTest.testLinkNodeBug5() |
void |
LinkTagTest.testLinkNodeBugNullPointerException()
This bug occurs when there is a null pointer exception thrown while scanning a tag using LinkScanner.
|
void |
LinkTagTest.testLinkNodeMailtoBug()
This bug occurs when there is a null pointer exception thrown while scanning a tag using LinkScanner.
|
void |
LinkTagTest.testLinkNodeSingleQuoteBug()
This bug occurs when there is a null pointer exception thrown while scanning a tag using LinkScanner.
|
void |
LinkTagTest.testLinkSpacesBug()
Bug pointed out by Sam Joseph (sam@neurogrid.net)
Links with spaces in them will get their spaces absorbed
|
void |
LinkTagTest.testLinkTag()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
LinkTagTest.testLinkTagBug()
The bug being reproduced is this :
<BODY aLink=#ff0000 bgColor=#ffffff link=#0000cc onload=setfocus() text=#000000 vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
LinkTagTest.testMailToIsNotAHTTPLink()
Test mail link.
|
void |
LinkTagTest.testMailtoLink() |
void |
ImageTagTest.testMapFollowImg()
Not reproducible.
|
void |
MetaTagTest.testMetaTagBug()
Tried to reproduce bug 707447 but test passes
|
void |
MetaTagTest.testMetaTagWithOpenTagSymbol()
Bug report 702547 by Joe Robbins being reproduced.
|
void |
BulletListTagTest.testMissingendtag() |
void |
ImageTagTest.testMissingEqualTo()
Based on a page submitted by Claude Duguay, the image tag has IMG SRC"somefile.jpg" - a missing equal
to sign
|
void |
LinkTagTest.testMultipleLineBug()
Bug reported by Raj Sharma,5-Apr-2002, upon parsing
http://www.samachar.com, the entire page could not be picked up.
|
void |
LabelTagTest.testNestedLabels() |
void |
LabelTagTest.testNestedLabels2() |
void |
TagTest.testNestedTags()
Bug reported by Gordon Deudney 2002-03-15
Nested JSP Tags were not working
|
void |
TitleTagTest.testNoEndTitleTag()
Testcase based on Claude Duguay's report.
|
void |
BulletTagTest.testNonEndedBullets() |
void |
LinkTagTest.testNotFTPLink() |
void |
BaseHrefTagTest.testNotHREFBaseTag() |
void |
LinkTagTest.testNotJavascriptLink() |
void |
LinkTagTest.testNotMailtoLink() |
void |
ImageTagTest.testNullImageBug()
The bug being reproduced is this :
<A HREF=>Something<A> vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
LinkTagTest.testNullTagBug()
The bug being reproduced is this :
<A HREF=>Something<A> vLink=#551a8b> The above line is incorrectly parsed in that, the BODY tag is not identified. |
void |
ObjectCollectionTest.testOneLevelNesting() |
void |
BulletTagTest.testOutOfMemoryBug() |
void |
TableTagTest.testOverFlow()
See bug #750117 StackOverFlow while Node-Iteration
Not reproducible.
|
void |
TagTest.testParameterChange()
See bug #740411 setParsed() has no effect on output.
|
void |
ScriptTagTest.testParamExtraction() |
void |
TagTest.testParseParameter3()
Test parseParameter method
Created by Kaarle Kaila (august 2001)
the tag name is here G
|
void |
TagTest.testParseParameterA()
Test parseParameter method
Created by Kaarle Kaila (august 2001)
the tag name is here A (and should be eaten up by linkScanner)
|
void |
TagTest.testParseParameterG()
Test parseParameter method
Created by Kaarle Kaila (august 2001)
the tag name is here G
|
void |
TagTest.testParseParameterSpace()
Test parseParameter method
Created by Kaarle Kaila (august 2002)
the tag name is here A (and should be eaten up by linkScanner)
Tests elements where = sign is surrounded by spaces
|
void |
ImageTagTest.testPlaceHolderImageScan()
This test has been improved to check for params
in the image tag, based on requirement by Annette Doyle.
|
void |
LinkTagTest.testQueryLink()
This is a reproduction of bug 617228, reported by
Stephen J.
|
void |
TableTagTest.testRecursionDepth()
Test many unclosed tags (causes heavy recursion).
|
void |
ImageTagTest.testRelativeImageScan() |
void |
ImageTagTest.testRelativeImageScan2() |
void |
ImageTagTest.testRelativeImageScan3() |
void |
LinkTagTest.testRelativeLinkNotHTMLBug() |
void |
LinkTagTest.testRelativeLinkScan() |
void |
LinkTagTest.testRelativeLinkScan2() |
void |
LinkTagTest.testRelativeLinkScan3() |
void |
LinkTagTest.testRelativeTypeHttpLink() |
void |
TitleTagTest.testScan() |
void |
TextareaTagTest.testScan() |
void |
TableTagTest.testScan() |
void |
StyleTagTest.testScan() |
void |
SelectTagTest.testScan() |
void |
OptionTagTest.testScan() |
void |
MetaTagTest.testScan() |
void |
LinkTagTest.testScan()
Test scan with data which is of diff nodes type
|
void |
InputTagTest.testScan() |
void |
FrameTagTest.testScan() |
void |
FrameSetTagTest.testScan() |
void |
FormTagTest.testScan() |
void |
DivTagTest.testScan() |
void |
BulletListTagTest.testScan() |
void |
BaseHrefTagTest.testScan() |
void |
AppletTagTest.testScan() |
void |
StyleTagTest.testScanBug() |
void |
StyleTagTest.testScanBug2()
This is a bug reported by Kaarle Kaaila.
|
void |
StyleTagTest.testScanBug3()
This is a bug reported by Dr.
|
void |
FormTagTest.testScanFormWithComments()
Bug 652674 - forms with comments are not being parsed
|
void |
FormTagTest.testScanFormWithComments2()
Bug 652674 - forms with comments are not being parsed
|
void |
FormTagTest.testScanFormWithLinks()
Bug reported by Pavan Podila - forms with links are not being parsed
Sample html is from google
|
void |
FormTagTest.testScanFormWithPreviousOpenLink()
Bug 656870 - a form tag with a previously open link causes infinite loop
on encounter
|
void |
MetaTagTest.testScanTagsInMeta() |
void |
FormTagTest.testSearchByName() |
void |
FormTagTest.testSearchFor() |
void |
FormTagTest.testSearchForCaseSensitive() |
void |
FormTagTest.testSetFormLocation() |
void |
TagTest.testSetText()
See bug #726913 toHtml() method incomplete
|
void |
HeadTagTest.testSH() |
void |
BodyTagTest.testSimpleBody() |
void |
HeadTagTest.testSimpleHead() |
void |
HeadTagTest.testSimpleHeadWithBody() |
void |
HeadTagTest.testSimpleHeadWithoutEndTag() |
void |
LabelTagTest.testSimpleLabels() |
void |
ObjectCollectionTest.testSimpleSearch() |
void |
ScriptTagTest.testSingleApostropheParsingBug() |
void |
JspTagTest.testSpecialCharacters() |
void |
TagTest.testStrictParsing()
Reproduction of a bug reported by Annette Doyle
This is actually a pretty good example of dirty html - we are in a fix
here, bcos the font tag (the first one) has an erroneous inverted comma.
|
void |
StyleTagTest.testStyleChildren()
See bug #900125 Style Tag Children not grouped
|
void |
TagTest.testStyleSheetTag() |
void |
TagTest.testTabText()
From oyoaha
|
void |
TagTest.testTagInsideTag() |
void |
TitleTagTest.testTitleTagContainsJspTag() |
void |
JspTagTest.testToHtml()
Check if the JSP Tag is being correctly recognized.
|
void |
TitleTagTest.testToHTML() |
void |
TextareaTagTest.testToHTML() |
void |
TagTest.testToHTML() |
void |
StyleTagTest.testToHTML() |
void |
SelectTagTest.testToHTML() |
void |
ScriptTagTest.testToHTML() |
void |
OptionTagTest.testToHTML() |
void |
MetaTagTest.testToHTML() |
void |
LinkTagTest.testToHTML() |
void |
InputTagTest.testToHTML() |
void |
ImageTagTest.testToHTML() |
void |
FrameTagTest.testToHTML() |
void |
FrameSetTagTest.testToHTML() |
void |
EndTagTest.testToHTML() |
void |
DoctypeTagTest.testToHTML() |
void |
BodyTagTest.testToHTML() |
void |
AppletTagTest.testToHTML() |
void |
InputTagTest.testToHTML2()
Reproduction of bug report 663038
|
void |
StyleTagTest.testToHtmlAttributes()
Reproducing a bug reported by Dhaval Udani relating to
style tag attributes being missed
|
void |
ScriptTagTest.testToHTMLWG()
Test raw string.
|
void |
TitleTagTest.testToPlainTextString() |
void |
LinkTagTest.testToPlainTextString() |
void |
FormTagTest.testToPlainTextString() |
void |
BodyTagTest.testToPlainTextString() |
void |
TitleTagTest.testToString() |
void |
OptionTagTest.testToString() |
void |
ObjectCollectionTest.testTwoLevelNesting() |
void |
LinkTagTest.testTypeFtp() |
void |
LinkTagTest.testTypeHttpLikeLink() |
void |
LinkTagTest.testTypeHttpLink() |
void |
LinkTagTest.testTypeHttps() |
void |
LinkTagTest.testTypeJavaScript() |
void |
LinkTagTest.testTypeNonHttp() |
void |
TableTagTest.testUnClosed2() |
void |
FormTagTest.testUnclosedOptions()
See bug #745566 StackOverflowError on select with too many unclosed options.
|
void |
ScriptTagTest.testVariableDeclarations() |
void |
TagTest.testWithoutParseParameter()
Test parseParameter method
Created by Kaarle Kaila (22 Oct 2001)
This test just wants the text in the element
|
限定符和类型 | 方法和说明 |
---|---|
void |
CharacterTranslationTest.Generate.parse(java.io.PrintWriter out)
Pull out text elements from the HTML.
|
void |
NonEnglishTest.testNonEnglishCharacters() |
void |
BeanTest.testSerializable() |
void |
BeanTest.testSerializableLinkBean() |
void |
BeanTest.testSerializableScanners() |
void |
BeanTest.testSerializableStringBean() |
void |
BeanTest.testZeroArgLexerConstructor() |
void |
BeanTest.testZeroArgPageConstructor() |
void |
BeanTest.testZeroArgParserConstructor() |
构造器和说明 |
---|
Generate()
Create a Generate object.
|
限定符和类型 | 类和说明 |
---|---|
class |
EncodingChangeException
The encoding is changed invalidating already scanned characters.
|
限定符和类型 | 方法和说明 |
---|---|
void |
ParserFeedback.error(java.lang.String message,
ParserException e) |
static void |
FeedbackManager.error(java.lang.String message,
ParserException e) |
void |
DefaultParserFeedback.error(java.lang.String message,
ParserException exception)
Print an error message.
|
限定符和类型 | 方法和说明 |
---|---|
static Parser |
ParserUtils.createParserParsingAnInputString(java.lang.String input) |
boolean |
NodeIterator.hasMoreNodes()
Check if more nodes are available.
|
boolean |
IteratorImpl.hasMoreNodes()
Check if more nodes are available.
|
Node |
NodeIterator.nextNode()
Get the next node.
|
Node |
IteratorImpl.nextNode()
Get the next node.
|
static java.lang.String[] |
ParserUtils.splitTags(java.lang.String input,
java.lang.Class nodeType) |
static java.lang.String[] |
ParserUtils.splitTags(java.lang.String input,
java.lang.Class nodeType,
boolean recursive,
boolean insideTag) |
static java.lang.String[] |
ParserUtils.splitTags(java.lang.String input,
NodeFilter filter) |
static java.lang.String[] |
ParserUtils.splitTags(java.lang.String input,
NodeFilter filter,
boolean recursive,
boolean insideTag) |
static java.lang.String[] |
ParserUtils.splitTags(java.lang.String input,
java.lang.String[] tags) |
static java.lang.String[] |
ParserUtils.splitTags(java.lang.String input,
java.lang.String[] tags,
boolean recursive,
boolean insideTag) |
static java.lang.String |
ParserUtils.trimTags(java.lang.String input,
java.lang.Class nodeType) |
static java.lang.String |
ParserUtils.trimTags(java.lang.String input,
java.lang.Class nodeType,
boolean recursive,
boolean insideTag) |
static java.lang.String |
ParserUtils.trimTags(java.lang.String input,
NodeFilter filter) |
static java.lang.String |
ParserUtils.trimTags(java.lang.String input,
NodeFilter filter,
boolean recursive,
boolean insideTag) |
static java.lang.String |
ParserUtils.trimTags(java.lang.String input,
java.lang.String[] tags) |
static java.lang.String |
ParserUtils.trimTags(java.lang.String input,
java.lang.String[] tags,
boolean recursive,
boolean insideTag) |
void |
NodeList.visitAllNodesWith(NodeVisitor visitor)
Utility to apply a visitor to a node list.
|