public class HTMLTokenMaker extends AbstractMarkupTokenMaker
AbstractTokenMaker
, so this class has been modified so that
Strings are never allocated (via yytext()), and the scanner never has to
worry about refilling its buffer (needlessly copying chars around).
We can achieve this because RText always scans exactly 1 line of tokens at a
time, and hands the scanner this line as an array of characters (a Segment
really). Since tokens contain pointers to char arrays instead of Strings
holding their contents, there is no need for allocating new memory for
Strings.The actual algorithm generated for scanning has, of course, not been modified.
If you wish to regenerate this file yourself, keep in mind the following:
zzRefill
and yyreset
.
You should hand-delete the second of each definition (the ones
generated by the lexer), as these generated methods modify the input
buffer, which we'll never have to do.yylex()
on the generated scanner
directly; rather, you should use getTokenList
as you would
with any other TokenMaker
instance.限定符和类型 | 字段和说明 |
---|---|
static int |
COMMENT |
static int |
CSS |
static int |
CSS_C_STYLE_COMMENT |
static int |
CSS_CHAR_LITERAL |
static int |
CSS_PROPERTY |
static int |
CSS_STRING |
static int |
CSS_VALUE |
protected TokenImpl |
currentToken
Used in the creation of the linked list.
|
static int |
DTD |
protected TokenImpl |
firstToken
The first token in the returned linked list.
|
static int |
INATTR_DOUBLE |
static int |
INATTR_DOUBLE_SCRIPT |
static int |
INATTR_DOUBLE_STYLE |
static int |
INATTR_SINGLE |
static int |
INATTR_SINGLE_SCRIPT
lexical states
|
static int |
INATTR_SINGLE_STYLE |
static int |
INTAG |
static int |
INTAG_CHECK_TAG_NAME |
static int |
INTAG_SCRIPT |
static int |
INTAG_STYLE |
static int |
INTERNAL_ATTR_DOUBLE
Type specific to XMLTokenMaker denoting a line ending with an unclosed
double-quote attribute.
|
static int |
INTERNAL_ATTR_DOUBLE_QUOTE_SCRIPT
Token type specifying we're in a double-qouted attribute in a
script tag.
|
static int |
INTERNAL_ATTR_DOUBLE_QUOTE_STYLE
Token type specifying we're in a double-qouted attribute in a
style tag.
|
static int |
INTERNAL_ATTR_SINGLE
Type specific to XMLTokenMaker denoting a line ending with an unclosed
single-quote attribute.
|
static int |
INTERNAL_ATTR_SINGLE_QUOTE_SCRIPT
Token type specifying we're in a single-qouted attribute in a
script tag.
|
static int |
INTERNAL_ATTR_SINGLE_QUOTE_STYLE
Token type specifying we're in a single-qouted attribute in a
style tag.
|
static int |
INTERNAL_CSS
Internal type denoting a line ending in CSS.
|
static int |
INTERNAL_CSS_CHAR
Internal type denoting line ending in a CSS single-quote string.
|
static int |
INTERNAL_CSS_MLC
Internal type denoting line ending in a CSS multi-line comment.
|
static int |
INTERNAL_CSS_PROPERTY
Internal type denoting a line ending in a CSS property.
|
static int |
INTERNAL_CSS_STRING
Internal type denoting line ending in a CSS double-quote string.
|
static int |
INTERNAL_CSS_VALUE
Internal type denoting a line ending in a CSS property value.
|
static int |
INTERNAL_IN_JS
Token type specifying we're in JavaScript.
|
static int |
INTERNAL_IN_JS_CHAR_INVALID
Token type specifying we're in an invalid multi-line JS single-quoted string.
|
static int |
INTERNAL_IN_JS_CHAR_VALID
Token type specifying we're in a valid multi-line JS single-quoted string.
|
static int |
INTERNAL_IN_JS_MLC
Token type specifying we're in a JavaScript multiline comment.
|
static int |
INTERNAL_IN_JS_STRING_INVALID
Token type specifying we're in an invalid multi-line JS string.
|
static int |
INTERNAL_IN_JS_STRING_VALID
Token type specifying we're in a valid multi-line JS string.
|
static int |
INTERNAL_INTAG
Token type specific to HTMLTokenMaker; this signals that the user has
ended a line with an unclosed HTML tag; thus a new line is beginning
still inside of the tag.
|
static int |
INTERNAL_INTAG_SCRIPT
Token type specific to HTMLTokenMaker; this signals that the user has
ended a line with an unclosed
<script> tag. |
static int |
INTERNAL_INTAG_STYLE
Token type specific to HTMLTokenMaker; this signals that the user has
ended a line with an unclosed
<style> tag. |
static int |
JAVASCRIPT |
static int |
JS_CHAR |
static int |
JS_EOL_COMMENT |
static int |
JS_MLC |
static int |
JS_STRING |
static int |
PI |
protected TokenImpl |
previousToken
Used in the creation of the linked list.
|
static int |
YYEOF
This character denotes the end of file
|
static int |
YYINITIAL |
offsetShift, s, start
构造器和说明 |
---|
HTMLTokenMaker()
Constructor.
|
HTMLTokenMaker(java.io.InputStream in)
Creates a new scanner.
|
HTMLTokenMaker(java.io.Reader in)
Creates a new scanner
There is also a java.io.InputStream version of this constructor.
|
限定符和类型 | 方法和说明 |
---|---|
void |
addNullToken()
Adds a null token to the end of the current linked list of tokens.
|
void |
addToken(char[] array,
int start,
int end,
int tokenType,
int startOffset)
Adds the token specified to the current linked list of tokens.
|
void |
addToken(char[] array,
int start,
int end,
int tokenType,
int startOffset,
boolean hyperlink)
Adds the token specified to the current linked list of tokens.
|
void |
addToken(javax.swing.text.Segment segment,
int start,
int end,
int tokenType,
int startOffset)
Adds the token specified to the current linked list of tokens.
|
protected OccurrenceMarker |
createOccurrenceMarker()
Returns the occurrence marker to use for this token maker.
|
int |
getClosestStandardTokenTypeForInternalType(int type)
Returns the closest
"standard" token type for a given
"internal" token type (e.g. one whose value is < 0 ). |
boolean |
getCompleteCloseTags()
Sets whether markup close tags should be completed.
|
boolean |
getCurlyBracesDenoteCodeBlocks()
Returns whether this programming language uses curly braces
('{' and '}') to denote code blocks.
|
javax.swing.Action |
getInsertBreakAction()
Returns an action to handle "insert break" key presses (i.e.
|
int |
getLastTokenTypeOnLine(javax.swing.text.Segment text,
int initialTokenType)
Returns the last token on this line's type if the token is "unfinished",
or
TokenTypes.NULL if it was finished. |
boolean |
getMarkOccurrencesOfTokenType(int type)
Returns whether tokens of the specified type should have "mark
occurrences" enabled for the current programming language.
|
OccurrenceMarker |
getOccurrenceMarker()
Returns the object in charge of marking all occurrences of the token
at the current caret position, if it is a relevant token.
|
boolean |
getShouldIndentNextLineAfter(Token token)
The default implementation returns
false always. |
Token |
getTokenList(javax.swing.text.Segment text,
int initialTokenType,
int startOffset)
Returns the first token in the linked list of tokens generated
from
text . |
protected void |
resetTokenList()
Deletes the linked list of tokens so we can begin anew.
|
static void |
setCompleteCloseTags(boolean complete)
Sets whether markup close tags should be completed.
|
void |
setLanguageIndex(int languageIndex)
Sets the language index to assign to tokens moving forward.
|
void |
yybegin(int newState)
Enters a new lexical state
|
char |
yycharat(int pos)
Returns the character at position pos from the
matched text.
|
void |
yyclose()
Closes the input stream.
|
int |
yylength()
Returns the length of the matched text region.
|
Token |
yylex()
Resumes scanning until the next regular expression is matched,
the end of input is encountered or an I/O-Error occurs.
|
void |
yypushback(int number)
Pushes the specified amount of characters back into the input stream.
|
void |
yyreset(java.io.Reader reader)
Resets the scanner to read from a new input stream.
|
int |
yystate()
Returns the current lexical state.
|
java.lang.String |
yytext()
Returns the text matched by the current regular expression.
|
getLineCommentStartAndEnd, isMarkupLanguage
yybegin
public static final int YYEOF
public static final int INATTR_SINGLE_SCRIPT
public static final int JS_CHAR
public static final int CSS_STRING
public static final int JS_MLC
public static final int CSS_CHAR_LITERAL
public static final int INTAG_SCRIPT
public static final int CSS_PROPERTY
public static final int CSS_C_STYLE_COMMENT
public static final int CSS
public static final int CSS_VALUE
public static final int COMMENT
public static final int INATTR_DOUBLE_SCRIPT
public static final int PI
public static final int JAVASCRIPT
public static final int INTAG
public static final int INTAG_CHECK_TAG_NAME
public static final int INATTR_SINGLE_STYLE
public static final int DTD
public static final int JS_EOL_COMMENT
public static final int INATTR_DOUBLE_STYLE
public static final int INATTR_SINGLE
public static final int YYINITIAL
public static final int INATTR_DOUBLE
public static final int JS_STRING
public static final int INTAG_STYLE
public static final int INTERNAL_ATTR_DOUBLE
public static final int INTERNAL_ATTR_SINGLE
public static final int INTERNAL_INTAG
public static final int INTERNAL_INTAG_SCRIPT
<script>
tag.public static final int INTERNAL_ATTR_DOUBLE_QUOTE_SCRIPT
public static final int INTERNAL_ATTR_SINGLE_QUOTE_SCRIPT
public static final int INTERNAL_INTAG_STYLE
<style>
tag.public static final int INTERNAL_ATTR_DOUBLE_QUOTE_STYLE
public static final int INTERNAL_ATTR_SINGLE_QUOTE_STYLE
public static final int INTERNAL_IN_JS
public static final int INTERNAL_IN_JS_MLC
public static final int INTERNAL_IN_JS_STRING_INVALID
public static final int INTERNAL_IN_JS_STRING_VALID
public static final int INTERNAL_IN_JS_CHAR_INVALID
public static final int INTERNAL_IN_JS_CHAR_VALID
public static final int INTERNAL_CSS
public static final int INTERNAL_CSS_PROPERTY
public static final int INTERNAL_CSS_VALUE
public static final int INTERNAL_CSS_STRING
public static final int INTERNAL_CSS_CHAR
public static final int INTERNAL_CSS_MLC
protected TokenImpl firstToken
protected TokenImpl currentToken
protected TokenImpl previousToken
public HTMLTokenMaker()
public HTMLTokenMaker(java.io.Reader in)
in
- the java.io.Reader to read input from.public HTMLTokenMaker(java.io.InputStream in)
in
- the java.io.Inputstream to read input from.public void addToken(char[] array, int start, int end, int tokenType, int startOffset)
addToken
在接口中 TokenMaker
array
- The character array.start
- The starting offset in the array.end
- The ending offset in the array.tokenType
- The token's type.startOffset
- The offset in the document at which this token
occurs.public boolean getCompleteCloseTags()
getCompleteCloseTags
在类中 AbstractMarkupTokenMaker
setCompleteCloseTags(boolean)
public Token getTokenList(javax.swing.text.Segment text, int initialTokenType, int startOffset)
text
. This method must be implemented by
subclasses so they can correctly implement syntax highlighting.text
- The text from which to get tokens.initialTokenType
- The token type we should start with.startOffset
- The offset into the document at which
text
starts.Token
in a linked list representing
the syntax highlighted text.public static void setCompleteCloseTags(boolean complete)
complete
- Whether closing markup tags are completed.getCompleteCloseTags()
public final void yyreset(java.io.Reader reader)
reader
- the new input streampublic final void yyclose() throws java.io.IOException
java.io.IOException
public final int yystate()
public final void yybegin(int newState)
yybegin
在类中 AbstractJFlexTokenMaker
newState
- the new lexical statepublic final java.lang.String yytext()
public final char yycharat(int pos)
pos
- the position of the character to fetch.
A value from 0 to yylength()-1.public final int yylength()
public void yypushback(int number)
number
- the number of characters to be read again.
This number must not be greater than yylength()!public Token yylex() throws java.io.IOException
java.io.IOException
- if any I/O-Error occurspublic void addNullToken()
addNullToken
在接口中 TokenMaker
public void addToken(javax.swing.text.Segment segment, int start, int end, int tokenType, int startOffset)
segment
- Segment
to get text from.start
- Start offset in segment
of token.end
- End offset in segment
of token.tokenType
- The token's type.startOffset
- The offset in the document at which this token
occurs.public void addToken(char[] array, int start, int end, int tokenType, int startOffset, boolean hyperlink)
array
- The character array.start
- The starting offset in the array.end
- The ending offset in the array.tokenType
- The token's type.startOffset
- The offset in the document at which this token
occurs.hyperlink
- Whether this token is a hyperlink.protected OccurrenceMarker createOccurrenceMarker()
public int getClosestStandardTokenTypeForInternalType(int type)
"standard" token type
for a given
"internal" token type (e.g. one whose value is < 0
).
The default implementation returns type
always, which
denotes that a mapping from internal token types to standard token types
is not defined; subclasses can override.
getClosestStandardTokenTypeForInternalType
在接口中 TokenMaker
type
- The token type.public boolean getCurlyBracesDenoteCodeBlocks()
The default implementation returns false
; subclasses can
override this method if necessary.
getCurlyBracesDenoteCodeBlocks
在接口中 TokenMaker
public javax.swing.Action getInsertBreakAction()
null
. Subclasses
can override.getInsertBreakAction
在接口中 TokenMaker
null
.public int getLastTokenTypeOnLine(javax.swing.text.Segment text, int initialTokenType)
TokenTypes.NULL
if it was finished. For example, if C-style
syntax highlighting is being implemented, and text
contained a line of code that contained the beginning of a comment but
no end-comment marker ("*\/"), then this method would return
TokenTypes.COMMENT_MULTILINE
for that line. This is useful
for doing syntax highlighting.getLastTokenTypeOnLine
在接口中 TokenMaker
text
- The line of tokens to examine.initialTokenType
- The token type to start with (i.e., the value
of getLastTokenTypeOnLine
for the line before
text
).TokenTypes.NULL
if the line was completed.public boolean getMarkOccurrencesOfTokenType(int type)
TokenTypes.IDENTIFIER
.
Subclasses can override this method to support other token types, such
as TokenTypes.VARIABLE
.getMarkOccurrencesOfTokenType
在接口中 TokenMaker
type
- The token type.public OccurrenceMarker getOccurrenceMarker()
null
is returned, a default OccurrenceMarker
is used.getOccurrenceMarker
在接口中 TokenMaker
null
for none.public boolean getShouldIndentNextLineAfter(Token token)
false
always. Languages
that wish to better support auto-indentation can override this method.getShouldIndentNextLineAfter
在接口中 TokenMaker
token
- The token the previous line ends with.protected void resetTokenList()
getLastTokenTypeOnLine(Segment, int)
or
TokenMaker.getTokenList(Segment, int, int)
.public void setLanguageIndex(int languageIndex)
languageIndex
- The new language index. A value of
0
denotes the "main" language, any positive value
denotes a specific secondary language. Negative values will
be treated as 0
.