public class FoxTextUtil extends Object
Modifier and Type | Field and Description |
---|---|
static org.apache.log4j.Logger |
LOG |
static String |
tokenSpliter
Defines token.
|
Modifier and Type | Method and Description |
---|---|
protected static String[] |
_getSentences(String source)
Gets sentences.
|
static Set<Integer> |
getIndices(String token,
String tokenInput) |
static String[] |
getSentences(String source) |
static String[] |
getSentencesToken(String input) |
static String[] |
getSentenceToken(String sentence)
Gets token of one sentence, token defined by
tokenSpliter . |
static String[] |
getToken(String in)
Gets token defined by
tokenSpliter . |
static String |
htmlToText(String html)
Gets the content from html/text as plain text.
|
static String |
urlToText(String url)
Gets the content from html/text as plain text.
|
public static org.apache.log4j.Logger LOG
public static final String tokenSpliter
public static String urlToText(String url)
url
- public static String htmlToText(String html)
public static String[] getSentencesToken(String input)
input
- protected static String[] _getSentences(String source)
source
- plain text of sentencespublic static String[] getSentenceToken(String sentence)
tokenSpliter
.sentence
- (with punctuation mark)public static String[] getToken(String in)
tokenSpliter
.in
- string to splitCopyright © 2024. All rights reserved.