
public class FoxTextUtil extends Object
| Modifier and Type | Field and Description |
|---|---|
static org.apache.log4j.Logger |
LOG |
static String |
tokenSpliter
Defines token.
|
| Modifier and Type | Method and Description |
|---|---|
protected static String[] |
_getSentences(String source)
Gets sentences.
|
static Set<Integer> |
getIndices(String token,
String tokenInput) |
static String[] |
getSentences(String source) |
static String[] |
getSentencesToken(String input) |
static String[] |
getSentenceToken(String sentence)
Gets token of one sentence, token defined by
tokenSpliter. |
static String[] |
getToken(String in)
Gets token defined by
tokenSpliter. |
static String |
htmlToText(String html)
Gets the content from html/text as plain text.
|
static String |
urlToText(String url)
Gets the content from html/text as plain text.
|
public static org.apache.log4j.Logger LOG
public static final String tokenSpliter
public static String urlToText(String url)
url - public static String htmlToText(String html)
public static String[] getSentencesToken(String input)
input - protected static String[] _getSentences(String source)
source - plain text of sentencespublic static String[] getSentenceToken(String sentence)
tokenSpliter.sentence - (with punctuation mark)public static String[] getToken(String in)
tokenSpliter.in - string to splitCopyright © 2024. All rights reserved.