-
Format text using String patterns
Create a string pattern using the constant {0..n} and format command to
create a formatted string. for exampi.e. 'format(*{0}*);' will cause
each string from selected column to be surrounded with asteriks.
-
Stripping unwanted characters
To strip a set of charcters for each value in the selected column type
'strip(<characters>);' for each character in <characters>
will be removed from the given text. i.e. strip(Aa); on text 'America'
will result in 'meric'. the characters are removed from the orginial text
in a case-sensitive matter.
-
Stripping entire strings
To remove an entire substring for each value in the selected column type
'delete(<substring>);' the enitre substring will be
removed for each value in the selected column. for example; delete(ell);
on a value 'Hello World' will result in 'Ho World'. The substring is
removed in a case-senstive matter.
-
Convert UNICODE to ASCII
If your database platform does not properly support UNICODE, and ASCII is
only supported then use the command 'nativetoascii;' will perform
functionality similar to the commandline tool of the same name such that
UNICODE/native characters are encoded using escape sequences like '\u1234' for
UNICODE character # 1234.
-
Convert text to upper-case
To convert a string to upper-case place 'toUpper;' as part of the parsing options.
-
Convert text to lower-case
To convert a string to lower-case place 'toLower;' as part of the parsing options.
-
Removing whitespace
To remove whitespace at both ends of a string use the 'trim;' command.
-
Extract a substring
To extract a substring for each value use the following command;
'substring(<start-index>, <end-index>);' | 'substring(<start-index>);'
Returns a new string that is a substring of the current string. The substring
begins at the specified start-index and extends to the character at index end-index - 1.
Specify -1 for the end-index to use the end of the string, this is the default if no
end-index is specified.
-
Replaces the first occurence of a pattern with another string.
To replace the first instance of a regular expression pattern with a replacement string.
'replaceFirst(<pattern>, <replacement>);'
Replaces the first subsequence of the current string that matches the pattern with the
given replacement string. This method first resets this matcher. It then scans the current
string looking for a match of the pattern. Characters that are not part of the match are
appended directly to the result string; the match is replaced in the result by the
replacement string. The replacement string may contain references to captured
subsequences as in the appendReplacement method.
For example with a given string 'zzzdogzzzdogzzz' replaceFirst(dog, cat);
would yield the string 'zzzcatzzzdogzzz'.
-
Replaces the all occurences of a pattern with another string.
To replace all occurennces of a regular expression pattern with a replacement string.
'replaceAll(<pattern>, <replacement>);'
Replaces the all subsequences of the current string that matches the pattern with the
given replacement string. This method first resets this matcher. It then scans the current
string looking for a match of the pattern. Characters that are not part of the match are
appended directly to the result string; the match is replaced in the result by the
replacement string. The replacement string may contain references to captured
subsequences as in the appendReplacement method.
For example with a given string 'aabfooaabfooabfoob' replaceAll(a*b, -);
would yield the string '-foo-foo-foo-'.