String Manipulation Functions: Difference between revisions
(Trim and TrimLines added) |
|||
Line 271: | Line 271: | ||
Argument <i>run mode</i> is optional (defaults to 0). |
Argument <i>run mode</i> is optional (defaults to 0). |
||
Argument <i>case sensitivity</i> is optional (defaults to 0). |
Argument <i>case sensitivity</i> is optional (defaults to 0). |
||
'''Important Notes'''<br> |
|||
Proficient users of Regex should be aware of the following points regarding Media Center's implementation: |
|||
::'''1:''' The Regex() function will ONLY return results for things that are explicitly placed in a capture group, for example: |
|||
::''[\w\s]+'' is a recognised regular expression, however, in order for this to work in Media Center, it must be modified, like so: '''''('''[\w\s]+''')''''' |
|||
::Putting parens around it creates a capture group, and that is the only way MC will return a result.<br><br> |
|||
::'''2:''' All Regex() modes except ''-2'' will return exactly one result for each defined capture group.<br><br> |
|||
::'''3:''' Care must be taken when working with strings of indeterminate length, or lists with an indeterminate number of elements, as if you have more capture groups in your expression than there exist matches in the string, Regex() will return nothing at all.<br><br> |
|||
Regex mode ''-2'' was devised specifically to get around the above issues, which made it almost impossible to deal with lists with a variable number of elements using regular expressions in MC. |
|||
|- valign="top" |
|- valign="top" |
||
Line 294: | Line 302: | ||
However, captured match segments are made available for subsequent use. |
However, captured match segments are made available for subsequent use. |
||
The three captures, [R1], [R2] and [R3] are arranged in the textual output so that we get the desired year/month/day ordering, such as <span style="font-family: monospace,monospace; font-size:1em;">2011/08/19</span>.</p> |
The three captures, [R1], [R2] and [R3] are arranged in the textual output so that we get the desired year/month/day ordering, such as <span style="font-family: monospace,monospace; font-size:1em;">2011/08/19</span>.</p> |
||
<span style="font-family: monospace,monospace; font-size:1em;"><b><nowiki>Regex([Actors],/#([\w\s\.]+(?=\s\[))#/,-2)</nowiki></b></span> |
|||
<p style="margin-left:20pt;">Where '''''[Actors]''''' is a semicolon delimited list of indeterminate length, and each list item contains the actor names followed by the character played in square brackets. For example:<br> |
|||
For an [Actors] field containing:<br><nowiki>F. Murray Abraham [Antonio Salieri];Tom Hulce [Wolfgang Amadeus Mozart];Elizabeth Berridge [Constanze Mozart];Roy Dotrice [Leopold Mozart];Simon Callow [Emanuel Schikaneder];Christine Ebersole [Katerina Cavalieri];Jeffrey Jones [Emperor Joseph II];Barbara Bryne [Mrs. Weber]</nowiki>, this regular expression example will return:<br><br><nowiki>F. Murray Abraham;Tom Hulce;Elizabeth Berridge;Roy Dotrice;Simon Callow;Christine Ebersole;Jeffrey Jones;Barbara Bryne</nowiki></p> |
|||
|} |
|} |
||
<div style="text-align:right;">([[#top|Back to top)]]</div> |
<div style="text-align:right;">([[#top|Back to top)]]</div> |
Revision as of 07:54, 14 June 2020
- See also: Expression Language and Function Index
The functions in this section are used primarily to manipulate strings. Since the Media Center expression language is primarily string-oriented, these functions provide a means to manipulate field values or the output from other expressions.
Clean(…)
- Clean a string to be used for various operations.
Description | Clean(string, mode)
The Clean() function is generally used to sanitize a string by stripping empty brackets, remove superfluous dash characters, eliminate leading or trailing articles, or replace filesystem-illegal characters. It is typically employed before some operation such as Rename to clean the product of joining various fields, some of which may be empty, or to produce filesystem-safe filenames. It may be used for a variety of purposes, however. Available mode values:
| ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Examples | clean([album] - [date])
clean(The Beatles, 1)
clean(AC//DC: Back In Black, 3)
clean(\//:*?"<>|, 3)
|
Find(…)
- Finds a string or character in another string, returning its zero-based position in that string.
Description | Find(Search, Find, Start, Flags)
If no match is found, the function returns a value of -1 Regardless of where in "Search" "Start" is initiated, the result is always zero-based, counting from the first character in "Search". When a match is found, the result returned is 'zero-based', meaning that if the result is zero, that points to the first character in "Search". This means that results will be compatible for use in combination with other zero-based functions, such as Mid(), for example. |
---|---|
Examples | Find(John LenNon, N, 0, 0)
Find(John LenNon, N, 4, 0)
Find(John LenNon, N, 4, 2)
Find(John LenNon, N, 9, 3)
|
FixCase(…)
- Changes the case of a given string.
Description | FixCase(string, mode)
The FixCase() function will convert the supplied text string according to the specified mode. Available mode values:
| ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Examples | fixcase(enjoy the silence)
fixcase(enjoy the silence, 1)
fixcase(MY ALbUm IS cAlLeD: adam, 4)
fixcase(MY ALbUm IS cAlLeD: adam, 5)
|
FixSpacing(…)
- Intelligently splits adjacent camel-cased words.
Description | FixSpacing(string, mode)
The FixSpacing() function inserts spaces between adjacent camel-cased words in string. It is useful for helping to clean and convert metadata that favors compactness over standard sentence structure. Available mode values:
| ||||
---|---|---|---|---|---|
Examples | fixspacing(OneWorld)
fixspacing([name], 1)
fixspacing(Another [name])
|
Hexify(…)
- Hexifies a string to make it suitable for web usage.
Description | Hexify(string)
The Hexify() function URI encodes a string to make it useable by a browser or search engine. Hexify() is typically used by expressions generating or working on URLs in Media Center's Link Manager. |
---|---|
Examples | hexify(Oasis - /(What's The Story/) Morning Glory?)
|
Left(…)
- Retrieves a specified number of characters from the left of a string.
Description | Left(string, quantity)
The Left() function retrieves no more than quantity characters from the left of the string. |
---|---|
Examples | left([filename], 3)
|
Length(…)
- Returns the number of characters in a string.
Description | Length(string)
The Length() function returns the number of characters contained in string. |
---|---|
Examples | length(A Simple Plan)
if(compare(length([filename]), >=, 68), Long, Short)
|
Letter(…)
- Returns the starting letter or letters of a given string.
Description | Letter(string, number, mode)
The Letter() function returns the starting letter or letters of a given string, which can be given literally or as a library field. The function's mode defaults to "Ignore Articles", and for this, it uses the existing, user managed articles list. It also defaults to "number grouping", meaning that if the first character is a number, regardless of that number, a hash (#) is returned. Number specifies the number of starting characters to return, and defaults to one. Available mode values: These modes are useful when working with a selection of multiple files that may have a mix of articles and numbers and allow you to achieve whichever combination of results you require.
| ||||||||
---|---|---|---|---|---|---|---|---|---|
Examples | letter(10CC)
letter(10CC,2)
letter(10CC,2,1)
letter(The Band,2)
letter(The Band,2,3)
|
Mid(…)
- Retrieves specified characters from a string.
Description | Mid(string, start position, quantity)
The Mid() function returns a specified quantity of characters from the start position in string. The start position is 0-based (i.e. the first character is considered position 0). A quantify of -1 returns all characters from the start positionning to the end of string. |
---|---|
Examples | mid(12345)
mid(12345, 1, 2)
Additional Examples: |
MoveArticles(…)
- Takes "The Beatles" and reverses it to "Beatles, The"
Description | MoveArticles(string)
The MoveArticles() function is similar to the Swap() function, except that it operates on strings with prefixed articles (such as "The Beatles" or "A Flock of Seagulls".) |
---|---|
Examples | MoveArticles(The Beatles)
|
Regex(…)
- Regular expression pattern matching and capture.
Description | regex(string, regexp, run mode, case sensitivity)
The Regex() function performs regular expression (RE) pattern matching on a string. The string is evaluated against the regular expression regexp, and run mode dictates the values output by Regex(). The three modes allow for match testing, capture output, silent operation or return all captures in a semicolon delimited string. All match captures are placed into special variables referenced as [R1], [R2], ... [R9], which can be used in later in the expression. The contents of the captures [R1] ... [R9] are available until the end of the expression, or Regex() is run again, whereby they are replaced. The regular expression implementation used prior to Media Center 19 is the Microsoft 2010 TR1 engine, and in Media Center 19 it is the Boost engine. Additional information is available regarding the full syntax and other implementation details. Available run mode values:
The case sensitivity argument toggles the case-sensitivity of regular expression matching. Note that case insensitivity does not apply to characters inside a character class [ ]. Use both uppercase and lowercase characters when necessary to match either case (e.g. [aAbB] to match either uppercase or lowercase A or B). Available case sensitivity values:
The regular expression language assigns special meaning to many characters. A few of these meta-characters, such as forward slash /, comma , and both ( and ) are also reserved and used by the Media Center expression language. To force the Media Center expression engine to ignore the meta-characters in regexp, surround the entire regular expression with /# #/. This is one of Media Center's escapements, which tells the expression engine to ignore everything inside, so that the entire, uninterpreted regexp can be provided to the Regex() regular expression evaluator. Although surrounding regexp by /# #/ is not necessary or required when no conflicting characters are in use, and you may manually escape the expression languages meta-characters with a forward slash /, it is probably a safe practice to always encase every regexp within /# #/. Argument run mode is optional (defaults to 0). Argument case sensitivity is optional (defaults to 0). Important Notes
Regex mode -2 was devised specifically to get around the above issues, which made it almost impossible to deal with lists with a variable number of elements using regular expressions in MC. | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Examples | ifelse(regex([name], /#^(the|an|a)\b#/, 0, 1), Fix your case!)
Searches the name field for any of the lowercase articles the, and and a at the beginning of name, and outputs Fix your case! when the match succeeds. The run mode is 0 which is a test and capture mode, and case sensitivity is enabled. if(regex([artist], /#([[:punct:]])#/, 0), [R1] --> [Artist], No Punctuation) Using the default mode 0, Regex() will output a Boolean for use inside a conditional to cause some action to occur based on the match success or failure. This example matches against the artist field looking for any punctuation character. If the match succeeds (a punctuation character was found), that character is output followed by the string --> and the artist. In there was no match, the string No Punctuation is output. regex(D03T02 some track.mp3, /#^D(\d+)T(\d+)#/, 1)Disc: [R1], Track: [R2] The string is matched against the regexp that is looking for a D followed by any number of digits, followed by a T and then more digits. Those digits were captured, and later used to output the value Disc: 03, Track: 02. regex([filename (name)], /#^(\d+)-#/, -1)Track number is [R1] Using run mode -1, the file's name is pattern tested against the regexp which is looking for leading digits followed by a dash. Those digits are captured in buffer [R1] which is used later in the expression. If the file name was 2-foo.mp3, the output would be Track number is 2. regex([filename], /#(\d{1,2})\.(\d{1,2}).(\d{4})#/, -1)[R3]//[R1]//[R2] Matches and captures a date formatted as dd.mm.yyyy anywhere within the filename, and rearranges it in a standard format of yyyy/mm/dd. Since run mode is -1, no output occurs. However, captured match segments are made available for subsequent use. The three captures, [R1], [R2] and [R3] are arranged in the textual output so that we get the desired year/month/day ordering, such as 2011/08/19. Regex([Actors],/#([\w\s\.]+(?=\s\[))#/,-2) Where [Actors] is a semicolon delimited list of indeterminate length, and each list item contains the actor names followed by the character played in square brackets. For example: |
RemoveCharacters(…)
- Removes a list of characters from a string.
Description | removecharacters(string, character list, mode)
The RemoveCharacters() function will remove from string any characters in the character list. The characters removed depend upon the mode specified. The function operates in a case-sensitive manner. Available mode values:
Argument mode is optional (defaults to 0). | ||||||||
---|---|---|---|---|---|---|---|---|---|
Examples | removecharacters(Paper, Ppr)
Removes P, p, and r from Paper, resulting in ae. The default mode 0 is in effect, removing all instances of the characters specified in the character list. removecharacters(Paper, Ppr, 1) With mode 1 set, only the initial character P is removed, resulting in aper. removecharacters(Paper, Ppr, 2) In mode 2, only one character from the end of the string are removed, leaving "Pape. removecharacters(Paper, Ppr, 3) Both the front and back are affected in mode 3, causing the removal of the leading P and trailing r resulting in ape. removecharacters([artist], /(/)) Removes any ( and ) characters from anywhere within the [artist] field. |
RemoveLeft(…)
- Trims characters from the beginning of a string.
Description | removeleft(string, quantity)
The RemoveLeft() function removes a specified quantity of characters from the left side of a string. If the quantity is larger than the length of the string, the output will be empty. |
---|---|
Examples | removeleft(Good Deeds, 5)
Removes the first 5 characters from resulting in Deeds being output. |
RemoveRight(…)
- Trims characters from the end of a string
Description | removeright(string, quantity)
The RemoveRight() function removes a specified quantity of characters from the right side of a string. If the quantity is larger than the length of the string, the output will be empty. |
---|---|
Examples | removeright(03-02-1959,5)
Removes the last 5 characters from the given date, leaving only the month and year 03-02. |
Replace(…)
- Replace or remove a string segment
Description | replace(string, old, new)
The Replace() function replaces all instances of old within string with new. If new is unspecified, it defaults to an empty value, causing old to be removed. Replace() operates in a case-sensitive manner. Argument new is optional (defaults to EMPTY). |
---|---|
Examples | replace(The Daily Show with John Oliver, hn Oliver, n Stewart)
Now that John Oliver has completed his summer stand-in for Jon Stewart, it is time for a replacement. The old sequence hn Oliver will be replaced with the new sequence n Stewart, resulting in The Daily Show with Jon Stewart. replace(Sample String, s, Replaced) In this example, the original string does not contain the old value s anywhere, so no replacement occurs and the original string is returned. replace(Led Zeppelin.[remastered], .[remastered]) Removes the trailing old value .[remastered] from the original string, resulting in Led Zeppelin. Because no new string is specified, the default empty value is used as a replacement, effectively stripping the old value. |
Right(…)
- Retrieves a specified number of characters from the right of a string.
Description | right(string, quantity)
The Right() function retrieves the specified quantity of characters from the right of the string. If quantity is larger than the length of string, the original string is returned. |
---|---|
Examples | right([filename], 3)
Returns the last three characters from the filename (typically this is the file's suffix). |
Swap(…)
- Takes Firstname Lastname and swaps to Lastname, Firstname.
Description | Swap(string)
The Swap() function is used to reverse the order of a personal name in a string (converting Firstname Lastname into Lastname, Firstname). The function has special handling for strings that end with Jr., Sr., I, II, III, IV, V (so that these common suffixes are handled properly), and it can also handle semicolon-delimited string lists (such as [Artist], [Actors], and [Director]). |
---|---|
Examples | swap(Paul Simon)
swap(Sammy Davis Jr.)
swap(Paul Simon; Art Garfunkel)
|
Trim(…)
- Removes leading and trailing non-printable characters and new lines from a given string.
Description | Trim(string)
The Trim() function takes any given string, removing any leading and trailing non-printable characters and new lines |
---|---|
Examples | Trim([Lyrics]) |
TrimLines(…)
- Removes leading, trailing and double new lines from a given string.
Description | TrimLines(string, mode)
The TrimLines() function can remove leading, trailing and double newlines from a given string, which can be given literally or as a library field. The function's mode defaults to removing all three, mode 3. Available mode values:
| ||||||
---|---|---|---|---|---|---|---|
Examples | TrimLines([Lyrics])
TrimLines([Lyrics],2)
|
UnMoveArticles(…)
- Takes "Beatles, The" and reverses it to restore the normal word order, "The Beatles"
Description | UnMoveArticles(string)
The UnMoveArticles() function is the opposite of the MoveArticles() function. It is used to restore the normal order of an band name with a prefix article in a string (converting Beatles, The into The Beatles). |
---|---|
Examples | UnMoveArticles(Flock of Seagulls, A)
Please Note: These examples were simplified and ignore the commas in the string arguments. For these expressions to work properly (when entered directly), you would need to escape the comma in the string literal: UnMoveArticles(Beatles/, The). However, this function is typically used with a field value (or the output of another function), and would therefore be treated as a single argument even if the resulting string contains a comma. See Function Arguments for further details. |
Unswap(…)
- Takes Lastname, Firstname and reverses it to Firstname Lastname.
Description | Unswap(string)
The Unswap() function is the opposite of the Swap() function. It is used to restore the normal western-style order of a personal name in a string (converting Lastname, Firstname into Firstname Lastname). This command also works properly with semicolon-delimited list data (such as the [Artist] field), and like the Swap() command, it handles common name suffixes such as Jr., Sr., I, II, III, IV, V, etc. |
---|---|
Examples | Unswap(Simon, Paul)
Unswap(Simon, Paul; Garfunkel, Art)
Please Note: These examples were simplified and ignore the commas in the string arguments. For these expressions to work properly (when entered directly), you would need to escape the comma in the string literal: Unswap(Simon/, Paul; Garfunkel/, Art). However, this function is typically used with a field value (or the output of another function), and would therefore be treated as a single argument even if the resulting string contains a comma. See Function Arguments for further details. |