data:image/s3,"s3://crabby-images/ca364/ca36463f9b82c90b6f3e27c6326cdcdc31617e4c" alt=""
String operations
Encyclopedia
In computer science
, in the area of formal language theory, frequent use is made of a variety of string functions; however, the notation used is different from that used on computer programming
, and some commonly used functions in the theoretical realm are rarely used when programming. This article defines some of these basic terms.
data:image/s3,"s3://crabby-images/5f704/5f70424c7824a1bffbc63705f66c64e334677fe1" alt=""
be its alphabet. A string substitution or simply a substitution is a mapping f that maps letters in
to languages (possibly in a different alphabet). Thus, for example, given a letter
, one has
where
is some language whose alphabet is
. This mapping may be extended to strings as
data:image/s3,"s3://crabby-images/a430b/a430b415a7d658333f5c922185da574f4da65d3a" alt=""
for the empty string
, and
data:image/s3,"s3://crabby-images/88143/881434dec5aa2b0c6fec25b8a1d5e20b929707a8" alt=""
for string
. String substitution may be extended to the entire language as
data:image/s3,"s3://crabby-images/48192/48192577912f7b70b14a3ff79492851aaf8456af" alt=""
An example of string substitution occurs in regular language
s, which are closed under string substitution. That is, if the letters of a regular language are substituted by other regular languages, the result is still a regular language.
Another example is the conversion of an EBCDIC
-encoded string to ASCII
.
in formal language theory) is a string substitution such that each letter is replaced by a single string. That is,
, where s is a string, for each letter a. String homomorphisms are homomorphism
s, preserving the binary operation
of string concatenation. Given a language L, the set
is called the homomorphic image of L. The inverse homomorphic image of a string s is defined as
data:image/s3,"s3://crabby-images/0b9cf/0b9cf2a1af955acf80c9d4b1e2d795fb70d49a21" alt=""
while the inverse homomorphic image of a language L is defined as
data:image/s3,"s3://crabby-images/34f04/34f04d13b60c6592760d092a79e656cce8f275e5" alt=""
Note that, in general,
, while one does have
data:image/s3,"s3://crabby-images/49f39/49f39ce9331d09790aabe5c5677c70a7dbe4b090" alt=""
and
data:image/s3,"s3://crabby-images/a5927/a592718929d3c002e4364f4531219c23fd311976" alt=""
for any language L.
A string homomorphism is said to be
-free (or e-free) if
for all
in
. Simple single-letter substitution cipher
s are examples of (
-free) string homomorphisms.
is an alphabet, the string projection of s is the string that results by removing all letters which are not in
. It is written as
. It is formally defined by removal of letters from the right hand side:
data:image/s3,"s3://crabby-images/cb761/cb76107e922705e4106e80beb58bb5b245b84784" alt=""
Here
denotes the empty string
. The projection of a string is essentially the same as a projection in relational algebra.
String projection may be promoted to the projection of a language. Given a formal language
L, its projection is given by
data:image/s3,"s3://crabby-images/e3272/e3272d090f566fb1dae279fe69009425b0777062" alt=""
. If the string does not have a on the right hand side, the result is the empty string. Thus:
data:image/s3,"s3://crabby-images/a9162/a916273523f7fe1a2a7ff430313cb35e387814dd" alt=""
The quotient of the empty string may be taken:
data:image/s3,"s3://crabby-images/1da0e/1da0e5b0c8294ac313271f2ef968349d1bf19bba" alt=""
Similarly, given a subset
of a monoid
, one may define the quotient subset as
data:image/s3,"s3://crabby-images/823b5/823b59736a764fcaef334bb913366cda755b64e4" alt=""
Left quotients may be defined similarly, with operations taking place on the left of a string.
of a monoid
defines an equivalence relation
, called the right syntactic relation of S. It is given by
data:image/s3,"s3://crabby-images/ed5c6/ed5c6768602d74f78808bc87d9705ec63312de9d" alt=""
The relation is clearly of finite index (has a finite number of equivalence classes) if and only if the family right quotients is finite; that is, if
data:image/s3,"s3://crabby-images/2d757/2d7575da2b75b5f0bdc00bb0d6d3a92c302f0ca8" alt=""
is finite. In this case, S is a recognizable language
, that is, a language that can be recognized by a finite state automaton. This is discussed in greater detail in the article on syntactic monoid
s.
and is recursively defined as
data:image/s3,"s3://crabby-images/25bcc/25bccdeeca2c1ea8b1a801e0c6ef5b7690dad8e8" alt=""
The empty string is always cancellable:
data:image/s3,"s3://crabby-images/6bdbf/6bdbfe4745615c5ce40464d6a36d60ea71e67012" alt=""
Clearly, right cancellation and projection commute
:
data:image/s3,"s3://crabby-images/f54db/f54db2cb38150050d786612e1be7d0cd1891e6db" alt=""
data:image/s3,"s3://crabby-images/74fb3/74fb3be19b15528c4cd7cf31d174f0fda1018aca" alt=""
here
.
The prefix closure of a language is
data:image/s3,"s3://crabby-images/ecb69/ecb69c2d9bbfbe560324beb47a00f44239a40a22" alt=""
Example:
data:image/s3,"s3://crabby-images/46759/4675998f25b7148834d0d2e92204fbc5269b277d" alt=""
A language is called prefix closed if
.
The prefix closure operator is idempotent:
data:image/s3,"s3://crabby-images/c90b0/c90b0c204808f49a2f1c9a2547f9d237d058f164" alt=""
The prefix relation is a binary relation
such that
if and only if
.
Computer science
Computer science or computing science is the study of the theoretical foundations of information and computation and of practical techniques for their implementation and application in computer systems...
, in the area of formal language theory, frequent use is made of a variety of string functions; however, the notation used is different from that used on computer programming
Computer programming
Computer programming is the process of designing, writing, testing, debugging, and maintaining the source code of computer programs. This source code is written in one or more programming languages. The purpose of programming is to create a program that performs specific operations or exhibits a...
, and some commonly used functions in the theoretical realm are rarely used when programming. This article defines some of these basic terms.
Alphabet of a string
The alphabet of a string is a list of all of the letters that occur in a particular string. If s is a string, its alphabet is denoted bydata:image/s3,"s3://crabby-images/5f704/5f70424c7824a1bffbc63705f66c64e334677fe1" alt=""
String substitution
Let L be a language, and letdata:image/s3,"s3://crabby-images/63ca1/63ca1ddc341a3e2c6cb9939784df291af3588d94" alt=""
data:image/s3,"s3://crabby-images/9446c/9446cc77865e6bbcb1020723b2c884f72cf9739c" alt=""
data:image/s3,"s3://crabby-images/e816b/e816be10c4789a8fd9f2fdc86d621eaadfccd1e4" alt=""
data:image/s3,"s3://crabby-images/32950/32950b94df96c0330bef9693060c2b2db0c10fd5" alt=""
data:image/s3,"s3://crabby-images/b4de5/b4de59e4868ff1195a33134b80dc87486d637bd3" alt=""
data:image/s3,"s3://crabby-images/74cae/74cae35e8df69261cafc263b1dfad5b5470f5676" alt=""
data:image/s3,"s3://crabby-images/a430b/a430b415a7d658333f5c922185da574f4da65d3a" alt=""
for the empty string
Empty string
In computer science and formal language theory, the empty string is the unique string of length zero. It is denoted with λ or sometimes Λ or ε....
data:image/s3,"s3://crabby-images/395ea/395ea1e5858eaeafaf42ca85095a0e145213dc96" alt=""
data:image/s3,"s3://crabby-images/88143/881434dec5aa2b0c6fec25b8a1d5e20b929707a8" alt=""
for string
data:image/s3,"s3://crabby-images/6fc0f/6fc0f4661a3406397d6cc0bc10ab850ecabb60d2" alt=""
data:image/s3,"s3://crabby-images/48192/48192577912f7b70b14a3ff79492851aaf8456af" alt=""
An example of string substitution occurs in regular language
Regular language
In theoretical computer science and formal language theory, a regular language is a formal language that can be expressed using regular expression....
s, which are closed under string substitution. That is, if the letters of a regular language are substituted by other regular languages, the result is still a regular language.
Another example is the conversion of an EBCDIC
EBCDIC
Extended Binary Coded Decimal Interchange Code is an 8-bit character encoding used mainly on IBM mainframe and IBM midrange computer operating systems....
-encoded string to ASCII
ASCII
The American Standard Code for Information Interchange is a character-encoding scheme based on the ordering of the English alphabet. ASCII codes represent text in computers, communications equipment, and other devices that use text...
.
String homomorphism
A string homomorphism (often referred to simply as a homomorphismHomomorphism
In abstract algebra, a homomorphism is a structure-preserving map between two algebraic structures . The word homomorphism comes from the Greek language: ὁμός meaning "same" and μορφή meaning "shape".- Definition :The definition of homomorphism depends on the type of algebraic structure under...
in formal language theory) is a string substitution such that each letter is replaced by a single string. That is,
data:image/s3,"s3://crabby-images/78048/78048377a0930dd0126c8d4d8744cc2d598ab080" alt=""
Homomorphism
In abstract algebra, a homomorphism is a structure-preserving map between two algebraic structures . The word homomorphism comes from the Greek language: ὁμός meaning "same" and μορφή meaning "shape".- Definition :The definition of homomorphism depends on the type of algebraic structure under...
s, preserving the binary operation
Binary operation
In mathematics, a binary operation is a calculation involving two operands, in other words, an operation whose arity is two. Examples include the familiar arithmetic operations of addition, subtraction, multiplication and division....
of string concatenation. Given a language L, the set
data:image/s3,"s3://crabby-images/2e2eb/2e2eb83e7fce61c8f826ff4caf13fe858a4f9b85" alt=""
data:image/s3,"s3://crabby-images/0b9cf/0b9cf2a1af955acf80c9d4b1e2d795fb70d49a21" alt=""
while the inverse homomorphic image of a language L is defined as
data:image/s3,"s3://crabby-images/34f04/34f04d13b60c6592760d092a79e656cce8f275e5" alt=""
Note that, in general,
data:image/s3,"s3://crabby-images/a4932/a4932377d9f1cc727ae8bcd535d2cc14fb32a64e" alt=""
data:image/s3,"s3://crabby-images/49f39/49f39ce9331d09790aabe5c5677c70a7dbe4b090" alt=""
and
data:image/s3,"s3://crabby-images/a5927/a592718929d3c002e4364f4531219c23fd311976" alt=""
for any language L.
A string homomorphism is said to be
data:image/s3,"s3://crabby-images/4025c/4025c476ac4fdddc5bc4e787ea1b3582c654995d" alt=""
data:image/s3,"s3://crabby-images/d56af/d56af82ece748d2e88b74209044c6a85686589b1" alt=""
data:image/s3,"s3://crabby-images/437a4/437a418f33d0a9c41b0dd8f86f198ab62d283799" alt=""
data:image/s3,"s3://crabby-images/819f2/819f22df7ee4a027dd1c59d8cf959a65d9019e78" alt=""
Substitution cipher
In cryptography, a substitution cipher is a method of encryption by which units of plaintext are replaced with ciphertext according to a regular system; the "units" may be single letters , pairs of letters, triplets of letters, mixtures of the above, and so forth...
s are examples of (
data:image/s3,"s3://crabby-images/773fb/773fbab160e074709b1f91552c35569ed86993d7" alt=""
String projection
If s is a string, anddata:image/s3,"s3://crabby-images/6b595/6b595e81c0ffeabb342f16b69a2e38f9e811148f" alt=""
data:image/s3,"s3://crabby-images/9190a/9190a77f774bf220a7d8e59b56c4090560f1dfcc" alt=""
data:image/s3,"s3://crabby-images/312ad/312ad6e8346a5589cdff596cc7c809041e6c6eb9" alt=""
data:image/s3,"s3://crabby-images/cb761/cb76107e922705e4106e80beb58bb5b245b84784" alt=""
Here
data:image/s3,"s3://crabby-images/19b05/19b0518d4f771b9c6bd93d237f91fea4e22efe7b" alt=""
Empty string
In computer science and formal language theory, the empty string is the unique string of length zero. It is denoted with λ or sometimes Λ or ε....
. The projection of a string is essentially the same as a projection in relational algebra.
String projection may be promoted to the projection of a language. Given a formal language
Formal language
A formal language is a set of words—that is, finite strings of letters, symbols, or tokens that are defined in the language. The set from which these letters are taken is the alphabet over which the language is defined. A formal language is often defined by means of a formal grammar...
L, its projection is given by
data:image/s3,"s3://crabby-images/e3272/e3272d090f566fb1dae279fe69009425b0777062" alt=""
Right quotient
The right quotient of a letter a from a string s is the truncation of the letter a in the string s, from the right hand side. It is denoted asdata:image/s3,"s3://crabby-images/d47d3/d47d3cce4a79e880f36cd06a838c3ec3edd4a471" alt=""
data:image/s3,"s3://crabby-images/a9162/a916273523f7fe1a2a7ff430313cb35e387814dd" alt=""
The quotient of the empty string may be taken:
data:image/s3,"s3://crabby-images/1da0e/1da0e5b0c8294ac313271f2ef968349d1bf19bba" alt=""
Similarly, given a subset
data:image/s3,"s3://crabby-images/12576/12576e8992ecd49a072d1a3286619f3379a418d3" alt=""
data:image/s3,"s3://crabby-images/c151e/c151e0c812a4c7042698c897f3df131db8885d98" alt=""
data:image/s3,"s3://crabby-images/823b5/823b59736a764fcaef334bb913366cda755b64e4" alt=""
Left quotients may be defined similarly, with operations taking place on the left of a string.
Syntactic relation
The right quotient of a subsetdata:image/s3,"s3://crabby-images/8832b/8832b84530d43cb66d7d08d17891faea51367d90" alt=""
data:image/s3,"s3://crabby-images/bbd80/bbd8005b9626228f77a91f73300f629e7b644bca" alt=""
Equivalence relation
In mathematics, an equivalence relation is a relation that, loosely speaking, partitions a set so that every element of the set is a member of one and only one cell of the partition. Two elements of the set are considered equivalent if and only if they are elements of the same cell...
, called the right syntactic relation of S. It is given by
data:image/s3,"s3://crabby-images/ed5c6/ed5c6768602d74f78808bc87d9705ec63312de9d" alt=""
The relation is clearly of finite index (has a finite number of equivalence classes) if and only if the family right quotients is finite; that is, if
data:image/s3,"s3://crabby-images/2d757/2d7575da2b75b5f0bdc00bb0d6d3a92c302f0ca8" alt=""
is finite. In this case, S is a recognizable language
Recognizable language
In mathematics and computer science, a recognizable language is a formal language that is recognized by a finite state machine. Equivalently, a recognizable language is one for which the family of quotients for the syntactic relation is finite.-Definition:...
, that is, a language that can be recognized by a finite state automaton. This is discussed in greater detail in the article on syntactic monoid
Syntactic monoid
In mathematics and computer science, the syntactic monoid M of a formal language L is the smallest monoid that recognizes the language L.-Syntactic quotient:...
s.
Right cancellation
The right cancellation of a letter a from a string s is the removal of the first occurrence of the letter a in the string s, starting from the right hand side. It is denoted asdata:image/s3,"s3://crabby-images/4d17a/4d17a8900257d9f9635fc3869607debfe392bead" alt=""
data:image/s3,"s3://crabby-images/25bcc/25bccdeeca2c1ea8b1a801e0c6ef5b7690dad8e8" alt=""
The empty string is always cancellable:
data:image/s3,"s3://crabby-images/6bdbf/6bdbfe4745615c5ce40464d6a36d60ea71e67012" alt=""
Clearly, right cancellation and projection commute
Commute
Commute, commutation or commutative may refer to:* Commuting, the process of travelling between a place of residence and a place of work* Commutative property, a property of a mathematical operation...
:
data:image/s3,"s3://crabby-images/f54db/f54db2cb38150050d786612e1be7d0cd1891e6db" alt=""
Prefixes
The prefixes of a string is the set of all prefixes to a string, with respect to a given language:data:image/s3,"s3://crabby-images/74fb3/74fb3be19b15528c4cd7cf31d174f0fda1018aca" alt=""
here
data:image/s3,"s3://crabby-images/f2255/f225575c5a7eb511d8b8ea1d90373175984824f9" alt=""
The prefix closure of a language is
data:image/s3,"s3://crabby-images/ecb69/ecb69c2d9bbfbe560324beb47a00f44239a40a22" alt=""
Example:
data:image/s3,"s3://crabby-images/46759/4675998f25b7148834d0d2e92204fbc5269b277d" alt=""
A language is called prefix closed if
data:image/s3,"s3://crabby-images/0ffd3/0ffd3ee13f4c63b337edd1d22a09191becf41a73" alt=""
The prefix closure operator is idempotent:
data:image/s3,"s3://crabby-images/c90b0/c90b0c204808f49a2f1c9a2547f9d237d058f164" alt=""
The prefix relation is a binary relation
Binary relation
In mathematics, a binary relation on a set A is a collection of ordered pairs of elements of A. In other words, it is a subset of the Cartesian product A2 = . More generally, a binary relation between two sets A and B is a subset of...
data:image/s3,"s3://crabby-images/507e0/507e0664756def9b232176edd155993ec0d55335" alt=""
data:image/s3,"s3://crabby-images/8860f/8860fc1b79a655f044a130d114000d2ec9ceb5a4" alt=""
data:image/s3,"s3://crabby-images/c076b/c076b97d5e231838656fa8974d8959abdbd0c134" alt=""