During searching and indexing, some characters such as accented characters and umlauts are replaced with the equivalent characters from the Latin alphabet. This type of replacement is called normalization; it means that some words, such as Müller and Mueller, are treated as identical words. A search for either returns all instances of both.
The following table shows some of the characters that are normalized.
Character | Replacement characters |
---|---|
é | e |
ç | c |
æ | ae |
ü | ue |