TextMatch is a web service that combines language independent ways of computing the similarity between two documents using powerful linguistic tools, and provides you different measures of the document similarity. It returns a percentage reflecting the probability that one document is similar to the other.
TextMatch can compare documents in different formats (such as Microsoft document formats, OpenDocument Format, Portable Document Format, Electronic Publication Format, HyperText Markup Language, Rich Text Format, Text formats).
TextMatch recognizes the language of the document using two-stage language detection system. Specific language tokenizers, lemmatizers and other analyzers are utilized for English, Bulgarian, German, French, and Russian. A language independent comparison algorithm is used if one of the uploaded documents is in another language.
People who looked at this resource also viewed the following:
- Helsinki Finite-State Transducer Technology
- The Archive of Estonian Dialects and Finno-Ugric Languages (EMSUKA) of the Institute of the Estonian Language
- SARP - Speech Analyzer Rapid Plot. Plotting vowels in F2-F1 scatter charts with multiple data sets
- POETICON Multisensory and Multimedia Recordings of Everyday Interaction