Given a verb (infinitive or a verbal form) this WS outputs its verbal paradigm grouped according tense and mode. The languages supported are Catalan and Spanish.
Provider:
Universitat Pompeu Fabra (UPF)
This WS provides a text segmentation into minor structural units (titles, paragraphs, sentences, etc.); detection of entities (not found in a dictionary: numbers, abbreviations, URLs, emails, etc.); and the keeping of sequences of two or more words in a single block (dates, phrases, etc.). The input is plain text in Catalan and Spanish.
Provider:
Universitat Pompeu Fabra (UPF)
Given a lemma and a category, this WS returns the sentences of the IULA corpus where this lemma occurs. The user can perform a domain search. The languages supported are Spanish and English.
Provider:
Universitat Pompeu Fabra (UPF)
Given a list of lemmas, the WS looks for the occurrences of them in IULA corpus, applies the given regular expressions and returns all the signatures.
Provider:
Universitat Pompeu Fabra (UPF)
Given a file with sentences where the studied word occurs, this WS applies the regular expressions in order to obtain the signatures.
Provider:
Universitat Pompeu Fabra (UPF)
Given a word form, this WS returns the lexical information by looking it up in the IULA's lexicon. The languages supported are Catalan, Spanish or English.
Provider:
Universitat Pompeu Fabra (UPF)
The IULA tokenizer WS splits a file in plain text format and UTF-8 encoded into units (tokens). The languages supported are Catalan and Spanish.
Provider:
Universitat Pompeu Fabra (UPF)
Convert encoding of given files from one encoding to another.
Based on the Linux command that converts text from one encoding to another encoding.
Provider:
Universitat Pompeu Fabra (UPF)
This WS calculates different lexicometric measures and displays them graphically (tokens, types, hapaxes and type/token ratio). The input is a plain text corpus with one token per line. Language independent WS.
Provider:
Universitat Pompeu Fabra (UPF)
This WS calculates the Term Frequency (TF) and the Inverse Document Frequency (IDF) of a word in a given corpus. The two values, labeled TF-IDF, are a statistical measure used to evaluate how important a word is to a document in a collection or corpus.
Provider:
Universitat Pompeu Fabra (UPF)
Ted Pedersen's Ngram Statistics Package (used to identify word Ngrams that appear in large corpora using standard tests of association such as Fisher's exact test, the log likelihood ratio, Pearson's chi-squared test, the Dice Coefficient, etc.).
Provider:
Universitat Pompeu Fabra (UPF)
|
3136
0
|
This service has been archived because it may not be active anymore (or is close to being non active).
|
1229
0
|
This service has been archived because it may not be active anymore (or is close to being non active).
|
1402
0
|
This service has been archived because it may not be active anymore (or is close to being non active).