This webservice performs traditional Naive Bayes classification of instances given in a weka file. It outputs the predicted classification for each instance and some statistics about the performance of the classification. The parameters needed as input can be learnt using estimate_bayesian_parameters webservice.
Provider:
Universitat Pompeu Fabra (UPF)
Given a training set encoded as vectors of cue (or feature) occurrences, this web service estimates the parameters P(cuei|class): the probability of seeing each cue as a member or non-member of the class. This estimation is performed using Bayesian inference, which combines prior knowledge with observed data. The parameters estimated with this web service can be used, for example, to classify new instances using a Naive Bayes
classifier. The output format is the one needed as input for the...
Provider:
Universitat Pompeu Fabra (UPF)
This WS identifies eventive nouns in a part of speech tagged text (with FreeLing Morphosyntactic tagger V 3.0 WS). The classification is performed with a pre-trained Decision Tree. The output is a LMF file with the classifier prediction for each noun. You can choose to have this prediction as:
- "scored": each noun gets a score of being or not being a member of the class (bigger than 0 means class member, smaller, non member of the class)
- "filtered": the nouns are filtered according to t...
Provider:
Universitat Pompeu Fabra (UPF)
This web service creates a weka file containing context information of a list of nouns in a given corpus. The context information for each noun is extracted using a set of Regular Expressions and it is encoded in one vector (one line per noun in the weka file). Each slot in the vector represents the number of times the regular expression in this position has been observed with the given noun.
Inputs:
- corpusId: Already indexed CQP corpus ID from which to extract the signatures. You can ind...
Provider:
Universitat Pompeu Fabra (UPF)
Given a training set encoded as vectors of cue (or feature) occurrences in weka format, this web service computes P(cuei|class): the probability of seeing each cue as a member or non-member of the class using MLE approach (counts frequencies of appearance of each cue in each class).
Inputs:
- weka_signatures: classified instances encoded as cue vectors. Each slot of the vector contains the number of times each feature has been observed for that instance. Also, we add special slots: total nu...
Provider:
Universitat Pompeu Fabra (UPF)
This WS calculates the probability of seeing a linguistic cue given a lexical class (P(cue|class) value). This probability is computed given the occurrences of cues in a corpus (codified in the signatures file) and the information of belonging or not belonging of these words to different classes (codified in indicators file). The probability is computed for each studied cue in the signatures file and for each class in the indicators file.
Provider:
Universitat Pompeu Fabra (UPF)
This WS allows extracting a column from a tabular file input text. It is useful to work with CoNLL or FreeLing annotated corpora. Language independent WS.
Provider:
Universitat Pompeu Fabra (UPF)
|
714
0
|
This service has been archived because it may not be active anymore (or is close to being non active).
This WS allows querying an already indexed corpus (see CQP indexer WS for indexing details). The WS is based on the IMS Open Corpus Workbench (CWB). Language independent WS.
Provider:
Universitat Pompeu Fabra (UPF)
CQP indexer WS based on the IMS Open Corpus Workbench (CWB). The input is an annotated corpus in tabular format. The output is the Corpus ID to be used by the CQPquery Web Service. Language independent WS.
Provider:
Universitat Pompeu Fabra (UPF)
Convert encoding of given files from one encoding to another.
Based on the Linux command that converts text from one encoding to another encoding.
Provider:
Universitat Pompeu Fabra (UPF)
|
6761
0
|
This service has been archived because it may not be active anymore (or is close to being non active).
This WS performs the Count function from Ted Pedersen's Ngram Statistics Package (used to identify word Ngrams that appear in large corpora using standard tests of association such as Fisher's exact test, the log likelihood ratio, Pearson's chi-squared test, the Dice Coefficient, etc.). Language independent WS.
Provider:
Universitat Pompeu Fabra (UPF)
This WS is a Panacea project converter that creates GrAF documents from the output of PoS taggers (Freeling and IULA tagger).
Provider:
Universitat Pompeu Fabra (UPF)
This WS is used to filter text. It extracts part of a file using pattern matching or substituting multiple occurrences of a string within a file.
Provider:
Universitat Pompeu Fabra (UPF)