Marc Poch Riera

Profile

Display Name:
Marc Poch Riera

Email address:
marc.pochriera [at] upf.edu

Affiliation:
IULA - UPF Universitat Pompeu Fabra

Country:
Spain

Services Submitted (63)

Displaying services 41 - 60 of 63 in total

« Previous 1 2 3 4 Next »

Soaplab validator Web Service

SOAPSoaplab

Categories:

Languages:

Language Independent

This WS verifies that a Soaplab web service is Panacea compliant.

Provider: Universitat Pompeu Fabra (UPF)

IULA paradigma Web Service

SOAPSoaplab

Categories:

Morphological Tagging

Languages:

Given a verb (infinitive or a verbal form) this WS outputs its verbal paradigm grouped according tense and mode. The languages supported are Catalan and Spanish.

Provider: Universitat Pompeu Fabra (UPF)

IULA preprocess Web Service

SOAPSoaplab

Categories:

Languages:

This WS provides a text segmentation into minor structural units (titles, paragraphs, sentences, etc.); detection of entities (not found in a dictionary: numbers, abbreviations, URLs, emails, etc.); and the keeping of sequences of two or more words in a single block (dates, phrases, etc.). The input is plain text in Catalan and Spanish.

Provider: Universitat Pompeu Fabra (UPF)

IULA concordancer Web Service

SOAPSoaplab

Categories:

Languages:

Given a lemma and a category, this WS returns the sentences of the IULA corpus where this lemma occurs. The user can perform a domain search. The languages supported are Spanish and English.

Provider: Universitat Pompeu Fabra (UPF)

Search signatures Web Service

SOAPSoaplab

Categories:

Languages:

Given a list of lemmas, the WS looks for the occurrences of them in IULA corpus, applies the given regular expressions and returns all the signatures.

Provider: Universitat Pompeu Fabra (UPF)

Regular expressions applicator Web Service

SOAPSoaplab

Categories:

Querying

Languages:

Language Independent

Given a file with sentences where the studied word occurs, this WS applies the regular expressions in order to obtain the signatures.

Provider: Universitat Pompeu Fabra (UPF)

IULA lexicon look up Web Service

SOAPSoaplab

Categories:

Lexicon/Terminology Extraction

Languages:

Given a word form, this WS returns the lexical information by looking it up in the IULA's lexicon. The languages supported are Catalan, Spanish or English.

Provider: Universitat Pompeu Fabra (UPF)

IULA tokenizer Web Service

SOAPSoaplab

Categories:

Languages:

The IULA tokenizer WS splits a file in plain text format and UTF-8 encoded into units (tokens). The languages supported are Catalan and Spanish.

Provider: Universitat Pompeu Fabra (UPF)

XSLT applicator Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

A command line tool for applying XSLT stylesheets to XML documents.

Provider: Universitat Pompeu Fabra (UPF)

PDF to text converter Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

This WS converts PDF documents to plain text format. Language independent WS.

Provider: Universitat Pompeu Fabra (UPF)

PANACEA converter Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

This is the Panacea conversion tool.

Provider: Universitat Pompeu Fabra (UPF)

IULA text converter Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

Convert encoding of given files from one encoding to another. Based on the Linux command that converts text from one encoding to another encoding.

Provider: Universitat Pompeu Fabra (UPF)

HTML to text converter Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

A WS to convert HTML documents to plain text format. Language independent WS.

Provider: Universitat Pompeu Fabra (UPF)

MS Word to text converter Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

A WS to convert MS Word documents to plain text format. Language independent WS.

Provider: Universitat Pompeu Fabra (UPF)

Vocabulary analyzer Web Service

SOAPSoaplab

Categories:

Statistics Analysis

Languages:

Language Independent

This WS calculates different lexicometric measures and displays them graphically (tokens, types, hapaxes and type/token ratio). The input is a plain text corpus with one token per line. Language independent WS.

Provider: Universitat Pompeu Fabra (UPF)

TF-IDF calculator Web Service

SOAPSoaplab

Categories:

Languages:

Language Independent

This WS calculates the Term Frequency (TF) and the Inverse Document Frequency (IDF) of a word in a given corpus. The two values, labeled TF-IDF, are a statistical measure used to evaluate how important a word is to a document in a collection or corpus.

Provider: Universitat Pompeu Fabra (UPF)

Ted Pedersen's Ngram Statistics Package

SOAPSoaplab

Categories:

Statistics Analysis

Languages:

Language Independent

Ted Pedersen's Ngram Statistics Package (used to identify word Ngrams that appear in large corpora using standard tests of association such as Fisher's exact test, the log likelihood ratio, Pearson's chi-squared test, the Dice Coefficient, etc.).

Provider: Universitat Pompeu Fabra (UPF)

P clue/ lexical class calculator Web Service

SOAPSoaplab

3149

This service has been archived because it may not be active anymore (or is close to being non active).

FreeLing Chunker parser Web Service

SOAPSoaplab

1242

This service has been archived because it may not be active anymore (or is close to being non active).

FreeLing Dependency parser Web Service

SOAPSoaplab

1410

This service has been archived because it may not be active anymore (or is close to being non active).

« Previous 1 2 3 4 Next »

Services Responsible For (0)

No entries found

Services Annotated (47)

Displaying services 41 - 47 of 47 in total

« Previous 1 2 3 Next »

Twitter NLP Web Service

SOAPSoaplab

Categories:

Languages:

English

This WS is based on the Twitter NLP tool developed by Noah's ARK group (Noah Smith's research group at the Language Technologies Institute, School of Computer Science, Carnegie Mellon University). A fast and robust Java-based tokenizer and part-of-speech tagger for Twitter, its training data of manually labeled POS annotated tweets, a web-based annotation tool, and hierarchical word clusters from unlabeled tweets. The language supported is English.

Provider: Universitat Pompeu Fabra (UPF)

Hungalign to GrAF converter Web Service

SOAPSoaplab

Categories:

Languages:

Language Independent

This WS creates an alignment file combining the Hunalign output and two sentences id lists extracted from GrAF documents.

Provider: Universitat Pompeu Fabra (UPF)

TGZ file compressor Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

This WS creates a compress file (in TGZ format) with output documents stored on this same server using their URL

Provider: Universitat Pompeu Fabra (UPF)

File splitter Web Service

SOAPSoaplab

Categories:

Languages:

Language Independent

This WS splits an input file into smaller files containing the number of lines indicated as input parameter. Splitted files are stored in the results public directory, and the output is a file with the list of URLs pointing to each splitted file. Language independent WS.

Provider: Universitat Pompeu Fabra (UPF)

MaltParser Web Service

SOAPSoaplab

Categories:

Languages:

Spanish, Castilian

This WS calls an instance of MaltParser for Spanish trained with the IULA treebank developed in the Metanet4you project. The input of this WS is plain text. The service performs PoS tagging with FreeLing and then performs the dependency parsing using Malt parser. The output follows CoNLL format.

Provider: Universitat Pompeu Fabra (UPF)

XML/TXT to Weka converter Web Service

SOAPSoaplab

Categories:

Format Conversion

Languages:

Language Independent

Given a XML signatures file (signatures.xml) and the indicators file (indicators.txt) with the nouns that belong or not to the class, this WS creates a file in ARFF format to experiment with Weka. Warning: the default encoding for input and outputs files is ISO-8859-1. It may be changed using optional parameters, but the two input files must have the same encoding, which must be indicated in the headers of the XML file.

Provider: Universitat Pompeu Fabra (UPF)

IULA tokenizer Web Service

SOAPSoaplab

Categories:

Languages:

The IULA tokenizer WS splits a file in plain text format and UTF-8 encoded into units (tokens). The languages supported are Catalan and Spanish.

Provider: Universitat Pompeu Fabra (UPF)

« Previous 1 2 3 Next »

Favourites (0)

None