Home

Services

Services

Providers

Providers

Search by Data

Feed_icon

Home
»
Members
»
Miquel Cornudella

Miquel Cornudella

Profile

Display Name:
Miquel Cornudella

Email address:
miquel.cornudella [at] upf.edu

Affiliation:
IULA

Country:
Spain

Services Submitted (27)

Displaying services 1 - 20 of 27 in total

« Previous 1 2 Next »

Freeling Anonymizer UPF

REST

Categories:

Languages:

none Help to add a language to this...

PASSED

This WS substitutes proper nouns with tags. This process anonymizes an input text by eliminating any person, place, corporation, etc. name. The service automatically calls the FreeLing WS and makes use of its Named Entity Recognition tool to detect proper nouns. The languages supported are English, Catalan, Spanish, Asturian, Welsh, Galician, Italian, Russian and Portuguese. Details: ds_lsr_analysis : analysis : analysis_extension : input : ...

Provider: ws-iulaterm-upf-edu

CQP Analyzer Web Service

REST

Categories:

Statistics Analysis

Languages:

PASSED

This WS allows analyzing an already indexed corpus (see CQP indexer WS for indexing details). The WS returns an Excel file with some statistical metrics such as number of nouns, verbs, ngrams, etc. The languages supported are Spanish and English. ds_lsr_analysis : analysis : analysis_extension : input : description : Corpus analysis bases on the CWB corpus workbench installation : Clam Web Services default installation ...

Provider: ws-iulaterm-upf-edu

CQP Query Web Service

REST

Categories:

Corpus Processing

Languages:

none Help to add a language to this...

PASSED

This WS allows querying an already indexed corpus (see CQP indexer WS for indexing details). The WS is based on the IMS Open Corpus Workbench (CWB). Language independent WS. Input: CorpusId: {id of the corpus you created with the CQP index web service} Query: A CQP query. Output: The CQP output as it would be in the command line. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : CQP Query Web ...

Provider: ws-iulaterm-upf-edu

CQP Indexer Web Service

REST

Categories:

Corpus Processing

Languages:

none Help to add a language to this...

PASSED

CQP indexer WS based on the IMS Open Corpus Workbench (CWB). The input is an annotated corpus in tabular format. The output is the Corpus ID to be used by the CQPquery Web Service. Language independent WS. Input: corpus: Annotated corpus in tabular format. structure: structure of the corpus. More info.: http://cwb.sourceforge.net/documentation.php Output: The Corpus id to be used in the CQP Query web service. Input example: corpus: http://ws02.iula.upf.edu/panacea/examples/ws/cqp/cqp...

Provider: ws-iulaterm-upf-edu

Ted Pedersen's Text Similarity Web Service

REST

Categories:

Statistics Analysis

Languages:

none Help to add a language to this...

PASSED

This WS is based on Ted Pedersen's Text Similarity module. It measures the similarity of two documents based on the number of shared words scaled by the lengths of the files. Text Similarity WS computes the F-Measure, the Dice Coefficient, the Cosine, and the Lesk measure. Language independent WS.

Provider: ws-iulaterm-upf-edu

Ted Pedersen's Ngrams Counter Web Service

REST

Categories:

Statistics Analysis

Languages:

Language Independent

PASSED

This WS performs the Count function from Ted Pedersen's Ngram Statistics Package (used to identify word Ngrams that appear in large corpora using standard tests of association such as Fisher's exact test, the log likelihood ratio, Pearson's chi-squared test, the Dice Coefficient, etc.). Language independent WS. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: 'Funció Count del Ngram Statistics Pack...

Provider: ws-iulaterm-upf-edu

Ted Pedersen's Ngram Statistics Package

REST

Categories:

Statistics Analysis

Languages:

Language Independent

PASSED

Ted Pedersen's Ngram Statistics Package (used to identify word Ngrams that appear in large corpora using standard tests of association such as Fisher's exact test, the log likelihood ratio, Pearson's chi-squared test, the Dice Coefficient, etc.). Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: 'Ngram Statistics Package' de Ted Pedersen (s'utilitza per calcular la coocurrència entre paraules). ...

Provider: ws-iulaterm-upf-edu

Vocabulary analyzer Web Service

REST

Categories:

Statistics Analysis

Languages:

Language Independent

PASSED

This web service calculates different lexicometric measures and displays them graphically (tokens, types, hapaxes & type/token ratio). Input: plain text corpus with one token per line Input example: This is an example

Provider: ws-iulaterm-upf-edu

MaltParser Web Service

REST

Categories:

Languages:

Spanish, Castilian

PASSED

Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: Analitzador de dependències utilitzant Malt Parser es: Analizador de dependencias utilizando Malt Parser en: Dependency parsing using Malt Parser. output : installation : Clam Web Service default installation name : malt_parser type : Syntactic_Tagging

Provider: ws-iulaterm-upf-edu

Bohnet parser Web Service

REST

Categories:

Syntactic Tagging

Languages:

none Help to add a language to this...

PASSED

Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: Analitzador de dependències utilitzant Bohnet Parser es: Analizador de dependencias utilizando Bohnet Parser en: Dependency parsing using Bohnet's graph-based Parser. installation : Clam Web Service default installation output : name : bohnet_parser type : Syntactic_Tagging

Provider: ws-iulaterm-upf-edu

TMX shuffling Web Service

REST

Categories:

Alignment

Languages:

none Help to add a language to this...

PASSED

Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: . es: . en: Tmx translation units scrambler output : installation : Clam Web Service default installation name : tmx_shuffling type : Others

Provider: ws-iulaterm-upf-edu

PDF to text converter Web Service

REST

Categories:

Format Conversion

Languages:

none Help to add a language to this...

PASSED

This WS converts PDF documents to plain text format. Language independent WS. Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: conversor de pdf a txt. es: conversor de pdf a txt. en: pdf to txt converter. installation : Clam Web Service default installation output : name : pdftotext type : Format_Conversion

Provider: ws-iulaterm-upf-edu

HTML to text converter Web Service

REST

Categories:

Format Conversion

Languages:

none Help to add a language to this...

PASSED

A WS to convert HTML documents to plain text format. Language independent WS. Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: conversor d'html a txt. es: conversor de html a txt. en: Html to txt converter. installation : Clam Web Service default installation output : name : html2text type : Format_Conversion

Provider: ws-iulaterm-upf-edu

Stream editor Web Service

REST

Categories:

Corpus Processing

Languages:

none Help to add a language to this...

PASSED

This WS is used to filter text. It extracts part of a file using pattern matching or substituting multiple occurrences of a string within a file with the sed command. Sed is typically used for extracting part of a file using pattern matching or substituting multiple occurrences of a string within a file. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: . es: . ...

Provider: ws-iulaterm-upf-edu

MS Word to text converter Web Service

REST

Categories:

Format Conversion

Languages:

none Help to add a language to this...

PASSED

Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: conversor de Word doc a txt. es: conversor de Word doc a txt. en: Word doc to txt converter. installation : Clam Web Service default installation output : name : catdoc type : Format_Conversion

Provider: ws-iulaterm-upf-edu

IULA text converter Web Service

REST

Categories:

Format Conversion

Languages:

Language Independent

PASSED

Convert character encoding of given files from one encoding to another. Based on the Linux command that converts text from one encoding to another encoding. Details: ds_lsr_analysis : analysis : analysis_extension : output : name : iconv

Provider: ws-iulaterm-upf-edu

Columns selector Web Service

REST

Categories:

Format Conversion

Languages:

Language Independent

PASSED

Processor to extract desired data by columns. Based on Linux awk. Columns: indicate the columns number you desire separated by commas. Input: Raw data. Default column separator is blank space or tabs. You can optionally specify the input and output separators. Example: Columns: 4,2 Input: http://ws02.iula.upf.edu/panacea/examples/ws/columns_selector/input_example_1.txt or http://ws02.iula.upf.edu/panacea/examples/ws/columns_selector/input_example_2.txt Output example: http://ws02.iula.upf....

Provider: ws-iulaterm-upf-edu

File splitter Web Service

REST

Categories:

Languages:

Language Independent

PASSED

Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: Donat un fitxer, el parteix en fitxers més petits, del nombre de línies indicat com a paràmetre d'entrada (defecte 1000 línies) es: Dado un fichero, lo parte en ficheros más pequeños, del número de líneas indicado como parámetro de entrada (defecto 1000 lineas). en: Given a file, split it into smaller files containing the ...

Provider: ws-iulaterm-upf-edu

Linescrambler parallel Web Service

REST

Categories:

Alignment

Languages:

none Help to add a language to this...

PASSED

This WS will scramble the lines in a parallel text corpus keeping the alignment. The goal is to make it difficult to reproduce the original text. The input size limit is 100 MB. Language independent WS. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: . es: . en: Web service to scramble the lines in a parallel corpus. output : installation : Clam Web ...

Provider: ws-iulaterm-upf-edu

Linescrambler Web Service

REST

Categories:

Alignment

Languages:

Language Independent

PASSED

This WS scrambles the lines in a file. The goal is to make it difficult to reproduce the original text. The input size limit is 100 MB. Language independent WS.

Provider: ws-iulaterm-upf-edu

« Previous 1 2 Next »

Services Responsible For (0)

No entries found

Services Annotated (27)

Displaying services 1 - 20 of 27 in total

« Previous 1 2 Next »

Freeling Chunker Parser Web Service

REST

Categories:

Languages:

PASSED

Freeling-based chunker parser (v4.1) Languages: English, Catalan, Spanish, Asturian and Galician. Input: Plain text Output: Freeling output format, XML, XML CQP ready. Input example: http://ws02.iula.upf.edu/panacea/examples/ws/freeling_parsed/freeling_parsed.input.example.txt Output example: http://ws02.iula.upf.edu/panacea/examples/ws/freeling_parsed/freeling_parsed.output.example.txt Output XML example: http://ws02.iula.upf.edu/panacea/examples/ws/freeling_parsed/fr...

Provider: ws-iulaterm-upf-edu

Freeling Dependency Parser

REST

Categories:

Languages:

PASSED

Freeling-based dependency parser (v4.1) Languages: English, Catalan, Spanish, Asturian and Galician. Input: Plain text Output: Freeling output format, XML, XML CQP ready. Input example: http://ws02.iula.upf.edu/panacea/examples/ws/freeling_dependency/freeling_dependency.input.example.txt Output example: http://ws02.iula.upf.edu/panacea/examples/ws/freeling_dependency/freeling_dependency.output.example.txt Output XML example: http://ws02.iula.upf.edu/panacea/examples/ws/freel...

Provider: ws-iulaterm-upf-edu

Freeling Morphological Analyzer UPF

REST

Categories:

Languages:

none Help to add a language to this...

PASSED

This Web Service deploys a FreeLing-based morphological analyzer (v 3.0). The languages supported are English, Catalan, Spanish, Asturian, Welsh, Galician, Italian, Russian and Portuguese.

Provider: ws-iulaterm-upf-edu

Freeling NER UPF

REST

Categories:

Named Entity Recognition

Languages:

PASSED

This Web Service deploys a FreeLing-based Named Entity Recognition System (v 4.1). The languages supported are English, Catalan, Spanish, Asturian, Welsh, Galician, Italian, Russian and Portuguese.

Provider: ws-iulaterm-upf-edu

Freeling Sentence Splitter UPF

REST

Categories:

Languages:

PASSED

This WS performs a FreeLing-based sentence splitter (v 4.1). The WS splits a file in plain text format and UTF-8 encoded into units (tokens). Output sentences are separated by empty lines. The languages supported are English, Catalan, Spanish, Asturian, Welsh, Galician, Italian, Russian and Portuguese. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: segmentador de textos basat en Freeling. ...

Provider: ws-iulaterm-upf-edu

Freeling Tagger UPF

REST

Categories:

Languages:

PASSED

This is the UPF Freeling-based part-of-speech tagger. Languages: English, Catalan, Spanish, Asturian, Welsh, Galician, Italian and Portuguese. Job duration: 1M words takes aprox. one minute. This depends on the server load. Input: plain text. Input example: http://ws02.iula.upf.edu/panacea/examples/ws/freeling_tagging/freeling_tagging.input.example.txt Output format: word, lemma, tag, probability, word-char-start and word-char-end all tab separated. Output format example: ...

Provider: ws-iulaterm-upf-edu

Freeling Tokenizer UPF

REST

Categories:

Languages:

PASSED

This WS deploys a FreeLing-based text tokenizer (v 4.1). The WS splits a file in plain text format and UTF-8 encoded into units (tokens). The languages supported are Catalan, English, Galician, Italian, Portuguese, Russian, Spanish, Welsh, and Asturian.

Provider: ws-iulaterm-upf-edu

Linescrambler Web Service

REST

Categories:

Alignment

Languages:

Language Independent

PASSED

This WS scrambles the lines in a file. The goal is to make it difficult to reproduce the original text. The input size limit is 100 MB. Language independent WS.

Provider: ws-iulaterm-upf-edu

Linescrambler parallel Web Service

REST

Categories:

Alignment

Languages:

none Help to add a language to this...

PASSED

This WS will scramble the lines in a parallel text corpus keeping the alignment. The goal is to make it difficult to reproduce the original text. The input size limit is 100 MB. Language independent WS. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: . es: . en: Web service to scramble the lines in a parallel corpus. output : installation : Clam Web ...

Provider: ws-iulaterm-upf-edu

File splitter Web Service

REST

Categories:

Languages:

Language Independent

PASSED

Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: Donat un fitxer, el parteix en fitxers més petits, del nombre de línies indicat com a paràmetre d'entrada (defecte 1000 línies) es: Dado un fichero, lo parte en ficheros más pequeños, del número de líneas indicado como parámetro de entrada (defecto 1000 lineas). en: Given a file, split it into smaller files containing the ...

Provider: ws-iulaterm-upf-edu

Columns selector Web Service

REST

Categories:

Format Conversion

Languages:

Language Independent

PASSED

Processor to extract desired data by columns. Based on Linux awk. Columns: indicate the columns number you desire separated by commas. Input: Raw data. Default column separator is blank space or tabs. You can optionally specify the input and output separators. Example: Columns: 4,2 Input: http://ws02.iula.upf.edu/panacea/examples/ws/columns_selector/input_example_1.txt or http://ws02.iula.upf.edu/panacea/examples/ws/columns_selector/input_example_2.txt Output example: http://ws02.iula.upf....

Provider: ws-iulaterm-upf-edu

IULA text converter Web Service

REST

Categories:

Format Conversion

Languages:

Language Independent

PASSED

Convert character encoding of given files from one encoding to another. Based on the Linux command that converts text from one encoding to another encoding. Details: ds_lsr_analysis : analysis : analysis_extension : output : name : iconv

Provider: ws-iulaterm-upf-edu

MS Word to text converter Web Service

REST

Categories:

Format Conversion

Languages:

none Help to add a language to this...

PASSED

Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: conversor de Word doc a txt. es: conversor de Word doc a txt. en: Word doc to txt converter. installation : Clam Web Service default installation output : name : catdoc type : Format_Conversion

Provider: ws-iulaterm-upf-edu

Stream editor Web Service

REST

Categories:

Corpus Processing

Languages:

none Help to add a language to this...

PASSED

This WS is used to filter text. It extracts part of a file using pattern matching or substituting multiple occurrences of a string within a file with the sed command. Sed is typically used for extracting part of a file using pattern matching or substituting multiple occurrences of a string within a file. Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: . es: . ...

Provider: ws-iulaterm-upf-edu

HTML to text converter Web Service

REST

Categories:

Format Conversion

Languages:

none Help to add a language to this...

PASSED

A WS to convert HTML documents to plain text format. Language independent WS. Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: conversor d'html a txt. es: conversor de html a txt. en: Html to txt converter. installation : Clam Web Service default installation output : name : html2text type : Format_Conversion

Provider: ws-iulaterm-upf-edu

PDF to text converter Web Service

REST

Categories:

Format Conversion

Languages:

none Help to add a language to this...

PASSED

This WS converts PDF documents to plain text format. Language independent WS. Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: conversor de pdf a txt. es: conversor de pdf a txt. en: pdf to txt converter. installation : Clam Web Service default installation output : name : pdftotext type : Format_Conversion

Provider: ws-iulaterm-upf-edu

TMX shuffling Web Service

REST

Categories:

Alignment

Languages:

none Help to add a language to this...

PASSED

Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: . es: . en: Tmx translation units scrambler output : installation : Clam Web Service default installation name : tmx_shuffling type : Others

Provider: ws-iulaterm-upf-edu

Bohnet parser Web Service

REST

Categories:

Syntactic Tagging

Languages:

none Help to add a language to this...

PASSED

Details: ds_lsr_analysis : analysis : input : analysis_extension : description : cat: Analitzador de dependències utilitzant Bohnet Parser es: Analizador de dependencias utilizando Bohnet Parser en: Dependency parsing using Bohnet's graph-based Parser. installation : Clam Web Service default installation output : name : bohnet_parser type : Syntactic_Tagging

Provider: ws-iulaterm-upf-edu

MaltParser Web Service

REST

Categories:

Languages:

Spanish, Castilian

PASSED

Details: ds_lsr_analysis : analysis : analysis_extension : input : description : cat: Analitzador de dependències utilitzant Malt Parser es: Analizador de dependencias utilizando Malt Parser en: Dependency parsing using Malt Parser. output : installation : Clam Web Service default installation name : malt_parser type : Syntactic_Tagging

Provider: ws-iulaterm-upf-edu

Vocabulary analyzer Web Service

REST

Categories:

Statistics Analysis

Languages:

Language Independent

PASSED

This web service calculates different lexicometric measures and displays them graphically (tokens, types, hapaxes & type/token ratio). Input: plain text corpus with one token per line Input example: This is an example

Provider: ws-iulaterm-upf-edu

« Previous 1 2 Next »

Favourites (0)

None