AttributesValues
type
label
  • IULA Spanish-English Technical Corpus
Description
  • The corpus consists of a number of specialized texts (Law, Economics, Medicine, Environment and Computer Science domains) available in both Spanish and English languages. This LSP corpus has been compiled with articles from specialized publications, PhD theses, etc. It contains about a total of about 2,1 M words in 127 documents in each language.
Documentation
Annotation Mode
Annotation Standoff
  • true
Annotation Tool
  • TreeTagger
Annotation Type
Character Encoding
Contact Person
Creation Mode
Domain
  • medicine
  • economy
  • environment
  • computer science
  • law
Funding Project
Identifier
Language Code
Language Identifier
  • en
  • es
Language Name
  • English
  • Spanish
Licence
Media Type
MetaShare Identifier
  • NOT_DEFINED_FOR_V2
Mime Type
Multilinguality Type
Original Source
  • IULACT http://bwananet.iula.upf.edu
Resource Creator
Resource Name
  • IULA Spanish-English Technical Corpus
Resource Short Name
  • bilingual corpus
Segmentation Level
Size Information
Tagset
  • MULTEX/PAROLE
Url
Alternative Linked Data Views: Sponger | iSPARQL | ODE     Raw Data in: CXML | CSV | RDF ( N-Triples N3/Turtle JSON XML ) | OData ( Atom JSON )    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] This material is Open Knowledge Creative Commons License Valid XHTML + RDFa
This work is licensed under a Creative Commons Attribution-Share Alike 3.0 Unported License.
OpenLink Virtuoso version 06.01.3127, on Linux (x86_64-pc-linux-gnu), Standard Edition
Copyright © 2009-2011 OpenLink Software