rogierkraf 2013-11-30+02:00 clarin.eu:cr1:p_1342181139640 CLARIN Netherlands
Resource http://openconvert.clarin.inl.nl/ OpenConvert OpenConvert 2015 http://openconvert.clarin.inl.nlDutch Language Institutehttp://portal.clarin.nl/node/4224 released 2015-10-07 CLARIN-NLCLARIN in the Netherlands184.021.003NWOhttp://www.clarin.nlJan OdijkNational Coordinator
Utrecht, the Netherlands
j.odijk@uu.nlUiL-OTSUtrecht University
20092015
CLARIAH-CORECommon Lab Research Infrastructure for the Arts and the Humanities184.033.101NWOhttp://www.clariah.nlJan OdijkNational Coordinator
Utrecht, the Netherlands
j.odijk@uu.nlUiL-OTSUtrecht University
20152018
NetherlandsNL The OpenConvert tools convert to TEI or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application.The OpenConvert Tools were created by IVDNT in the OpenConvert project. The OpenConvert tools convert to TEI or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application. Furthermore, as a proof of concept, the website currently provides two annotation tools: a simple Tokenizer for TEI files and a modern Dutch part of speech tagger.
conversion tool annotation tool written language tool corpus processing format conversion text conversion tokenisation part of speech tagging Enriching Data Linguistics Religion Studies Communication and Media Studies Cultural Sciences History Literary Studies Philosophy Political Studies no no Download Online available https://github.com/INL/OpenConvert command line interface local desktop graphical user interface web application other web service The tool service can be called as a REST webservice which returns responses in XML, allowing it to be part of a webservice tool chain. text input text text/plain application/msword text/html Input TEI, plain text, HTML text input text ALTO text/xml ALTO XML input text input text application/epub+zip ePub input directory input text directory containing files of a valid input type zipped text input text application/zip zip file (with extension .zip) containing files of a valid input type text UTF8 conversion result FoLiA https://github.com/proycon/folia/blob/master/schemas/folia.rng text/xml text UTF8 conversion result TEI text/xml other public https://github.com/INL/OpenConvert Free for academic use. Non-applicable for commercial parties CLARIN based login required. The Clarin federation accepts login from many europian institutions. please seehttp://www.clarin.eu/content/service-provider-federation for more details 0 EUR servicedesk@ivdnt.org Instituut voor de Nederlandse Taal Institute for the Dutch Language http://www.ivdnt.org/ OpenConvert help user http://openconvert.clarin.inl.nl/openconvert/web/help.html eng OpenConvert help technical http://openconvert.clarin.inl.nl/openconvert/web/help.html eng http://dev.clarin.nl/sites/default/files/picture.jpg OpenConvert OpenConvert http://clarin.nl http://portal.clarin.nl/node/4224 Jan Theo Bakker jantheo.bakker@ivdnt.org Instituut voor de Nederlandse Taal Institute for the Dutch Language http://www.ivdnt.org/over-ons/contact/medewerkers Developer Jan Theo Bakker jantheo.bakker@ivdnt.org Instituut voor de Nederlandse Taal Institute for the Dutch Language http://www.ivdnt.org/over-ons/contact/medewerkers Java unknown Format Conversion, tokenisation, part of speech tagging (the latter for Dutch) input input file name (File upload) xsd:string false http://hdl.handle.net/11459/CCR_C-3825_36820064-e2e2-a526-9eea-827cff915dbb format Format of input file xsd:string true tei input file mimetype is application/tei+xml html input file mimetype is text/html alto input file mimetype is text/alto+xml word input file mimetype is application/msword docx input file mimetype is application/msword epub input file mimetype is application/epub+zip text input file mimetype is text/plain to Format of output file xsd:string true tei output file mimetype is application/tei+xml folia output file mimetype is text/folia+xml tagger to specify the tagger or tokeniser xsd:string true chn-tagger Basic tagger-lemmatizer for modern Dutch tokenizer a TEI tokenizer