rogierkraf2013-11-30+02:00clarin.eu:cr1:p_1342181139640CLARIN NetherlandsResourcehttp://openconvert.clarin.inl.nl/OpenConvertOpenConvert2015http://openconvert.clarin.inl.nlDutch Language Institutehttp://portal.clarin.nl/node/4224released2015-10-07CLARIN-NLCLARIN in the Netherlands184.021.003NWOhttp://www.clarin.nlJan OdijkNational CoordinatorUtrecht, the Netherlandsj.odijk@uu.nlUiL-OTSUtrecht University20092015CLARIAH-CORECommon Lab Research Infrastructure for the Arts and the Humanities184.033.101NWOhttp://www.clariah.nlJan OdijkNational CoordinatorUtrecht, the Netherlandsj.odijk@uu.nlUiL-OTSUtrecht University20152018NetherlandsNLThe OpenConvert tools convert to TEI or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application.The OpenConvert Tools were created by IVDNT in the OpenConvert project. The OpenConvert tools convert to TEI or FOLiA from a number of input formats (alto, text, word, HTML, ePub). The tools are available as a Java command line tool, a web service and a web application. Furthermore, as a proof of concept, the website currently provides two annotation tools: a simple Tokenizer for TEI files and a modern Dutch part of speech tagger.conversion toolannotation toolwritten language toolcorpus processingformat conversiontext conversiontokenisationpart of speech taggingEnriching DataLinguisticsReligion StudiesCommunication and Media StudiesCultural SciencesHistoryLiterary StudiesPhilosophyPolitical StudiesnonoDownloadOnline availablehttps://github.com/INL/OpenConvertcommand line interfacelocal desktopgraphical user interfaceweb applicationotherweb serviceThe tool service can be called as a REST webservice which returns responses in XML, allowing it to be part of a webservice tool chain.textinput texttext/plainapplication/mswordtext/htmlInput TEI, plain text, HTMLtextinput textALTOtext/xmlALTO XML inputtextinput textapplication/epub+zipePub inputdirectoryinput textdirectory containing files of a valid input typezipped textinput textapplication/zipzip file (with extension .zip) containing files of a valid input typeotherpublichttps://github.com/INL/OpenConvertFree for academic use. Non-applicable for commercial partiesCLARIN based login required. The Clarin federation accepts login from many europian institutions. please seehttp://www.clarin.eu/content/service-provider-federation for more details 0EURservicedesk@ivdnt.orgInstituut voor de Nederlandse TaalInstitute for the Dutch Languagehttp://www.ivdnt.org/OpenConvert helpuserhttp://openconvert.clarin.inl.nl/openconvert/web/help.htmlengOpenConvert helptechnicalhttp://openconvert.clarin.inl.nl/openconvert/web/help.htmlengOpenConvertOpenConverthttp://clarin.nlhttp://portal.clarin.nl/node/4224Jan Theo Bakkerjantheo.bakker@ivdnt.orgInstituut voor de Nederlandse TaalInstitute for the Dutch Languagehttp://www.ivdnt.org/over-ons/contact/medewerkersDeveloperJan Theo Bakkerjantheo.bakker@ivdnt.orgInstituut voor de Nederlandse TaalInstitute for the Dutch Languagehttp://www.ivdnt.org/over-ons/contact/medewerkersJavaunknownFormat Conversion, tokenisation, part of speech tagging (the latter for Dutch)
inputinput file name (File upload)xsd:stringfalsehttp://hdl.handle.net/11459/CCR_C-3825_36820064-e2e2-a526-9eea-827cff915dbbformatFormat of input filexsd:stringtrueteiinput file mimetype is application/tei+xmlhtmlinput file mimetype is text/htmlaltoinput file mimetype is text/alto+xmlwordinput file mimetype is application/msworddocxinput file mimetype is application/mswordepubinput file mimetype is application/epub+ziptextinput file mimetype is text/plaintoFormat of output filexsd:stringtrueteioutput file mimetype is application/tei+xmlfoliaoutput file mimetype is text/folia+xmltaggerto specify the tagger or tokeniserxsd:stringtruechn-taggerBasic tagger-lemmatizer for modern Dutchtokenizera TEI tokenizer