janodijk 2018-08-01+02:00 clarin.eu:cr1:p_1342181139640 CLARIN Netherlands
Resource https://webservices-lst.science.ru.nl/frog Frog Frog: An advanced Natural Language Processing Suite for Dutch (Web Service and Application) v0.15 2011-03-27 https://webservices-lst.science.ru.nl/frog/none yet published 2018-05-16 v0.15 CLARIN-NLCLARIN in the Netherlands184.021.003NWOhttp://www.clarin.nlJan OdijkNational Coordinator
Utrecht, the Netherlands
j.odijk@uu.nlUiL-OTSUtrecht University
20092015
CLARIAH-CORECommon Lab Research Infrastructure for the Arts and the Humanities184.033.101NWOhttp://www.clariah.nlJan OdijkNational Coordinator
Utrecht, the Netherlands
j.odijk@uu.nlUiL-OTSUtrecht University
20152018
NetherlandsNL Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It performs automatic linguistic enrichment such as part of speech tagging, lemmatisation, named entity recognition, shallow parsing, dependency parsing and morphological analysis. All NLP modules are based on TiMBL.
written language tool mono-lingual tool dependency parsing shallow parsing lemmatisation morphological analysis named entity recognition part of speech tagging sentence splitting tokenisation Enriching Data Linguistics general linguistics syntax yes Dutchnld yes 20 21 Online available https://github.com/LanguageMachines/frog not specified not specified POSIX not specified unknown icu not specified http://site.icu-project.org/design/cpp localDesktop libxml2 not specified https://pypi.org/project/libxml2-python/ localDesktop ticcutils not specified localDesktop timbl not specified https://github.com/LanguageMachines/timbl localDesktop libfolia not specified https://github.com/LanguageMachines/libfolia localDesktop mbt not specified https://github.com/LanguageMachines/mbt localDesktop ucto not specified https://github.com/LanguageMachines/ucto localDesktop frogdata not specified https://github.com/LanguageMachines/frogdata localDesktop graphical user interface web application web interface web service text PDF application/pdf text MS-Word application/msword utf8 text FoLiA text/folia+xml utf8 text text/plain ISO-8859-1 text text/plain ISO 8859-15 text text/plain text utf8 Tadpole Columned Output Format text/csv Discourse/Sentence Boundaries Morphosyntax/Inflection Morphosyntax/Lemma Morphosyntax/POS Morphosyntax/Word form Orthography/Token POSTags/DCOI Tagset Semantics/Named Entity Semantics/Named Entity Class NETags/Frog NE Tag Set Syntax/Chunks Syntax/Dependency Relations Syntax/Grammatical Relations Syntax/Multiword Expressions Syntax/Alpino Tagset text utf8 FoLiA text/folia+xml Discourse/Sentence Boundaries Morphosyntax/Inflection Morphosyntax/Lemma Morphosyntax/POS Morphosyntax/Word form Orthography/Token POSTags/DCOI Tagset Semantics/Named Entity Semantics/Named Entity Class NETags/Frog NE Tag Set Syntax/Chunks Syntax/Dependency Relations Syntax/Grammatical Relations Syntax/Multiword Expressions Syntax/Alpino Tagset GNU GPL 3.0 public https://spdx.org/licenses/GPL-3.0 0 EUR Antal van den Bosch
Nijmegen, the Netherlands
a.vandenbosch@let.ru.nl Center for Language and Speech Technology Radboud University Nijmegen https://www.ru.nl/clst/
Iris Hendrickx, Antal van den Bosch, Maarten van Gompel, Ko van der Sloot, Walter Daelemans. 2016.Frog: An Natural Language Processing suite for Dutch. CLST Technical Report 16-02. user https://github.com/LanguageMachines/frog/blob/master/docs/frogmanual.pdf eng Webservice Specification user https://webservices-lst.science.ru.nl/frog/info eng readme user https://github.com/LanguageMachines/frog/blob/master/README.md eng releaseNotes user https://github.com/LanguageMachines/frog/releases eng issueTracker technical https://github.com/LanguageMachines/frog/issues eng contIntegration technical https://travis-ci.org/LanguageMachines/frog eng technical report technical no Iris Hendrickx, Antal van den Bosch, Maarten van Gompel, Ko van der Sloot and Walter Daelemans. 2016.Frog: A Natural Language Processing Suite for Dutch. CLST Technical Report 16-02, pp 99-114. Nijmegen, the Netherlands. https://github.com/LanguageMachines/frog/blob/master/docs/frogmanual.pdf in proceedings scientific background yes Van den Bosch, A., Busser, G.J., Daelemans, W., and Canisius, S. (2007). An efficient memory-based morphosyntactic tagger and parser for Dutch, In F. van Eynde, P. Dirix, I. Schuurman, and V. Vandeghinste (Eds.), Selected Papers of the 17th Computational Linguistics in the Netherlands Meeting, Leuven, Belgium, pp. 99-114. http://ilk.uvt.nl/downloads/pub/papers/tadpole-final.pdf http://dev.clarin.nl/sites/default/files/froglogo.svg CGN Corpus Gesproken Nederlands NWO CGN Corpus Gesproken Nederlands NWO CLARIN-NL <funder>NWO</funder> <url/> <Contact> <Person/> <Email/> <Organisation xml:lang="eng"/> </Contact> <Duration/> </Project> <Project> <name>CLARIAH-CORE</name> <title/> <funder>NWO</funder> <url/> <Contact> <Person/> <Email/> <Organisation xml:lang="eng"/> </Contact> <Duration/> </Project> <Creator> <Contact> <Person>Antal van den Bosch</Person> <Email/> <Organisation xml:lang="eng"/> </Contact> </Creator> <Creator> <Role> project lead </Role> <Contact> <Person> Antal van den Bosch </Person> <Address>Nijmegen, the Netherlands</Address> <Email> a.vandenbosch@let.ru.nl </Email> <Department>Center for Language and Speech Technology</Department> <Organisation> Radboud University Nijmegen </Organisation> <Url> https://www.ru.nl/clst/ </Url> </Contact> </Creator> <Creator> <Role> software developer </Role> <Contact> <Person> Maarten van Gompel </Person> <Address>Nijmegen, the Netherlands</Address> <Email> proycon@anaproy.nl </Email> <Department>Center for Language and Speech Technology</Department> <Organisation> Radboud University Nijmegen </Organisation> <Url> https://www.ru.nl/clst/ </Url> </Contact> </Creator> <Creator> <Role> software developer </Role> <Contact> <Person> Ko van der Sloot </Person> <Address>Nijmegen, the Netherlands</Address> <Department>Center for Language and Speech Technology</Department> <Organisation> Radboud University Nijmegen </Organisation> <Url> https://www.ru.nl/clst/ </Url> </Contact> </Creator> <Creator> <Role> software developer </Role> <Contact> <Person> Bertjan Busser </Person> <Address>Tilburg, the Netherlands</Address> <Organisation> Tilburg University </Organisation> <Url> https://www.tilburguniversity.edu/research/institutes-and-research-groups/ticc/ </Url> </Contact> </Creator> <Creator> <Role> software developer </Role> <Contact> <Person> Iris Hendrickx </Person> <Address>Nijmegen, the Netherlands</Address> <Department>Center for Language and Speech Technology</Department> <Organisation> Radboud University Nijmegen </Organisation> <Url> https://www.ru.nl/clst/ </Url> </Contact> </Creator> <Creator> <Role> software developer </Role> <Contact> <Person> Sander Canisius </Person> <Organisation> CSI-DP </Organisation> </Contact> </Creator> <Creator> <Role> software developer </Role> <Contact> <Person> Jakub Zavrel </Person> <Organisation> MBT </Organisation> </Contact> </Creator> </SoftwareDevelopment> <TechnicalInfo> <ImplementationLanguage> <implementationLanguage>C++</implementationLanguage> <version>unknown</version> </ImplementationLanguage> </TechnicalInfo> <LRS> <Authentication>Yes. Before tool use, please register at https://webservices-lst.science.ru.nl/register.</Authentication> <Description><Description>Frog (plain text input)</Description></Description> <ToolTasks> <toolTask>dependency parsing</toolTask> <toolTask>shallow parsing</toolTask> <toolTask>lemmatisation</toolTask> <toolTask>morphological analysis</toolTask> <toolTask>named entity recognition</toolTask> <toolTask>part of speech tagging</toolTask> <toolTask>sentence splitting</toolTask> <toolTask>tokenisation</toolTask> </ToolTasks> <Input> <characterEncoding>utf8</characterEncoding> <inputType>text</inputType> <MimeType><MimeType>text/plain</MimeType></MimeType> </Input> <Output><outputType>text</outputType> <characterEncoding>utf8</characterEncoding> <Schema><schemaname>Tadpole Columned Output Format</schemaname></Schema> <MimeType><MimeType>text/csv</MimeType></MimeType> <AnnotationType> <AnnotationType>Discourse/Sentence Boundaries</AnnotationType> <AnnotationType>Morphosyntax/Inflection</AnnotationType> <AnnotationType>Morphosyntax/Lemma</AnnotationType> <AnnotationType>Morphosyntax/POS</AnnotationType> <AnnotationType>Morphosyntax/Word form</AnnotationType> <AnnotationType>Orthography/Token</AnnotationType> <TagSet>POSTags/DCOI Tagset</TagSet> </AnnotationType> <AnnotationType> <AnnotationType>Semantics/Named Entity</AnnotationType> <AnnotationType>Semantics/Named Entity Class</AnnotationType> <TagSet>NETags/Frog NE Tag Set</TagSet> </AnnotationType> <AnnotationType> <AnnotationType>Syntax/Chunks</AnnotationType> <AnnotationType>Syntax/Dependency Relations</AnnotationType> <AnnotationType>Syntax/Grammatical Relations</AnnotationType> <AnnotationType>Syntax/Multiword Expressions</AnnotationType> <TagSet>Syntax/Alpino Tagset</TagSet> </AnnotationType> </Output> <Output><outputType>text</outputType> <characterEncoding>utf8</characterEncoding> <Schema><schemaname>FoLiA</schemaname></Schema> <MimeType><MimeType>text/folia+xml</MimeType></MimeType> <AnnotationType> <AnnotationType>Discourse/Sentence Boundaries</AnnotationType> <AnnotationType>Morphosyntax/Inflection</AnnotationType> <AnnotationType>Morphosyntax/Lemma</AnnotationType> <AnnotationType>Morphosyntax/POS</AnnotationType> <AnnotationType>Morphosyntax/Word form</AnnotationType> <AnnotationType>Orthography/Token</AnnotationType> <TagSet>POSTags/DCOI Tagset</TagSet> </AnnotationType> <AnnotationType> <AnnotationType>Semantics/Named Entity</AnnotationType> <AnnotationType>Semantics/Named Entity Class</AnnotationType> <TagSet>NETags/Frog NE Tag Set</TagSet> </AnnotationType> <AnnotationType> <AnnotationType>Syntax/Chunks</AnnotationType> <AnnotationType>Syntax/Dependency Relations</AnnotationType> <AnnotationType>Syntax/Grammatical Relations</AnnotationType> <AnnotationType>Syntax/Multiword Expressions</AnnotationType> <TagSet>Syntax/Alpino Tagset</TagSet> </AnnotationType> </Output> <ActualParameters><!--0-1 --> <ActualParameter><!--1 - unbounded --> <ActualParameterName>project</ActualParameterName> <ActualParameterValue>new</ActualParameterValue> </ActualParameter> <ActualParameter><!--1 - unbounded --> <ActualParameterName>input</ActualParameterName> <ActualParameterValue>self.linkToResource</ActualParameterValue> </ActualParameter> </ActualParameters> <LRSMapping> <LRSParameterName>input</LRSParameterName> <ActualParameterName>maininput_url</ActualParameterName> </LRSMapping> </LRS> <LRS> <Authentication>Yes. Before tool use, please register at https://webservices-lst.science.ru.nl/register.</Authentication> <Description><!-- 0-1 --><Description>Frog (folia+xml input)</Description></Description> <ToolTasks><!-- 1-1 --> <toolTask>dependency parsing</toolTask> <toolTask>lemmatisation</toolTask> <toolTask>morphological analysis</toolTask> <toolTask>named entity recognition</toolTask> <toolTask>part of speech tagging</toolTask> <toolTask>sentence splitting</toolTask> <toolTask>tokenisation</toolTask> </ToolTasks> <Input> <characterEncoding>utf8</characterEncoding> <inputType>text</inputType> <Schema><schemaname>FoLiA</schemaname></Schema> <MimeType><MimeType>text/folia+xml</MimeType></MimeType> </Input> <Output><outputType>text</outputType> <characterEncoding>utf8</characterEncoding> <Schema><schemaname>Tadpole Columned Output Format</schemaname></Schema> <MimeType><MimeType>text/csv</MimeType></MimeType> </Output> <Output><outputType>text</outputType> <characterEncoding>utf8</characterEncoding> <Schema><schemaname>FoLiA</schemaname></Schema> <MimeType><MimeType>text/folia+xml</MimeType></MimeType> </Output> <ActualParameters><!--0-1 --> <ActualParameter><!--1 - unbounded --> <ActualParameterName>project</ActualParameterName> <ActualParameterValue>new</ActualParameterValue> </ActualParameter> <ActualParameter><!--1 - unbounded --> <ActualParameterName>input</ActualParameterName> <ActualParameterValue>self.linkToResource</ActualParameterValue> </ActualParameter> </ActualParameters> <LRSMapping> <LRSParameterName>input</LRSParameterName> <ActualParameterName>foliainput_url</ActualParameterName> </LRSMapping> </LRS> </ClarinSoftwareDescription> </Components> </CMD>