Edu/gum EWT 254old Standard Universal Dependencies Corpus for English, built over à la recherche d'hommes beau the source material of the English Web Treebank LDC2012T13 Contributors: Natalia Silveira, Timothy Dozat, Christopher Manning, Sebastian Schuster, John Bauer, Miriam Connor, Marie-Catherine de Marneffe, Nathan Schneider, Sam Bowman, Hanzhi Zhu, Daniel Galbraith Repository.
The.0 and.2 releases are brought forward because of their usage in the CoNLL 20 Multilingual Parsing Shared Tasks.
64 treebanks, 47 languages, released November 15, 2016.
See here for comparative statistics of Polish treebanks.
Finnish 3 377K Uralic, Finnic Finnish treebanks TDT 202K UD_Finnish-TDT is based on the Turku Dependency Treebank (TDT a broad-coverage dependency treebank of general Finnish covering numerous genres.Ancient Greek treebanks, proiel 214K, uD_Ancient_Greek-proiel is converted from the Ancient Greek data in the proiel treebank, and consists of the New Testament plus selections from Herodotus.FicTree 167K FicTree is a treebank of Czech fiction, automatically converted into the UD format.Welsh 1 - IE, Celtic Welsh treebanks CCG - Corpws Cystrawennol y Gymraeg Language documentation The language hub documentation has not yet been created or ported from the UDv1 documentation.Hungarian 1 42K Uralic, Ugric Hungarian treebanks Szeged 42K Please add a summary section to the treebank readme file Contributors: Richárd Farkas, Katalin Simkó, Zsolt Szántó, Viktor Varga, Veronika Vincze Repository master dev readme Treebank hub page Language documentation The language hub documentation has not.Romanian 2 239K IE, Romance Romanian treebanks RRT 218K The Romanian UD treebank (called RoRefTrees) (Barbu Mititelu., 2016) is the reference treebank in UD format for standard Romanian.Nynorsk 301K The Norwegian UD treebank is based on the Nynorsk section of the Norwegian Dependency Treebank (NDT which is a syntactic treebank of Norwegian.Turkish 3 74K Turkic, Southwestern Turkish treebanks imst 58K The UD Turkish Treebank, also called the imst-UD Treebank, is a semi-automatic conversion of the imst Treebank (Sulubacak., 2016).
The original sentences are from Corpus of Historical Japanese' (CHJ).
Version.1 treebanks are archived at t/11234/LRT-1478.
The data is in Simplified Chinese.
Congo, madagascar, rencontre vs burkina Faso Niger Rwanda Guinee Tchad Haiti Burundi Benin Togo En ce moment : 55063 Femmes et 57008 Hommes Sont En Ligne Derniers membres inscrits, tchat Français : Asami, Zoë, Nadia, Tatiana, Alice, Belgique, tchat Belge : Catherine, Tura, Inés, Jackie, Susan, Célibataires.Perseus 29K This Universal Dependencies Latin Treebank consists of an automatic conversion of a selection of passages from the Ancient Greek and Latin Dependency Treebank.1 See here for comparative statistics of Latin treebanks.The corpus consists of public government documents.Old Church Slavonic 1 57K IE, Slavic Old Church Slavonic treebanks proiel 57K The Old Church Slavonic (OCS) UD treebank is based on the Old Church Slavonic data from the proiel treebank and contains the text of the Codex Marianus New Testament translation.Current UD Languages, information about language families (and genera for families with multiple branches) is mostly taken from.French 6 1,134K IE, Romance French treebanks ParTUT 28K UD_French-ParTUT is a conversion of a multilingual parallel treebank developed at the University of Turin, and consisting of a variety of text genres, including talks, legal texts and Wikipedia articles, among others.English 6 576K IE, Germanic English treebanks ParTUT 49K UD_English-ParTUT is a conversion of a multilingual parallel treebank developed at the University of Turin, and consisting of a variety of text genres, including talks, legal texts and Wikipedia articles, among others.