{"id":33411,"date":"2019-03-25T16:49:11","date_gmt":"2019-03-25T15:49:11","guid":{"rendered":"http:\/\/www.h-its.org\/downloads\/pos-annotation-for-icsi-meeting-recorder-data\/"},"modified":"2019-05-23T11:00:22","modified_gmt":"2019-05-23T09:00:22","slug":"pos-annotation-for-icsi-meeting-recorder-data","status":"publish","type":"hits-software","link":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/","title":{"rendered":"POS Annotation for ICSI Meeting Recorder Data"},"content":{"rendered":"\n<p>Here you find the Part of Speech annotation for the ICSI Meeting Recorder Data.\u00a0Please note, that the files only contain the POS information and no word information. You already have to have the ICSI corpus to use this data.\u00a0When using this data, please cite the following paper:\u00a0Margot Mieskes and Michael Strube:<br><strong>Part-of-Speech Tagging of Transcribed Speech\u00a0<\/strong><em>Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006).\u00a0<\/em>Genua, Italy, May 22-28, 2006 (<a href=\"http:\/\/www.lrec-conf.org\/proceedings\/lrec2006\/pdf\/345_pdf.pdf\">PDF<\/a>).<br>\u00a0This paper also contains a description of the method used and detailed results.\u00a0<br>The format in the .txt files is one segment per line.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">The Gold Standard files are:<\/h2>\n\n\n\n<p>The Gold Standard files are:Bed016<br>Bed017<br>Bmr001<br>Bmr002<br>Bns003<br>Bmr003<br>Bmr004<br>Bmr005<br>Bsr001<br>Btr001<br>Btr002<br>Buw001<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Downloads:<\/h2>\n\n\n\n<p><a href=\"https:\/\/cosyne.h-its.org\/nlpdl\/dianasumm\/POSAnnotation.tar.gz\">Here<\/a>&nbsp;you find the Gold Standard manual annotation in .txt format.<\/p>\n\n\n\n<p><a href=\"https:\/\/cosyne.h-its.org\/nlpdl\/dianasumm\/ManPosGold.tar.gz\">Here<\/a>&nbsp;you find the Gold Standard manual annotation in .mmax format.<\/p>\n\n\n\n<p><a href=\"https:\/\/cosyne.h-its.org\/nlpdl\/dianasumm\/GoldStand.tar.gz\">Here<\/a>&nbsp;you find the automatic POS annotation for the Gold Standard after retraining the four taggers on the manual data in .txt format.<\/p>\n\n\n\n<p><a href=\"https:\/\/cosyne.h-its.org\/nlpdl\/dianasumm\/GoldStandAuto.tar.gz\">Here<\/a>&nbsp;you find the automatic POS annotation for the Gold Standard after retraining the four taggers on the manual data in .mmax format.<\/p>\n\n\n\n<p><a href=\"https:\/\/cosyne.h-its.org\/nlpdl\/dianasumm\/PosTags.tar.gz\">Here<\/a>&nbsp;you find the automatic POS annotation for the whole corpus after retraining the four taggers on the manual data in .txt format.<\/p>\n\n\n\n<p><a href=\"https:\/\/cosyne.h-its.org\/nlpdl\/dianasumm\/AutoPOS.tar.gz\">Here<\/a>&nbsp;you find the automatic POS annotation for the whole corpus after retraining the four taggers on the manual data in .mmax format.<\/p>\n\n\n\n<p>The four taggers used were the following:<\/p>\n\n\n\n<p>TBL Tagger: Eric Brill Some Advance in transformation based part of speech tagging In Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, Washington 1. \u2013 4. August 1994, pp. 722-727<\/p>\n\n\n\n<p><a href=\"http:\/\/www.coli.uni-saarland.de\/~thorsten\/tnt\">TnT Tagger<\/a>: Thorsten Brants TnT \u2013 A statistical Part Of Speech tagger In Proceedings of the 6th International Conference on Applied Natural Language Processing, Seattle, Washington 29. April \u2013 4. May 2000, pp. 224-231<\/p>\n\n\n\n<p><a href=\"http:\/\/nlp.stanford.edu\/software\/tagger.shtml\">Stanford NLP Library Tagger<\/a>: Kristina Toutanova and Christopher D. Manning Enriching the knowledge sources used in a maximum entropy part-of-speech tagger In Proceedings of the Joint SIGDAT Conference on Empirical methods in Natural Language Processing and very large corpus, Hong Kong 2000, pp. 63-70\u00a0<\/p>\n\n\n\n<p><a href=\"http:\/\/nlp.stanford.edu\/software\/tagger.shtml\">Stanford NLP Library Tagger<\/a>:Kristina Toutanova, Dan Klein, Christopher D. Manning and Yoram Singer Feature-Rich Part-of-Speech Tagging with a cyclic dependency network. In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, Edmonton, Alberta, Canada, 27. May \u2013 1. June 2003, pp. 252-259NLP Group<\/p>\n\n\n\n<p><br><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Here you find the Part of Speech annotation for the ICSI Meeting Recorder Data.\u00a0Please note, that the files only contain &#8230;<\/p>\n","protected":false},"featured_media":0,"template":"","hits-research-group":[1302],"hits-software-category":[],"class_list":["post-33411","hits-software","type-hits-software","status-publish","hentry","hits-research-group-nlp-de"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>POS Annotation for ICSI Meeting Recorder Data - HITS gGmbH<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/\" \/>\n<meta property=\"og:locale\" content=\"de_DE\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"POS Annotation for ICSI Meeting Recorder Data - HITS gGmbH\" \/>\n<meta property=\"og:description\" content=\"Here you find the Part of Speech annotation for the ICSI Meeting Recorder Data.\u00a0Please note, that the files only contain ...\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/\" \/>\n<meta property=\"og:site_name\" content=\"HITS gGmbH\" \/>\n<meta property=\"article:modified_time\" content=\"2019-05-23T09:00:22+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Gesch\u00e4tzte Lesezeit\" \/>\n\t<meta name=\"twitter:data1\" content=\"2\u00a0Minuten\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/software\\\/pos-annotation-for-icsi-meeting-recorder-data\\\/\",\"url\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/software\\\/pos-annotation-for-icsi-meeting-recorder-data\\\/\",\"name\":\"POS Annotation for ICSI Meeting Recorder Data - HITS gGmbH\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/#website\"},\"datePublished\":\"2019-03-25T15:49:11+00:00\",\"dateModified\":\"2019-05-23T09:00:22+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/software\\\/pos-annotation-for-icsi-meeting-recorder-data\\\/#breadcrumb\"},\"inLanguage\":\"de\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.h-its.org\\\/de\\\/software\\\/pos-annotation-for-icsi-meeting-recorder-data\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/software\\\/pos-annotation-for-icsi-meeting-recorder-data\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Software\",\"item\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/software\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"POS Annotation for ICSI Meeting Recorder Data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/#website\",\"url\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/\",\"name\":\"HITS gGmbH\",\"description\":\"Heidelberg Institute for Theoretical Studies\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.h-its.org\\\/de\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"de\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"POS Annotation for ICSI Meeting Recorder Data - HITS gGmbH","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/","og_locale":"de_DE","og_type":"article","og_title":"POS Annotation for ICSI Meeting Recorder Data - HITS gGmbH","og_description":"Here you find the Part of Speech annotation for the ICSI Meeting Recorder Data.\u00a0Please note, that the files only contain ...","og_url":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/","og_site_name":"HITS gGmbH","article_modified_time":"2019-05-23T09:00:22+00:00","twitter_card":"summary_large_image","twitter_misc":{"Gesch\u00e4tzte Lesezeit":"2\u00a0Minuten"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/","url":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/","name":"POS Annotation for ICSI Meeting Recorder Data - HITS gGmbH","isPartOf":{"@id":"https:\/\/www.h-its.org\/de\/#website"},"datePublished":"2019-03-25T15:49:11+00:00","dateModified":"2019-05-23T09:00:22+00:00","breadcrumb":{"@id":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/#breadcrumb"},"inLanguage":"de","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.h-its.org\/de\/software\/pos-annotation-for-icsi-meeting-recorder-data\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.h-its.org\/de\/"},{"@type":"ListItem","position":2,"name":"Software","item":"https:\/\/www.h-its.org\/de\/software\/"},{"@type":"ListItem","position":3,"name":"POS Annotation for ICSI Meeting Recorder Data"}]},{"@type":"WebSite","@id":"https:\/\/www.h-its.org\/de\/#website","url":"https:\/\/www.h-its.org\/de\/","name":"HITS gGmbH","description":"Heidelberg Institute for Theoretical Studies","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.h-its.org\/de\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"de"}]}},"publishpress_future_action":{"enabled":false,"date":"2026-05-02 16:00:07","action":"change-status","newStatus":"draft","terms":[],"taxonomy":"hits-research-group","extraData":[]},"publishpress_future_workflow_manual_trigger":{"enabledWorkflows":[]},"_links":{"self":[{"href":"https:\/\/www.h-its.org\/de\/wp-json\/wp\/v2\/hits-software\/33411","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.h-its.org\/de\/wp-json\/wp\/v2\/hits-software"}],"about":[{"href":"https:\/\/www.h-its.org\/de\/wp-json\/wp\/v2\/types\/hits-software"}],"wp:attachment":[{"href":"https:\/\/www.h-its.org\/de\/wp-json\/wp\/v2\/media?parent=33411"}],"wp:term":[{"taxonomy":"hits-research-group","embeddable":true,"href":"https:\/\/www.h-its.org\/de\/wp-json\/wp\/v2\/hits-research-group?post=33411"},{"taxonomy":"hits-software-category","embeddable":true,"href":"https:\/\/www.h-its.org\/de\/wp-json\/wp\/v2\/hits-software-category?post=33411"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}