

{"id":130,"date":"2021-08-31T09:45:59","date_gmt":"2021-08-31T07:45:59","guid":{"rendered":"https:\/\/tremolo.irisa.fr\/?page_id=130"},"modified":"2022-08-18T11:55:09","modified_gmt":"2022-08-18T09:55:09","slug":"tremolo-web","status":"publish","type":"page","link":"http:\/\/tremolo.irisa.fr\/fr\/tremolo-web\/","title":{"rendered":"Corpus TREMoLo-Web"},"content":{"rendered":"<p><strong>TREMoLo-Web <\/strong>est un corpus de 825 000 textes r\u00e9cup\u00e9r\u00e9s sur le web repr\u00e9sentant un total d&rsquo;environ 750 millions de mots. Les pages web ont \u00e9t\u00e9 r\u00e9cup\u00e9r\u00e9e automatiquement sur la base de requ\u00eates sp\u00e9cifiques aux registres familier et soutenu mais sans contrainte sur la source. Les pages ont \u00e9t\u00e9 segment\u00e9es en segments (textes du corpus) de 5000 caract\u00e8res maximum. Ces segments ont \u00e9t\u00e9 annot\u00e9s de mani\u00e8re semi-automatique dans les registres de langue (familier, courant, soutenu).<\/p>\n<p>Veuillez nous contacter si vous souhaiter obtenir le corpus.<\/p>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>TREMoLo-Web est un corpus de 825 000 textes r\u00e9cup\u00e9r\u00e9s sur le web repr\u00e9sentant un total d&rsquo;environ 750 millions de mots. Les pages web ont \u00e9t\u00e9 r\u00e9cup\u00e9r\u00e9e automatiquement sur la base de requ\u00eates sp\u00e9cifiques aux registres familier et soutenu mais sans contrainte sur la source. Les pages ont \u00e9t\u00e9 segment\u00e9es en\u2026<\/p>\n<p> <a class=\"continue-reading-link\" href=\"http:\/\/tremolo.irisa.fr\/fr\/tremolo-web\/\"><span><\/span><i class=\"crycon-right-dir\"><\/i><\/a> <\/p>\n","protected":false},"author":1285,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-130","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/pages\/130","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/users\/1285"}],"replies":[{"embeddable":true,"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/comments?post=130"}],"version-history":[{"count":3,"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/pages\/130\/revisions"}],"predecessor-version":[{"id":161,"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/pages\/130\/revisions\/161"}],"wp:attachment":[{"href":"http:\/\/tremolo.irisa.fr\/fr\/wp-json\/wp\/v2\/media?parent=130"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}