Commit 8bd59252 authored by Bruno Guillaume's avatar Bruno Guillaume

add website content

parent 68052604
website/public
\ No newline at end of file
selfdoc:
@echo " * make run --> run locally the server"
@echo " * make talc2 --> install on the production server"
@echo " * make build --> build the website "
run:
hugo server -w &
open -a firefox -g http://localhost:1313/
build:
@make -C static/parsing run
@make -C static/parsing img
@make -C static/deep_syntax run
@make -C static/deep_syntax img
hugo
lchn:
hugo
scp -r public/* $(slchn)/www/deep-sequoia/
purge:
@make -C static/parsing purge
@make -C static/deep_syntax purge
languageCode = "en-us"
title = "Deep-sequoia"
baseURL = "http://grew.fr/deep-sequoia/"
enableEmoji=true
theme = "hyde"
[params]
themeColor = "theme-base-09"
+++
Description = ""
menu = "main"
Categories = ["Development","GoLang"]
Tags = ["Development","golang"]
date = "2018-03-26T13:34:37+02:00"
title = "contact"
+++
# Contact
For any remark, suggestion or correction to the corpus:
* You can send a mail at [deep-sequoia@inria.fr](mailto:deep-sequoia@inria.fr).
* You can fill a new issue on [Gitlab](https://gitlab.inria.fr/sequoia/deep-sequoia/issues) (you have to [Register](https://gitlab-account.inria.fr/) before)
If you refer to a precise sentence of the corpus, please try to give the identifier of the sentence, i.e. something like:
* `annodis.er_00xxx`
* `emea-fr-dev_00xxx`
* `emea-fr-test_00xxx`
* `Europar.550_00xxx`
* `frwiki_50.1000_00xxx`
+++
Tags = ["Development","golang"]
Description = ""
date = "2017-02-27T17:44:22+01:00"
title = "index"
menu = "main"
Categories = ["Development","GoLang"]
+++
# Deep-sequoia: A French corpus with surface and deep syntactic annotations
**Deep-sequoia** is a corpus of French sentences annotated with both surface and deep syntactic dependency structures.
It is freely available with the LGPL-LR License.
The latest released is the **version 8.1** (2017.10.17).
For more details about the annotations, please consult [the annotation guidelines (in French)](http://passage.inria.fr/deepwiki/node/19).
## References
**Marie Candito**, **Guy Perrier**, **Bruno Guillaume**, **Corentin Ribeyre**, **Karën Fort**, **Djamé Seddah** and **Éric de la Clergerie**. (2014) [*Deep Syntax Annotation of the Sequoia French Treebank.*](http://hal.inria.fr/docs/00/97/15/74/PDF/deep_sequoia.final_with_keywords.pdf) Proc. of LREC 2014, Reykjavic, Iceland.
**Guy Perrier**, **Marie Candito**, **Bruno Guillaume**, **Corentin Ribeyre**, **Karën Fort** and **Djamé Seddah**. (2014) [*Un schéma d’annotation en dépendances syntaxiques profondes pour le français.*](http://talc2.loria.fr/deep-sequoia/papers/syntaxe_profonde.pdf) Proc. of TALN 2014, Marseille, France.
This diff is collapsed.
+++
date = "2018-03-26T13:40:35+02:00"
title = "versions"
menu = "main"
Categories = ["Development","GoLang"]
Tags = ["Development","golang"]
Description = ""
+++
# Versions
The latest version of the corpus **Deep-Sequoia** is version 8.1 release in October 2017.
| Version | Date | UD-version | Description | Grew-match |
|:-------:|:----------:|:----------:|-----------|:--------------:|
| [8.1](http://talc2.loria.fr/deep-sequoia/sequoia-8.1.tgz) | 2017.10.21 | 2.1 | | [surf](http://match.grew.fr?corpus=sequoia.surf@8.1)     [deep&surf](http://match.grew.fr?corpus=sequoia.deep_and_surf@8.1) |
| [8.0](http://talc2.loria.fr/deep-sequoia/sequoia-8.0.tgz) | 2017.03.13 | 2.0 | Fix errors, change encoding of fixed expressions (see below) | [surf](http://match.grew.fr?corpus=sequoia.surf@8.0)     [deep&surf](http://match.grew.fr?corpus=sequoia.deep_and_surf@8.0) |
| [7.0](http://talc2.loria.fr/deep-sequoia/sequoia-7.0.tgz) | 2015.11.13 | | Fix errors by systematic search of inconsistency in annotation.| [surf](http://match.grew.fr?corpus=sequoia.surf@7.0)     [deep&surf](http://match.grew.fr?corpus=sequoia.deep_and_surf@7.0)|
| [1.1](http://talc2.loria.fr/deep-sequoia/deep-sequoia-1.1/deep-sequoia-1.1.conll) | 2014.06.05 | | Fix some lemmas, fix 3 sentences with multiple surface roots | |
| [1.0](http://talc2.loria.fr/deep-sequoia/deep-sequoia-1.0/deep-sequoia-1.0.conll) | 2014.05.29 | | First release | |
# Notes
## Version numbers
In the 2015 release, the version number was set to **7.0** to be align with previous release of the **Sequoia** corpus (before introduction of deep-dependencies
## UD Versions
Since 2017, the Sequoia corpus (surface only) is converted into Universal Dependency format and is released as one of the UD corpora nammed [UD_French-Sequoia](https://github.com/UniversalDependencies/UD_French-Sequoia/blob/dev/README.md).
## Encoding of fixed expressions
In version **7.0** and previous, fixed expressions are encoded in a single token with `_` symbol as a word separator.
Since version **8.0** these expressions are represented by several tokens linked with `dep_cpd` relations.
For instance, in the two figures below, the sentence `annodis.er_00106` is given with its annotation in Sequoia **7.0** and **8.0**.
![example 7.0](../S7.svg)
![example 8.0](../S8.svg)
talc2:
scp ud.dom ${stalc2}/resources
\ No newline at end of file
% this file "uni-dep-tb-all-2.0.grs" contains the declarations of POS, CAT and relations used in universal dependency treebanks
% May 21, 2015 -> fr + ko
% ====================================================================================================
features {
% field 2 of CONLL format is interpreted as the "phon" feature
phon: *;
% field 3 of CONLL format is interpreted as the "lemma" feature
lemma: *;
% field 4 of CONLL format is interpreted as the "cat" feature
cat: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, SYM, VERB, X, _,
CONJ; % V1
% field 5 of CONLL format is interpreted as the "pos" feature, unused in tiger
pos: *;
% In Tiger, sentence identifier are given through a identifer "id_pos", and transfomed to a feature sentid on the first node in this CONLL description.
sentid: *;
% Features grepped from the English Corpus
Case: Nom, Acc, Abs, Erg, Dat, Gen, Voc, Loc, Ins, Abl, Ine, Ade, All, Par, Ill, Ela, Ess, Abe, Com, Tra,
"Acc,Dat", "Acc,Nom"; % Spanish
% to be completed
Definite: Def, Ind;
Degree: Abs, Cmp, Pos, Sup;
Gender: Fem, Masc, Neut, Com, "Masc,Neut", "Fem,Neut", "Fem,Masc",
Unsp; % UD_Portuguese
Mood: Imp, Ind, Cnd, Sub, Pot;
NumType: Card, Mult, Ord, Range, Frac, Sets, "Mult,Sets";
Number: Plur, Sing, Dual,"Plur,Sing",
Unsp; % UD_Portuguese
Person: 0, 1, 2, 3;
Poss: Yes;
PronType: Art, Dem, Int, Prs, Rel, Neg, Ind,
"Int,Rel", Tot, % needed for Polish treebank
Exc, % needed for Italian data
Emp, % Czech
Rcp; % needed for Finnish treebank
Reflex: Yes;
Tense: Past, Pres, Imp, Fut,
Pqp; % UD_Portuguese
VerbForm: Fin, Ger, Inf, Part, Trans, Vnoun, Conv, Sup;
Voice: Act, Pass, Mid;
Polarity: Neg, Pos;
% Polish
AdpType:Prep, Post, Voc, Comprep,
Preppron; % UD_Portuguese
Animacy: Anim, Inan, Nhum, Hum;
Aspect: Perf, Imp;
Variant: Short, Long;
Negative: Pos, Neg;
PrepCase: Npr, Pre;
Abbr: Yes;
Hyph: Yes;
Typo: Yes;
% Finnish
PartForm: Agt, Past, Pres, Neg;
Number__psor: Plur, Sing;
Person__psor: 1, 2, 3;
Gender__psor: Fem, Masc, "Masc,Neut";
Clitic: Kin, Ko, Han, Kaan, Pa, Ka, "Ko,S", S, "Han,Ko", "Pa,S", "Han,Pa", Yes;
Style: Coll, Arch, Rare, Slng, Vrnc, Expr, Vulg;
InfForm: 1,2,3;
Connegative: Yes;
Derivation: Minen, Inen, Llinen, Lainen, Sti, U, Ton, Ja, Vs, Ttaa, Tar, Ttain;
Foreign: Yes;
% Spanish
Polite: Form;
% Czech
NumForm: Digit, Word, Roman;
NameType: Sur, Giv, Pro, Com, Geo, Oth, Nat,
"Com,Pro", "Giv,Sur", "Geo,Sur", "Com,Sur", "Geo,Giv", "Com,Giv", "Pro,Sur", "Com,Nat", "Giv,Pro",
"Com,Geo", "Geo,Oth", "Com,Oth", "Geo,Pro", "Giv,Nat", "Nat,Sur", "Oth,Sur", "Giv,Oth",
"Geo,Giv,Sur", "Com,Giv,Sur", "Giv,Pro,Sur";
NumValue: 1, "1,2,3";
ConjType: Oper;
Diat: Demsuj; Intrinsimp: Yes;
}
% ====================================================================================================
labels {
% list of labels taken from the corpora [cat *.conll | cut -f 8 | sort -u]
acl, acl:relcl,
advcl,
advmod, advmod:emph,
amod,
appos,
aux, auxpass, aux:pass, % auxpass/V1 and aux:pass/V2
aux:caus, obl:caus, obj:caus, iobj:caus,
case,
cc, cc:preconj,
ccomp,
compound, compound:prt,
conj, conj:preconj,
cop, cop:own,
csubj, csubjpass, csubj:pass,
dep,
det, det:predet, det:poss,
discourse,
dislocated,
dobj, obj, % dobj/V1 and obj/V2
expl, expl:pv, expl:impers, expl:pass,
flat, flat:foreign, flat:name,
foreign,
goeswith,
iobj,
list,
mark,
mwe, fixed, % mwe/V1 and fixed/V2
name,
neg,
nmod, nmod:npmod, nmod:poss, nmod:tmod,
nsubj, nsubjpass, nsubj:pass, % nsubjpass/V1 and nsubj:pass/V2
nummod, nummod:gov, nummod:entity,
obl, obl:agent, obl:tmod, obl:npmod,
orphan,
parataxis,
punct,
remnant,
reparandum,
root,
vocative,
xcomp,
det:numgov, det:nummod, % Polish treebank
nmod:own, compound:nn, nsubj:cop, csubj:cop, nmod:gobj, xcomp:ds, nmod:gsubj, % Finnish
% Secondary dependency repations in the Finnish and English treebank
E:acl,
E:acl:relcl,
E:advcl,
E:advmod,
E:amod,
E:appos,
E:aux,
E:aux:pass,
E:case,
E:cc,
E:ccomp,
E:compound,
E:compound:nn,
E:compound:prt,
E:conj,
E:cop,
E:csubj,
E:csubj:cop,
E:dep,
E:det,
E:discourse,
E:dobj,
E:expl,
E:fixed,
E:flat, E:flat:name, E:flat:foreign,
E:iobj,
E:mark,
E:name,
E:neg,
E:nmod,
E:nmod:gobj,
E:nmod:gsubj,
E:nmod:own,
E:nmod:poss,
E:nsubj,
E:nsubj:cop,
E:nsubj:pass,
E:nummod, E:nummod:gov, E:nummod:entity,
E:obj, E:obl:agent,
E:obl,
E:orphan,
E:parataxis,
E:punct,
E:root,
E:vocative,
E:xcomp, E:xcomp:a, E:xcomp:ds,
E:root, E:exroot, % UD_Russian-SynTagRus
FAIL_obj.cpl, FAIL_ats,
}
sequences { main {} }
{{ partial "head.html" . }}
<body class="{{ .Site.Params.themeColor }} {{if .Site.Params.layoutReverse}}layout-reverse{{end}}">
{{ partial "sidebar.html" . }}
<div class="content container">
<div class="post">
{{ .Content }}
</div>
</div>
</body>
</html>
{{ partial "head.html" . }}
<body class="{{ .Site.Params.themeColor }} {{if .Site.Params.layoutReverse}}layout-reverse{{end}}">
{{ partial "sidebar.html" . }}
<div class="content container">
<div class="post">
{{ range .Data.Pages }}
{{if eq .Title "index" }}
{{.Content}}
{{ end }}
{{ end }}
</div>
</div>
</body>
</html>
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml"{{with .Site.LanguageCode}} xml:lang="{{.}}" lang="{{.}}"{{end}}>
<head>
<link href="http://gmpg.org/xfn/11" rel="profile">
<meta http-equiv="content-type" content="text/html; charset=utf-8">
{{ .Hugo.Generator }}
<!-- Enable responsiveness on mobile devices-->
<meta name="viewport" content="width=device-width, initial-scale=1.0, maximum-scale=1">
{{ if .IsHome }}
<title>{{ .Site.Title }}</title>
{{ else }}
<title>{{ .Title }} &middot; {{ .Site.Title }}</title>
{{ end }}
<!-- CSS -->
<link rel="stylesheet" href="{{ .Site.BaseURL }}css/poole.css">
<link rel="stylesheet" href="{{ .Site.BaseURL }}css/syntax.css">
<link rel="stylesheet" href="{{ .Site.BaseURL }}css/hyde.css">
<link rel="stylesheet" href="{{ .Site.BaseURL }}css/main.css">
<link rel="stylesheet" href="https://fonts.googleapis.com/css?family=PT+Sans:400,400italic,700|Abril+Fatface">
<!-- Prism -->
<link rel="stylesheet" href="{{ .Site.BaseURL }}css/prism.css">
<script src="{{ .Site.BaseURL }}js/prism.js"></script>
<script src="{{ .Site.BaseURL }}js/prism_grew.js"></script>
<!-- <link rel="stylesheet" href="//cdnjs.cloudflare.com/ajax/libs/highlight.js/9.6.0/styles/default.min.css">
<script src="//cdnjs.cloudflare.com/ajax/libs/highlight.js/9.6.0/highlight.min.js"></script>
<script>hljs.initHighlightingOnLoad();</script> -->
<!-- Icons -->
<link rel="apple-touch-icon-precomposed" sizes="144x144" href="/apple-touch-icon-144-precomposed.png">
<link rel="shortcut icon" href="/favicon.png">
<!-- RSS -->
<link href="{{ .RSSLink }}" rel="alternate" type="application/rss+xml" title="{{ .Site.Title }}" />
</head>
<div class="sidebar">
<div class="container">
<div class="sidebar-about">
<a href="{{ .Site.BaseURL }}"><h1>{{ .Site.Title }}</h1></a>
<p class="lead">A French corpus with surface and deep syntactic annotations </p>
</div>
<ul class="sidebar-nav">
<hr/>
<li class="section">Information</li>
<li><a href="{{ .Site.BaseURL }}">Home</a></li>
<li><a href="{{ .Site.BaseURL }}versions">Versions</a></li>
<li><a href="{{ .Site.BaseURL }}contact">Contact</a></li>
<li><a href="{{ .Site.BaseURL }}licence">Licence (LGPL-LR)</a></li>
<li class="section">Downloads</li>
<li><a href="http://talc2.loria.fr/deep-sequoia/sequoia-8.1.tgz">8.1</a></li>
<li><a href="http://talc2.loria.fr/deep-sequoia/sequoia-8.0.tgz">8.0</a></li>
<li><a href="http://talc2.loria.fr/deep-sequoia/sequoia-7.0.tgz">7.0</a></li>
<li><a href="http://talc2.loria.fr/deep-sequoia/deep-sequoia-1.1/deep-sequoia-1.1.conll">1.1</a></li>
<li><a href="http://talc2.loria.fr/deep-sequoia/deep-sequoia-1.0/deep-sequoia-1.0.conll">1.0</a></li>
</ul>
</div>
</div>
<pre><code class="language-grew">{{$file := .Get "file"}}
{{- $file | readFile | safeHTML -}}
</code></pre>
\ No newline at end of file
<pre><code>{{$file := .Get "file"}}
{{- $file | readFile | safeHTML -}}
</code></pre>
\ No newline at end of file
This diff is collapsed.
figures:
find . -name "*.dot" -type f -print | sed "s/.dot$$//" | xargs -I {} make "{}.svg"
find . -name "*.dep" -type f -print | sed "s/.dep$$//" | xargs -I {} make "{}.svg"
find . -name "*.conll" -type f -print | sed "s/.conll$$//" | xargs -I {} make "{}.svg"
clean:
find . -name "*.dot" -type f -print | sed "s/.dot$$//" | xargs -I {} rm -f "{}.svg"
find . -name "*.dep" -type f -print | sed "s/.dep$$//" | xargs -I {} rm -f "{}.svg"
find . -name "*.conll" -type f -print | sed "s/.conll$$//" | xargs -I {} rm -f "{}.svg"
.SUFFIXES: .dot .svg .dep .conll
.dot.svg:
dot -Tsvg $< -o $@
.dep.svg:
dep2pict $< $@
.conll.svg:
dep2pict $< $@
This diff is collapsed.
This diff is collapsed.
hr {
margin-top: 6pt;
margin-bottom: 6pt;
border-color: #b5cfda;
}
li {
margin-left: 10pt;
}
.section {
margin-left: 0pt;
font-size: 20pt;
}
pre {
font-family: Monaco, "Courier New", monospace;
}
.sidebar {
overflow-y: auto;
}
/* http://prismjs.com/download.html?themes=prism&languages=markup+css+clike+javascript+abap+actionscript+ada+apacheconf+apl+applescript+asciidoc+aspnet+autoit+autohotkey+bash+basic+batch+c+brainfuck+bro+bison+csharp+cpp+coffeescript+ruby+css-extras+d+dart+django+diff+docker+eiffel+elixir+erlang+fsharp+fortran+gherkin+git+glsl+go+graphql+groovy+haml+handlebars+haskell+haxe+http+icon+inform7+ini+j+jade+java+jolie+json+julia+keyman+kotlin+latex+less+livescript+lolcode+lua+makefile+markdown+matlab+mel+mizar+monkey+nasm+nginx+nim+nix+nsis+objectivec+ocaml+oz+parigp+parser+pascal+perl+php+php-extras+powershell+processing+prolog+properties+protobuf+puppet+pure+python+q+qore+r+jsx+reason+rest+rip+roboconf+crystal+rust+sas+sass+scss+scala+scheme+smalltalk+smarty+sql+stylus+swift+tcl+textile+twig+typescript+verilog+vhdl+vim+wiki+xojo+yaml&plugins=line-numbers+show-invisibles+toolbar+show-language */
/**
* prism.js default theme for JavaScript, CSS and HTML
* Based on dabblet (http://dabblet.com)
* @author Lea Verou
*/
code[class*="language-"],
pre[class*="language-"] {
color: black;
background: none;
text-shadow: 0 1px white;
font-family: Consolas, Monaco, 'Andale Mono', 'Ubuntu Mono', monospace;
text-align: left;
white-space: pre;
word-spacing: normal;
word-break: normal;
word-wrap: normal;
line-height: 1.5;
-moz-tab-size: 4;
-o-tab-size: 4;
tab-size: 4;
-webkit-hyphens: none;
-moz-hyphens: none;
-ms-hyphens: none;
hyphens: none;
}
pre[class*="language-"]::-moz-selection, pre[class*="language-"] ::-moz-selection,
code[class*="language-"]::-moz-selection, code[class*="language-"] ::-moz-selection {
text-shadow: none;
background: #b3d4fc;
}
pre[class*="language-"]::selection, pre[class*="language-"] ::selection,
code[class*="language-"]::selection, code[class*="language-"] ::selection {
text-shadow: none;
background: #b3d4fc;
}
@media print {
code[class*="language-"],
pre[class*="language-"] {
text-shadow: none;
}
}
/* Code blocks */
pre[class*="language-"] {
padding: 1em;
margin: .5em 0;
overflow: auto;
}
:not(pre) > code[class*="language-"],
pre[class*="language-"] {
background: #f5f2f0;
}
/* Inline code */
:not(pre) > code[class*="language-"] {
padding: .1em;
border-radius: .3em;
white-space: normal;
}
.token.prolog,
.token.doctype,
.token.cdata {
color: slategray;
}
.token.comment {
color: #e34;
}
.token.punctuation {
color: #999;
}
/* added for grew */
.token.command {
color: #905;
}
/* added for grew */
.token.strat {
color: #D63;
}
.namespace {
opacity: .7;
}
.token.property,
.token.tag,
.token.boolean,
.token.number,
.token.constant,
.token.symbol,
.token.deleted {
color: #905;
}
.token.selector,
.token.attr-name,
.token.string,
.token.char,
.token.builtin,
.token.inserted {
color: #690;
}
.token.operator,
.token.entity,
.token.url,
.language-css .token.string,
.style .token.string {
/*color: #a67f59;*/
color: #090;
background: hsla(0, 0%, 100%, .5);
}
.token.atrule,
.token.attr-value,
.token.keyword {
color: #07a;
}
.token.function {
color: #DD4A68;
}
.token.regex,
.token.important,
.token.variable {
color: #e90;
}
.token.important,
.token.bold {
font-weight: bold;
}
.token.italic {
font-style: italic;
}
.token.entity {
cursor: help;
}
pre.line-numbers {
position: relative;
padding-left: 3.8em;
counter-reset: linenumber;
}
pre.line-numbers > code {
position: relative;
}
.line-numbers .line-numbers-rows {
position: absolute;
pointer-events: none;
top: 0;
font-size: 100%;
left: -3.8em;
width: 3em; /* works for line-numbers below 1000 lines */
letter-spacing: -1px;
border-right: 1px solid #999;
-webkit-user-select: none;
-moz-user-select: none;
-ms-user-select: none;
user-select: none;
}
.line-numbers-rows > span {
pointer-events: none;
display: block;
counter-increment: linenumber;
}
.line-numbers-rows > span:before {
content: counter(linenumber);
color: #999;
display: block;
padding-right: 0.8em;
text-align: right;
}
.token.tab:not(:empty),
.token.cr,
.token.lf,
.token.space {
position: relative;
}
.token.tab:not(:empty):before,
.token.cr:before,
.token.lf:before,
.token.space:before {
color: hsl(24, 20%, 85%);
position: absolute;
}
.token.tab:not(:empty):before {
content: '\21E5';
}
.token.cr:before {
content: '\240D';
}
.token.crlf:before {
content: '\240D\240A';
}
.token.lf:before {
content: '\240A';
}
.token.space:before {
content: '\00B7';
}
pre.code-toolbar {
position: relative;
}
pre.code-toolbar > .toolbar {
position: absolute;
top: .3em;
right: .2em;
transition: opacity 0.3s ease-in-out;
opacity: 0;
}
pre.code-toolbar:hover > .toolbar {
opacity: 1;
}
pre.code-toolbar > .toolbar .toolbar-item {
display: inline-block;
}
pre.code-toolbar > .toolbar a {
cursor: pointer;
}
pre.code-toolbar > .toolbar button {
background: none;
border: 0;
color: inherit;
font: inherit;
line-height: normal;
overflow: visible;
padding: 0;
-webkit-user-select: none; /* for button */
-moz-user-select: none;
-ms-user-select: none;
}