Datasets in this knowledge domain stand out with a significantly larger path length of traversal relations (cf. measure diameter). The modeling strategy here seems fairly concise, resulting in a low average number of types and outgoing predicates/edges per subject, which is reflected by the measures mean out degree and max partial out degree.
All datasets are clustered into three groups:
- universal-dependencies-treebank-... (63 datasets)
- apertium-rdf-... (22 datasets), and
- other (37 datasets)
Name |
# of Edges |
panlex |
291,314,466 |
getty-aat |
20,196,996 |
semanticquran |
15,741,399 |
xwn |
12,798,534 |
saldom-rdf |
8,349,115 |
lemonwiktionary |
3,501,697 |
olac |
3,133,455 |
ids |
1,984,385 |
catalan-eurowordnet-lemon-lexicon-3-0 |
1,841,180 |
jrc-names-mlode |
1,712,143 |
associations |
1,674,376 |
saldo-rdf |
1,382,823 |
basque-eurowordnet-lemon-lexicon-3-0 |
1,215,583 |
universal-dependencies-treebank-russian-syntagrus |
952,493 |
apertium-rdf-es-ast |
825,547 |
apertium-rdf-en-ca |
759,613 |
galician-eurowordnet-lemon-lexicon-3-0 |
734,979 |
apertium-rdf-es-ca |
730,513 |
lexvo |
726,674 |
apertium-rdf-eo-fr |
726,288 |
universal-dependencies-treebank-japanese-ktc |
641,336 |
apertium-rdf-eo-en |
617,779 |
universal-dependencies-treebank-catalan |
595,302 |
universal-dependencies-treebank-czech |
580,968 |
apertium-rdf-en-es |
576,329 |
universal-dependencies-treebank-spanish-ancora |
534,629 |
apertium-rdf-fr-es |
495,621 |
parole-simple-lexinfo-ontology-lexicons |
441,203 |
apertium-rdf-eo-ca |
426,308 |
apertium-rdf-en-gl |
425,124 |
asit |
422,260 |
apertium-rdf-es-ro |
400,373 |
universal-dependencies-treebank-hindi |
385,655 |
apertium-rdf-eo-es |
380,205 |
simple |
372,294 |
iwn |
368,442 |
apertium-rdf-oc-ca |
346,353 |
swefn-rdf |
339,385 |
universal-dependencies-treebank-arabic |
319,776 |
apertium-rdf-oc-es |
317,169 |
universal-dependencies-treebank-galician |
298,144 |
apertium-rdf-es-pt |
279,252 |
universal-dependencies-treebank-romanian |
269,631 |
apertium-rdf-eu-en |
265,473 |
apertium-rdf-eu-es |
262,343 |
universal-dependencies-treebank-norwegian |
262,249 |
universal-dependencies-treebank-english |
247,670 |
universal-dependencies-treebank-ancient-greek |
244,725 |
universal-dependencies-treebank-portuguese-br |
237,097 |
pdev-lemon |
236,360 |
pdevlemon |
236,360 |
apertium-rdf-pt-gl |
234,072 |
gemet-annotated |
231,297 |
universal-dependencies-treebank-estonian |
230,731 |
universal-dependencies-treebank-portuguese-bosque |
218,637 |
universal-dependencies-treebank-basque |
211,684 |
apertium-rdf-es-gl |
206,291 |
universal-dependencies-treebank-swedish |
197,505 |
apertium-rdf-ca-it |
180,858 |
universal-dependencies-treebank-ancient-greek-proiel |
178,945 |
apertium-rdf-pt-ca |
163,156 |
universal-dependencies-treebank-finnish-ftb |
161,090 |
universal-dependencies-treebank-german |
156,450 |
de-gaap-ontology-lexicon |
153,910 |
de-gaap-ontology-lexicon |
153,910 |
universal-dependencies-treebank-bulgarian |
152,113 |
universal-dependencies-treebank-slovenian |
152,041 |
apertium-rdf-fr-ca |
152,009 |
universal-dependencies-treebank-persian |
149,473 |
universal-dependencies-treebank-latin-proiel |
145,572 |
germlex |
144,203 |
universal-dependencies-treebank-slovak |
128,165 |
multext-east |
127,651 |
universal-dependencies-treebank-hebrew |
116,131 |
universal-dependencies-treebank-czech-cac |
110,830 |
universal-dependencies-treebank-italian |
110,208 |
universal-dependencies-treebank-chinese |
97,507 |
universal-dependencies-treebank-russian |
91,389 |
bll-thesaurus |
90,181 |
universal-dependencies-treebank-finnish |
86,868 |
universal-dependencies-treebank-turkish |
84,873 |
universal-dependencies-treebank-indonesian |
82,854 |
apertium-rdf-es-an |
72,004 |
universal-dependencies-treebank-polish |
70,610 |
universal-dependencies-treebank-spanish |
69,082 |
universal-dependencies-treebank-english-esl |
67,636 |
universal-dependencies-treebank-english-lines |
66,341 |
universal-dependencies-treebank-swedish-lines |
64,178 |
universal-dependencies-treebank-latin-ittb |
63,788 |
universal-dependencies-treebank-portuguese |
63,549 |
universal-dependencies-treebank-french |
61,160 |
universal-dependencies-treebank-dutch |
58,542 |
universal-dependencies-treebank-croatian |
55,726 |
universal-dependencies-treebank-greek |
55,063 |
universal-dependencies-treebank-vietnamese |
53,112 |
universal-dependencies-treebank-danish |
51,377 |
clld-sails |
50,950 |
universal-dependencies-treebank-gothic |
49,589 |
universal-dependencies-treebank-old-church-slavonic |
49,577 |
universal-dependencies-treebank-latin |
47,094 |
universal-dependencies-treebank-czech-cltt |
41,517 |
universal-dependencies-treebank-latvian |
40,263 |
universal-dependencies-treebank-dutch-lassysmall |
39,064 |
universal-dependencies-treebank-irish |
37,208 |
universal-dependencies-treebank-hungarian |
37,181 |
universal-dependencies-treebank-slovenian-sst |
31,707 |
sentiws |
30,339 |
universal-dependencies-treebank-galician-treegal |
25,241 |
universal-dependencies-treebank-japanese |
22,372 |
universal-dependencies-treebank-tamil |
22,056 |
mlsa |
20,898 |
geodomainwn |
20,557 |
news-100-nif-ner-corpus |
12,329 |
rss-500-nif-ner-corpus |
10,078 |
universal-dependencies-treebank-uyghur |
9,492 |
reuters-128-nif-ner-corpus |
7,007 |
universal-dependencies-treebank-kazakh |
6,443 |
universal-dependencies-treebank-coptic |
3,856 |
kore-50-nif-ner-corpus |
3,201 |
universal-dependencies-treebank-sanskrit |
2,418 |
universal-dependencies-treebank-ukrainian |
1,535 |
isocat |
808 |