You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+5-5
Original file line number
Diff line number
Diff line change
@@ -20,7 +20,7 @@ The first online catalogue for Arabic NLP datasets. This catalogue contains more
20
20
-`License` license of the dataset
21
21
-`Year` year of the publishing the dataset/paper
22
22
-`Language` ar or multilingual
23
-
-`Dialect` region ar-LEV: (Arabic(Levant)), country ar-EGY: (Arabic (Egypt)) or type ar-MSA: (Arabic (Modern Standard Arabic))
23
+
-`Dialect` region Levant, country ar-EGY: (Arabic (Egypt)) or type Modern Standard Arabic
24
24
-`Domain` social media, news articles, reviews, commentary, books, transcribed audio or other
25
25
-`Form` text, audio or sign language
26
26
-`Collection style` crawling, crawling and annotation (translation), crawling and annotation (other), machine translation, human translation, human curation or other
@@ -72,7 +72,7 @@ which gives the following output
72
72
'Cost': '',
73
73
'Derived From': '',
74
74
'Description': 'the first Levantine Dialect Corpus (SDC) covering data from the four dialects spoken in Palestine, Jordan, Lebanon and Syria.',
75
-
'Dialect': 'ar-LEV: (Arabic(Levant))',
75
+
'Dialect': 'Levant',
76
76
'Domain': 'social media',
77
77
'Ethical Risks': 'Medium',
78
78
'Form': 'text',
@@ -85,19 +85,19 @@ which gives the following output
85
85
'Paper Title': 'Shami: A Corpus of Levantine Arabic Dialects',
0 commit comments