Skip to content

Repo to parse the WiFiNE dataset mentioned in the paper: Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus by Abbas Ghaddar, Philippe Langlais

License

Notifications You must be signed in to change notification settings

whoiswillma/wifineparse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

 1. Navigate to this link: http://rali.iro.umontreal.ca/rali/en/wifiner-wikipedia-for-et
 2. Click on 'Download most of the resources here' at the bottom of the page
 3. Download and extract Documents.tar.bz2 and FineNE.tar.bz2 from the [Google Drive Link](https://drive.google.com/drive/folders/0B6SOo3wyWh6wdGZhYkZDUHRTdkU?resourcekey=0-MIh3gE3MX-ceZlaRKlmxKw) (valid as of writing this) 
 4. Folder structure should look something like:
    
```
$ find . -type d
.
./FineNE
./FineNE/FineNoNE
./FineNE/FineEntity
./FineNE/NumericNE
./Documents
./Documents/Documents
```

About

Repo to parse the WiFiNE dataset mentioned in the paper: Transforming Wikipedia into a Large-Scale Fine-Grained Entity Type Corpus by Abbas Ghaddar, Philippe Langlais

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages