Skip to content

Commit b2e02b5

Browse files
committed
updated readme
1 parent 82aefcc commit b2e02b5

File tree

1 file changed

+11
-11
lines changed

1 file changed

+11
-11
lines changed

README.md

Lines changed: 11 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
# Crisis Consolidated Dataset
1+
# Crisis Dataset for Benchmarks
22

33

44

@@ -12,16 +12,16 @@ Before trying to start running any script, please download the dataset first. Mo
1212
Assuming that your current working directory is YOUR_PATH/crisis_datasets_benchmarks
1313
```
1414
tar -xvf crisis_datasets_benchmarks_v1.0.tar.gz
15-
mv crisis_datasets_benchmarks_v1.0/data_image $PWD/
15+
mv crisis_datasets_benchmarks_v1.0 YOUR_PATH/crisis_datasets_benchmarks
1616
```
1717

18-
**Different Versions of the dataset**
19-
* data/all_data_en
20-
* data/individual_data_en
21-
* data/event_aware_en
22-
* data/class_label_mapped
23-
* data/initial_filtering
24-
* data/data_split_all_lang
18+
**Different versions of the dataset**
19+
* data/all_data_en -- all combined english dataset used for the experiments
20+
* data/individual_data_en -- consists of data used for the experiments as individual data source such as crisisnlp and crisislex
21+
* data/event_aware_en -- all combined english dataset with event tag (fire, earthquake, flood, ...) are tagged
22+
* data/data_split_all_lang -- all combined dataset with their train/dev and test splits
23+
* data/initial_filtering -- all combined dataset duplicate removed data
24+
* data/class_label_mapped -- all combined dataset initial set of dataset where class label mapped
2525

2626

2727

@@ -172,14 +172,14 @@ nohup bash bin/bert_multiclass.sh hum-event-aware data/event_aware_en/crisis_con
172172
* *Firoj Alam, Hassan Sajjad, Muhammad Imran and Ferda Ofli, "Standardizing and Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing", arxiv. 2020.*
173173

174174
```bib
175-
@inproceedings{crisisConsolidated2020,
175+
@inproceedings{firojalamcrisisbenchmark2020,
176176
Author = {Firoj Alam and Hassan Sajjad and Muhammad Imran and Ferda Ofli},
177177
Title = {Standardizing and Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing},
178178
Year = {2020}
179179
}
180180
```
181181

182-
**and the follwing associated papers**
182+
**and the following associated papers**
183183

184184
* *Muhammad Imran, Prasenjit Mitra, Carlos Castillo. Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages. In Proceedings of the 10th Language Resources and Evaluation Conference (LREC), 2016, Slovenia.*
185185
* *A. Olteanu, S. Vieweg, C. Castillo. 2015. What to Expect When the Unexpected Happens: Social Media Communications Across Crises. In Proceedings of the ACM 2015 Conference on Computer Supported Cooperative Work and Social Computing (CSCW '15). ACM, Vancouver, BC, Canada.*

0 commit comments

Comments
 (0)