File tree 1 file changed +4
-5
lines changed
1 file changed +4
-5
lines changed Original file line number Diff line number Diff line change @@ -171,21 +171,20 @@ contents of the output file will look like:
171
171
---------------------------
172
172
173
173
In order to correctly handle all possible characters in the tweets
174
- (think of chinese or arabic characters), the output is saved as utf-8
174
+ (think of Japanese or Arabic characters), the output is saved as utf-8
175
175
encoded bytes. That is why you could see text like
176
- "":raw-latex: `\u 3`0b1:raw-latex: `\u 3`0f3:raw-latex: `\u 3`055:raw-latex: `\u 3`07e:raw-latex: `\u 3`0fe
177
- ..." in the output file.
176
+ "\u 30b1 \u 30f3 \u 3055 \u 307e \u 30fe ..." in the output file.
178
177
179
178
What you should do is open the file with the proper encoding:
180
179
181
180
.. figure :: https://user-images.githubusercontent.com/4409108/30702318-f05bc196-9eec-11e7-8234-a07aabec294f.PNG
182
- :alt: Example of output with chinese characters
183
181
184
- Example of output with chinese characters
182
+ Example of output with Japanese characters
185
183
186
184
TO DO
187
185
=====
188
186
187
+ - Twitterscraper can not retrieve retweets.
189
188
- Add caching potentially? Would be nice to be able to resume scraping
190
189
if something goes wrong and have half of the data of a request cached
191
190
or so.
You can’t perform that action at this time.
0 commit comments