First Matrice project : analyse a simple text and present the outcome with simple terms
In the event of COP27, I wanted to analyse a text on the topic of environment. Remembering Greta Tunberg's direct speeches, I found interesting to analyse words frequencies and make a wordcloud out of it. Then compare 'How dare you' speech, happening in 2019 at the U.N.'s Climate Action Summit and the 'Blah blah blah' speech made two years later in Milan, few month before COP26.
After getting rid of unwanted spaces and punctuation, words were uniformed (lower case), then stored in a list without stop words (keeping 'we', 'you', 'i', 'our', 'your' for the analysis).
Finally, a function taking this list of tokens as a parameter was counting all words frequencies, stores as key-value in a dictionnary.
As a visual support, the data were presented in a horizontal histogram. Different colors were used to separate frequencies as themes seemed to appear within each group.
In the fun of testing visual support, comparison of the two speeches with simple wordcloud gave us :