A bot for collecting Earnings Announcement Transcripts from SeekingAlpha.com
DO NOT ABUSE THIS SCRAPER.
This scraper was developed for educational purposes only and serves no other purposes. SeekingAlpha TOS prohibit scraping and I make no representation that your account won't be banned if you use this program. Furthermore, should I be contacted by SeekingAlpha with a request to remove this repo, I will do so immediately.
This scraper collects the earnings announcements of a specific company (identified with ticker
or t
).
First, make sure that you're using Python 3.7+.
- Clone or download this repository.
- Run
pip install -r requirements.txt
inside this repo. Consider doing this inside of a Python virtual environment. - Create a
secret.yaml
file containing the keysusername
andpassword
with your SeekingAlpha login information. You should consider creating a dummy SeekingAlpha account. - It is highly recommended (but not required) that you use a virtual environment for your project.
usage: Scraper.py [-h] [-t ticker]
optional arguments:
-h, --help show this help message and exit
-t TICKER, --ticker TICKER Ticker Symbol of the desired company.
Run the script as follows, taking Costco (Ticker Symbol: COST) as an example.
python Scraper.py -t COST
The outputs will be saved in Outputs/Transcripts
as HTML files.
Each file will be saved in the following format:
{ticker}_transcripts_{article_title}.html
where ticker
is the ticker symbol and article_title
is the title of the article.
- At this moment, this scraper does not pass any CAPTCH tests. You should probably do it manually in the session that you start and try again.
- After the first time you sing in, you will not need to sing in again because
the
chromedriver
optionuser-data-dir
is added. - This module uses
webdriver-manager
, which handleschromedriver
versions automatically. - Lastly and importantly, this scraper is slow by design. As mentioned before, this is for educational purposes only and nothing more.