Science Data Service - A Science News Aggregator

⚠️ Work in progress.

A content aggregation and social engagement platform focused on science articles.

Features

Automated web scraping of science news websites
Article translation and summarization using OpenAI GPT API
User authentication and commenting system
Future forum for science discussions

Getting Started

Prerequisites

Python 3.8+
MongoDB
Node.js
Docker (optional but recommended for easy setup)

Setup

Clone the repository

git clone https://github.com/nbursa/science-data-service.git
cd science-data-service

Install dependencies

pip install -r requirements.txt

Start services with Docker Compose (optional)

If you prefer to use Docker, you can start MongoDB and Redis using Docker Compose:

docker-compose up -d

Running the Application

Run the Uvicorn server

uvicorn app.main:app --host 0.0.0.0 --port 8000 --reload

Start Celery worker

celery -A scraping.celery_tasks worker --loglevel=info

Start Celery beat

celery -A scraping.celery_tasks beat --loglevel=info

Scrape articles

python -m scrapy crawl <spider_name>

Usage

Accessing the Application

After starting the Uvicorn server, you can access the application at:

http://localhost:8000

API Documentation

API documentation is available at:

http://localhost:8000/docs

Environment Variables

Make sure to set the necessary environment variables for connecting to MongoDB and Redis, as well as for the OpenAI API key for article translation and summarization.

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.idea		.idea
.vscode		.vscode
app		app
scraping		scraping
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
run_translate_task.py		run_translate_task.py
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Science Data Service - A Science News Aggregator

Features

Getting Started

Prerequisites

Setup

Clone the repository

Install dependencies

Start services with Docker Compose (optional)

Running the Application

Run the Uvicorn server

Start Celery worker

Start Celery beat

Scrape articles

Usage

Accessing the Application

API Documentation

Environment Variables

Contributing

About

Releases

Packages

Languages

nbursa/science-data-service

Folders and files

Latest commit

History

Repository files navigation

Science Data Service - A Science News Aggregator

Features

Getting Started

Prerequisites

Setup

Clone the repository

Install dependencies

Start services with Docker Compose (optional)

Running the Application

Run the Uvicorn server

Start Celery worker

Start Celery beat

Scrape articles

Usage

Accessing the Application

API Documentation

Environment Variables

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages