The automated news feedback system uses web crawlers to create a dataset of news articles, scrape article URLs, and optical character recognition technology to extract content from e-papers. The system is built with the Streamlit framework to generate graphs using the Plotly library for visualization of scraped data.
Additionally, the system includes a chatbot (powered by Gemini API) that provides perspective on the latest news for users and a Chrome extension for real-time fake news detection.
Data Acquisition
Web Scraping: Utilizes the BeautifulSoup
library along with the httpx
library to asynchronously scrape news articles from various news sources.
File Scraping: Utilizes the PyTesseract
library for image-to-text conversion and the PyMuPDF
library for PDF-to-text conversion.
Data Analysis
Gemini API: Provides sentiment analysis, media analysis, and fake news detection services.
Database Storage: Utilizes the MongoDB
database to store responses from the Gemini API.
Data Presentation
User Interface: Utilizes the Streamlit
framework to generate graphs using the Plotly
library for visualization of scraped data.
Chrome Extension: Provides real-time fake news detection on news articles (Manifest V3).
Follow these steps to set up and run the GodsEye software on your local machine, or you can watch the demo video.
-
Clone the repository to your local machine:
git clone https://github.com/areebahmeddd/GodsEye.git
-
Navigate to the project directory:
cd GodsEye
-
Create a virtual environment (optional but recommended):
python -m venv .venv
-
Activate the virtual environment:
- Windows:
.venv\Scripts\activate
- macOS and Linux:
source .venv/bin/activate
- Windows:
-
Install the project dependencies:
pip install -r requirements.txt
-
Set up the configuration files:
- Create a
secrets.toml
file in the.streamlit
directory with the following template:
[admin] email = "[email protected]" app_pass = "your_app_password" [gemini] api_key = "your_gemini_api_key"
- Create a
-
Set up the Chrome extension:
- Open Chrome and go to
chrome://extensions
. - Enable "Developer mode" (top right corner).
- Click "Load unpacked" (top left corner).
- Select the
browser-extension
folder in the GodsEye repository.
- Open Chrome and go to
-
Launch the application by clicking the run button in the top right of VS Code, or execute:
python python-app/app.py
-
Using the Browser Extension:
-
Navigate to any of the following supported websites:
NOTE: The subdomain https://www.ndtvprofit.com is currently unsupported due to a different HTML structure.
-
Select and open any article. The browser extension will be displayed on the right side of the screen. Click on the extension icon to access detailed information.
-
-
For Contributors:
- To apply changes made to the browser extension, visit
chrome://extensions
, click the "Update" button at the top left, and reload the article page to see the updates.
- To apply changes made to the browser extension, visit
This project is licensed under the Apache License 2.0.