Knowledge Base Builder

This project scrapes specific web pages and converts the content into HTML files, organizing them into a knowledge base. The project uses Selenium for web scraping and BeautifulSoup for HTML parsing.

Prerequisites

Python 3.x
pip (Python package installer)
Google Chrome browser
ChromeDriver

Installation

Clone the repository:

git clone https://github.com/Glebuar/KnowledgeBaseBuilder.git
cd knowledge-base-builder

Create a virtual environment and activate it:

python -m venv .venv
source .venv/bin/activate  # On Windows use `.venv\Scripts\activate`

Install the dependencies:
```
pip install -r requirements.txt
```
Download ChromeDriver:
- Download the correct version of ChromeDriver that matches your Chrome browser from here.
- Place the chromedriver executable in a known location.

Update config.json:

Make sure the config.json file is present in the root directory with the correct structure and update the chrome_driver_path to the path where you placed the chromedriver executable.

Example config.json:

{
    "chrome_driver_path": "path/to/chromedriver",
    "urls": [
        {
            "url": "https://example.com/page1",
            "children": []
        },
        {
            "url": "https://example.com/page2",
            "children": [
                {
                    "url": "https://example.com/page2-1",
                    "children": []
                }
            ]
        }
    ]
}

Running the Script

python main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Knowledge Base Builder

Prerequisites

Installation

Running the Script

Files

README.md

Latest commit

History

README.md

File metadata and controls

Knowledge Base Builder

Prerequisites

Installation

Running the Script