Knowledge Base Builder

This project scrapes specific web pages and converts the content into HTML files, organizing them into a knowledge base. The project uses Selenium for web scraping and BeautifulSoup for HTML parsing.

Prerequisites

Python 3.x
pip (Python package installer)
Google Chrome browser
ChromeDriver

Installation

Clone the repository:

git clone https://github.com/Glebuar/KnowledgeBaseBuilder.git
cd knowledge-base-builder

Create a virtual environment and activate it:

python -m venv .venv
source .venv/bin/activate  # On Windows use `.venv\Scripts\activate`

Install the dependencies:
```
pip install -r requirements.txt
```
Download ChromeDriver:
- Download the correct version of ChromeDriver that matches your Chrome browser from here.
- Place the chromedriver executable in a known location.

Update config.json:

Make sure the config.json file is present in the root directory with the correct structure and update the chrome_driver_path to the path where you placed the chromedriver executable.

Example config.json:

{
    "chrome_driver_path": "path/to/chromedriver",
    "urls": [
        {
            "url": "https://example.com/page1",
            "children": []
        },
        {
            "url": "https://example.com/page2",
            "children": [
                {
                    "url": "https://example.com/page2-1",
                    "children": []
                }
            ]
        }
    ]
}

Running the Script

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
README.md		README.md
config.json		config.json
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge Base Builder

Prerequisites

Installation

Running the Script

About

Releases

Packages

Languages

Glebuar/KnowledgeBaseBuilder

Folders and files

Latest commit

History

Repository files navigation

Knowledge Base Builder

Prerequisites

Installation

Running the Script

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages