Skip to content
Change the repository type filter

All

    Repositories list

    • crawl4ai

      Public
      🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper
      Python
      Apache License 2.0
      1.7k000Updated Jan 8, 2025Jan 8, 2025
    • OpenRLHF

      Public
      An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
      Python
      Apache License 2.0
      348000Updated Jan 8, 2025Jan 8, 2025
    • maxun

      Public
      🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
      TypeScript
      GNU Affero General Public License v3.0
      537000Updated Jan 4, 2025Jan 4, 2025
    • PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
      Python
      GNU Affero General Public License v3.0
      1.1k000Updated Dec 30, 2024Dec 30, 2024
    • Open-Sora

      Public
      Open-Sora: Democratizing Efficient Video Production for All
      Python
      Apache License 2.0
      2.3k000Updated Dec 27, 2024Dec 27, 2024
    • flink

      Public
      基于Apache Flink做实时数仓和ETL、实时监控报警等
      Java
      Apache License 2.0
      13k000Updated Dec 17, 2024Dec 17, 2024
    • hudi

      Public
      Upserts, Deletes And Incremental Processing on Big Data.
      Java
      Apache License 2.0
      2.4k000Updated Dec 17, 2024Dec 17, 2024
    • ClickHouse Java Clients & JDBC Driver
      Java
      Apache License 2.0
      546000Updated Dec 17, 2024Dec 17, 2024
    • flink-cdc

      Public
      Flink CDC is a streaming data integration tool
      Java
      Apache License 2.0
      2k000Updated Dec 17, 2024Dec 17, 2024
    • Label Studio is a multi-type data labeling and annotation tool with standardized output format
      JavaScript
      Apache License 2.0
      2.5k000Updated Dec 16, 2024Dec 16, 2024
    • Effortless data labeling with AI support from Segment Anything and other awesome models.
      Python
      GNU General Public License v3.0
      522000Updated Dec 16, 2024Dec 16, 2024
    • Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
      Python
      Apache License 2.0
      4.6k000Updated Dec 14, 2024Dec 14, 2024
    • MinerU

      Public
      A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
      Python
      GNU Affero General Public License v3.0
      1.8k000Updated Dec 13, 2024Dec 13, 2024
    • zero_nlp

      Public
      中文nlp解决方案(大模型、数据、模型、训练、推理)
      Jupyter Notebook
      MIT License
      379000Updated Dec 10, 2024Dec 10, 2024
    • TRELLIS

      Public
      Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
      Python
      MIT License
      408000Updated Dec 7, 2024Dec 7, 2024
    • g1

      Public
      g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
      Python
      MIT License
      376000Updated Dec 6, 2024Dec 6, 2024
    • Retrieval and Retrieval-augmented LLMs
      Python
      MIT License
      599000Updated Dec 6, 2024Dec 6, 2024
    • Let your Claude able to think
      TypeScript
      MIT License
      1.5k000Updated Dec 3, 2024Dec 3, 2024
    • xxl-job

      Public
      A distributed task scheduling framework.(分布式任务调度平台XXL-JOB)
      Java
      GNU General Public License v3.0
      11k000Updated Nov 30, 2024Nov 30, 2024
    • sl-chat

      Public
      基于Java + webscoket跟netty做的分布式IM中心
      Java
      Apache License 2.0
      1000Updated Nov 8, 2024Nov 8, 2024
    • Java
      0000Updated Oct 23, 2024Oct 23, 2024
    • BibleGPT

      Public
      用RAG结合大语言模型开发的圣经学习app
      Jupyter Notebook
      Apache License 2.0
      2000Updated Apr 29, 2024Apr 29, 2024