AI Tools
loading
948 repositories found
Langchain-Chatchat
38211 stars
6215 forks
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Llama) RAG and Agent app with langchain
Python
ms-swift
14599 stars
1490 forks
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).
Python
PaddleNLP
12950 stars
3037 forks
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
Python
Chinese-Word-Vectors
12228 stars
2324 forks
100+ Chinese Word Vectors 上百种预训练中文词向量
Python
turbovec
12124 stars
1075 forks
A vector index built on TurboQuant, written in Rust with Python bindings
Python
claude-context
11938 stars
888 forks
Code search MCP for Claude Code. Make entire codebase the context for any coding agent.
TypeScript
all-in-rag
8865 stars
4408 forks
🔍大模型应用开发实战一:RAG 技术全栈指南,在线阅读地址:https://datawhalechina.github.io/all-in-rag/
Python
TencentDB-Agent-Memory
6010 stars
524 forks
TencentDB Agent Memory delivers fully local long-term memory for AI Agents via a 4-tier progressive pipeline, with zero external API dependencies.
TypeScript
UltraRAG
5604 stars
433 forks
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
Python
embedding-atlas
4836 stars
302 forks
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
TypeScript
infinity
4582 stars
428 forks
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
C++
myGPTReader
4416 stars
441 forks
A community-driven way to read and chat with AI bots - powered by chatGPT.
Python
telegram-search
3949 stars
259 forks
🔍 导出并模糊搜索 Telegram 聊天记录 | Export and fuzzy search your Telegram chat history
TypeScript
siamese-triplet
3175 stars
632 forks
Siamese and triplet networks with online pair/triplet mining in PyTorch
Python
LlamaIndexTS
3078 stars
521 forks
Data framework for your LLM applications. Focus on server side solution
TypeScript
trieve
2683 stars
247 forks
All-in-one platform for search, recommendations, RAG, and analytics offered via API
Rust
awesome-community-detection
2445 stars
357 forks
A curated list of community detection research papers with implementations.
Python
node-llama-cpp
2108 stars
200 forks
Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level
TypeScript
openTSNE
1618 stars
176 forks
Extensible, parallel implementations of t-SNE
Python
modelfusion
1320 stars
96 forks
The TypeScript library for building AI applications.
TypeScript
MyScaleDB
1033 stars
72 forks
A @ClickHouse fork that supports high-performance vector search and full-text search.
C++
chatWeb
914 stars
137 forks
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
Python
ngram2vec
847 stars
172 forks
Four word embedding models implemented in Python. Supporting arbitrary context features
Python
gritlm
695 stars
50 forks
Generative Representational Instruction Tuning
Jupyter Notebook
VLM2Vec
659 stars
60 forks
This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]
Python
embedJs
604 stars
75 forks
A NodeJS RAG framework to easily work with LLMs and embeddings
TypeScript
openl3
594 stars
65 forks
OpenL3: Open-source deep audio and image embeddings
Jupyter Notebook
pymde
588 stars
30 forks
Minimum-distortion embedding with PyTorch
Python
ACG2vec
580 stars
24 forks
ACG2vec (Anime Comics Games to vector) are committed to creating a playground that combines ACG and Deep learning.(文本语义检索、以图搜图、语义搜图、图片超分辨率、推荐系统)
NLP_pytorch_project
569 stars
121 forks
Embedding, NMT, Text_Classification, Text_Generation, NER etc.
Python
1
2
3
4
5