Transformers
State-of-the-art Machine Learning library for Pytorch, TensorFlow, and JAX, providing thousands of pre-trained models for natural language processing, computer vision, and other areas.
Looking for an open-source alternative to spaCy? Below are 3 community-built tools that offer similar functionality — all free, open source, and ready to use or self-host. Ranked by GitHub stars.
State-of-the-art Machine Learning library for Pytorch, TensorFlow, and JAX, providing thousands of pre-trained models for natural language processing, computer vision, and other areas.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components such as models, vector databases, and file converters to pipelines or agents that can interact with your data. Best suited for building Retrieval-Augmented Generation (RAG), question answering, semantic search, or conversational agent chatbots.
🦛 CHONK your texts with Chonkie ✨ — The no-nonsense RAG chunking library. A powerful and efficient tool for text chunking and processing.
The top picks from this list are Transformers, Haystack, Chonkie — all maintained, free to use, and self-hostable.
Yes. Every tool listed here is open source and free to use. Many can be self-hosted on your own infrastructure, which means no subscription fees and full control over your data.
Most of the alternatives listed are self-hostable. Check each tool's page for hosting details, system requirements, and licensing terms.
Get notified about new tools and updates to existing ones.