GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#go #approximate_nearest_neighbor_search #generative_search #grpc #hnsw #hybrid_search #image_search #information_retrieval #mlops #nearest_neighbor_search #neural_search #recommender_system #search_engine #semantic_search #semantic_search_engine #similarity_search #vector_database #vector_search #vector_search_engine #vectors #weaviate

Weaviate is a powerful, open-source vector database that uses machine learning to make your data searchable. It's fast, scalable, and flexible, allowing you to vectorize your data at import or upload your own vectors. Weaviate supports various modules for integrating with popular AI services like OpenAI, Cohere, and Hugging Face. It's designed for production use with features like scaling, replication, and security. You can use Weaviate for tasks beyond search, such as recommendations, summarization, and integration with neural search frameworks. It offers APIs in GraphQL, REST, and gRPC and has client libraries for several programming languages. This makes it easy to build applications like chatbots, recommendation systems, and image search tools quickly and efficiently. Joining the Weaviate community provides access to tutorials, demos, blogs, and forums to help you get started and stay updated.

https://github.com/weaviate/weaviate
#jupyter_notebook #ai #azure #chatgpt #dall_e #generative_ai #generativeai #gpt #language_model #llms #openai #prompt_engineering #semantic_search #transformers

This course teaches you how to build Generative AI applications with 21 comprehensive lessons from Microsoft Cloud Advocates. You'll learn about Generative AI, Large Language Models (LLMs), prompt engineering, and how to build various applications like text generation, chat apps, and image generation using Python and TypeScript. The course includes videos, written lessons, code samples, and additional learning resources. You can start anywhere and even join a Discord server for support and networking with other learners. This helps you gain practical skills in building and deploying Generative AI applications responsibly and effectively.

https://github.com/microsoft/generative-ai-for-beginners
#rust #app_search #database #enterprise_search #faceting #full_text_search #fuzzy_search #geosearch #hybrid_search #instantsearch #rest #rust #search #search_as_you_type #search_engine #semantic_search #site_search #synonyms #typo_tolerance #vector_database #vectors

Meilisearch is a fast and powerful search engine that you can easily integrate into your apps, websites, and workflow. It offers features like hybrid search, search-as-you-type, typo tolerance, filtering, and sorting to enhance the user experience. You can customize it to fit your needs with support for multiple languages and advanced security management. It's easy to install, deploy, and maintain, and you can use their cloud service for added convenience. Meilisearch also provides extensive documentation, SDKs for various programming languages, and a supportive community through Discord and other channels. This makes it a great tool to supercharge your search capabilities quickly and efficiently.

https://github.com/meilisearch/meilisearch
#typescript #ai #chatgpt #docsgpt #hacktoberfest #information_retrieval #language_model #llm #machine_learning #natural_language_processing #python #pytorch #rag #react #semantic_search #transformers #web_app

DocsGPT is an open-source AI tool that helps you quickly find accurate answers from many types of documents and web sources without errors. It supports formats like PDF, DOCX, images, and integrates with websites, APIs, and chat platforms like Discord and Telegram. You can deploy it privately for security, customize it to fit your brand, and connect it to tools for advanced actions. This means you save time searching for information, get reliable answers with sources, and improve productivity whether you’re a developer, support team, or business user. It’s easy to set up and scales well for many users[2][3][4].

https://github.com/arc53/DocsGPT
1
#python #ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing

Memvid lets you store millions of text pieces inside a single MP4 video file using QR codes, making your data 50-100 times smaller than usual databases. You can search this video instantly in under 100 milliseconds without needing servers or internet after setup. It works offline, is easy to use with simple Python code, and supports PDFs and chat with your data. The upcoming version 2 will add features like continuous memory updates, shareable capsules, fast local caching, and better video compression, making your AI memory smarter, faster, and more flexible. This means you get a powerful, portable, and efficient way to manage and search huge knowledge bases quickly and easily.

https://github.com/Olow304/memvid
#go #agent #agentic #ai #chatbot #chatbots #embeddings #evaluation #generative_ai #golang #knowledge_base #llm #multi_tenant #multimodel #ollama #openai #question_answering #rag #reranking #semantic_search #vector_search

WeKnora is a powerful tool that helps you understand and find answers in complex documents like PDFs and Word files. It uses advanced AI to read documents, understand what they mean, and answer your questions in a simple way. This tool is useful for businesses and researchers because it can quickly find information from many documents, making it easier to manage knowledge and make decisions. It also supports multiple languages and can be used privately, ensuring your data stays safe.

https://github.com/Tencent/WeKnora
#rust #ai #change_data_capture #context_engineering #data #data_engineering #data_indexing #data_infrastructure #data_processing #etl #hacktoberfest #help_wanted #indexing #knowledge_graph #llm #pipeline #python #rag #real_time #rust #semantic_search

**CocoIndex** is a fast, open-source Python tool (Rust core) for transforming data into AI formats like vector indexes or knowledge graphs. Define simple data flows in ~100 lines of code using plug-and-play blocks for sources, embeddings, and targets—install via `pip install cocoindex`, add Postgres, and run. It auto-syncs fresh data with minimal recompute on changes, tracking lineage. **You save time building scalable RAG/semantic search pipelines effortlessly, avoiding complex ETL and stale data issues for production-ready AI apps.**

https://github.com/cocoindex-io/cocoindex