GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#python #agents #ai #multimodal #real_time #video #voice #voice_assistant

The Agents framework helps you build AI-driven programs that can interact with users in real-time through text, audio, images, or video. It integrates with OpenAI's Realtime API for ultra-low latency interactions and supports various plugins for speech-to-text, text-to-speech, and other AI services. You can use it to create voice assistants, transcription agents, and more, with easy deployment across local, self-hosted, or cloud environments. This makes it easier to develop interactive AI applications quickly and efficiently.

https://github.com/livekit/agents
#python #ai #ai_deep_fake #ai_face #ai_webcam #artificial_intelligence #deep_fake #deepfake #deepfake_webcam #faceswap #fake_webcam #real_time_deepfake #realtime_deepfake #realtime_face_changer #video_deepfake #webcam #webcamera

Deep Live Cam is a powerful tool that allows you to perform real-time face swaps and create video deepfakes with just a single click and one image. It's designed to help artists animate custom characters or use them as models for clothing, among other tasks. The software includes built-in checks to prevent processing inappropriate content and requires users to obtain consent before using someone's face.

You can easily download and use the pre-built version with CUDA support, or manually install it if you have technical skills. The tool offers features like resizable preview windows, face mapping, and various command line arguments for customization. It also supports different execution providers like CUDA, CoreML, and DirectML for optimized performance.

Using Deep Live Cam responsibly and legally is crucial; always label any output as a deepfake when sharing online. This tool simplifies complex tasks and enhances creativity, making it beneficial for artists and content creators who need advanced face swapping capabilities.

https://github.com/hacksider/Deep-Live-Cam
#python #python #sponskrub #sponsorblock #video_downloader #youtube_dl #youtube_dlc #youtube_downloader #yt_dlp

yt-dlp is a powerful command-line tool for downloading audio and video content from thousands of sites, including YouTube. Here are the key benefits and important information yt-dlp can download content from a vast array of websites, not just YouTube.
- **Advanced Format Selection** It allows merging multiple video and audio streams into a single file.
- **Post-Processing** You can customize the output filename template and directory structure using the `-o` and `-P` options.
- **Plugin Support** Automatically marks or removes sponsor segments from YouTube videos.
- **Improved Metadata Handling** Can be installed using binaries, pip, or third-party package managers. See the [installation instructions](#installation).
- **Usage** Customize filenames using templates with various fields like title, ID, and more. See [output template examples](#output-template-examples).
- **Format Selection** Convert audio/video files, embed subtitles/thumbnails, and more using `--extract-audio`, `--remux-video`, etc.
- **Configuration**//www.youtube.com/watch?v=BaW_jenozKc

# Download only the audio in mp3 format
yt-dlp -x --audio-format mp3 https//www.youtube.com/watch?v=BaW_jenozKc
```

This tool offers extensive customization and flexibility, making it a powerful alternative to

https://github.com/yt-dlp/yt-dlp
👍2
#cplusplus #android #audio_processing #c_plus_plus #calculator #computer_vision #deep_learning #framework #graph_based #graph_framework #inference #machine_learning #mediapipe #mobile_development #perception #pipeline_framework #stream_processing #video_processing

MediaPipe is a tool that helps you add smart machine learning features to your apps and devices. It works on mobile, web, desktop, and other devices. You can use pre-made solutions for tasks like vision, text, and audio processing, or customize the models to fit your needs. MediaPipe also offers tools like Model Maker and Studio to help you create and test your solutions easily. This makes it easier to delight your customers with innovative features without needing deep machine learning expertise.

https://github.com/google-ai-edge/mediapipe
#cplusplus #audio #c #c_plus_plus #dash #hevc #hls #live #live_streaming #low_latency #media_server #multimedia #prometheus_exporter #rtmp #server_side #srt #streaming #video #video_conferencing #video_streaming #webrtc

SRS (Simple Realtime Server) is a powerful and efficient video server that supports multiple streaming protocols like RTMP, WebRTC, HLS, and more. It works on various operating systems (Linux, Windows, macOS) and hardware architectures. You can easily set it up using Docker and stream videos using tools like FFmpeg or OBS. SRS is free and open-source, licensed under MIT, making it a great choice for developers to build high-quality streaming platforms. It also offers extensive documentation and community support, making it easier to get started and troubleshoot issues.

https://github.com/ossrs/srs
🔥1
#java #16k #android #exoplayer #ffmpeg #gsy #ijkplayer #player #ssl #video #videocache

This video player supports multiple video playback engines like IJKPlayer, Media3 (ExoPlayer2), MediaPlayer, and AliPlayer. It offers various features such as Supports caching while playing videos.
- **Protocols** Includes over 20 types of filters and animations.
- **Frame Capture** Offers list playback, continuous playback, speed adjustment, and rotation.
- **Display** Allows switching between different playback cores, custom layouts, and more.
- **Ads and Subtitles**: Supports pre-roll ads, skip ads, and external subtitles.

Using this player benefits you by providing a highly customizable and feature-rich video playback solution that can handle various video formats and protocols, making it versatile for different needs.

https://github.com/CarGuo/GSYVideoPlayer
#cplusplus #bittorrent_client #bt #download #download_manager #download_videos #ftp_client #http_client #libtorrent #m3u8 #magnet #qbittorrent #qt #remote_download #ssh_client #stream_downloader #torrent #transmission #video_downloader #webdav_client

File Centipede is a powerful tool that helps you manage internet files easily. It supports many protocols like HTTP, FTP, BitTorrent, and more, making it versatile for different tasks. You can download videos and audio from websites, even if they are encrypted. The software is fast, customizable, and user-friendly, with features like speed limits, proxy management, and browser integration. It also includes tools for translating, encoding, and merging files. Best of all, it's free and has no ads, making it a great choice for managing your online files efficiently.

https://github.com/filecxx/FileCentipede
#python #ai_translation #dubbing #localization #video_translation #voice_cloning

VideoLingo is a powerful tool that helps translate, localize, and dub videos, making them understandable across different languages. It uses advanced technologies like WhisperX for accurate subtitle recognition and GPT for high-quality translations. The tool ensures single-line subtitles, similar to those on Netflix, and offers dubbing alignment for a more natural viewing experience. You can use it online, in Google Colab, or install it locally on your computer. This makes it easier to share videos globally without language barriers, enhancing global knowledge sharing and communication.

https://github.com/Huanshere/VideoLingo
#typescript #chromeos #electron #epg #fair_source #iptv #iptv_m3u #iptv_player #iptv_player_application #linux #m3u #m3u8 #macos #player #player_video #playlist #pwa #snap #tauri #tv #video

IPTVnator is a free video player application that lets you watch IPTV channels using m3u and m3u8 playlists. You can add these playlists from files or internet links and even get TV guides (EPG) in XMLTV format. It supports multiple players like MPV and VLC, has features like channel search, favorite channels, and different themes. It's available for macOS, Windows, and Linux, and you can download it from the release page or install it using package managers. This app is helpful because it allows you to organize and watch your IPTV channels easily on various devices.

https://github.com/4gray/iptvnator
#python #annotation #annotation_tool #annotations #boundingbox #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labelling_tool #imagenet #labeling #labeling_tool #object_detection #pytorch #semantic_segmentation #tensorflow #video_annotation

CVAT is a powerful tool for annotating videos and images, especially useful for computer vision projects. It helps developers and companies annotate data quickly and efficiently. You can use CVAT online for free or subscribe for more features like unlimited data and integrations with other tools. It also offers a self-hosted option with enterprise support. CVAT supports many annotation formats and has automatic labeling options to speed up your work. It's widely used by many teams worldwide, making it a reliable choice for your data annotation needs.

https://github.com/cvat-ai/cvat
#svelte #collaboration #downloader #javascript #music #reddit #social_media #soundcloud #svelte #tiktok #twitter #typescript #video #vimeo #vk #webapp #youtube #youtube_downloader

Cobalt is a simple and efficient media downloader without ads, trackers, or paywalls. You just paste the link and get the file quickly. It's easy to use and doesn't bother you with unnecessary things. Cobalt is free, publicly accessible, and does not support piracy. It also has a community Discord server and Twitter for support. Using Cobalt helps you save what you love easily and quickly.

https://github.com/imputnet/cobalt
1
#go #activitypub #broadcasting #chat #decentralized #federation #fediverse #golang #hacktoberfest #hls #live #livestream #owncast #rtmp #self_hosted #streaming_video #video

Owncast is a free, open-source tool that lets you stream your videos live and control everything yourself. You can use it with popular broadcasting software like OBS or Streamlabs. It gives you full ownership over your content, interface, and audience, which means you have more freedom and control. To get started, you can visit the quickstart guide or view a demo to see how it works. This way, you don't have to rely on big streaming services and can manage your streams independently.

https://github.com/owncast/owncast
#javascript #freetube #privacy #subscriptions #video #videos #youtube

FreeTube is a free, open-source app that lets you watch YouTube videos without ads and helps keep your viewing private. It doesn't use Google's tracking cookies or JavaScript, so you can enjoy videos without being tracked by Google. Your data stays on your computer, not online. FreeTube works on Windows, Mac, and Linux, and you can subscribe to channels without needing an account. It also offers features like importing subscriptions and using external players. This makes it a good choice for people who want more privacy while watching YouTube videos.

https://github.com/FreeTubeApp/FreeTube
👍1
#go #dubbing #localization #tts #video_transcription #video_translation

Krillin AI is a tool that helps translate and dub videos easily. It supports many languages and can automatically add subtitles, translate them, and even change the voice. This tool is useful for making videos ready for different platforms like YouTube or TikTok. It saves time by doing everything in just a few clicks, making it easy to share videos with people who speak different languages.

https://github.com/krillinai/KrillinAI
1
#python #face_animation #image_animation #video_editing #video_generation

LivePortrait is a tool that uses AI to animate still photos, making them look like videos. It works by identifying key facial features and adding realistic movements. This technology helps create lifelike videos that can be used for personalized communication. The benefit to users is that they can easily create engaging animated portraits from static images, which can be fun and useful for various applications like social media or storytelling.

https://github.com/KwaiVGI/LivePortrait
#typescript #alternative #converter #data_manipulation #developer_tools #devtools #frontend #good_first_issue #image_manipulation #image_processing #javascript #pdf_manipulation #productivity #react #self_hosted #swissarmyknife #tools #typescript #video_manipulation #webapp #website

OmniTools is a self-hosted web app that helps with many tasks like image and video editing, number crunching, and more. It offers tools for resizing images, converting videos, calculating dates, and generating prime numbers. You can run it on your own computer using Docker, which means your data stays local. This app is open-source and free, allowing you to contribute new features or tools easily. Using OmniTools simplifies many everyday tasks and keeps your data private.

https://github.com/iib0011/omni-tools
👍1
#rust #fpv #gopro #gpu #gpu_computing #gyroscope #insta360 #rolling_shutter_undistortion #rust #sony_alpha_cameras #stabilization #video #video_processing

Gyroflow is a powerful video stabilization software that uses gyroscope data from cameras like GoPro, Sony, and Insta360 to make your videos smooth and steady. It corrects lens distortion, rolling shutter effects, and can even level the horizon for a professional look. You can preview changes in real-time, use GPU acceleration for fast processing, and apply stabilization directly in popular video editors with plugins. It supports many video formats and works on Windows, Mac, Linux, Android, and iOS. Using Gyroflow helps you create high-quality, cinematic videos without bulky equipment or complicated setups[1][3][5].

https://github.com/gyroflow/gyroflow
1
#python #ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing

Memvid lets you store millions of text pieces inside a single MP4 video file using QR codes, making your data 50-100 times smaller than usual databases. You can search this video instantly in under 100 milliseconds without needing servers or internet after setup. It works offline, is easy to use with simple Python code, and supports PDFs and chat with your data. The upcoming version 2 will add features like continuous memory updates, shareable capsules, fast local caching, and better video compression, making your AI memory smarter, faster, and more flexible. This means you get a powerful, portable, and efficient way to manage and search huge knowledge bases quickly and easily.

https://github.com/Olow304/memvid
#javascript #appimage #compressor #downloader #electron #electron_app #ffmpeg #flatpak #javascript #linux #linux_app #macos #nodejs #snap #ubuntu #video #windows #youtube #youtube_dl #youtube_downloader #ytdownloader

You can use ytDownloader, a modern app that lets you download videos and audio from hundreds of sites like YouTube, Facebook, Instagram, TikTok, and Twitter. It works on Windows, macOS, and Linux, offers fast downloads, supports playlists, subtitles, and video compression with hardware acceleration, and has multiple themes. It’s free of ads and trackers, making it safe and easy to use. You can install it via various methods like Flatpak, Snap, or package managers on different systems. This helps you save videos for offline viewing, enjoy faster access without ads, and keep your favorite content anytime.

https://github.com/aandrew-me/ytDownloader
#python #audio_generation #diffusion #image_generation #inference #model_serving #multimodal #pytorch #transformer #video_generation

vLLM-Omni is a free, open-source tool that makes serving AI models for text, images, videos, and audio fast, easy, and cheap. It builds on vLLM for top speed using smart memory tricks, overlapping tasks, and flexible resource sharing across GPUs. You get 2x higher throughput, 35% less delay, and simple setup with Hugging Face models via OpenAI API—perfect for building quick multi-modal apps like chatbots or media generators without high costs.

https://github.com/vllm-project/vllm-omni