GitHub Trends
10.1K subscribers
15.3K links
See what the GitHub community is most excited about today.

A bot automatically fetches new repositories from https://github.com/trending and sends them to the channel.

Author and maintainer: https://github.com/katursis
Download Telegram
#python #classification #coco #computer_vision #deep_learning #hacktoberfest #image_processing #instance_segmentation #low_code #machine_learning #metrics #object_detection #oriented_bounding_box #pascal_voc #python #pytorch #tensorflow #tracking #video_processing #yolo

Supervision is a powerful tool for building computer vision applications. It allows you to easily load datasets, draw detections on images or videos, and count detections in specific zones. You can use any classification, detection, or segmentation model with it, and it has connectors for popular libraries like Ultralytics and Transformers. Supervision also offers customizable annotators to visualize your data and utilities to manage datasets in various formats. By using Supervision, you can streamline your computer vision projects and make them more reliable and efficient. Additionally, there are extensive tutorials and documentation available to help you get started quickly.

https://github.com/roboflow/supervision
#python #agents #ai #multimodal #real_time #video #voice #voice_assistant

The Agents framework helps you build AI-driven programs that can interact with users in real-time through text, audio, images, or video. It integrates with OpenAI's Realtime API for ultra-low latency interactions and supports various plugins for speech-to-text, text-to-speech, and other AI services. You can use it to create voice assistants, transcription agents, and more, with easy deployment across local, self-hosted, or cloud environments. This makes it easier to develop interactive AI applications quickly and efficiently.

https://github.com/livekit/agents
#python #ai #ai_deep_fake #ai_face #ai_webcam #artificial_intelligence #deep_fake #deepfake #deepfake_webcam #faceswap #fake_webcam #real_time_deepfake #realtime_deepfake #realtime_face_changer #video_deepfake #webcam #webcamera

Deep Live Cam is a powerful tool that allows you to perform real-time face swaps and create video deepfakes with just a single click and one image. It's designed to help artists animate custom characters or use them as models for clothing, among other tasks. The software includes built-in checks to prevent processing inappropriate content and requires users to obtain consent before using someone's face.

You can easily download and use the pre-built version with CUDA support, or manually install it if you have technical skills. The tool offers features like resizable preview windows, face mapping, and various command line arguments for customization. It also supports different execution providers like CUDA, CoreML, and DirectML for optimized performance.

Using Deep Live Cam responsibly and legally is crucial; always label any output as a deepfake when sharing online. This tool simplifies complex tasks and enhances creativity, making it beneficial for artists and content creators who need advanced face swapping capabilities.

https://github.com/hacksider/Deep-Live-Cam
#python #python #sponskrub #sponsorblock #video_downloader #youtube_dl #youtube_dlc #youtube_downloader #yt_dlp

yt-dlp is a powerful command-line tool for downloading audio and video content from thousands of sites, including YouTube. Here are the key benefits and important information yt-dlp can download content from a vast array of websites, not just YouTube.
- **Advanced Format Selection** It allows merging multiple video and audio streams into a single file.
- **Post-Processing** You can customize the output filename template and directory structure using the `-o` and `-P` options.
- **Plugin Support** Automatically marks or removes sponsor segments from YouTube videos.
- **Improved Metadata Handling** Can be installed using binaries, pip, or third-party package managers. See the [installation instructions](#installation).
- **Usage** Customize filenames using templates with various fields like title, ID, and more. See [output template examples](#output-template-examples).
- **Format Selection** Convert audio/video files, embed subtitles/thumbnails, and more using `--extract-audio`, `--remux-video`, etc.
- **Configuration**//www.youtube.com/watch?v=BaW_jenozKc

# Download only the audio in mp3 format
yt-dlp -x --audio-format mp3 https//www.youtube.com/watch?v=BaW_jenozKc
```

This tool offers extensive customization and flexibility, making it a powerful alternative to

https://github.com/yt-dlp/yt-dlp
👍2
#cplusplus #android #audio_processing #c_plus_plus #calculator #computer_vision #deep_learning #framework #graph_based #graph_framework #inference #machine_learning #mediapipe #mobile_development #perception #pipeline_framework #stream_processing #video_processing

MediaPipe is a tool that helps you add smart machine learning features to your apps and devices. It works on mobile, web, desktop, and other devices. You can use pre-made solutions for tasks like vision, text, and audio processing, or customize the models to fit your needs. MediaPipe also offers tools like Model Maker and Studio to help you create and test your solutions easily. This makes it easier to delight your customers with innovative features without needing deep machine learning expertise.

https://github.com/google-ai-edge/mediapipe
#cplusplus #audio #c #c_plus_plus #dash #hevc #hls #live #live_streaming #low_latency #media_server #multimedia #prometheus_exporter #rtmp #server_side #srt #streaming #video #video_conferencing #video_streaming #webrtc

SRS (Simple Realtime Server) is a powerful and efficient video server that supports multiple streaming protocols like RTMP, WebRTC, HLS, and more. It works on various operating systems (Linux, Windows, macOS) and hardware architectures. You can easily set it up using Docker and stream videos using tools like FFmpeg or OBS. SRS is free and open-source, licensed under MIT, making it a great choice for developers to build high-quality streaming platforms. It also offers extensive documentation and community support, making it easier to get started and troubleshoot issues.

https://github.com/ossrs/srs
🔥1
#java #16k #android #exoplayer #ffmpeg #gsy #ijkplayer #player #ssl #video #videocache

This video player supports multiple video playback engines like IJKPlayer, Media3 (ExoPlayer2), MediaPlayer, and AliPlayer. It offers various features such as Supports caching while playing videos.
- **Protocols** Includes over 20 types of filters and animations.
- **Frame Capture** Offers list playback, continuous playback, speed adjustment, and rotation.
- **Display** Allows switching between different playback cores, custom layouts, and more.
- **Ads and Subtitles**: Supports pre-roll ads, skip ads, and external subtitles.

Using this player benefits you by providing a highly customizable and feature-rich video playback solution that can handle various video formats and protocols, making it versatile for different needs.

https://github.com/CarGuo/GSYVideoPlayer
#cplusplus #bittorrent_client #bt #download #download_manager #download_videos #ftp_client #http_client #libtorrent #m3u8 #magnet #qbittorrent #qt #remote_download #ssh_client #stream_downloader #torrent #transmission #video_downloader #webdav_client

File Centipede is a powerful tool that helps you manage internet files easily. It supports many protocols like HTTP, FTP, BitTorrent, and more, making it versatile for different tasks. You can download videos and audio from websites, even if they are encrypted. The software is fast, customizable, and user-friendly, with features like speed limits, proxy management, and browser integration. It also includes tools for translating, encoding, and merging files. Best of all, it's free and has no ads, making it a great choice for managing your online files efficiently.

https://github.com/filecxx/FileCentipede