Seamlessly integrate state-of-the-art transformer models into robotics stacks
-
Updated
Jun 2, 2024 - Python
Seamlessly integrate state-of-the-art transformer models into robotics stacks
A web UI Project In order to learn the large language model. This project includes features such as chat, quantization, fine-tuning, prompt engineering templates, and multimodality.
Framework for processing and filtering datasets
This repository is used to collect papers and code in the field of AI.
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Orchestrate Swarms of Agents From Any Framework Like OpenAI, Langchain, and Etc for Business Operation Automation. Join our Community: https://discord.gg/DbjBMJTSWD
记录有意思的AI相关项目
A curated list of awesome Multimodal studies.
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought, Instruction-Tuning and Multimodality.
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
NSMusicS,Multi platform Multi mode Music Software ,Electron(Vue3+Vite+TypeScript)+.net core+AI
Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics, and Fundamental Sciences such as Mathematics.
BIM-based 3D Multimodal Reconstruction for Substation Equipment Inspection Images
Generative AI suite powered by state-of-the-art models and providing advanced AI/AGI functions. It features AI personas, AGI functions, multi-model chats, text-to-image, voice, response streaming, code highlighting and execution, PDF import, presets for developers, much more. Deploy on-prem or in the cloud.
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
React component library for crafting user-friendly and engaging conversational experiences
autoupdate paper list
Build real-time multimodal AI applications 🤖🎙️📹
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."