rlhf

Here are 121 public repositories matching this topic...

log10-io / log10

Python client library for improving your LLM app accuracy

python debugging ai monitoring evaluations feedback logging artificial-intelligence openai agents autonomous-agents fine-tuning llms rlhf llmops anthropic

Updated Jun 11, 2024
Python

argilla-io / argilla

Star

Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.

nlp machine-learning natural-language-processing ai weak-supervision developer-tools active-learning annotation-tool text-annotation weakly-supervised-learning human-in-the-loop mlops text-labeling gpt-4 llm langchain rlhf

Updated Jun 11, 2024
Python

argilla-io / distilabel

Star

⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.

python ai openai synthetic-data synthetic-dataset-generation huggingface llms rlhf rlaif

Updated Jun 11, 2024
Python

InternLM / InternLM

Star

Official release of InternLM2 7B and 20B base and chat models. 200K context support

chatbot chinese gpt pretrained-models llm long-context rlhf large-language-model flash-attention fine-tuning-llm

Updated Jun 11, 2024
Python

OctopusMind / DPO

Star

dpo算法实现

lora dpo rlhf qwen

Updated Jun 11, 2024
Python

hiyouga / LLaMA-Factory

Star

Unify Efficient Fine-Tuning of 100+ LLMs

Updated Jun 11, 2024
Python

kyryl-opens-ml / rlfh-dagster-modal

Star

Re-usable & scalable RLHF training pipeline with Dagster and Modal.

modal dpo dagster llm rlhf

Updated Jun 11, 2024
Python

Dev1nW / Rating-based-Reinforcement-Learning

Star

Official Codebase for Rating-Based Reinforcement Learning.

alignment rlhf human-guided-reinforcement-learning

Updated Jun 10, 2024
Python

tatsu-lab / alpaca_eval

Star

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

nlp deep-learning leaderboard evaluation instruction-following foundation-models large-language-models rlhf

Updated Jun 9, 2024
Jupyter Notebook

PKU-Alignment / llms-resist-alignment

Star

Repo for paper "Language Models Resist Alignment"

alignment llama safe alpaca ai-safety vicuna llm llms rlhf safe-rlhf llama2 llama3

Updated Jun 9, 2024
Python

alexisrozhkov / llm-calib

Star

Improving LLM truthfulness via reporting confidence

alignment truthfulness llm rlhf

Updated Jun 9, 2024
Python

allenai / reward-bench

Star

RewardBench: the first evaluation tool for reward models.

preference-learning rlhf

Updated Jun 9, 2024
Python

Esmail-ibraheem / Axon

Star

AI research lab🔬: implementations of AI papers and theoretical research: InstructGPT, llama, transformers, diffusion models, RLHF, etc...

transformers pytorch llama research-paper paper-implementations arxiv-papers llms rlhf

Updated Jun 9, 2024
Jupyter Notebook

Nips20262 / Nips20262

Star

Language Models Resist Alignment

alignment theory llm rlhf instruction-tuning unalignment

Updated Jun 7, 2024
Python

Aligner2024 / aligner

Star

Achieving Efficient Alignment through Learned Correction

alignment aligner llm rlhf weak-to-strong

Updated Jun 7, 2024
Python

RLHFlow / Online-RLHF

Star

A recipe for online RLHF.

llm rlhf llama3

Updated Jun 7, 2024
Python

opening-up-chatgpt / opening-up-chatgpt.github.io

Star

Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Tracking Openness, Transparency, and Accountability in Instruction-Tuned Text Generators.” In Proceedings of the 5th International Conference on Conversational User Interfaces. doi:10.1145/3571884.3604316.

open-source transparency llm chatgpt rlhf chatgpt-free

Updated Jun 7, 2024
Python

jianzhnie / LLamaTuner

Star

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

llama ppo dpo chatgpt rlhf qlora qwen mixtral llama3

Updated Jun 7, 2024
Python

patrick-tssn / LM-Research-Hub

Star

Language Modeling Research Hub, a comprehensive compendium for enthusiasts and scholars delving into the fascinating realm of language models (LMs), with a particular focus on large language models (LLMs)

open-source api-wrapper accelerate multi-modal pretraining large-language-models llm rlhf instruction-tuning

Updated Jun 7, 2024
Python

ld-ing / qdhf

Star

Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization (ICML 2024)

text-to-image optimization-algorithms generative-ai rlhf human-ai-alignment

Updated Jun 7, 2024
Python

Improve this page

Add a description, image, and links to the rlhf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rlhf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rlhf

Here are 121 public repositories matching this topic...

log10-io / log10

argilla-io / argilla

argilla-io / distilabel

InternLM / InternLM

OctopusMind / DPO

hiyouga / LLaMA-Factory

kyryl-opens-ml / rlfh-dagster-modal

Dev1nW / Rating-based-Reinforcement-Learning

tatsu-lab / alpaca_eval

PKU-Alignment / llms-resist-alignment

alexisrozhkov / llm-calib

allenai / reward-bench

Esmail-ibraheem / Axon

Nips20262 / Nips20262

Aligner2024 / aligner

RLHFlow / Online-RLHF

opening-up-chatgpt / opening-up-chatgpt.github.io

jianzhnie / LLamaTuner

patrick-tssn / LM-Research-Hub

ld-ing / qdhf

Improve this page

Add this topic to your repo