#

aisafety

Here are 9 public repositories matching this topic...

tigerlab-ai / tiger

Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)

classification data-augmentation ai-safety fine-tuning aisafety rag large-language-models llm llm-training

Updated Dec 2, 2023
Jupyter Notebook

trendmicro / ais

Toolkit for research purposes in AIS. See the website for the paper.

security safety sdr rf ais aisafety

Updated Feb 15, 2021
Python

metadriverse / cat

[CoRL'23] Adversarial Training for Safe End-to-End Driving

autonomous-vehicles adversarial-machine-learning aisafety

Updated Dec 5, 2023
Python

riceissa / aiwatch

Website to track people, organizations, and products (tools, websites, etc.) in AI safety

mysql php database dataset ai-safety data-portal aisafety ai-alignment

Updated Jun 12, 2024
HTML

ZiyueWang25 / llm-security-challenge

Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments

cybersecurity aisafety llm

Updated Aug 21, 2023
Python

kkhetarpal / ais

Common repository for our readings and discussions

reinforcement-learning ai intrinsic-motivation aisafety saferl reward-design

Updated Apr 16, 2018

kkhetarpal / safe_a2oc_delib

Safe Option Critic: Learning Safe Options in the A2OC Architecture

options-framework aisafety asynchr-advantage-option-critic safe-option-critic

Updated Dec 17, 2018
Python

endlessloop2 / UC-AI-Thinkathon-2023

Winning entry for the UC Chile AI Safety Thinkathon 2023. Coauthor @mon-b

ai alignment ai-safety aisafety gpt-3

Updated Mar 1, 2023
R

SasankYadati / mech-interp

where I learn and explore mechanistic interpretability of transformers

transformers interpretability aisafety

Updated May 24, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the aisafety topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the aisafety topic, visit your repo's landing page and select "manage topics."