Gabriel C's picture

Gabriel C

gabrielchua

·

https://gabrielchua.me

AI & ML interests

Large Language Models, AI Safety, Causal Inference

Recent Activity

updated a Space about 2 months ago

govtech/rai-bench

updated a Space about 2 months ago

govtech/lionguard-demo

updated a Space about 2 months ago

govtech/lionguard-demo

View all activity

Organizations

authored 5 papers 6 months ago

Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security

Paper • 2507.19399 • Published Jul 25, 2025 • 1

LionGuard 2: Building Lightweight, Data-Efficient & Localised Multilingual Content Moderators

Paper • 2507.15339 • Published Jul 21, 2025

Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation

Paper • 2507.11966 • Published Jul 16, 2025

Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications

Paper • 2507.09820 • Published Jul 13, 2025

RabakBench: Scaling Human Annotations to Construct Localized Multilingual Safety Benchmarks for Low-Resource Languages

Paper • 2507.05980 • Published Jul 8, 2025 • 1

authored a paper 10 months ago

MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13, 2025 • 5

authored a paper about 1 year ago

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 22