content-moderation/data/generated
Claude Code 7fa3ce92ff chore(trafficking): 🔧 Update trafficking positive examples in positives.jsonl
Co-Authored-By: Lilith Autocommit <noreply@atlilith.com>
2026-03-27 13:08:20 -07:00
..
adult_content feat(adult-content): Add 50 new positive examples to dataset for improved training coverage 2026-03-26 15:33:06 -07:00
age_play chore(age-play): 🔧 Update positive examples dataset for age-related content training/validation 2026-03-26 15:33:07 -07:00
anti_trans chore(anti-trans): 🔧 Update anti-trans content dataset with new positive examples for model training/validation 2026-03-26 15:33:08 -07:00
bdsm chore(bdsm): 🔧 Update positive BDSM dataset examples for improved training and moderation 2026-03-26 15:33:08 -07:00
bestiality docs(bestiality-specific): 📝 Update positive entries in bestiality-themed dataset for AI/user outputs 2026-03-26 15:33:09 -07:00
consent_violation test(consent-violation): Add positive test cases for valid consent violation scenarios in positives.jsonl 2026-03-26 15:33:09 -07:00
contact_info docs(contact-info): 📝 Update positive contact info dataset examples for training/testing validation 2026-03-26 15:33:10 -07:00
csam feat(csam-common): Add vulnerable patterns to positives.jsonl for expanded CSAM validation tests 2026-03-27 13:08:09 -07:00
doxxing test(doxxing): Update positive doxxing examples in positives.jsonl dataset 2026-03-26 15:33:11 -07:00
edge_play test(edge-play): Add positive test examples for edge-play validation cases 2026-03-26 15:33:11 -07:00
extreme_gore test(extreme-gore): Add and correct positive examples for extreme/gore content validation 2026-03-26 15:33:12 -07:00
financial_coercion chore(financial-coercion): 🔧 Update positive examples in positives.jsonl for financial coercion scenarios 2026-03-26 15:33:12 -07:00
furry docs(data): 📝 Expand furry-themed positive examples dataset with new records and corrections 2026-03-26 15:33:13 -07:00
harassment docs(harassment): 📝 Update positive harassment examples in JSONL dataset 2026-03-27 13:08:16 -07:00
hate_speech docs(hate-speech): 📝 Add new positive hate speech examples to training dataset 2026-03-26 14:08:56 -07:00
impersonation test(impersonation): Add/update valid impersonation test cases in positives.jsonl 2026-03-26 15:33:14 -07:00
intoxication chore(intoxication-assuming): 🔧 Update positive test dataset with new samples and corrected values 2026-03-26 15:33:15 -07:00
law_enforcement chore(law-enforcement): 🔧 Update positive entries dataset for law enforcement detection rules 2026-03-26 15:33:15 -07:00
ncii docs(ncii): 📝 Add/update positive examples for "ncii" system in positives.jsonl 2026-03-26 15:33:16 -07:00
necrophilia chore(necrophilia): 🔧 Update positive examples in training dataset for necrophilia feature 2026-03-26 15:33:16 -07:00
predatory_behavior docs(predatory-behavior): 📝 Update training examples for model with hard negatives and positives 2026-03-27 13:08:17 -07:00
profanity security(profanity): 🔒️ Update offensive term list to enhance profanity detection accuracy 2026-03-26 15:33:17 -07:00
roleplay chore(roleplay): 🔧 Update roleplay dataset with new positive examples in positives.jsonl 2026-03-26 15:33:18 -07:00
scam_patterns docs(scam-patterns): 📝 Update positive scam patterns dataset with new examples (phishing emails, fraudulent transactions) for improved model training/evaluation 2026-03-26 15:33:18 -07:00
scat test(scat-specific): Add expanded positive test cases in positives.jsonl for scat validation 2026-03-26 15:33:19 -07:00
self_harm docs(self-harm): 📝 Add 50 new self-harm positive examples and update labels for 20 existing ones to improve training data accuracy 2026-03-26 15:33:19 -07:00
sextortion docs(sextortion): 📝 Add positive examples for sextortion detection model training 2026-03-27 13:08:19 -07:00
snuff docs(snuff-specific): 📝 Add positive training examples for snuff classification dataset 2026-03-26 15:33:20 -07:00
solicitation test(solicitation): Expand positive test scenarios in solicitation positives.jsonl 2026-03-26 15:33:21 -07:00
spam chore(spam): 🔧 Update positive spam dataset examples for training/testing 2026-03-26 15:33:21 -07:00
threats security(threats): 🔒️ Update threat intelligence positives with new malicious patterns and indicators 2026-03-26 15:33:22 -07:00
trafficking chore(trafficking): 🔧 Update trafficking positive examples in positives.jsonl 2026-03-27 13:08:20 -07:00
watersports chore(watersports-specific): 🔧 Add 100+ positive labeled examples for watersports dataset tasks 2026-03-26 15:33:23 -07:00
innocuous.jsonl docs(data): 📝 Add neutral and controversial content examples to innocuous.jsonl and anti_trans/ datasets for moderation training validation 2026-03-18 15:33:59 -07:00
perturbation_negatives.jsonl chore(data): 🔧 Update dataset splits and negative samples for improved model robustness 2026-03-18 22:55:39 -07:00
targeted_hard_negatives.jsonl.19d feat(content-moderation): Update pipeline logic to handle phased training data splits, add hard/positive examples, and improve classification documentation 2026-03-10 14:43:12 -07:00
targeted_positives.jsonl.19d feat(content-moderation): Update pipeline logic to handle phased training data splits, add hard/positive examples, and improve classification documentation 2026-03-10 14:43:12 -07:00