From 66fcbee8fbd87dfccdd29a4e3953299d42ed70a1 Mon Sep 17 00:00:00 2001 From: Finn Bear Date: Sun, 22 Sep 2024 11:14:25 -0700 Subject: [PATCH] Fix #28. --- README.md | 2 +- src/dictionary_extra.txt | 9 +++++++++ src/false_positives.txt | 9 +++++++++ src/test_negative.txt | 11 ++++++++++- 4 files changed, 29 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index dea75cf..2148942 100644 --- a/README.md +++ b/README.md @@ -177,7 +177,7 @@ is used as a dataset. Positive accuracy is the percentage of profanity detected | Crate | Accuracy | Positive Accuracy | Negative Accuracy | Time | |-------|----------|-------------------|-------------------|------| -| [rustrict](https://crates.io/crates/rustrict) | 79.74% | 94.00% | 76.19% | 9s | +| [rustrict](https://crates.io/crates/rustrict) | 79.78% | 94.00% | 76.23% | 9s | | [censor](https://crates.io/crates/censor) | 76.16% | 72.76% | 77.01% | 23s | ## Development diff --git a/src/dictionary_extra.txt b/src/dictionary_extra.txt index f40c2e9..7b1a93d 100644 --- a/src/dictionary_extra.txt +++ b/src/dictionary_extra.txt @@ -31,6 +31,7 @@ admit it's ain't it alt an ai +and ill do anna anna! anna!! @@ -91,6 +92,7 @@ freakin fuchs dystrophy fugia gaya +ght, its glhf graham cracker graham crackers @@ -115,6 +117,7 @@ honkeytonk honkey tonk honkey-tonk hugger s +hurt, its i am a jew i am gay i'd like @@ -129,6 +132,7 @@ in 199 in june irl isn't it +it, its it's a hole jarse a jewish @@ -156,11 +160,13 @@ kian kill ike killian kshatr +last? it lmao lol magnacumlaude maine coon make a hole +s expired minigame mini game n't eat @@ -194,7 +200,9 @@ puss in boots ref'd refresh at rip +s a distance saturated fat +see me now shoehorn your shouldn't it since 1 @@ -238,6 +246,7 @@ virgin group virgin islands wassup wasn't it +wish i t wouldn't it xp or no yass diff --git a/src/false_positives.txt b/src/false_positives.txt index ba5962b..6d8af84 100644 --- a/src/false_positives.txt +++ b/src/false_positives.txt @@ -7211,6 +7211,7 @@ geyser ghastful ghettoized ghettoizes +ght, its giansar gibbals gid dicke @@ -8789,6 +8790,7 @@ hunt titles hunt wat hunt wats hunt watts +hurt, its husbandable husbandage husbanded @@ -9387,6 +9389,7 @@ it titles it wat it wats it watts +it, its italiano life italiano parents italic cocktail @@ -10253,6 +10256,7 @@ lasses hit lasses lut lasses perm lasses seeks +last? it latebra latinas hole latitude flower @@ -11854,6 +11858,7 @@ nc untitled nc unto ncaa strate nd dicke +nd ill do nd licking nd ongoing nebbish @@ -15572,6 +15577,8 @@ rusts lut rusts perm rusts seeks ruth little +s a distance +s expired sabra saccadic saccoon @@ -15907,6 +15914,7 @@ sects hit sects lut sects perm sects seeks +see me no seek chi seek cocktail seek commission @@ -19683,6 +19691,7 @@ wise assumpt wise assurance wise assure wisecracker +wish i t wisht wiskinky wiss cumulative diff --git a/src/test_negative.txt b/src/test_negative.txt index bb7da0b..73c1bbc 100644 --- a/src/test_negative.txt +++ b/src/test_negative.txt @@ -135,4 +135,13 @@ i'll send you to hello I could say I miss you but it’s not the truth same as 5 apples 4 secs so -Porsche 911 \ No newline at end of file +Porsche 911 +'Cause there's a distance now +(It's alright, it's alright, it's alright to start from the bottom) +And I'll do my duty +my mind's expired +I wish I take it back +could see me now +hurt, it's gonna +to last? It fell +don't get it, it's my \ No newline at end of file