Skip to content

Commit

Permalink
Finalize integration of numbers to words
Browse files Browse the repository at this point in the history
  • Loading branch information
ZDisket committed Apr 26, 2021
1 parent 40f7a0a commit aa3764a
Showing 1 changed file with 5 additions and 3 deletions.
8 changes: 5 additions & 3 deletions TextTokenizer.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -4,15 +4,17 @@
#include <cassert>
#include <cctype>
#include <iostream>
const std::vector<std::string> first14 = { "zero", "one", "two", "three", "four", "five", "six", "seven", "eight", "nine", "ten", "eleven", "twelve", "thirteen", "fourteen" };
const std::vector<std::string> prefixes = { "twen", "thir", "for", "fif", "six", "seven", "eigh", "nine" };
#include <algorithm>




// Punctuation, this gets auto-converted to SIL
const std::u32string punctuation_f = U",.-;";

// For Tacotron2, including question and other marks
const std::u32string punctuation_tac = U",.;¡!¿?:";
#include <algorithm>



using namespace std;
Expand Down

0 comments on commit aa3764a

Please sign in to comment.