Commit graph

25 commits

Author SHA1 Message Date
Helium314
d183baf401 remove some unused code and todos 2023-10-20 09:43:14 +02:00
Helium314
c6efd5a843 fix some warnings, disable "with gesture typing" text 2023-09-07 15:54:08 +02:00
Helium314
33b64ac34d move dictionaries and related script to a separate repository
https://codeberg.org/Helium314/aosp-dictionaries
2023-07-28 19:19:57 +02:00
Helium314
29c252066d allow users to select dictionaries
https://github.com/openboard-team/openboard/pull/569
https://github.com/openboard-team/openboard/pull/578
modified so dictionaries use the correct type instead of always main
2023-06-26 13:25:32 +02:00
Yeghishe Piruzyan
e46783fd9d Better support for Armenian language 2022-07-19 22:00:33 -07:00
jnnkB
503498ee89 Revert "Updated German dictionary"
This reverts commit 52c0e90902.
2021-10-13 11:38:14 +02:00
RHJihan
5d38410095 Several updates for Bengali (Bangladesh) 2021-07-13 19:21:47 +06:00
Reibl Janos Daniel
ffe8d7a0fe Add Hungarian dictionary 2021-06-13 16:52:40 +02:00
so9q
13c548c79f Added Swedish and Danish dictionaries 2021-05-09 14:28:15 +02:00
Trevor Terris
206fc18ec3 Add persistence for user history dictionary
User history dictionary was not flushing to disk on close, added that logic so (unless there's a crash) the user history should be persisted.
2021-05-06 08:55:44 +02:00
Trevor Terris
8d46e0ed25 Add bigrams to English wordlist for next word suggestion
Add bigrams to all English words, essentially "pre-loading" the word suggestions without needing to learn from user input. Words derived from the American National Corpus, https://www.anc.org/data/oanc/ .
2021-05-01 15:06:32 +02:00
Martin Sand
52c0e90902 Updated German dictionary
Signed-off-by: Martin Sand <martin.sand.dev@gmail.com>
2021-03-18 11:40:31 +01:00
dslul
d3dd6236e4 replace nl dictionary 2020-09-30 22:14:01 +02:00
Codruț Constantin Gușoi
8132aeba7d Add Romanian dictionary
With Romanian you have to take special care when handling words that
contain the "sh" and "tz" character (read it as the "sh" in "shiver"
and the "zz" in "pizza"). There are two sets of characters that look
sort of the same `ş` and `ș`, `ţ` and `ț`. If you look carefully, one
has a tail connected to the body, and the other has a comma separated
from the body of the character. The correct ones are the one with the
comma separated, not the touching tail. If in doubt, switch to a
Romanian layout and type `;` and `'`, they will give you the correct
characters to use.

The HTML codes for these characters are:
- `&#x0218;` and `&#x0219;` for `Ș` and `ș`.
- `&#x21A;` and `&#x21B;` for `Ț` and `ț`.

Reference:
https://en.wikipedia.org/wiki/S-comma
https://en.wikipedia.org/wiki/T-comma

While similar in shape, this difference will break autoc ompletion.
I've replaced all of them with the proper one.

I've also tried creating a new dictionary but ran into issues...
The list of words was downloaded from:
https://raw.githubusercontent.com/hermitdave/FrequencyWords/master/content/2018/ro/ro_full.txt

This is not a quality source, and some cleaning up was done in order to
remove some mistakes, like words containing numbers, and the `,` and `.`
characters. Words that were separated with `--` were also removed as
there is no such notation in the language.

The tools from here were used to create the dictionary:
https://github.com/remi0s/aosp-dictionary-tools

They only take the top 150,000 words, from a total of 1,154,496
effectively skipping words with less than 2 occurrences. This is OK, I
guess... although it misses a lot of valid ones. A better data source
would help with this, but it's difficult to find such data.

I guess I can come back in the future to improve this.
2020-05-30 17:29:43 +01:00
Daniele Laudani
f935623456 Merge pull request #108 from sandsturm/105_improve_bulgarian_wordlist_and_create_dictionary
Added Bulgarian dictionary and missing words of #105
2020-05-27 11:35:28 +02:00
Martin Sand
81cdc03ca6
Corrected typos in bg_wordlist
Signed-off-by: Martin Sand <martin.sand.dev@gmail.com>
2020-05-26 09:58:16 +02:00
Martin Sand
2ca6ce149a
Added Bulgarian dictionary and missing words of #105
Signed-off-by: Martin Sand <martin.sand.dev@gmail.com>
2020-05-25 00:16:46 +02:00
H2172
939fea7aec
creating Dutch dictionary 2020-05-15 15:58:04 +02:00
Martin Sand
729c54a5c8 Added Portugese (pt_PT) dictionary 2020-04-28 12:28:32 +02:00
Martin Sand
7212f57c92 Added Esperanto dictionary 2020-04-28 10:37:11 +02:00
Martin Sand
e4fc435b55 Added another 30 common words to the German dictionary 2020-03-29 22:12:14 +02:00
dslul
111be29ab4 Add greek dictionary 2020-01-30 14:07:22 +01:00
dslul
fe269aa2f6 add turkish dictionary 2020-01-27 22:30:01 +01:00
dslul
7b9f21607c Add polish dictionary 2020-01-22 21:25:30 +01:00
dslul
797ee4f5c6 first commit 2019-12-31 18:19:35 +01:00