geary/sql
Michael Gratton 7e38198287 ImapDb.Database: Register new ICU-based tokeniser for FTS
The SQLite tokeniser does not deal with scripts that do not use spaces
for word breaking (CJK, Thai, etc), thus searching in those languages
does not work well.

This adds a custom SQLite tokeniser based on ICU that breaks words for
all languages supported by that library, and uses NFKC_Casefold
normalisation to handle normalisation, case folding, and dropping of
ignorable characters.

Fixes #121
2021-01-19 20:48:59 +11:00
..
meson.build engine: Convert from SQLite FTS3/4 to FTS5 for full-text-search 2021-01-19 20:48:17 +11:00
version-001.sql
version-002.sql
version-003.sql
version-004.sql
version-005.sql Remove existing contact harvesting mechanism 2019-06-13 16:28:59 +10:00
version-006.sql
version-007.sql
version-008.sql
version-009.sql
version-010.sql
version-011.sql engine: Remove in-tree unicodesn stemmer 2020-10-14 20:05:15 +11:00
version-012.sql
version-013.sql
version-014.sql
version-015.sql
version-016.sql
version-017.sql
version-018.sql
version-019.sql
version-020.sql
version-021.sql
version-022.sql
version-023.sql
version-024.sql
version-025.sql
version-026.sql Store last cleanup time in GarbageCollectionTable 2020-01-21 17:57:33 +01:00
version-027.sql sql: Add migration for bad message-id data in the database 2020-08-13 15:50:07 +10:00
version-028.sql sql: Add db migration to clean up message_ids with extra whitespace 2020-08-23 12:58:17 +10:00
version-029.sql engine: Remove in-tree unicodesn stemmer 2020-10-14 20:05:15 +11:00
version-030.sql ImapDb.Database: Register new ICU-based tokeniser for FTS 2021-01-19 20:48:59 +11:00