VirtualTam's bookmarks
6 bookmarks found
Page 1 of 1
-
Text normalization in Go
2024-11-28 Standard library:
- unicode - Data and functions to test some properties of Unicode code points
Extra libraries:
- golang.org/x/text - Supplementary Go packages for text processing, many involving Unicode
- golang.org/x/text/runes - Tansforms for UTF-8 encoded text
- golang.org/x/text/transform - Reader and writer wrappers that transform the bytes passing through
- golang.org/x/text/unicode/norm - Types and functions for normalizing Unicode strings
- golang.org/x/text/unicode - Building blocks for other packages in golang.org/x/text
Third-party libraries:
- anyascii/go - Converts Unicode characters to their best ASCII representation
RFCs:
- RFC 8264 - PRECIS Framework: Preparation, Enforcement, and Comparison of Internationalized Strings in Application Protocols
- RFC 8265 - PRECIS for Usernames and Passwords
- RFC 8266 - PRECIS for Nicknames
Resources:
-
- https://www.elastic.co/guide/en/elasticsearch/guide/current/language-intro.html
- https://www.elastic.co/guide/en/elasticsearch/guide/current/identifying-words.html
- https://www.elastic.co/guide/en/elasticsearch/guide/current/token-normalization.html
- https://www.elastic.co/guide/en/elasticsearch/guide/current/stemming.html
- https://www.elastic.co/guide/en/elasticsearch/guide/current/stopwords.html
- https://www.elastic.co/guide/en/elasticsearch/guide/current/synonyms.html
- https://www.elastic.co/guide/en/elasticsearch/guide/current/fuzzy-matching.html
-
Music Theory for Musicians and Normal People
2016-09-20 -
xkcd: Keyboard Mash
2015-05-28 I'm a normal human, typing with my human hands
-
DRY Principle: Don't Repeat Yourself
2014-12-17 "Every piece of knowledge must have a single, unambiguous, authoritative representation within a system."
The opposite is WET:
We Edit Terribly, Tumultuously, Tempestuously, Tenaciously, Too much, Timidly, Tortuously, Terrifiedly...
> I think WET also stands for "We Enjoy Typing" // DuncanBayne
Page 1 of 1