VirtualTam's bookmarks
240 bookmarks found
-
- Unicode text segmentation - https://unicode.org/reports/tr29/
- Unicode emoji - https://unicode.org/reports/tr51/#Searching
- https://jolicode.com/blog/search-for-emoji-with-elasticsearch
-
95 CSS Text Effects
2018-02-22 -
"The font that replaces every buzzword by a Comic Sans-styled censorship bar"
-
Portmanteau algorithms
2017-11-12 - https://en.wikipedia.org/wiki/Portmanteau
- https://en.wikipedia.org/wiki/List_of_portmanteaus
- https://www-users.cs.umn.edu/~kluve018/portmanteau1.html
- http://pythonexample.com/snippet/python/portmanteaupy_rfong_python
- https://github.com/jamcowl/PORTMANTEAU-BOT
- https://gist.github.com/aparrish/5416755
-
VIM Adventures
2017-10-06 -
Home Page for 20 Newsgroups Data Set
2017-09-30 -
-
http://dataconomy.com/2016/12/use-elasticsearch-nlp-text-mining%e2%80%8a-%e2%80%8apart-1/
-
http://dataconomy.com/2017/05/use-elasticsearch-nlp-text-mining-part-2/
-
https://www.elastic.co/blog/text-classification-made-easy-with-elasticsearch
-
https://www.elastic.co/guide/en/elasticsearch/guide/current/languages.html
-
https://www.elastic.co/guide/en/elasticsearch/plugins/current/ingest-attachment.html
-
https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-lang-analyzer.html
-
https://lingpipe-blog.com/2014/04/10/lucene4-document-classification/
-
https://medium.com/@textminers/how-to-use-elasticsearch-for-textmining-part-1-5589e76301d5
-
-
- https://stackoverflow.com/questions/381806/large-public-datasets
- https://aws.amazon.com/datasets/
- https://archive.ics.uci.edu/ml/datasets.html
- https://datasource.kapsarc.org/pages/home/
- https://www.kaggle.com/datasets
- https://www.reddit.com/r/datasets/
- https://zenodo.org/
- https://toolbox.google.com/datasetsearch