VirtualTam's bookmarks
305 bookmarks found
-
Publicly Available Spam Filter Training Sets
2017-05-05 - http://www.paulgraham.com/spamarchives.html
- http://untroubled.org/spam/
- https://spamassassin.apache.org/old/publiccorpus/
- https://github.com/dmitrynogin/SpamAssassin
- http://archive.ics.uci.edu/ml/datasets/Spambase + http://statweb.stanford.edu/~tibs/ElemStatLearn/
- https://inclass.kaggle.com/c/adcg-ss14-challenge-02-spam-mails-detection
- https://www.kaggle.com/rtatman/fraudulent-email-corpus
- https://www.kaggle.com/veleon/ham-and-spam-dataset
- https://www.kaggle.com/beatoa/spamassassin-public-corpus
- http://www.csmining.org/index.php/spam-assassin-datasets.html
- http://artinvoice.hu/spams/
- http://www2.aueb.gr/users/ion/data/enron-spam/
- https://security.stackexchange.com/questions/223731/spamassassin-public-corpus
-
Pure Data Introduction
2017-05-03 -
deck.gl
2017-04-07 -
Intro to Apache Spark | Databricks
2017-03-08 -
Apache Hadoop
2017-03-06 - https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/SingleCluster.html
- https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/ClusterSetup.html
- https://cloud.google.com/hadoop/setting-up-a-hadoop-cluster
- https://www-ssl.intel.com/content/dam/www/public/us/en/documents/white-papers/best-practices-for-implementing-apache-hadoop-paper.pdf
- https://www-ssl.intel.com/content/dam/doc/white-paper/cloud-computing-optimizing-hadoop-deployments-paper.pdf
-
Map Lab | WIRED
2017-02-18