- IF Archive Search Page
- searchlib/cli.py @240e47c
- Sygil-Dev/whoosh-reloaded - A fast, featureful full-text indexing and searching library implemented in pure Python
VirtualTam's bookmarks
-
-
- A summary of my bot defence systems
- Butlerian Jihad - Blog posts on the topic of fighting off spam bots, search engine spiders and other non-humans wasting the precious resources we have on Earth
- EmacsWiki's robots.txt
- A summary of my bot defence systems
-
2025-04-09 Web page archive formats:
Tools for crawling, scraping and archiving Web pages:
- internetarchive/heritrix3 - Extensible, web-scale, archival-quality web crawler project (Java)
- internetarchive/Zeno - State-of-the-art web crawler (Go)
- internetarchive/gowarc - Read and write WARC files in Go
- webrecorder/pywb - Web Archiving Toolkit for replay and recording of web archives (Python)
Self-hosted solutions:
- ArchiveBox - A self-hosted app that lets you preserve content from websites in a variety of formats
- Wallabag - Save and classify articles, read them later
-
2025-02-19 - FAQ
- Zoekt Query Language Guide
- isker/neogrok - A frontend for Zoekt
- Query Language Guide by Neogrok
-
2024-12-17 -
2024-12-03 -
- From Map to Reality: Using PostGIS in Warfare (PDF)
- NASA Earthdata - Full and open access to NASA’s collection of Earth science data for understanding and protecting our home planet
- Visible Infrared Imaging Radiometer Suite (VIIRS) Active Fire detection
- PostgreSQL - GiST - Generalized Search Tree
- PostgreSQL - SP-GiST - Space-partitioned GiST
- H3 - Hexagonal hierarchical geospatial indexing system
- zachasme/h3-pg - PostgreSQL bindings for H3
-
2023-07-14 - rust-lang/regex
- google/re2
- BurntSushi/rebar - A biased barometer for gauging the relative speed of some regex engines on a curated set of tasks
- Resources about implementing regular expression search efficiently, Russ Cox
-
- Postgres Full Text Search vs the rest (MeiliSearch, OpenSearch/Elasticsearch, SQlite FTS, Typesense)
- Optimizing Full Text Search with Postgres tsvector Columns and Triggers
- Word separators for Postgres full text search with Rails
- Using PostgreSQL Full Text Search With Golang
- PostgreSQL: Full Text Search
- PostgreSQL: Preferred Index Types for Text Search
-
- UTM Parameters
- Google Ads - Conversion Tracking
- Websites use URLs to track you. Here’s how to stop them.
- Firefox’s privacy crusade now targets a key form of tracking
- ClearURLs - An addon to remove tracking elements from URLs to help protect your privacy
- iOS 17 automatically removes tracking parameters from links you click on