English
Dutch & Afrikaans
Major Search Engines did not distinguish between Dutch and Afrikaans: except for Google, they do not provide for searching only for pages in Afrikaans, and searches for pages in Dutch usually return some pages in Afrikaans as well. National domains (.nl, .be / .za) are only a rough guide to location. International domains like .com, .net, .biz, .info etc. provide no clue to the source. These lists were compiled to test various algorithms to distinguish Afrikaans from Dutch pages.
Museum Some of my ancient papers in which there has been a renewed interest
Site developed by Bill Fletcher, whose other free resources – the BNC-based online database "Phrases in English" (try proxy site PhrasesInEnglish.org if first link fails), web concordancer KWiCFinder and n-gram extractor kfNgram are already familiar to the corpus linguistics community.
Bill's legacy site miniappolis.com is hosted on the same server. It will no longer be updated, but to prevent link rot it will remain on life support for the foreseeable future.
http://webascorpus.org
launched 7 February 2007, updated 19 November 2012
Background: driftwood, Dares Beach, Maryland –
original image