MediaWiki talk:Captcha-addurl-whitelist/Archive 1

Latest comment: 9 years ago by Mr. Stradivarius in topic Update to link
Archive 1

Please use the whitelist

Could somebody please set up this whitelist to help with the related problems we've been having with CAPTCHAs recently? The related discussions are at Wikipedia:Village pump (policy)#New CAPTCHA for unregistered users are too frequent and too hard and also T45546.

Edit notice

This page should probably have an edit notice at Template:Editnotices/Page/MediaWiki talk:Captcha-addurl-whitelist so that people what it's for.

Support for use

The support for using this whitelist can be found at Wikipedia:Village pump (idea lab)/Archive 8#CAPTCHA for IP users adding extenal links. It was a limited discussion but had strong support from those joining in the discussion, including support from a well-respected admin / Edit Filter Manager familiar with the subject.

Purpose

This whitelist allows URLs to be added by unregistered users without the need for entering a CAPTCHA. Its purpose is to allow the addition of so-called "good links" that the project would like added, such as references to reliable sources like {{cite doi}} and maintenance templates using helpful external links such as {{AFC submission/draftnew}}.

Usage

This whitelist uses regular expressions, therefore any admin editing it should be familiar with them. Documentation and notes on usage may be found at mediawikiwiki:Extension:ConfirmEdit#URL and IP whitelists.

Suggestion

Below is a suggestion for the initial set up. This should be checked by an admin familiar with regular expressions to verify it does what is needed.

# -----------------------------------------------------------------------------
# PLEASE DO NOT MODIFY THIS LIST UNLESS YOU HAVE A BASIC UNDERSTANDING OF REGEX
# -----------------------------------------------------------------------------
# 
# 
# This MediaWiki page is used by Extension:ConfirmEdit. More details, 
# including usage, may be found at [[mediawikiwiki:Extension:ConfirmEdit]]
# 
# The Sites and URLs contained in this list may be entered by unregistered 
# users without having to enter a CAPTCHA. These may be known good sites 
# such as Wikipedia or other sites under the WMF umbrella. They may be 
# sites used for referencing articles such as those used in {{tl|cite doi}}.
# They may also be sites used in maintenance templates such as those used 
# in {{tl|AFC submission/draftnew}}.
# 
# 
# Syntax is as follows: 
#   * Everything from a "#" character to the end of the line is a comment
#   * Every non-blank line is a regex fragment which will only match hosts inside URLs
# 
#  
# -----------------------------------------------------------------------------
#       ---==[ Sites related to the Wikimedia Foundation & related ]==---
# -----------------------------------------------------------------------------

\.wikipedia\.org        # All language versions of Wikipedia
\.wikimedia\.org        # All Wkimedia related sites
\.toolserver\.org       # All toolserver related sites

# -----------------------------------------------------------------------------
#                  ---==[ Sites related to referencing ]==---
# -----------------------------------------------------------------------------

jstor\.org            # jstor.org  used when referencing academic journals and in Template:Find sources
worldcat\.org         # worldcat.org used for referencing in Template:OCLC, Template:ISSN and others
scholar\.google\.com  # Google search of academic papers used for referencing and in Template:Find sources
\.doi\.org            # doi.org digital object identifier (DOI) used for referencing and in Template:Cite doi
nytimes\.com          # The New York Times used for references

# -----------------------------------------------------------------------------
#               ---==[ Sites used in maintenance templates ]==---
# -----------------------------------------------------------------------------
More suggestions

We should probably include some Google links also, but this may be somewhat controversial because of the possibility of redirect spam using google.com/url?. See meta:Talk:Spam blacklist/Archives/2012-08#Google redirect spam for more details. Perhaps just links to reliable sources, like the ones below or something similar

Google books search
google.com along with tbs=bks

Google news search
google.com along with tbm=nws

Both of these are used extensively for referencing and in multiple templates such as {{find}}. There are a number of other "good" URLs used throughout the project that are included in citation templates. Most of these are listed at {{Citation Style 1}} if anybody wants to help find the "good" ones. Thanks for the help. 64.40.54.93 (talk) 13:02, 4 January 2013 (UTC)

Redundant listings

Note that the first three entries in the current version are not necessary, as domains matching

^(https?:)?//([.a-z0-9-]+\\.)?((wikimedia|wikipedia|wiktionary|wikiquote|wikibooks|wikisource|wikispecies|mediawiki|wikimediafoundation|wikinews|wikiversity)\.org|dnsstuff\.com|completewhois\.com|wikimedia\.de|toolserver\.org)(/|$)

are already whitelisted by the $wgCaptchaWhitelist configuration setting. OTOH, it doesn't hurt much to have them in here too. Anomie 02:44, 5 January 2013 (UTC)

PubMed etc.

Can an admin add the following in the correct syntax:

  • nih.gov: Contains PubMed, PubChem and other subdomains used in journal reference templates, infoboxes, and other uses

This domain will likely never be used by spam bots, but is used frequently by good-faith editors. 109.79.82.55 (talk) 22:09, 5 May 2014 (UTC)

  Done There weren't any objections within 24 hours when I advertised this request at VPT, so I've enacted it. — Mr. Stradivarius ♪ talk ♪ 09:45, 7 May 2014 (UTC)
Thanks Mr. Stradivarius. 93.107.213.48 (talk) 11:25, 7 May 2014 (UTC)

Given that the Toolserver no longer exists, its regex entry should probably be changed to be about tools.wmflabs.org, the current address (not sure how to format the regex). Also, "Wkimedia" is misspelled; it should be "Wikimedia". Jo-Jo Eumerus (talk) 11:26, 23 June 2015 (UTC)

Mr. Stradivarius - you edited this last: would you mind tending to this request as I'm not sure how it all works? — Martin (MSGJ · talk) 20:00, 23 June 2015 (UTC)
  DoneMr. Stradivarius ♪ talk ♪ 23:22, 23 June 2015 (UTC)