Jump to content

MediaWiki talk:Spam-blacklist/archives/March 2017

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

openborders.info[edit]

Vipul openly acknowledges paying for edits (overtly and covertly). Most links to this site seem to have been added by Vipul, the site owner, and he makes reference to SEO. This one's obvious as the site is not a WP:RS, so I'm posting here for logging purposes and about to blacklist. Guy (Help!) 12:30, 6 March 2017 (UTC)[reply]

@JzG/help: plus Added to MediaWiki:Spam-blacklist. --Guy (Help!) 12:31, 6 March 2017 (UTC)[reply]

Wikipedia mirrors through google books[edit]

Starting a discussion here, after Wikipedia:Village pump (technical)#Restricting Google book links from a specific publisher discussion (permanent link [1]).

Pinging editors involved in those discussions: @Ugog Nizdast, Someguy1221, and Utcursch:. Is anyone of you aware of editors who have a more than healthy tendency (i.e. spam) to add these, or do they tend to be one-off good-faith additions?

The following books are published on Google books, but contain unattributed material from Wikipedia (see Wikipedia:Potentially unreliable sources/Books that plagiarize Wikipedia for specific examples of copies). Referencing these is hence resulting in circular references, and the material is therefore hence not verifiable.

Specific links:

  • books.google.co.in/books?id=mY8X_vlVThAC&lpg=PA48&ots=ol2ZKxPMBA&dq=%22remains%20to%20be%20investigated%20whether%20its%20findings%22&pg=PA47#v=onepage&q=%22remains%20to%20be%20investigated%20whether%20its%20findings%22&f=false
  • books.google.co.in/books?id=ZsswQ9oTa0wC&lpg=PA3&dq=%22was%20the%20principal%20artist%20of%20the%20Bengal%20school%22&pg=PA3#v=onepage&q=%22was%20the%20principal%20artist%20of%20the%20Bengal%20school%22&f=false
  • books.google.co.in/books?id=zzbuJOC11BoC&lpg=PA129&pg=PA129#v=onepage&q&f=true
  • books.google.co.in/books?id=fu84nNqH-o4C&lpg=PA10&vq=Cycle%20of%20Samsara&dq=bibliogroup%3A%22Indian%20religions%20series%22&pg=PA99#v=snippet&q=Mahaparinibbana%20Sutta&f=false
  • books.google.co.in/books?id=aGiwrLKJ8mkC&lpg=PA10&vq=shaping%20of%20Western%20civilization&dq=bibliogroup%3A%22Indian%20religions%20series%22&pg=PA5#v=snippet&q=self-sufficient%20revelation&f=false
  • books.google.co.in/books?id=J7AIg5bVFCYC&lpg=PA293&vq=symbolism%20and%20iconography&dq=bibliogroup%3A%22Indian%20religions%20series%22&pg=PA77#v=snippet&q=chaitnya&f=false
  • books.google.com/books?id=gMiQMWGhoScC&pg=PA163
  • books.google.co.in/books?id=GzyLjkqbVGQC&lpg=PP1&vq=%22scholarship%20and%20have%20the%20highest%20degree%22&dq=bibliogroup%3A%22Indian%20religions%20series%22&pg=PA8#v=snippet&q=Utsarpinis%20&f=false
  • books.google.co.in/books?id=uhtzeompVAUC&lpg=PP1&dq=bibliogroup%3A%22Indian%20religions%20series%22&pg=PA60#v=onepage&q=mathematical%20%20cobra&f=false
  • books.google.co.in/books?id=_rPHSdXk1GEC&lpg=PP1&dq=bibliogroup%3A%22Indian%20religions%20series%22&pg=PA111#v=snippet&q=druj&f=false
  • books.google.co.in/books?id=vRwS6FmS2g0C&lpg=PA175&dq=%22The%20rise%20of%20Khalsa%20dominance%20in%20the%20Sikh%22&pg=PA175#v=onepage&q=%22The%20rise%20of%20Khalsa%20dominance%20in%20the%20Sikh%22&f=false
  • books.google.co.in/books?id=BCqYE087Yt8C&lpg=PP1&pg=PA17#v=onepage&q=According%20to%20the%20agenda-setting%20theory&f=false
  • books.google.ca/books?id=rFW8AgAAQBAJ&pg=PT115
  • books.google.ca/books?id=hg3qAgAAQBAJ&pg=PT48
  • books.google.com/books?id=0A3qAgAAQBAJ&pg=PT47
  • books.google.com/books?id=fms4CgAAQBAJ&pg=PT72
  • books.google.ca/books?id=XAT4AgAAQBAJ&pg=PT166
  • books.google.ca/books?id=alllAwAAQBAJ&pg=PT104
  • books.google.com/books?id=YLhBAgAAQBAJ&pg=PA281
  • books.google.ca/books?id=zU-xAAAAQBAJ&pg=PA655
  • books.google.co.in/books?id=fM33afZnVy8C&lpg=PA153&dq=%22marketers%20see%20advertising%20as%20part%20of%20an%20overall%20promotional%20strategy%22&pg=PA153#v=onepage&q=%22marketers%20see%20advertising%20as%20part%20of%20an%20overall%20promotional%20strategy%22&f=false
  • books.google.co.in/books?id=LSnewoqa5UwC&lpg=PT18&ots=4fA3Ixsidr&dq=%22remains%20to%20be%20investigated%20whether%20its%20findings%22&pg=PT20#v=onepage&q=%22remains%20to%20be%20investigated%20whether%20its%20findings%22&f=false
  • books.google.co.in/books?id=ZzjDzqtNgoQC&lpg=PT133&dq=Chanakya%20trautmann%20parvata&pg=PT133#v=onepage&q=Chanakya%20trautmann%20parvata&f=false
  • books.google.ca/books?id=ngCqCQAAQBAJ&lpg=PA139&pg=PA139
  • books.google.com/books?id=uf2pCQAAQBAJ

The regexes need to be crafted from 'books.google', followed by an 'id=<code>' to blacklist each specific book. --Dirk Beetstra T C 03:32, 20 February 2017 (UTC)[reply]

Note that the URL can be books.google.* depending on the country (e.g. .com for the US, .ca for Canada, .co.in for India, .co.uk for the UK etc.)
Here are the book IDs:
  • alllAwAAQBAJ
  • hg3qAgAAQBAJ
  • ngCqCQAAQBAJ
  • rFW8AgAAQBAJ
  • XAT4AgAAQBAJ
  • zU-xAAAAQBAJ
  • _rPHSdXk1GEC
  • aGiwrLKJ8mkC
  • BCqYE087Yt8C
  • fM33afZnVy8C
  • fu84nNqH-o4C
  • GzyLjkqbVGQC
  • J7AIg5bVFCYC
  • LSnewoqa5UwC
  • mY8X_vlVThAC
  • uhtzeompVAUC
  • vRwS6FmS2g0C
  • ZsswQ9oTa0wC
  • zzbuJOC11BoC
  • ZzjDzqtNgoQC
  • 0A3qAgAAQBAJ
  • fms4CgAAQBAJ
  • gMiQMWGhoScC
  • uf2pCQAAQBAJ
  • uf2pCQAAQBAJ
  • YLhBAgAAQBAJ
utcursch | talk 03:38, 20 February 2017 (UTC)[reply]
  • Beetstra Per your note at the pump, the two specific examples I picked up from WP:PUS are: id# VY1nTMBQ9vQC and id# zzbuJOC11BoC. The second one is included above but the first isn't. The first has been published many times by Gyan (and other publishers) and has multiple IDs I think. —SpacemanSpiff 13:29, 22 February 2017 (UTC)[reply]

List of IDs[edit]

  • Regex requested to be blacklisted: \bbooks.google.*?id\=alllAwAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=hg3qAgAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=ngCqCQAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=rFW8AgAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=XAT4AgAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=zU-xAAAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=_rPHSdXk1GEC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=aGiwrLKJ8mkC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=BCqYE087Yt8C\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=fM33afZnVy8C\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=fu84nNqH-o4C\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=GzyLjkqbVGQC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=J7AIg5bVFCYC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=LSnewoqa5UwC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=mY8X_vlVThAC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=uhtzeompVAUC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=vRwS6FmS2g0C\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=ZsswQ9oTa0wC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=zzbuJOC11BoC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=ZzjDzqtNgoQC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=0A3qAgAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=fms4CgAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=gMiQMWGhoScC\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=uf2pCQAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=uf2pCQAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=YLhBAgAAQBAJ\b
  • Regex requested to be blacklisted: \bbooks.google.*?id\=VY1nTMBQ9vQC\b
plus Added to MediaWiki:Spam-blacklist. --Dirk Beetstra T C 13:06, 7 March 2017 (UTC)[reply]
I have pulled the trigger on this subset. If more arise, please report them in a new thread, or as a new subthread in this thread. --Dirk Beetstra T C 13:07, 7 March 2017 (UTC)[reply]

immihelp.com, visajourney.com, k1-visa.net, rapidvisa.com[edit]

WP:REFSPAM for immigration-related websites. Guy (Help!) 19:06, 10 March 2017 (UTC)[reply]

plus Added to MediaWiki:Spam-blacklist. --Guy (Help!) 19:07, 10 March 2017 (UTC)[reply]

dcgpac.com[edit]

dcgpac.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

plus Added MER-C 02:42, 11 March 2017 (UTC)[reply]

Immigration law refspam round 3[edit]

List of domains

All appear to have been added by Vipul (talk · contribs · deleted contribs · logs · filter log · block user · block log), Riceissa (talk · contribs · deleted contribs · logs · filter log · block user · block log) or other paid surrogates of Vipul, most are selling legal services related to immigration, and the overall conclusion is SEO. All valid content can be drawn from more authoritative sources such as law books, pages in law faculty websites, official government sources etc. Guy (Help!) 23:24, 10 March 2017 (UTC)[reply]

plus Added to MediaWiki:Spam-blacklist. --Guy (Help!) 23:26, 10 March 2017 (UTC)[reply]
I think econlib.org needs to be removed. This is a legitimate, relatively prominent Economics blog where relatively prominent economists(Sumner, Bryan Caplan) discuss current issues in econ. Dark567 (talk) 01:31, 11 March 2017 (UTC)[reply]
It's being spammed by user:Vipul, but I will; look into it. Guy (Help!) 10:44, 11 March 2017 (UTC)[reply]

Immigration law spam round 4[edit]

Some owned by the same people as the ones above. Guy (Help!) 10:44, 11 March 2017 (UTC)[reply]

plus Added to MediaWiki:Spam-blacklist. --Guy (Help!) 10:44, 11 March 2017 (UTC)[reply]

backupify.com[edit]

plus Added to MediaWiki:Spam-blacklist. --Guy (Help!) 09:45, 12 March 2017 (UTC)[reply]

Predatory journals[edit]

I have been watching these predatory open access publishers for some months. New links keep being added, very often by users whose identities strongly suggest they are the authors of the papers. I think it is time to blacklist these ones due to repeated additions over an extended period, even though at east some of these additions are in good faith. Guy (Help!) 09:23, 13 March 2017 (UTC)[reply]

Archive-org.com[edit]

archive-org.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

This looks like a scam. For a time they offered free website archiving, then later added a paywall so you can't access the content without donating money. This is nonsense, we have many free and open source archive sites such as archive.org and dozens more .. notice they used almost the same name to confuse people. The article Floorball contains an example link archive-org.com/page/4261749/2014-07-11/http://www.freeway.org/issue4/sports/floorball.htm. Please ping me if you need further input. -- GreenC 18:23, 11 March 2017 (UTC)[reply]

A connected site is www.inarchive.com/ which lists dozens of domain names eg.archive-se.com, archive.by.com etc.. they are not all paywalled so I don't know what to make of it.

-- GreenC 20:47, 11 March 2017 (UTC)[reply]

  • Can you assemble a complete list here please? Also any users who may be spamming it? It may also be an issue for the meta blacklist as these could be used to bypass other blacklists. Guy (Help!) 09:24, 13 March 2017 (UTC)[reply]
  • @Green Cardamom: I agree with JzG, can you please compile a full list? I'd like to review the reports for more of them to see patterns. (I have disabled the links in this thread, in case we decide to blacklist).
  • @JzG: Spamming here does not necessarily need to be the 'abuse'/misuse we try to block (though we would likely pull the trigger earlier). As the domains are similar to known genuine archiving sites, I guess many of these links get added in good faith. Meta-blacklisting may be prudent, especially if there is spamming. --Dirk Beetstra T C 11:34, 13 March 2017 (UTC)[reply]
  • Can you assemble a complete list here please? --User:Green Cardamom. There are 73 domains:
Extended content
  • web-archive-uk.com
  • archive-se.com
  • archive-fr.com
  • archive-nl.com
  • archive-dk.com
  • archive-ie.com
  • web-archive-it.com
  • archive-be.com
  • archive-es.com
  • archive-at.com
  • archive-no.com
  • archive-us.com
  • archive-ca.com
  • archive-au.com
  • archive-de.com
  • archive-cz.com
  • archive-hu.com
  • archive-fi.com
  • archive-ch.com
  • archive-pl.com
  • archive-sk.com
  • archive-ro.com
  • archive-ua.com
  • web-archive-pt.com
  • archive-nz.com
  • archive-si.com
  • archive-lt.com
  • web-archive-bg.com
  • archive-lv.com
  • archive-eu.com
  • archive-com.com
  • web-archive-net.com
  • archive-org.com
  • archive-biz.com
  • archive-edu.com
  • archive-co.com
  • archive-info.com
  • web-archive-me.com
  • archive-tv.com
  • archive-mobi.com
  • archive-name.com
  • archive-am.com
  • archive-br.com
  • archive-pe.com
  • archive-cc.com
  • archive-ee.com
  • archive-fm.com
  • web-archive-ar.com
  • archive-mx.com
  • archive-hr.com
  • archive-za.com
  • archive-ht.com
  • archive-py.com
  • archive-hn.com
  • archive-ng.com
  • archive-gr.com
  • archive-cl.com
  • archive-do.com
  • archive-cr.com
  • archive-uy.com
  • archive-sv.com
  • archive-cu.com
  • archive-pr.com
  • archive-rs.com
  • archive-sg.com
  • archive-ph.com
  • archive-in.com
  • archive-by.com
  • archive-ve.com
  • archive-id.com
  • archive-tr.com
  • archive-kz.com
  • archive-ru.com
  • Also any users who may be spamming it? I searched the entire enwiki |archiveurl= fields for these domains and found only a few instances in these articles:
Extended content
  • Bozhin
  • Enlargement
  • Floorball
  • Henry
  • Ken
  • Lyu
  • Madonna
  • Maximilian
  • Nikanor
  • Poland
  • Records
  • Rehman
  • Russell
  • Shawn

Search "archive-" inside the article or regex archive[-][^.]*[.]com. -- GreenC 14:47, 13 March 2017 (UTC)[reply]

attukaldevi.com[edit]

attukaldevi.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

Attukaldevi.com is block listed, the site is more informative of Attukal Devi temple. I request you to visit this site and remove from spam list. — Preceding unsigned comment added by 106.51.20.162 (talk) 09:58, February 17, 2017‎

no Declined,  Defer to Whitelist for specific links on this domain. --Dirk Beetstra T C 08:12, 16 March 2017 (UTC)[reply]

Ongoing efforts over months to years to add this link. Doc James (talk · contribs · email) 17:50, 17 March 2017 (UTC)[reply]

Concerns were previously raised[8]. So therefore added it. Doc James (talk · contribs · email) 17:58, 17 March 2017 (UTC)[reply]

revitalisecosmetics.com.au[edit]

Being spammed into a bunch of articles in replacement of valid refs; see Special:Contributions/Moefry1. Zero value as a reference for any encyclopedic content. Jytdog (talk) 04:48, 21 March 2017 (UTC)[reply]
@Jytdog: no Declined for now, user is blocked. However, if this is now continued on other accounts, I will blacklist this immediately. --Dirk Beetstra T C 06:02, 21 March 2017 (UTC)[reply]

specificationtech.com[edit]

specificationtech.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

Also:
plus Added MER-C 03:11, 25 March 2017 (UTC)[reply]

womanitely.com[edit]

womanitely.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com

useful blog I think it shouldn't be on the blacklist. Many writers from the US contribute to it. Catalina520 (talk) 14:14, 23 March 2017 (UTC)[reply]

Note that this is blocked on the m:Spam blacklist and user has already raised it over there. Ravensfire (talk) 18:01, 24 March 2017 (UTC)[reply]
@Catalina520: not blacklisted locally,  Defer to Global blacklist to request global removal (where you already raised it), or  Defer to Whitelist to ask for local whitelisting of specific links. Note that this blog was mainly spammed by several ranges of IPs over a significant timespan, and the few (3?) genuine additions where all questionable in use as references- blogs do generally not pass our thresholds for reliable sourcing. I think this is rightfully blacklisted, and would suggest that you first show merit for specific links on specific pages through whitelisting. --Dirk Beetstra T C 11:21, 25 March 2017 (UTC)[reply]

arabic-keyboard.info[edit]

arabic-keyboard.info: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:frSpamcheckMER-C X-wikigs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: searchmeta • Domain: domaintoolsAboutUs.com
2604:2000:71C2:0:5D33:3CF9:BA1B:12E9 (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • robtex.com • Google) diff #1, diff #2
2604:2000:71C2:0:D42A:4F01:BAA2:F7EF (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • robtex.com • Google) diff #3
2604:2000:71C2:0:F530:80EC:DB07:9C1F (talk • contribs • deleted contribs • blacklist hits • AbuseLog • what links to user page • COIBot • Spamcheck • count • block log • x-wiki • Edit filter search • WHOIS • robtex.com • Google) diff #4, diff #5

The diffs are from the past few days only, when I've kept track of them, but I know I've seen them before. - Tom | Thomas.W talk 16:44, 24 March 2017 (UTC)[reply]

plus Added MER-C 02:59, 25 March 2017 (UTC)[reply]
Note:
maimed this report, and hit the blacklist on this link. --Dirk Beetstra T C 05:30, 26 March 2017 (UTC)[reply]