This project page in other languages:

English | 日本語 | +/−

Shortcut: COM:BRFA

Bot policy and list · Requests to operate a bot · Requests for work to be done by a bot · Changes to allow localization  · Requests for batch uploads
Gnome-system-run.svg

If you want to run a bot on Commons, you must get permission first. To do so, file a request following the instructions below.

Please read Commons:Bots before making a request for bot permission.

Requests made on this page are automatically transcluded in Commons:Requests and votes for wider comment.

Requests for permission to run a botEdit

Before making a bot request, please read the new version of the Commons:Bots page. Read Commons:Bots#Information on bots and make sure you have added the required details to the bot's page. A good example can be found here.

When complete, pages listed here should be archived to Commons:Bots/Archive.

Any user may comment on the merits of the request to run a bot. Please give reasons, as that makes it easier for the closing bureaucrat. Read Commons:Bots before commenting.

Embedded Data Bot (talk · contribs) (adminbot)Edit

Operator: Zhuyifei1999 (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Because human admins have real lives and can't monitor bot speedy requests every single hour, this bot request for approbal is for extension to the original task, to delete such files according to COM:CSD#F9 on sight if and only if all of the conditions are satisfied (to reduce false positives to a minimum):

  • The file ending is determined precisely with a parser according to the format specifications. i.e. not determined with a remuxer or converter like ffmpeg or jpegtran.
  • The MIME detection of the embedded part returns any of "application/x-rar", "application/zip", "application/x-7z-compressed", and other archival formats (if abuse is found), subject to changes if the MIME themselves change (i.e. if future "application/x-rar" renamed to "application/rar", this list change as well).
  • Has a hit on Special:AbuseFilter/166, via API action=query&list=abuselog&aflfilter=166&afltitle=.... (If the false negative rate goes too high because of this condition I'll file another BRFA to remove this condition).
  • The file has only one entry in its upload history. This is to prevent anyone making the bot delete an arbitrary file just by overwriting with a file subject to deletion.

Automatic or manually assisted: Automatic

Edit type (e.g. Continuous, daily, one time run): Continuous

Maximum edit rate (e.g. edits per minute): 6 per minute

Bot flag requested: (Y/N): N

Programming language(s): python: pywikibot

Zhuyifei1999 (talk) 09:56, 14 January 2017 (UTC)

DiscussionEdit

  • FWIW, I've granted @Steinsplitter: access to the bot on tool labs. --Zhuyifei1999 (talk) 09:56, 14 January 2017 (UTC)
  •   Support Zhuyifei1999 is doing valuable work with this task, and the admin request is reasonable. --Krd 11:34, 14 January 2017 (UTC)
  • I don't have objections, but will be good idea to hear opinion about errors rate from administrators, who deleted such files in past. --EugeneZelenko (talk) 15:01, 14 January 2017 (UTC)
    Well, the conditions laid above are much higher than those that actually add {{embedded data}} tags, and some false positives are visible in Special:Contributions/Embedded_Data_Bot. Anyways, pinging @Jdx, Didym, Ronhjones, Srittau, Hedwig in Washington:@Josve05a, Herbythyme: (sorry if I missed anyone) who did some related deletions, @Ninjastrikers: who speaks their language and actively monitors the abuse, and @Revent: whom I talked to about this on IRC. --Zhuyifei1999 (talk) 16:11, 14 January 2017 (UTC)
  •   Support Yes please. The sooner the better. Noted that there has been 35 embedded files in last 24h. I know Telenor are bring in the Fair Use Policy next week, and this might slow down the attacks (as the Telenor users will not be able to download anything over 150MB per day), but it's probably only a matter of time before another loophole is found. I suspect filter 166 could be tweaked to improve it's function - all recent files have been the first download of a new user, I think 166 allows 7 days old - could be reduced. Ronhjones  (Talk) 16:29, 14 January 2017 (UTC)
    Filter 166 is currently set to upload within 7 days of account creation, with less than 50 edit count prior to upload. This time range is obviously to prevent sleeper accounts. As for adding the condition of first upload (which is currently 50), surely you don't want them to go undetected by the filter after simply uploading a small file, right? I'd suggest a number no smaller than 10, preferably no smaller than 20, if we really need to reduce it. The latest five false positives (well, not-yet-deleted is more accurate) from the filter are Agatyr's File:1665_Girl_with_a_Pearl_Earring.jpg, ProgramaConecta's File:Alisson_Wanderfillk.png, Shameem_Reza's File:সাহেব_বাড়ি।.jpg, Fizwizviz's File:Musee-education-tunis-1.ogg, and B235R's File:Actin_UR5_Robotic_Tool_Path.gif, all within first 5 edit/uploads (well, except the last, but his sixth upload triggered the filter anyways), so I doubt a reduction in edit count requirement will reduce the filter's false positive rate.
    For a comparison, the bot use an condition of user having no more than 200 edit count. The reason isn't the false positive rate, but the workload.   --Zhuyifei1999 (talk) 16:59, 14 January 2017 (UTC)
  •   Support for the reasons outlined above. Zhuyifei1999 is also very receptive to suggestion and additions, so I am confident that all potential problems that arise will be addressed. Sebari – aka Srittau (talk) 16:55, 14 January 2017 (UTC)
  •   Strong support This should have been a RFA instead since what is requested here is the admin flag and not an approval for a new bot task, but either doesn't matter too much though. Zhuyfei1999 is a very trusted bot operator and an admin, in fact he is the one who revived FlickrreviewR and Panoramio review bot, so he is definitely trusted to run this kind of adminbot. (I know this is not a vote, I just want to provide my support...) -- Poké95 08:11, 15 January 2017 (UTC)
  •   Support I support this task, and admin rights for it. It would help to post a link to this request on rfa if it hasn't already been done. --99of9 (talk) 00:45, 17 January 2017 (UTC)
  •   Strong support as per above comments. NinjaStrikers «» 04:04, 17 January 2017 (UTC)
  •   Strong support This is basically, as noted, a request for both Zhuyfei AND Steinsplitter, as bot operators. The bot will, per this request, only delete files that are explicitly positive for containing an embedded archive, with limits that mainly serve to reduce what must be checked. Such files would be deleted as CSD F9 violations anyhow. - Reventtalk 08:24, 17 January 2017 (UTC)
  •   Support Sure! Thanks for getting rid of these pirated material. Regards, Yann (talk) 09:45, 17 January 2017 (UTC)
  •   Support Storkk (talk) 10:17, 17 January 2017 (UTC)
  •   Support Good work! - Jcb (talk) 10:57, 17 January 2017 (UTC)
  •   Support. -- Geagea (talk) 14:22, 17 January 2017 (UTC)
  •   Support - I generally don't like admin bots however in this case it is a great idea. Go careful but many thanks for the help. --Herby talk thyme 15:10, 17 January 2017 (UTC)
  •   Support A very reasonable adminbot use by two trusted admins. Pi.1415926535 (talk) 17:04, 17 January 2017 (UTC)
    •   Comment @Zhuyifei1999: Would it be better to have all versions of the file be by the same user, rather than having to have one version? That would prevent uploaders trying to evade the bot by uploading multiple versions of the same image. Pi.1415926535 (talk) 17:05, 18 January 2017 (UTC)
      • I was thinking about this earlier. The downside would be that I won't want anyone to abuse this bot to delete their own images either; it would add the complexity of checking if the latest version and the version before that < 7d, and the file is unused (COM:CSD#G7). That's do-able, but complex. --Zhuyifei1999 (talk) 17:16, 18 January 2017 (UTC)
  •   Support matanya talk 18:50, 17 January 2017 (UTC)
  •   Support Very sensible, trusted operator. ~ Rob13Talk 23:23, 17 January 2017 (UTC)
  •   Comment I suspect AbuseFilter 166 (Large files by newbies) will be obsolete once Telenor's 150 MB/day data limit kicks/kicked in as pirates will have to split archives. Dispenser (talk) 05:58, 18 January 2017 (UTC)
  •   Support --sasha (krassotkin) 12:23, 18 January 2017 (UTC)
  •   Support Natuur12 (talk) 13:08, 18 January 2017 (UTC)
  •   Strong support Definitely. Jianhui67 talkcontribs 14:00, 18 January 2017 (UTC)
  •   Support --DCB (talk) 19:22, 18 January 2017 (UTC)

Cewbot 2 (talk · contribs)Edit

Operator: Kanashimi (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Upload images from 慕光之城 蛾類世界. The images are licensed under {{cc-by-3.0-tw}}.

Automatic or manually assisted: automatic

Edit type (e.g. Continuous, daily, one time run): Weekly

Maximum edit rate (e.g. edits per minute): 12 per minute

Bot flag requested: (Y/N): No

Programming language(s): JavaScript (CeJS)

Kanashimi (talk) 09:54, 10 January 2017 (UTC)

DiscussionEdit

There should be thousands of images. --Kanashimi (talk) 09:54, 10 January 2017 (UTC)

  • Please can we have a translation of the site's copyright statement you linked to? --99of9 (talk) 00:51, 17 January 2017 (UTC)
  • Some of these images (e.g. [1]) appear to be aggregated from Facebook posts (e.g. [2]) shared to this group: [3]. But there is no indication that I can see of a license release from the photographer. --99of9 (talk) 01:03, 17 January 2017 (UTC)

  I withdraw my nomination I contacts the administrator of the site and find that the license just contains the result of the search system and do not contains the images. @99of9: Thank you for your reply and advice. --Kanashimi (talk) 12:27, 18 January 2017 (UTC)

Cewbot (talk · contribs)Edit

Operator: Kanashimi (talk · contributions · Number of edits · recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Add photo links on TAIBNET of species. The images are licensed under {{cc-by-nd-2.5}}.

Automatic or manually assisted: automatic

Edit type (e.g. Continuous, daily, one time run): Weekly

Maximum edit rate (e.g. edits per minute): 12 per minute

Bot flag requested: (Y/N): No

Programming language(s): JavaScript (source code)

Kanashimi (talk) 14:13, 30 December 2016 (UTC)

DiscussionEdit

Please make a few test uploads. How many images do you plan to upload in total? --Krd 14:48, 30 December 2016 (UTC)

I am writing codes now. There should be thousands of images, and increasing aperiodically. --Kanashimi (talk) 14:48, 31 December 2016 (UTC)
I have do some test edits. There are 21K+ images. Please review the result and give me some suggestions, thank you. --Kanashimi (talk) 11:22, 8 January 2017 (UTC)
As you stated above, http://taibnet.sinica.edu.tw/eng/policy.php says the site is under cc-by-2.5 unless otherwise noted. Sadly all the source pages of the images you transferred note different, i.e. cc-by-nc. Please let me know if I miss something. --Krd 11:58, 8 January 2017 (UTC)
Thank you for your speedy reply. Well... You are right, although the messages on the guest board of TaiBNET says it is OK to use the data in Wikipedia articles. I think the main concern is that they do not want the works being used in a business product. But there are many images in the site. If I change the bot task; for the species exist in commons, giving a link to TaiBNET, is this a good idea? --Kanashimi (talk) 13:53, 8 January 2017 (UTC)
I don't get that, please say again in different words or give an example. --Krd 14:33, 8 January 2017 (UTC)
For example, add a link:
to Category:Albara reversaria, and add a link:
to Category:Neohelota. --Kanashimi (talk) 14:40, 8 January 2017 (UTC)
Understood. I'm not sure if this is helpful, awaiting additional opinions. --Krd 14:53, 8 January 2017 (UTC)
Thanks for the reply. I will wait for some time if other contributors have any advice. --Kanashimi (talk) 15:36, 8 January 2017 (UTC)
I oppose external links to non-free images from our category pages. If someone wants non-free images, they can easily search. --99of9 (talk) 00:55, 17 January 2017 (UTC)
  •   Oppose Ah, I see this was also raised above. --99of9 (talk) 00:55, 17 January 2017 (UTC)
I find there are many sample photos in the site that wiki do not have. So how about just add a link to the page contains the sample photo? I think this can help the visitor who want to know more about the species. --Kanashimi (talk) 12:37, 18 January 2017 (UTC)
If that's the case, why don't we add links to google images? No, advertising for non-free media is not within our scope. --Zhuyifei1999 (talk) 13:47, 18 January 2017 (UTC)