Category:Media needing categories (cyrillic names)Edit

A large part (if not the majority) of files in Category:Media needing categories (cyrillic names) are currently categorised one way or another. Could they be removed from the category by a bot? For example, User:TaxonBot... 18:25, 18 November 2017 (UTC)

I checked a couple of files. Most had no categories and thus are correctly flagged. I think the categories get removed automatically, when two or more categories are paresent. --Schlurcher (talk) 07:15, 13 December 2017 (UTC)
@Schlurcher:, please check some more files. I've been browsing this category for some time (not only from this IP), and I can assure you that a large part of files in it do not lack categories any more. And no, Category:Media needing categories (cyrillic names) does not get removed automatically. For example, these are just some (not even close to all!) of the files from the first 200 in the category, and many were categorised more than a year ago...

Double category redirectsEdit

Several double {{category redirect}}s (that is, where the target page contains a {{category redirect}} again) certainly exist: e.g. Dnipropetrovsk airport and Aircraft at Dnipropetrovsk airport. There is no Toolserver anymore and I don’t know how to learn how numerous are they. Can somebody care about it?

Happy New Year! Incnis Mrsi (talk) 11:02, 31 December 2017 (UTC)

I think it's normally done by User:RussBot on the fly as one can see on its log (scroll down a bit). However, it looks like it didn't run past two or three days, pinging R'n'B. --Achim (talk) 12:14, 31 December 2017 (UTC)
Thanks for the bot link, but one can easily check with edit histories that the Dnipropetrovsk_airport problem stands for 1½ years without any undertaking, and the second one for 11 months. While the bot fixes Category: pages in some circumstances (e.g. #REDIRECTs as far as I spotted), it certainly isn’t tasked with detection of {{category redirect | ↦ {{category redirect|…}}}} cases. Incnis Mrsi (talk) 12:59, 31 December 2017 (UTC)
Thanks for the ping. I kicked the defloginator, and the bot has started up again. As for Incnis Mrsi's comment: you're right. The bot will only detect and fix a double-redirect if someone populates the double-redirected category, since the main task of the bot is looking for non-empty redirected categories, and fixing double-redirects is just a byproduct. --R'n'B (talk) 14:46, 31 December 2017 (UTC)
Thanks to Incnis Mrsi for pointing this out. At a first view there seem to be affected at least 8000 category redirects. I'll have a closer look during the next days. --Achim (talk) 11:27, 1 January 2018 (UTC)
Thank you all. A bot (or new task for an existing bot) that fixes these automatically would be helpful.   — Jeff G. ツ please ping or talk to me 14:39, 1 January 2018 (UTC)
Commons:Bots/Requests/Gabrielchihonglee-Bot/2 Gabrielchihonglee (talk) 02:48, 13 January 2018 (UTC)

Files with U+FFxx in namesEdit

There is a lot of files with full-width forms in the name:＀&to=⦅&namespace=6 . IMHO they should be summarily renamed applying un<wide> compatibility equivalence where possible (of course, there are ASCII symbols whose use in page names is prohibited by MediaWiki). Possibly, should be done by a bot although, alternatively, I can make a transliteration script to use with Special:MovePage. Note: I’m not a Commons regular and can’t assure anybody that this mass renaming can be substantiated by the point 6. or some else rule. Incnis Mrsi (talk) 15:00, 1 January 2018 (UTC)

I think we need more explanation as to why this should be done. Are wide characters deprecated? Are there downsides? Do we need to check outputs from "Unicode Text Converter" too?
SELECT CONCAT("* [[:File:", img_name, "|",
  REGEXP_REPLACE(CAST(img_name AS CHAR CHARACTER SET utf8mb4), "([!-~]+)", "<span class=error>\\1</span>"),
  "]]") AS Filename
FROM image
WHERE CAST(img_name AS CHAR CHARACTER SET utf8mb4) REGEXP "[!-~]" /* U+FF01 - U+FF5E*/
  AND CAST(img_name AS CHAR CHARACTER SET utf8mb4) REGEXP "^[!-~ !-~]+$"
The script above search filenames using only containing ASCII (U+0021–U+007E) and ASCII full width (U+FF01–U+FF5E) characters. These should not have issues for conversion. I've included a selection from the 1,874 results. —Dispenser (talk) 22:53, 1 January 2018 (UTC)
Fixed that. I was deriving it from memory and assumed the ASCII control block would be repeated. The query and results were correct though. —Dispenser (talk) 23:22, 1 January 2018 (UTC)
Wide characters are perfectly valid, but there is iMHO no reason to tolerate them as substitutes for (most of) ASCII equivalents in file names because of problems with accessibility, evasion of detection by Special:PrefixIndex and AllPages, larger (in bytes) links—both wiki text and URLs, potential corruption by text editors, bulky visual appearance, maverick sorting. Incnis Mrsi (talk) 23:42, 1 January 2018 (UTC)
But it would be reasonable to consult CJK users because they produced most of names in question. A volunteer to go to C J K venues? Incnis Mrsi (talk) 23:59, 1 January 2018 (UTC)

Full-text search in SVG filesEdit

Can all SVGs containing (in the latest revision) «-Bold», «-Italic» and so on be identified and, preferably, categorized? Can the same be done for other perversions of Adobe Illustrator?

Attention: I am speaking about searching in files – SVG code is a kind of text. I do not request searching in «File:» wiki pages.
Incnis Mrsi (talk) 14:25, 13 January 2018 (UTC)

@Incnis Mrsi: It's not impossible, but it would currently require all 1.2 million SVGs to be downloaded on someone's computer or on the WMF cloud servers. At that point it would be better to have the bot periodically reupload all such files (for fixes which are very unlikely to cause rendering issues, anyway). This would also help with fixing invalid XML declarations/<svg> tags (librsvg stopped rendering files without an xmlns about 7 years ago but there are still some files which don't have an xmlns yet), and possibly other librsvg/Inkscape/etc. formatting and rendering artefacts if someone is aware of how to fix those things. Jc86035 (talk) 14:39, 13 January 2018 (UTC)

Add Category:Images from Turismo Bahia or Category:Images from Fotos GOVBAEdit

Hi! There are a lot of images from theses two Flickr user. But the associated category was created today. So my request is add Category:Images from Turismo Bahia to the Turismo Bahia's images and Category:Images from Fotos GOVBA to the Fotos GOVBA's images. See Special:Search/insource:turismobahia, Special:Search/"Turismo Bahia", Special:Search/insource:agecombahia, Special:Search/"Fotos GOVBA". Thanks, Luan (talk) 17:20, 17 January 2018 (UTC)

Idem to Category:Images from Bahia Notícias, Special:Search/insource:bahianoticias, Special:Search/"Bahia Notícias". --Luan (talk) 21:36, 17 January 2018 (UTC)

You can do this yourself with Help:Cat-a-lot. It works on search results, just press the button to display 500 or type in a bigger number in the URL to show, say, 2000 images at a time. -- (talk) 21:43, 17 January 2018 (UTC)
Great! I don't know about it. Thanks!! --Luan (talk) 21:53, 17 January 2018 (UTC)
Done. --Luan (talk) 22:08, 17 January 2018 (UTC)