Notice If you want to see Python source code that supports some of my projects, go to Github and help yourself. The code is not written with reuse in mind... -- (talk) 15:57, 15 May 2018 (UTC)
Notice

If you are concerned that a category gets flooded with automated uploads, check that a template like {{Disambig}}, {{Photographs}}, {{Categorise}}, {{CatDiffuse}} or {{CatCat}} has been applied before complaining. In the case of my batch upload projects, any category marked this way will not be added to new photographs. -- (talk) 16:32, 20 September 2018 (UTC)

Archives.png

2017
2018
2019
2020
2021

Silverton

Recatagorized all images in the Category:Silverton to Category:Silverton, New South Wales.

Thanks~Cessna_208_Caravan#/media/File:Cessna_208_Caravan_I,_Seawings_(Jet-Ops)_AN1347237.jpg

It is helping young students learn that the cockpit is not really that complicated ~ and all those fancy buttons and such do have a purpose and are very easy to learn ~ Mitchellhobbs (talk) 21:25, 13 April 2019 (UTC) — Preceding unsigned comment added by Mitchellhobbs (talk • contribs) 21:30, 13 April 2019 (UTC)

Prośba

Bardzo proszę o napisanie do mnie po polsku, bo niestety, ale nie znam angielskiego i nie wiem w czym jest problem. Pozdrawiam:) Keres 40 (dyskusja) 16:01, 12 gru 2019 (CEST)

Traffic Signs....

For archive purposes are the working drawings on this site suitable for commons? https://www.gov.uk/government/publications/traffic-signs-working-drawings-tsrgd-2002

which are a fairly comprehensive set of working drawings for a previous revision of the secondary legislation governing Traffic Signs in the UK.

The current set is at: https://www.gov.uk/government/collections/traffic-signs-signals-and-road-markings#traffic-signs-images-and-drawings but makes reference to several S and T series drawings which are in the 2002 set. (Given the numbering on SOME of the files, I am thinking that the drawings themselves perhaps only have minor differences.)

A check of the ( https://www.gov.uk/guidance/traffic-sign-images#images-in-eps-format Traffic Signs Image Database ) is also possibly suggested.

If you were really passionate, asking the DFT for a set of all diagram images in the 2016/2017 regulations is possible ( they should mostly be in the database linked above), but would need some kind of FOIA request from someone higher in the Wikimedia Community, I think.ShakespeareFan00 (talk) 14:42, 20 November 2020 (UTC)

Checking one example at https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/545597/tsrgd-2002-index.csv/preview, these are stated as Crown Copyright. As the default of OGL is only if nothing else is specified, I think a mass upload is not defensible. Some signs will be too simple for the Crown claim to be credible, but not all. -- (talk) 20:12, 20 November 2020 (UTC)
Thanks for your comments, DR filed in respect of some existing items uploaded in Good faith. If the DR doesn't get anywhere in a week, I'll mark them for speedy deletion due to an ambiguous (at best) status. ShakespeareFan00 (talk) 23:53, 20 November 2020 (UTC)
Please comment in the relevant DR's as your input would be most useful. ShakespeareFan00 (talk) 01:05, 21 November 2020 (UTC)
I suspect the DRs will go through uncontested unless someone want to raise the too-simple-for-copyright on specific types, like the drawings which are nothing but font sets. -- (talk) 10:09, 21 November 2020 (UTC)
Well, I plan to file a massive DR for some of UK traffic signs content as well other content as well, based on the ambiguity that's arisen, even though there is a previous OTRS in respect of some of it. I've marked some specific items I'd contributed in good faith as :-
{{sdelete|See DR and OTRS#Ticket#2020112110000535, The response received this morning was that the source wasn't able to "complete the request" mentioned in that Ticket. Thus unless someone else is able to clarify with the source concerned as to OGL applicability, I cannot assume that items like this that are derived from the relevant working drawings are still covered under Open Government License Terms.}}
I think this is being over-cautious, but the source apparently wasn't able to 'complete' (their wording) an OTRS request. Whilst not an outright view that its not covered, it's not the kind of positive confirmation Commons generally prefers. Do you know anyone in GLAM circles that might be able to reach the right people more directly? @Jdforrester: perhaps? ShakespeareFan00 (talk) 12:49, 21 November 2020 (UTC)
An opinion by JDF would be convincing, but if the cases are clear then a discussion at COM:VPC might give a suitable conclusion and could be reused as a consensus. Keep in mind that if looking for an arbitrator for what is or is not OGL, curators at National Records may be more helpful than getting replies from an official Crown bureaucrat. -- (talk) 12:56, 21 November 2020 (UTC)
As would exemplar DR, with input from the aformentioned contributor. ShakespeareFan00 (talk) 13:18, 21 November 2020 (UTC)
I also have a note in my e-mail archives (in relation to OTRS Tickets#2015063010023764 and Ticket#2015070210009947) where it was stated that the Traffic Signs Manual (at that date was OGL covered.), and that the example signs in the working drawings were OGL covered. This does need someone like JDF to read through the tickets again, but my view is that the working drawings should be covered based on a careful reading of what was said., (notwithstanding the "(C) Crown Copyright" notice in them.). However, I've still marked my uploads for speedy, until someone with more expertise can take a look. I would welcome someone else taking the lead on this at WP:VPC. (Aside: It is also noted that File:DfT-circular-01-2016.pdf outright says resuse under OGL is permitted, so that file isn't disputed.).
Email address(es) for member(s) of "curators at National Records" would be helpful, in the tickets or via emailuser.   — Jeff G. please ping or talk to me 17:40, 22 November 2020 (UTC)

Getting a full OTRS might not be essential, but it would certainly resolve some ambiguities. ShakespeareFan00 (talk) 15:34, 21 November 2020 (UTC)

Thread opened at VPC - Commons:Village_pump/Copyright#UK_Traffic_Sign_Working_drawings_-_OGL_or_not? , Let's get this resolved. ShakespeareFan00 (talk) 17:30, 22 November 2020 (UTC)

IA Books with images in the IA Flikr stream?

Example IA-identifier: americanspecimen00amer

Any chance of a script to identify the IA identifers for image uploads like these, and possibly grabbing hi-res scans (djvu or PDF) of the relevant books etc, for potential (very long term) Wikisource use?

You were doing an upload of these images back in 2015, and I thought you might want to eventually also try and cross-reference against the IA books upload project? ShakespeareFan00 (talk) 10:53, 27 November 2020 (UTC)

Probably, I may run a test to see what issues arise. You may wish to examine Commons:Undeletion requests#File:The siyar-ul-Mutakherin, a history of the Mahomedan power in India during the last century (IA siyarulmutakheri00ghulrich).pdf. -- (talk) 18:18, 27 November 2020 (UTC)

Sample test:

 1901  2039 accountofcrustac04sars
 -> An account of the Crustacea of Norway, with short descriptions and figures of all the species (IA accountofcrustac04sars).pdf 
 1903  2040 accountofcrustac05sars
 -> An account of the Crustacea of Norway, with short descriptions and figures of all the species (IA accountofcrustac05sars).pdf 
 1913  2041 accountofcrustac06sars
 -> An account of the Crustacea of Norway, with short descriptions and figures of all the species (IA accountofcrustac06sars).pdf 
 1919  2045 accountofcrustac07sars
 -> An account of the Crustacea of Norway, with short descriptions and figures of all the species (IA accountofcrustac07sars).pdf 
 1906  2047 accountofalcyona01thom
 1967  2048 accountofgenusse1967prae
 1903  2050 analysisofgothi01bran
 1849  2051 analysisofgothic02branuoft
 1972  2053 analysisofenviro00harv
 -> An analysis of environmental data for use in updating low frequency propagation loss forecasts. (IA analysisofenviro00harv).pdf 
 1805  2054 analysisofhorsem01adam
 1961  2055 animalecology00kend
 1902  2056 animallifeworldo119021903lond
 1902  2057 animallifeworldo219031904lond
 1834  2058 animalandvegetab01roge
 -> Animal and vegetable physiology, considered with reference to natural theology, by Peter Mark Roget .. (IA animalandvegetab01roge).pdf 
 1836  2060 animalvegetable01roge
 1940  2062 animalbiology1940wolc
 1914  2063 animalcastration01whit
 -> Animal castration, a book for the use of students and practitioners; (IA animalcastration01whit).pdf 
 1920  2065 animalcastration02whit
 -> Animal castration; a book for the use of students and practitioners (IA animalcastration02whit).pdf 
 1914  2067 animalcastration00whit
 1924  2069 animallifeinyose00grinrich

So there are some observations:

  1. Some relevant pdfs exist and have not been added to the category the extracted pages are in. There's no obvious way of working out what the 'parent' category is for the the extracts.
  2. Post 1925 extracts exist, one might presume that a default PD-USGov license would do for any new pdfs uploaded for these...
  3. Where pdfs do not exist, one could add all the jpgs that refer to the same ident to a hidden tracking category, like Category:IA books animalecology00kend, rather than attempting to create a good English category name automatically, or deduce what an existing parent cat for the book might be.

-- (talk) 20:19, 27 November 2020 (UTC)

This is at an early test stage, but the category of categories is at Category:Internet Archive index categories. -- (talk) 13:54, 28 November 2020 (UTC)

speedy....

IA Books uploads...

These still happening en-masse? I'm seeing harmonisation and housekeeping as well :) (And thanks.)

BTW Would it be possible to use the IA identifiers to match up possible mutliple volume sets? ( I'd been using {{Morevols}} but I am seeing sequence gaps where Commons has randomid02regexp but not randomid01regexp which is volume 2 and volume respectively. Not sure how this could be even semi automated as you'd still need a human reviewer pairing things up... but someting to consider.

I hope that my efforts to find the (so far) thankfully small number of items Commons can't host isn't overloading your talk page. ShakespeareFan00 (talk) 21:35, 10 December 2020 (UTC)

DRs will be swept up eventually.
The repeated upload attempts are still running for some collections, there's 3 or 4 current. I just terminated one with about 8 PDFs left that had over 80 upload attempts each, clearly they are the 'residuals' with basic unacceptable formatting for the WMF servers to handle, not because of size alone. These run until either there's no pending uploads, or a manual termination.
If you have an idea for another library collection feel free to suggest it, I have not been looking but might in a few days. I'd like the collection to be over a million, as that makes for a good VP notice to remind folks they can assist with using it, or if they wish to complain about it as invariably some will. -- (talk) 22:29, 10 December 2020 (UTC)
I didn't currently, but if I think of something, or you do :) ShakespeareFan00 (talk) 23:37, 10 December 2020 (UTC)
I can think of another archive of niche works, but I am not sure of the copyright status of some them.. (i.e I was searching by IA subject categories) not collections. I'm not sure your script is able to do targeted queries as such.

ShakespeareFan00 (talk) 00:00, 11 December 2020 (UTC)

Parking these for consideration:-

https://archive.org/details/cornell
https://archive.org/details/georgetown-university-law-library

It would need someone to take a deeper look before they could be considered as forks, due to the need to filter out post 1925 and non-US works still in copyright. ShakespeareFan00 (talk) 00:00, 11 December 2020 (UTC)

Running a soak test to see if
Wash\.|Ala\.|Ore\.|Mass\.|N\.Y\.|Va\.|Minn\.|Conn\.|Cal\.|Francisco|New\ York|Boston|Albany|Washington|Philadelphia = USloc
against publisher would be good for the cornell collection. There are many with blank publishers.
A positive filter for US location is logically going to be easier than trying to filter all non-US locations. Again, sad that IA metadata has no publication country field. -- (talk) 11:24, 11 December 2020 (UTC)
Provisional extra logic for non-Fed collection:
  1. If positive USloc match in publisher -> PD 1925
  2. If non/unknown USloc and year < 1900 -> PD-old-100-expired
  3. Reject the rest
Category:Scans from Cornell University Library
-- (talk) 12:11, 11 December 2020 (UTC)
Another possibility is to look at the "Current location" field for the IA uploads, which if processed down will generate the names of collections on IA ( some possibly nested.) which could based on frequency in exisiting items (and the number of DR's being filed for those collections) be used to determine possible areas for mirroring. ShakespeareFan00 (talk) 10:56, 12 December 2020 (UTC)

FYI, I was vaguely thinking of putting out a summary/end of year VP notice today, but not really up to it. It sort of makes sense to leave it for end week1 or week2 2021 anyway, as there may be some discussion and other projects relating to public domain day. -- (talk) 11:35, 31 December 2020 (UTC)

Harmonization: Finished?

Did this complete or is it still ongoing, as I'm not seeing as many Related changes for it?

I'm in the process of going through Faebot's upload log to find the "harmoinizing" uploads made before the category, so they can be added.

(Also doing some "catlog record updating" (i.e adding creator tags where I can.) ShakespeareFan00 (talk) 20:30, 15 December 2020 (UTC)

Nowhere near. It's on image ~8,800, streetrailwayj271906newy. This is around 2% progress. As the order of images is related to their type on IA, it's likely to have run into a sequence where every PDF is un-uploadable, and it takes 40 minutes each to work that out. It also is taking about 78s per jpg to add the categories, though it is doing that in parallel with the next PDF being examined. As the project page says, it's a slow process. -- (talk) 21:12, 15 December 2020 (UTC)
Update. Though the routine is 'productive', it does rely on the search engine to generate prospects. This is limited to 10,000 returns, while the potential sample space is 470,000. It's not going to go wrong, but the way this is done might need some redesigning. No hurry, it's telling me that we are still at the 1% tested right now.
It's taking about 5 minutes to check 10 candidate files. Without errors, and not counting the delays waiting for PDF failures to expire, this would be 163 days. -- (talk) 13:40, 17 December 2020 (UTC)

Bain collection

I have noticed that there are some gaps in your upload of the Bain collection, some GGBain numbers are missing. I find this when I am matching images at Flickr Commons to Wikimedia Commons. At Flickr Commons there is a project that is ongoing to identify the event or the person in each image. I am creating entries for the people at Wikidata and adding the photo if the event or person already has an entry. Are there any plans to look for the missing ggbain numbers and upload them? --RAN (talk) 02:44, 23 December 2020 (UTC)

I'll try to get around to looking at this in a couple of months. The script is around, but I was rewriting the LOC scripts more than a year ago and never got back to finishing them. No idea why some numbers may have been missed, the whole collection should have been uploaded, so there may be issues like the TIFFs being rejected at the WMF server stage, or there may be/have been an issue with image availability at the LOC end. -- (talk) 10:36, 23 December 2020 (UTC)

File:Cranberries; - the national cranberry magazine (1989) (20515720878).jpg

Suspected copyright violation notice removed. File:Cranberries; - the national cranberry magazine (1989) (20515720878).jpg (edit|talk|history|links|watch|logs)

User who nominated the file for deletion (Nominator) : ShakespeareFan00.

And also:

File:This terrified baby was almost the only human being left alive in Shanghai's South Station after brutal Japanese bombing HD-SN-99-02790.jpg

 
File:This terrified baby was almost the only human being left alive in Shanghai's South Station after brutal Japanese bombing HD-SN-99-02790.jpg has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.

If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue.

Please remember to respond to and – if appropriate – contradict the arguments supporting deletion. Arguments which focus on the nominator will not affect the result of the nomination. Thank you!

A barnstar for you!

  The Photographer's Barnstar
Thanks for uploading The Pisciarelli (a hot spring) issuing from the cone of the Wellcome V0025267.jpg that was selected as Picture of the month on the Neapolitan Wikisource. --Ruthven (msg) 19:22, 9 January 2021 (UTC)

Companion versions (in other formats) of IA scans...

A recent scan uploaded as PDF proved to be rendered as illegible due to limitations in the PDF thumbmnailing support provided by mediawiki.

In this specfic instance I was able to use a DJVU version of the same file that had been uploaded independently.

Would it be possible to task Faebot with uploading the DJVU versions of IA scans (for scans already uploaded as PDF), so that contributors and downstream users can make use of the format most appropriate for thier use case? Sometimes the DJVU is better than the PDF, or in comparison the PDF has pages missing from the DJVU version etc. Having both versions makes it's easier for contributors to choose and repair either file.

One of your scripts was already harmonizing for image matching.. Would it be possible for a script to 'harmonize', pairs of DJVU and PDF versions of essentially the same scans in the same way? (OR upload the equivalent companion PDF/DJVU based on the IA identifier where a scan (PDF or DJVU already exists on Commons)?

This is requested because at present it's not possible to use IA-Upload to do this, as it sees the IA identifier, and won't let me upload the companin format, if the PDF or DJVU for a given IA identifer tag is already present on Commons.

Longer term, it would be nice if there was a way to have something on Commons simmilar to IA's book viewing interface, which uses high quality JPEG or TIFF scans directly. For more recent scan efforts at IA, these scansets are in ZIP files, and I don't think it would be too hard for someone experienced to develop a gadget or extension to work with these more directly than PDF/DJVU.

ShakespeareFan00 (talk) 10:47, 10 January 2021 (UTC)

This can be done. But
  1. we need a consensus to keep DjVu and PDF versions as both have uses,
  2. PDFs can be kept as they are, but a project should examine how to remake the files, or to debug mediawiki if the problem lies with WMF rendering,
  3. a better proposal needs to be made for reading all document formats on Commons, it's unrealistically bad compared to the IA UI which itself is open source,
  4. testing and remaking files is not something I can do using my current 10 year old kit
-- (talk) 11:26, 10 January 2021 (UTC)

British Museum objects

Hi Fæ! I wanna import this photo https://www.britishmuseum.org/collection/object/A_1919-0101-0-99 . May I ask you whether there is a template for BM? This object's Museum number is 1919,0101,0.99 , which doesnt seem to fit any template.--Roy17 (talk) 13:46, 13 January 2021 (UTC)

I played around with these 11 years ago, creating templates on Wikipedia and Commons. Have a look at:
{{British-Museum-object}}
{{British Museum}}
{{British Museum online}}
You don't have to use them. -- (talk) 16:42, 13 January 2021 (UTC)

File:Rub al Khali ESA360605.jpg

File:Rub al Khali ESA360605.jpg (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Rub al Khali ESA360605.jpg StellarHalo (talk) 10:17, 14 January 2021 (UTC)

File:Barents bloom ESA364568.jpg

File:Barents bloom ESA364568.jpg (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Barents bloom ESA364568.jpg StellarHalo (talk) 11:48, 14 January 2021 (UTC)

File:Rome ESA375054.jpg

File:Rome ESA375054.jpg (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Rome ESA375054.jpg StellarHalo (talk) 12:38, 14 January 2021 (UTC)

Notification about possible deletion

Bundle DR:
Commons:Deletion requests/Files in Category:Jenő Haranghy

Affected:

And also:

Yours sincerely, Regasterios (talk) 11:45, 15 January 2021 (UTC)

Category:Examiner

@:, I was wondering if it would make sense to create a Category:The Examiner (1808–1886) as per the article, so the files in the Category:Examiner could be added to subcategories like Category:The Examiner (1808), Category:The Examiner (1809) etc. Thank you for your time. Lotje (talk) 12:18, 15 January 2021 (UTC)

Yes. These choices are not done automatically but the main 'bucket' category is key sorted by IA reference, which therefore sequences by date. There are, of course, several large microfilm digitization collections now uploaded, and that large picture is the project priority.
Normal search plus cat-a-lot would fairly easily subcat these, but from a project perspective, it's a minor improvement for the reader or reuser. Other ways of slicing the content based on writers or historical political contents, like the politics of abolitionism, might be of higher educational curation value and could be a better investment of volunteer time. Fortunately though often the scans are not great quality, the text is searchable. -- (talk) 13:11, 15 January 2021 (UTC)

File:Socialism, poster, political ad, László Ördögh-graphics Fortepan 91350.jpg

File:Socialism, poster, political ad, László Ördögh-graphics Fortepan 91350.jpg (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Socialism, poster, political ad, László Ördögh-graphics Fortepan 91350.jpg Regasterios (talk) 09:41, 17 January 2021 (UTC)

File:Aquila (IA aquila911984magy).pdf

File:Aquila (IA aquila911984magy).pdf (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Aquila (IA aquila911984magy).pdf Regasterios (talk) 15:45, 17 January 2021 (UTC)

File:Aquila (IA aquila831976magy).pdf

File:Aquila (IA aquila831976magy).pdf (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Aquila (IA aquila831976magy).pdf Regasterios (talk) 15:46, 17 January 2021 (UTC)

File:Aquila (IA aquila921985magy).pdf

File:Aquila (IA aquila921985magy).pdf (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Aquila (IA aquila921985magy).pdf Regasterios (talk) 15:48, 17 January 2021 (UTC)

File:Aquila (IA aquila9697198990magy).pdf

File:Aquila (IA aquila9697198990magy).pdf (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Aquila (IA aquila9697198990magy).pdf Regasterios (talk) 15:48, 17 January 2021 (UTC)

File:Aquila (IA aquila981991magy).pdf

File:Aquila (IA aquila981991magy).pdf (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Aquila (IA aquila981991magy).pdf Regasterios (talk) 15:49, 17 January 2021 (UTC)

File:Bear street art (Unsplash).jpg

File:Bear street art (Unsplash).jpg (edit|talk|history|links|watch|logs)
Commons:Deletion requests/File:Bear street art (Unsplash).jpg A1Cafel (talk) 04:25, 18 January 2021 (UTC)

Notification about possible deletion

Bundle DR:
Commons:Deletion requests/Files in Category:Batcolumn

Affected:


Yours sincerely, JWilz12345 (Talk|Contrib's.) 13:06, 18 January 2021 (UTC)