Open main menu

Wikimedia Commons β

Commons:Batch uploading

(Redirected from Commons:Batch upload)
Bot policy and list · Requests to operate a bot · Requests for work to be done by a bot · Changes to allow localization  · Requests for batch uploads
Gnome-system-run.svg

This page has a backlog that requires the attention of experienced editors.
Please remove this notice if it won't be needed in the future.


Boarisch | Български | বাংলা | Català | Deutsch | Deutsch (Sie-Form)‎ | Ελληνικά | English | Español | فارسی | Français | Galego | Magyar | Bahasa Indonesia | Italiano | 日本語 | 한국어 | Македонски | മലയാളം | Nederlands | Polski | Português | Português do Brasil | Svenska | Türkçe | 中文(简体) | +/−

Shortcut
COM:BATCH
Nuvola apps kcmsystem.png

Commons Batch Uploading is a project to centralize the uploading of a collection of files, that have released their work as PD or any Commons compatible license. The files would be assigned to a bot operator who would see how the request would be fulfilled. (To upload batches from Flickr, please make requests on Commons:Flickr batch uploading)

Before you request a batch upload here, please read the guide to batch uploading first.

See w:Wikipedia:Public domain image resources for potential future batch uploads.

ScriptersEdit

Currently inactiveEdit

ToolsEdit

Scripts, Examples and InformationEdit

RequestsEdit

Images of listed buildings by Stephen Richards on Geograph.org.ukEdit

  • Source to upload from: http://www.geograph.org.uk
    • Do the media URLs follow a pattern? Yes: http://www.geograph.org.uk/photo/[ID]
    • Does the site have an API? Yes: http://www.geograph.org.uk/help/api
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Don't know
    • Did you contact the site owner? No need
  • Describe the works to be uploaded in detail (audio files, images by …):
All photographs of listed buildings by this user are of high quality and are tagged [listed building]. They would be very useful to have on Commons as every listed building has an item on Wikidata. I'd like them to be uploaded en masse and given the categories Category:Listed buildings in [county or London borough] and Category:Images by Stephen Richards. I could then further refine the listed building categories manually. However, the terms "Grade I", "Grade II*" and "Grade II" (the three listing grades for buildings in England and Wales) appear in the image descriptions, so is there a way that these could be picked out and used to categorise the images on Commons?
  • Which license tag(s) should be applied?
{{Geograph}}
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?
{{Geograph}}

Ham II (talk) 19:50, 16 November 2017 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Edo period coin collecting cataloguesEdit

  • Source to upload from:

The website of the University of California at Santa Barbara. 🎓

    • Do the media URLs follow a pattern?

Kokin kousei, Shinsen zeni kagami (Corrected Against Past and Current Records, A New Selected Mirror of Cash Coins):

cover, 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10.

Shinpan kaisei, Kosen nedantsuke, Narabi ni bantsuki (Improved New Edition: Price List of Old Coins, Together with Rarity Ranking):

Book 📚 cover, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, and 11.

    • Does the site have an API?

Not that I’m aware of.

    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Not my subject of expertise.

    • Did you contact the site owner?

No, though I have attempted to contact him through his information provided here which may or may not be out of date, after that I tried it here which also bore no results. These images are in public domain so his permission is not needed. 📚

  • Describe the works to be uploaded in detail (audio files, images by …):

The images are of 2 books both of which were published in Edo period Japan, the images themselves are “large JPG” files of around 200 kb each (yes, “large” files, note 📝 that this website was last updated in 2003 or so), and generally contain two (2) pages with a description. The description in this case should not be copied as it was written by Dr. Luke Roberts himself.

The first book is owned by Dr. Luke Roberts and is from 1842 while the second book as published in Nagoya in 1799 and is (or was?) owned by collector Sam Leung. Both of them are too old to be copyrighted. Also note that the authorship of the second book 🕮 is (currently) unknown.

The books themselves are coin collecting catalogues containing the images of various Chinese, Japanese, Korean, Vietnamese (Annamese), and sometimes Muslim coins and have their prices and rarity written next to them, the illustrations generally exclusively show the obverse of the coins unless the reverse is notable as well. These books 📚 are old but well-preserved so they’re still easy to read and the scans themselves (though in contemporary times considered to be “small”) are of high quality.

  • Which license tag(s) should be applied?

Public domain from Japan. 🗾

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

PD-scan.

Also I think that the page from where the image 🖼 was uploaded from should be properly linked 🔗 so the information regarding the coins and the translation of the pages are easily accessible.

Sent from my Microsoft Lumia 950 XL with Microsoft Windows 10 Mobile 📱. --Donald Trung (Talk 💬) (Sockuppets 🎭) 09:49, 22 September 2017 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Illustrations of Vietnamese cash coins from Ed Toda's "Annam and its minor currency".Edit

I would like to request these bots to fetch all images and their related text from Ed Toda’s Annam and its minor currency, I really would like to do this myself only I have very little experience uploading files from public domain, and I have extremely little free time at hand and the images almost number in 300. 😅 Now I really wouldn't request this here if I thought that I had many other options so here it goes...

Sent from my Microsoft Lumia 950 XL with Microsoft Windows 10 Mobile 📱.

I plan on using all of these in the Wikipedia article “Vietnamese cash”, so I do have an immediate usage for them. I hope that these ideas 💡 are plausible. 🙇🏻

  • Source to upload from:

Ed. Toda.’s Annam and its minor currency hosted on Art-Hanoi, a website operated by Sema (known on Wikipedia as @Pyvanet~commonswiki:).

    • Do the media URLs follow a pattern?

11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21(, and technically ALL coins here).

Do not copy after 21 as page 22 is originally created by Sema himself, though Sema did create the Wikipedia article I wish to add these to, and he did wish to upload them himself he eventually gave up on doing it as too many of his files of more recent South-Vietnamese banknotes got deleted, I would request these separately from him but a ticket 🎟 would have to be fetched, and it should be more clearly discriminated as more recent currencies may violate Vietnamese copyright ©.

    • Does the site have an API?

Not that I’m aware of.

    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
    • Did you contact the site owner?

Yes, I did though the other images would have to be uploaded by himself, these however are the images of Ed. Toda and are mere scans thus do not go over the threshold of originality, and are in public domain.

Note 📝: The owner really does want his content here but quit here after some of his images got deleted due to Vietnamese copyright © laws.

For context: this page describes the scans in detail and is entitled “Read this first”.

  • Describe the works to be uploaded in detail (audio files, images by …):

The files are all images (scans) from Eduardo Toda’s 1882 book 📚 “Annam and its minor currency”, the authoritive English-language “classic” on Vietnamese cash coins prior to its publication going in full detail on the history, and circumstances (both economically wth resource management as well cultural with various “religious” (read: Superstitious) reasoning behind the content of the alloys), the descriptions of the coins should also be uploaded (further below), all of these files are hand-drawn Vietnamese (Annamese/Annamite) cash coins, though they are somewhat factually inaccurate due to them only following a single Chinese calligraphic style. They are all images from the same book and number exactly at --- images.

How the files should be organised:

The files themselves should all be named “Toda Nr.[number]” and then if possible their inscription in Chinese characters. The description below the files could be like exempli gratia title = “Toda No. 1. 太平興寶“ description is “Quote: “ (Barker: 1.6-1.16) Obverse: 太平興寶 Thai-binh-hung-bao. Reverse: The character 丁 Dinh, the name of the Dynasty.” - Ed Toda, Annam and its minor currency, 1882” Maybe again with a link 🔗 to the appropriate page. It should copy all the text until the next image, it should not any text before that chapter’s first image. Even long text should be copied as these illustrations mostly rely on their context.

With “the appropriate page” I mean that if it were ploaded from “http://art-hanoi.com/toda/15.html” then that specific link 🔗 should be listed as “the source”.

Where white-spaces exist between paragraphs “
” could (or should) be added.

The files should be made in a new category called “Category:Illustrations from Annam and its minor currency by Ed Toda” which in itself falls under “Category:Coins of Vietnam”, and “Category:Eduard Toda”.

  • Which license tag(s) should be applied?

The license that should be applied is “

 

The author died in 1941, so this work is in the public domain in its country of origin and other countries and areas where the copyright term is the author's life plus 75 years or less.


This work is in the public domain in the United States because it was published (or registered with the U.S. Copyright Office) before January 1, 1923.

This file has been identified as being free of known restrictions under copyright law, including all related and neighboring rights.

” as the book was published in Shanghai (though the writer is Hispanic ABD/and if it were published in Spain the copyright © would still be valid until 2022, but it completely falls outside of this in both Chinese, and U.S. American law).

Each file should contain at the author “Eduardo Toda y Güell (though Dr. R. Allan Barker, “the Qui-Gon Jinn of Vietnamese cash coins” hypothesises that Toda’s wife drew them)” (I put a part in there as I'm a joker 🃏), or on a more serious note “Eduardo Toda y Güell (though Dr. R. Allan Barker hypothesises that Toda’s wife drew them)” as the authorship isn't 100% known, while the source should list their appropriate page at “Art-Hanoi” a website operated by Sema (known on Wikipedia as @Pyvanet~commonswiki:) for appropriate attribution.

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

"Template:PD-scan"

Donald Trung (talk) 13:17, 24 July 2017 (UTC)

How and where I would use these images on WikipediaEdit

For how I would use these images please see w:nl:Gebruiker:Donald Trung/Lijst van Vietnamese kèpèngs (geïdentificeerde munten) where I plan on adding an illustration next to the mentioned coin where applicable, and users can see more information ℹ on the coins by clicking on the images, and next to the Toda illustrations I would add an actual photograph of the coin (if they are available on Wikimedia Commons, which most unfortunately aren't). --Donald Trung (Talk 💬) ("The Chinese Coin Troll" 👿) (Articles 📚) 11:32, 27 November 2017 (UTC)

Why this book (and its illustrations) are (still) importantEdit

The text itself is public domain and according to the Numismatic Bibliomania Society this book is still considered to be the main English language reference when dealing with Vietnamese cash coins.

“The E-Sylum: Volume 9, Number 47, November 19, 2006, Article 13. ONLINE BOOK: ANNAM AND ITS MINOR CURRENCY BY ED. TODA”

“Regarding last week's featured website (art-hanoi.com/collection), Dave Kellogg writes: "I know of a related website that I could not reach through the link above. It is an on-line copy of Ed Toda's book (1882) on ancient Annamese (Vietnamese) cash coins. It contains an interesting coin identifier feature which is a huge help for those of us who are marginally illiterate with Chinese characters. Here it is: coin identifier"

This book 📖 is still used a lot for novice Annamese (Vietnamese) cash coin collectors. The webite where I want to upload these images from operated by Sema (or Pyvanet~commonswiki) is the only online host I could find for this book 📖, of someone wants to know the coding of the website I’m sure that they could ask him. My experiences with him have been positive so far so I think that if the batch scripter wants the necessary code to script on then they could easily ask Sema.

“[The book is titled ANNAM AND ITS MINOR CURRENCY. From the web page: "This book written over 100 years ago still remains the main reference for all collectors of Vietnamese cash coins." -Editor]”

Annam and its minor currency 💲 has been used as the main English language reference for over a century, and I’m sure that these illustrations will be very useful for this subject. Even if the images won't immediately be used on Wikipedia it would still be of great educational value for Wikimedia Commons to host them.

“Dave adds: "That web page statement, "remains the main reference", is a bit of an exaggeration. That would have been true a couple of years ago if it said, "in English". There are some French references with an excellent reputation. Then, in 2004 Barker published Part 1 of what will undoubtedly become the definitive English reference. Nevertheless, Toda's on-line copy is still really helpful, and readers should know of it. Perhaps, it is still the main reference in that more people have access to it."

- © Wayne Homren, Editor”

So this book 📖 is no longer the main English language reference, but unlike Barker’s this is in public domain. There are indeed a lot of French language books 📚 that are superior to Toda’s Annam and its minor currency, however I do not have any (online) access to those, nor do I know which one of those are in the public domain. --Donald Trung (Talk 💬) ("The Chinese Coin Troll" 👿) (Articles 📚) 09:42, 29 November 2017 (UTC)

ExamplesEdit

An example of all coins from the first page would be like this:

Toda No. 1 太平興寶

Description
English: No. 1. (Barker: 1.6-1.16)

Obverse: 太平興寶 Thai-binh-hung-bao.

Reverse: The character 丁 Dinh, the name of the Dynasty.
Date
Source Annam and its minor currency (Art-Hanoi)
Author Eduardo Toda y Güell

Toda No. 2 太平興寶

Description
English: No. 2. (Barker: 1.17-1.19)

Obverse: same as before. Reverse: plain. Coins made by the king 先皇 TIEN-HOANG. White copper. Note: Though the reign title was 太平, all coins actually bear the legend 大平興寶 instead of 太平興寶 as it was stated in the original Toda's book. The former 黎 Le Dynasty. - 981-1010. General LE-HOAN ascended the throne under the name of 天福 THIEN-PHUOC, and, following the policy initiated by his predecessor, secured peace on the frontiers by successful wars against China and Ciampa.

His son and successor, called LONG-VIET, was murdered by his brother 臥朝 NGOA-TRIEU, three days after he had come to power. This prince, whose conduct was extremely cruel and bad, soon afterwards proclaimed himself king and committed every kind of excess and crime, inventing new tortures and ruining the country in every way. With his death the LE Dynasty came to an end.
Date
Source Annam and its minor currency (Art-Hanoi)
Author Eduardo Toda y Güell

Toda No. 3 天福鎮寶

Description
English: No. 3. (Barker: 2.1-2.7)

Obverse: 天福鎮寶 Thien-phuc-tran-bao, or provincial coin of Thien-phuoc. At that time, as some fifty years before in China, the provinces of Annam were called 鎮 TRAN.

Reverse: The character 黎 LE, the name of the Dynasty.
Date
Source Annam and its minor currency (Art-Hanoi)
Author Eduardo Toda y Güell

Toda No. 4 黎

Description
English: No. 4. (Barker: none)

Obverse: Only the character 黎 LE in the lower part of the square hole. Reverse: plain.

The above two coins were cast in the 5th moon of the 5th year of 大行 DAI-HANH (986). They were made principally of white copper, and are rather smaller than the ordinary Chinese cash.
Date
Source Annam and its minor currency (Art-Hanoi)
Author Eduardo Toda y Güell

I hope that these examples are clear enough for any potential scripters, if asked I will try to make them more “upload-friendly”.   😉.

Sent 📩 from my Microsoft Lumia 950 XL with Microsoft Windows 10 Mobile 📱. --Donald Trung (Talk 💬) ("The Chinese Coin Troll" 👿) (Articles 📚) 09:42, 29 November 2017 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

USDA NRCS Plants DatabaseEdit

  • Source to upload from: http://plants.usda.gov/
    • Do the media URLs follow a pattern? Yes.
    • Does the site have an API? No.
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) valid XHTML
    • Did you contact the site owner? No.
  • Describe the works to be uploaded in detail (audio files, images by …): Public domain: 10771 photos and 7064 line drawings, with species information for categorization. There are other copyrighted images as well, some of which may be freely licensed.
  • Which license tag(s) should be applied?

{{PD-USGov-USDA-NRCS}}

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

OpinionsEdit

@Guanaco: There is a lot of copyrighted material within these images, e.g. [1] [2]. (Just because this is a U.S. government web site this does not mean all the material is U.S. government material and by this means freely usable!) Actually I have not found too many images that really can be used (e.g. [3]). You should at least provide a procedure how to distinguish between copyrighted and free material. --Reinhard Kraasch (talk) 11:02, 9 July 2017 (UTC)

@Reinhard Kraasch: The gallery search function [4] has a filter by copyright status. [5]
I've found that the URLs linked by the thumbnails provide species information within <title>: https://plants.usda.gov/core/profile?symbol=HACA2&photoID=haca2_003_ahp.jpg#
The search is navigable with &page=2, 3, 4, etc.
I'm actually interested in scripting this myself now, though it would be my first batch upload task. Guanaco (talk) 14:23, 9 July 2017 (UTC)
@Guanaco: Well, just go on... On the other hand it always is a good idea to have a second opinion with such a batch upload - especially for the non-technical aspects. --Reinhard Kraasch (talk) 20:52, 10 July 2017 (UTC)
Assigned to Progress Bot name Category

US National ArchivesEdit

I am hoping to begin a bulk upload of media from the US National Archives in the next few weeks. This will be a very different approach from the first upload, which was based on uploading files from an offline drive and scraping HTML for the metadata. This time around, NARA has an API for our online catalog, and so I am building a bot, using mwclient, to upload using the live metadata and files from the API. Some details:

Dataset

The dataset includes all PD materials at https://catalog.archives.gov (API: https://catalog.archives.gov/api/v1). I plan to begin with a series of ~100,000 WWI-era photos. Technically, there are over 15 million files (and counting) in this dataset.

File names

The script is currently configured to name files with the formula: For single-page items:

  • "File:[TITLE] - NARA - [NAID].ext"
    Where "[TITLE]" is the catalog record's title field, and "[NAID]" is the National Archives Identifier. If this is over the character limit, "[TITLE]" is automatically truncated, with "(...)" appended.

For multi-page items (since the above formula would give all files belonging to one catalog record the same title):

  • "File:[TITLE] - NARA - [NAID] (page X).ext"
Metadata

We are developing a custom metadata mapping, since NARA does not adhere to a metadata standard. You can see the metadata template we use here: {{NARA-image-full}}. Some notes:

While all the records in this catalog come from NARA or partner institutions, there are many different facility locations, and some NARA facilities have their own institutions templates already (e.g. US presidential libraries). Therefore, I am creating institution templates to go along with all NARA locations, and the script will insert the correct institution template based on a mapping.

NARA's authority file is not yet mapped to Wikidata, however that is definitely something that would be useful in the future. For now, we will upload files with NARA's creator and author names and their NAIDs and links back to the catalog authority record. However, including the NAIDs in a Commons template field means that in the future, Wikidata could be used to make creator templates appear instead. Any help with this would be appreciated.

Licenses

Because NARA records are nearly all (>99%) derived from the records of US federal agencies, these uploads will use {{PD-USGov}} or its subtemplates. Most NARA records are in one of about 600 record groups based on their creating agency, so I am using a mapping of NARA record groups to Commons PD-USGov templates so that the bot can apply the more specific agency templates in most cases. Help filling out this mapping would be appreciated.

Nearly all holdings of the US National Archives are in the public domain as a work of the federal government (or, otherwise, due to age). This is marked in the "use restriction" field in the catalog, with a value of "Unrestricted" indicating public domain determination by the archivists. Therefore, the script will be configured to skip over any records in which the use restriction is anything other than "unrestricted" (even "possibly" ones, which could ultimately be PD, but need a human determination).

Categories

All uploads will be automatically categorized by the metadata template into Category:Media contributed by the National Archives and Records Administration and a category for the series they belong to (such as Category:US National Archives series: DOCUMERICA: The Environmental Protection Agency's Program to Photographically Document Subjects of Environmental Concern, compiled 1972 - 1977). Eventually, the script will be designed to create the series category if a file is uploaded for a series which does not yet have one.

When it comes to topical categories, past NARA uploads utilized the {{uncategorized}} tag to encourage the community to add topical tags. However, since this creates work for the community, I am planning this time around to run uploads a small batch (hundreds to a few thousand) at a time, so I can upload them with one or more topical categories that apply to all records in the batch, rather than uncategorized.

Code

You can find the upload bot's code at https://github.com/usnationalarchives/wikimedia-upload. This project is being developed in public on NARA's official GitHub account. I would welcome collaboration (pull requests or otherwise) there. In addition, the Commons community is welcome to file issue reports on that repo.

Examples

The most recent test uploads can be viewed in Category:US National Archives series: American Unofficial Collection of World War I Photographs. I am still polishing the upload script, but these examples essentially represent what should be expected from the bot once it gets started.

OpinionsEdit

The bot account is technically already flagged from the last bulk upload a couple of years ago, however I would like to submit the current plan to community review before restarting uploads. If there are any opinions on the bot's design or the format of uploads or other issues, I am happy to hear them. We'd also like to know whether to limit what is uploaded in any way—as in, would Commons actually be interested in 15 million files, or might some of these, like the millions of census cards, not be of interest. Also, if anyone is interested in helping out with the coding or other tasks, please feel free to let me know. This is a big undertaking. Thanks! Dominic (talk) 17:25, 31 May 2017 (UTC)


Assigned to Progress Bot name Category
User:Dominic Coding User:US National Archives bot Category:Media contributed by the National Archives and Records Administration

Catharijne ConventEdit

  • Source to upload from: http://museumcommons.wikimedia.nl/
    • Do the media URLs follow a pattern?
    • Does the site have an API? No
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
    • Did you contact the site owner? Yes; this site was specifically set up for this particular upload, as the museum itself claimed not to be able to host the images online.
  • Describe the works to be uploaded in detail (audio files, images by …): 572 high-res TIFF files of objects held by the Museum Catharijneconvent in Utrecht, The Netherlands. The metadata were provided as XML and are available as well.


  • Which license tag(s) should be applied?

Probably [6].

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?


AWossink (talk) 13:17, 14 April 2016 (UTC)

OpinionsEdit

AWossink, I could look into uploading this one as well. The linked source only has one file currently, and do you have the xml? Basvb (talk) 13:59, 14 April 2016 (UTC)
Aah I see that I likely misunderstood, you're planning on uploading this batch yourself using GWToolset Arne? See phabricator:T131841 Basvb (talk) 14:16, 14 April 2016 (UTC)
@Basvb: Yes, I was planning to do that in order to get some experience in the GWToolset. However, since this batch has been sitting around for quite a while and uploading with the GWtoolset seems to be a lot slower than your script, it would be great if you are still interested in taking care of this upload as well! Let me know if that suits you - otherwise I am happy to continue as I originally planned. Best, AWossink (talk) 14:26, 14 April 2016 (UTC)
Is it really slow with GWtoolset (more than a day for all, nice thing with GWToolset is that it can run while your PC is off)? The file sizes are quite large compared to the tropenmuseum upload, so any script will be slower. Basvb (talk) 15:04, 14 April 2016 (UTC)
Assigned to Progress Bot name Category
AWossink Resolved Uploaded with Pattypan instead of GWT Category:Textiles in Museum Catharijneconvent

TrainpixEdit

  • Source to upload from:: http://trainpix.org ;
    • Did you observe an URL pattern:
      — Yes. Each photo from this site has an individual number from 1 to 145250 (last actual number at 11:50, 12 October 2015 (UTC) is 145250) and is available with link like trainpix.org/photo/<number of photo>. To get the direct url of JPG file:
      • prepend number of photo with zeros to get seven-digit number
      • remove the last digit
      • separate first 6 digits with "/" to get three 2-digit numbers
      • insert this numbers between trainpix.org/photo/ and the number without leading zeros, and append .jpg

For example, the direct file link of photo with number 4 (http://trainpix.org/photo/4) is http://trainpix.org/photo/00/00/00/4.jpg ; for photo with number 12345 (http://trainpix.org/photo/12345) is http://trainpix.org/photo/00/12/34/12345.jpg

    • Do you know whether the site has an API — No
    • What else can ease uploading (is the site valid XHTML, WCM they use…)?
    • Did you contact the site owner? — No
  • Describe the works to be uploaded in detail (audio files, images by …):

Trainpix is Russian photo gallery of railway rolling stock (default language is Russian, the site is also available in English, Ukrainian and Belorussian languages). The first string under photo describes the name of rolling stock (model and number), which should be used as name of file on Commons (also it can be appended with number of photo on this site like bot-uploaded photos from Flickr). The second string describes the place where a photo has been taken. The third string describes author's name and date. In the left part of page, each photo has description tag of license (in Russian — Лицензия), including copyright, zero, by, by-sa, by-nc, by-nc-sa, by-nd, by-nc-nd and mark. Any registered user of this site is available to select license in upload form (screenshot), also it includes Please read before license selection hyperlink to page, which describes Creative Commons 3.0 licenses. I am not understand what means mark license (it's description in upload form is No author rights) and is it simillar to Public Domain, but all photographs with free licenses zero, by, and by-sa should be uploaded on Commons. Note that some numbers are deleted photos (example — http://trainpix.org/photo/1/).


  • Which license tag(s) should be applied?
    • {{cc-zero}} — for images with license tag "zero"
    • {{cc-by 3.0}} — for images with license tag "by"
    • {{cc-by-sa 3.0}} — for images with license tag "by-sa"


  • Is there a template that could be used on the file description pages?

No

  • Do you think a special template should be created?

Maybe. Each rail vehicle on this site has description table with registration railway, model, current condition, etc.

Xenotron (talk) 15:34, 21 August 2015 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

40,000 from zeno.orgEdit

Any chance of getting a new set of high-resolution images from Directmedia/zeno to replace the Yorck Meisterwerk Project images? Even in 2005 many of them looked completely inadequate because of the very low (40%) jpg quality (see Commons talk:10,000 paintings from Directmedia/Requests for improvement). Zeno.org has much better images available now. There are so far over 2700 images in Category:Images from zeno.org, but they appear to be done ad-hoc by individuals, not as part of a particular project (or a particular bot). That compares to 10,374 in Category:PD-Art (Yorck Project), and Zeno.org now claims to have 40,000 Meisterwerk, which appear to be much higher quality that the original set. No doubt the original 10,000 are included in the current 40,000.

Because the Directmedia/Yorck Meisterwerk Project was the first big batch upload of artwork, the images are heavily used throughout the wikipedia sites. It is precisely for this reason that updating them is important. So: is it possible to replace the original 2005 images on commons? If not then after uploading new images, all of the old images would need to be marked with {{superseded}} and all of the pages that use the old images updated to the new ones (by bot).

Would it be easier to get a new set of images directly from Directmedia as part of GLAM? They might even provide higher resolution images than what is on their own site.

Note that the zeno.org site is in German, and they have a lot of other things besides paintings. Laura1822 (talk) 10:12, 13 July 2015 (UTC)

  • Source to upload from:   
    • Do the media URLs follow a pattern?   
    • Does the site have an API?   
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)   
    • Did you contact the site owner?   
  • Describe the works to be uploaded in detail (audio files, images by …):   
  • Which license tag(s) should be applied?   
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?   

OpinionsEdit

Assigned to Progress Bot name Category

ESA-Rosetta-NAVCAMEdit

  • Describe the works to be uploaded in detail (audio files, images by …):
Images the comet 67P/CHURYUMOV-GERASIMENKO by the NAVCAM on the Rosetta spacecraft.


  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yann (talk) 14:32, 6 June 2015 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Old requests (over two years)Edit

Batch uploads in progressEdit

AucklandMuseumCCBYEdit

RequestEdit


General notesEdit

There was a Wikimedian in Residence over 5 weeks in the summer when some uploads were done. AM blog post

At the time of upload, the API returned 70,789 objects marked with a CCBY license. It is uncertain how many valid images this relates to, based on the first tranche of uploads, this may result in ~250,000 photographs.

Technical notesEdit

Files are in the format File:<title> (AM <accession number>-<sequential image number>).jpg

If there is only one image for the object, then no sequential image number is used. All images are added to a gallery using the "other versions" parameter of the Artwork template. Should any one image fail to load, the gallery will not be fixed automatically.

AM APIEdit

The API swagger page seems to never have been created, so the API usage is suck-it-and-see. Requests to the API are specified as limited to 1000 per day, though in practice this has yet to be an issue.

The API path discovering URIs for the data needed is:

(search) -> (objects) -> (media or dimensions)

Though the (search) can return many objects, the object requests have to be one at a time. It is unclear if the API throttling actually counts all different types of request.

Searching is arbitrarily chosen to be for 10 100 objects at a time, each object may have many images. In the upload comment the search page, number of object within that page and the sequential number for the image of the object is shown. This is for debugging or restarts, it has no long term value. Using the API search from day to day, appears to return the objects in different orders. It's unclear why this happens, but is probably because the search returns the most recently changed catalogue entries first. This means that re-starting batch upload should be run from the first page, rather than jumping to the last used position.

The API structure is complex, with hierarchical families of relationships. For the most part these appear redundant for Commons' purposes, for example identities of curators or massive detail of material composition, when the descriptive text probably suffices.

Metadata mappingEdit

Date may be difficult to abstract from the metadata. Field names may not be obvious and there is no dictionary to reference, for example "period" precise dates may be estimated even though not marked as such, and "exact" date appears to be used for date of acquisition even when it is against the object "made" metadata. It's not possible to tell if this is by design or practical usage has ended up mapping things differently from the literal descriptions.

Questions may arise from the data parameter including several dates, such as "20th century; 1935-1949; 1996". In these cases the metadata is picking up different ways of describing the object's creation period, plus the year of being donated to the museum is being included, which will always be the most recent year given. For copyright purposes, the earliest definitive date is the most useful, and the most accurate.

Description is based on the object field content and where a note exists against the object, these are added separately. Where the text of an object note exactly matches text in content, it is skipped.

Dimensions were added late in the batch run. These have to be detected as a property of the object, then a dictionary of URIs has to be interrogated. This may add to the API throttle count. Dimensions look like (type, value) where type can be anything like {length, height, note, string of multiple dimensions}.

CopyrightEdit

Some objects returned include modern era works, such as lacework or clothing. Though the photograph is CC-BY, it is possible that some of these will have sufficient artwork content to be counted as non-utilitarian objects and potentially be copyrightable, at which point copyright can often be determined by age.

Most modern objects appear to be mass produced design rather than drawings or other art. It is presumed that files selected by the museum do not represent any obvious copyright issue, including photographs of objects from the 1960s or later. Initial uploads show some images will be problematic under Commons' policies, for example modern political badges with drawings or product labels later than the 1940s with potentially copyrighted imagery. As these appear to be at the level of less than 1% of uploads, and not obvious from searching metadata at source, they can be weeded out as housekeeping.

Reference deletion requestsEdit
Failure typesEdit
  • Image not retrievable, e.g. http://api.aucklandmuseum.com/id/media/v/390708 gives Error 500. Image is skipped after failing.
  • Page not found, e.g. http://api.aucklandmuseum.com/id/humanhistory/object/15043 gives Error 400, skipped.
  • No am:accessionNumber found. Accession number is replaced with the database object number. As accession numbers always appear to have periods, this should not lead to any confusion. The root cause is likely to be that the object is in a non-AM collection, in these cases the am:creditLine gives more detail on alternate collection reference numbers.
  • Blank images exist, such as http://api.aucklandmuseum.com/id/media/v/358003. These are detected and skipped if the object has one image and the image is 800x800 (the default blank card size).
  • Corrupt downloads. Due to the way a header field needs to be passed to the API with any image download request, a custom method in Python of opening the file has been used, which is less reliable for larger binary files than using Python's urlretrieve module. In a handful of cases this unexpectedly led to partly corrupted downloads, missing some of the last sets of pixels. This might have been an intermittent issue with a poor home internet connection. An additional test which "looks" at the last row of the downloaded image is used (relying on PIL modules), and reattempts download a few times which should stop this recurring.
HousekeepingEdit

Due to the varied nature of failure types, rather than working out where images are failing to be uploaded before adding predicted galleries to image pages, a housekeeping process goes through the collection looking for broken links. When these are in a gallery, and the image was uploaded at least a day ago, the link is removed. Example diff.

CategoriesEdit

The main/bucket category is Category:Images from Auckland Museum where the institution template is displayed and sub-categories are added.

Type category

New categories are created based on the object type. Examples:

Some of these may turn out to be not great, but it's a starting point.

Date category

Where the date fields include "19th century" or some other match to the regular expression "\d?\d(st|th) century", then a century category of the format Category:19th century in Auckland Museum is added. As the date fields are flexibly used, this will miss many items with periods or ranges, such as "George V (1910 - 1936)/House of Windsor/English reign".

When a date field matches a decade, like 1850s, or another match to "\d{4}s", then a decade category of the form Category:1850s in Auckland Museum is added. These are initially placed in the top category, but should be manually moved under the century as they arise.

ProgressEdit

Assigned to Progress Bot name Category
  • 24 Nov 2017 Started
  • 26 Nov 2017 Metadata usage improved, such as date categories, credit line and dimensions. Not retrospective.
  • 6 Dec 2017 Housekeeping of broken links in galleries created.
  • 7 Dec 2017 Change to 100 objects per page rather than 10, speeding up re-starts after recent outage problems with WMF servers.
  • 10 Dec 2017
    • Where a value is set for am:onDisplayFlag, this is taken as current display location and shown under exhibition history.
    • Restarts now jump to results page rather than iterating through them. It presumes returned pages are always the expected perpage value.
NA Images from Auckland Museum

Specific search

Batch uploads on holdEdit

Past batch uploadsEdit

2005 - 2009Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
10,000 paintings from Directmedia 10,000 public domain images digitized by the Yorck project and contributed to commons 10,000 Eloquence File Upload Bot (Eloquence) PD-Art (Yorck Project)
Picswiss project Roland Zumbühl agreed on releaseing his images as GFDL, depicting various areas and subjects in Switzerland. 5,000 of 13,000 Dake Dake Images from Picswiss
Bundesarchiv From the German Federal Archive, the images depict Germany between the 19th and 20th century including valuable photographs of the Nazi era and World War II. 100,000 Duesentrieb BArchBot Information fetch Images from the German Federal Archive Bundesarchiv <id>, <desc>
Starr images Images of plants of Hawaii 60,000 Multichill Multichill Images from Forest & Kim Starr Starr <date>-number <taxon/desc>
Wenceslas Hollar Digital Collection A collection of 2700 high resolution images of engravings of Wenceslas Hollar, about 90% of his life works 2,700 Dcoetzee Dcoetzee University of Toronto Wenceslas Hollar Digital Collection
National Portrait Gallery Various portraits of famous people between the 16th and 19th century. 3,000 Dcoetzee Dcoetzee National Portrait Gallery, London
Deutsche Fotothek Images from Deutsche Fotothek mainly about east Germany between the 19th and 20th century including the Bombardment of Dresden and other events. Only 25% of the images have been uploaded till now. 62,104 of 250,000 Multichill FotothekBot Tools used Images from the Deutsche Fotothek Fotothek <id> <desc>
Berger Collection A collection of high resolution images of paintings and other works from the Berger Collection, depicting British art, culture and people. 140 Dcoetzee Dcoetzee Berger Collection
Great Images in NASA Images from Great Images in NASA 1,400 TheDJ Multichill Great Images in NASA
Alaska-Yukon-Pacific Exposition of 1909 High-resolution scans of documents from the Alaska-Yukon-Pacific Exposition found here. 700 Dcoetzee Dcoetzee Alaska-Yukon-Pacific Exposition
Commanster Pictures of plants, animals, birds and insects of Commanster, Belgium by James Lindsey 6,000 Sarefo Sarefo Pictures by James Lindsey
WLANL Images from Wiki Loves art Netherland imported from the flickr group pool, depicting Netherland and its different museums. 4,000 Multichill BotMultichillT Images from Wiki Loves Art Netherlands WLANL - <team> - <desc>
FEMA site All the images found on US Federal Emergency Management Agency Disaster Photo Librarywas copied to Commons, depicting US environmental disasters and emergency actions. 20,000 Multichill BotMultichillT script PD US FEMA FEMA - <id> - Photograph by <photographer> taken on <date> in <location>
AntWeb images All the images found on http://www.antweb.org/ depicting different species of ants. 32,000 Dave Thau File Upload Bot (AntWeb) Images from AntWeb <desc> <specimenID> profile <viewnumber>
Images of erosion All the images found on http://picasaweb.google.com/VolkerPrasuhn depicting erosions. 700 Leyo manual Images by Volker Prasuhn
livepict.com All the images found on http://livepict.com/ depicting bands. 1000 Justass Justass Images from LivePict
Tropenmuseum A partnership with Tropenmuseum 40,000 Multichill KITbot svn Images from the Tropenmuseum COLLECTIE TROPENMUSEUM <desc> TMnr <id>

2010 - 2013Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Randolph Caldecott All pages in The complete collection of pictures & songs / by Randolph Caldecott 510 Diaa abdelmoneim Dudubot upload.py The complete collection of pictures & songs by Randolph Caldecott Randolph Caldecott collection-page <page>
Rob Lavinsky Mineral images from Rob Lavinsky on mindat.org 34,917 Reinhard Kraasch RKBot upload.py + pyodbc Images by Rob Lavinsky <mineral1>[-<mineral2>[<mineral3>]]-<mindatID>
Rob Lavinsky Mineral images from Rob Lavinsky on irocks.com 20,582 Reinhard Kraasch RKBot upload.py + pyodbc Images by Rob Lavinsky <mineral1>[-<mineral2>[<mineral3>]]-<irocks file name>
Bibliothèque Nationale de France Books provided by the Bibliothèque Nationale de France (French National Library) as part of a partnership with Wikimédia France 1,413 Seb35 (with help from Plyd and Jean-Fred) BnF import, operated by Tim Starling svn Books provided by the BNF <Author> - <Title>.djvu
Erling Mandelmann Portraits of notable people donated from Erling Mandelmann 581 Diaa abdelmoneim Dudubot Photographs by Erling Mandelmann <Title> - <Author>
Travelers in the Middle East Archiven Historical images from books about the Middle East from Travelers in the Middle East Archive, provided by Rice University 2,277 Diaa abdelmoneim Dudubot Images from the Travelers in the Middle East Archive "<Title>" (<Year>) - TIMEA
Fonds Eugène Trutat Photographs by famous French photographer Eugène Trutat, donated by the City Archives of Toulouse as part of a partnership with Wikimédia France 200 Jean-Frédéric TrutatBot GitHub Fonds Trutat - Archives municipales de Toulouse <Title> (<Year>) - <Id> - Fonds Trutat
Nordiska Museet A collection of early photographs, donated by Nordiska Museet as part of a collaboration with Wikimedia Sverige. 1,000 Prolineserver NordiskaMuseetBot Toolserver Images from Nordiska museet <Title> - Nordiska Museet - <Id>.jpg
Commons:Batch uploading/Adams Ansel Adams National Park Service photographs 221 User:Kaldari User:File Upload Bot (Kaldari) Perl Category:2011 Ansel Adams donation from U.S. National Archives Ansel Adams - National Archives - 79-AA-<digit digit>.jpg
Web Gallery of Art Large collection of well documented artworks. Uploaded ~15k new files and synchronization metadata for ~6k already uploaded files 21,700 Jarekt JarektUploadBot UploadWGA.py
FixWGAMetadataInfo.py
FixWGAMetadataArt.py
Images from Web Gallery of Art <Author> - <Title> - WGA<ID>.jpg
Commons:Batch_uploading/Monument_lists Images of German cultural heritage monuments 3000? User:ElyaUser:Raymond User:SternthalerBot cat <STRING>-Nr. <##>, <STRING> (<####>).jpg
Commons:Chris's Acorns Large collection of Acorn computer hardware and peripherals from Chris's Acorns 1700 Smallman12q Smallbot C#4 w/ LINQ and MSHTML interop Chris's Acorns just filename...no format
Commons:Batch uploading/Flickr Fotostream of NOAA Photo Library Botanical images  ? User:Kobac Category:Images_from_NOAA
Walters Art Museum Collection of 3D and 2D artworks from around the world 19,000 Kaldari File Upload Bot (Kaldari) modified botclasses.php Media contributed by the Walters Art Museum <Author> - <Title> - Walters <ID> - <View>.jpg
Commons:Bible Illustrations Bible illustrations 2993 Smallman12q OrophinBot VBScript, XHR, XMLDOM, MSHTML, COM Media contributed by the Sweet Publishing <name> <chapter>-<section> (Bible Illustrations by Sweet Media).jpg
Flora Batava Illustrations of all plants in the Netherlands 1582 Rillke FloraUploadR own implementation using VB6/COM/C++ Files uploaded from Flora Batava by FloraUploadR <latin plant name> — Flora Batava — Volume v<number>.jpg
Commons:Bots/Requests/Smallbot 2 Oregon Historical County Records Guide 4273 Smallman12q Smallbot VBScript, XHR, XMLDOM, MSHTML, COM Category:Images_from_Oregon_Historical_County_Records_Guide <name> (<Countyname> County, Oregon scenic images) (<id>).jpg
The World's Columbian Exposition PD-Photos of the The World's Columbian Exposition 115 Rillke RillkeBot own implementation using VB6/COM/C++ World Columbian Exposition taken by Press Chicago Photo-Gravure Co. <caption> — Official Views Of The World's Columbian Exposition — <file number>.jpg
Defenselink Defense.gov News Photos 14572 Slick Slick-o-bot pywikipediabot and some bash scripts Defense.gov News Photos to check Defense.gov News Photo <VRIN>[ - description].jpg
U.S. Army Map Service Maps of India and Pakistan from the U.S. Army Map Service 304 Slick Slick-o-bot pywikipediabot and some bash scripts India maps by U.S. Army Map Service Map India and Pakistan 1-250,000 Tile <tile name>.jpg
Defense.gov Photo Essays Defense.gov Photo Essays 23106 Slick Slick-o-bot pywikipediabot and some bash scripts Category:Defense.gov photo essays to check Defense.gov photo essay <VRIN>.jpg
Navy SEAL pics and vids Navy SEAL pics and vids 682 Slick Slick-o-bot pywikipediabot and some bash scripts Category:United States Navy SEALs Images to check United States Navy SEALs <NUMBER>.jpg
Beaverton, Oregon Historical Photo Gallery Beaverton, Oregon Historical Photo Gallery 305 Smallman12q Smallbot VBScript, XHR, XMLDOM, MSHTML, COM Category:Beaverton, Oregon Historical Photo Gallery <name> (Beaverton, Oregon Historical Photo Gallery) (<number>).jpg
ForestWander Mostly nature photos from West Virginia 2600 Rillke Forestwander Nature Photography upload bot own implementation using VB6/COM/C++ Category:Bot-uploaded files from Forestwander Nature Photography <name> - [West Virginia|Virginia] - ForestWander.jpg
Navy SEAL pics and vids U.S. Navy SEALs pictures and videos 681 pics, 56 vids Slick Slick-o-bot pywikipediabot and some bash scripts Category:United States Navy SEALs Images to check Category:United States Navy SEALs Videos to check images: United States Navy SEALs <number>.jpg, videos: different
Umair Zafar fashion shoot Umair Zafar fashion shoot 91 Slick Slick-o-bot pywikipediabot and some bash scripts Category:Images from Umair Zafar fashion shoot to check different
New Orleans Bee New Orleans Bee 136667 Slick Slick-o-bot pywikipediabot and some bash scripts Category:The_New_Orleans_Bee_by_year The New Orleans Bee <year> <month> <number>.pdf
Brooklyn Museum Brooklyn Museum 3629 Slick Slick-o-bot pywikipediabot and some bash scripts Category:African art in the Brooklyn Museum Brooklyn Museum <ID> <SHORT DESC>.jpg
U.S. Marines Corps U.S. Marines Corps 77288 Slick Slick-o-bot pywikipediabot and some bash scripts Category:Marines.mil_images_to_check USMC-<NUMBER>.jpg or USMC-<VRIN>.jpg
Photographic History of the Civil War Photographic History of the Civil War 3668 Mattwj2002, Slick Mattwj2002, Slick-o-bot pywikipediabot and some bash scripts Category:The_Photographic_History_of_The_Civil_War The Photographic History of The Civil War Volume <VOLUME> Page <NUMBER>.jpg
Rijksdienst voor het Cultureel Erfgoed Photos of historic buildings in the Netherlands (Rijksmonumenten) 4650000 Multichill Multichill pywikibot based Category:Images from the Rijksdienst voor het Cultureel Erfgoed <title> - <id> - RCE.jpg
Commons:Batch uploading/AELG Photos of Galician writers 800 User:Smallman12q User:Smallbot Category:Images from AELG <NAME> (AELG)-<N>.jpg
Defence Imagery (UK) High quality selected photographs by the UK Ministry of Defence (MoD), released on the Open Government Licence (equivalent to Public Domain with an attribution requirement) 2,880 pywikipediabot Category:Images from MoD uploaded by Fæ <MoD title> MOD <file number>.jpg
Weather maps Weather maps of the USA, daily and weekly from the U.S. National Oceanic and Atmospheric Administration 20,000 (10 year archive) and ongoing at 5 new maps per day User:Fæ User:Fæ pywikipediabot Category:NCEP maps by year <YYYY-MM-DD> <map type> NOAA.png
Los Angeles County Museum of Public Art Art history - photographs of artifacts from LACMA 22,000 pywikipediabot Category:Images from LACMA uploaded by Fæ <LACMA description> LACMA <Accession Number>.jpg
LSH Objects in the LSH-museum collections 19,961 (approx 1,500 missing from / missnamed on drive and still to be uploaded) Lokal_Profil LSHuploadBot own script Images from Livrustkammaren och Skoklosters slott med Stiftelsen Hallwylska museet <description> - <mueseum> -_ <imageid>.<filetype>

2014Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Fonds Trutat − Muséum de Toulouse Historical images by Eugène Trutat 213 Jean-Frédéric TrutatBot GitHub Category:Media contributed by the Muséum de Toulouse <Title> - Fonds Trutat - <Id>
Archives Nationales (France) Archive documents from the French history 77 Jean-Frédéric ArchivesNationalesBot GitHub Category:Media contributed by the Archives Nationales (France) <Title> <Page> - Archives Nationales - <Id>
Commons:Batch uploading/World Digital Library Old books from WDL - - Pywikibot
geo/map-marker icons by Nicolas Mollet more than 700 free icons to use as placemarks for POI (Point of Interests) locations on maps 6,880 Rillke GeoUploadR node.js / nodemw Category:Map icons by Nicolas Mollet – Uploaded by GeoUploadR Map marker icon – Nicolas Mollet – <Title> – <Category> – <Style>.png
EnergieagenturNRW Contemporary - active North Rhine-Westphalia (German) politicians 2,249 (61% of the Flickrstream) pywikipediabot EnergieagenturNRW <Flickr title> (<Flickr ID>).jpg
RA Coat of Arms drawn by the National Archive of Sweden 336 André Costa (WMSE) RA-uploadbot PyCJWiki (modified) Coats of arms by the National Archives of Sweden <name> <type>vapen - Riksarkivet Sverige.png
Atlas de Wit 17th-century Dutch atlas of the lower countries from the collections of the Koninklijke Bibliotheek (Dutch National Library) 145 Husky HuskyBot Pywikibot (script) Atlas de Wit 1698 Atlas de Wit 1698-<page>-KB PPN 145205088.jpg
goodfreephotos.com different public domain images, landscapes, objects and so on ... 3547 Slick Slick-o-bot pywikipediabot and some bash scripts Category:Images_from_goodfreephotos.com and Subcats of Category:Import by User:Slick-o-bot/Images from goodfreephotos.com (based on galleries for maintenance) Gfp-<name>.jpg
Sustainable Sanitation Alliance Contemporary photographs of sustainable sanitation, Africa 9,810 pywikipediabot Files created by Sustainable Sanitation Alliance (SuSanA) <Flickr title> (<Flickr ID>).jpg
KNBLO Images of the Vierdaagse (walking event) from 1910-1940 1,183 GWToolset (Basvb) Basvb GWToolset Images from KNBLO <description> - <id> - KNBLO.jpg
(upload) (description fixes) Wigman Images of nature photographer A.B. Wigman 576 Basvb GA Ede (upload) BasBot (description fixes) Uploadwizard (upload) pywikibot (description adding) A.B. Wigman/Images from Gemeentearchief Ede (could be filled with other images as well) <description> - A.B. Wigman - <id>.jpg
Commons:Batch uploading/Atlas of Mutual Heritage Old maps 2479 User:Husky and User:Gerritdeveer1597 User:HuskyBot Category:Media from Atlas of Mutual Heritage AMH-NNNN-XX <description>.jpg
Commons:Batch uploading/Wellcome Images Medical history 99,000 - pywikibot
RCE shipwrecks Images of Shipwrecks in the Netherlands 18,568 Basvb BasBot pywikibot Images of shipwrecks from the Rijksdienst voor het Cultureel Erfgoed <description> - <shipwreck> - <id> - RCE.jpg

2015Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Commons:Batch uploading/Manuscripts by Srečko Kosovel Images of writings by Srečko Kosovel 1050 User:Sporti User:Sporti semi-automatic Category:Manuscripts by Srečko Kosovel Srečko Kosovel - <title>.jpg
Commons:Batch uploading/US Army Research Laboratory Eniac A few images of ENIAC-era Army computer systems 13 BMacZero BMacZero C# custom Category:ENIACCategory:EDVACCategory:ORDVACCategory:BRLESC-I etc
Commons:Batch uploading/Freshwater and Marine Image Bank PD images related to all things marine and limnological 20747 User:BMacZero User:BMacZeroBot C# custom Category:Images from the Freshwater and Marine Image Bank FMIB NNNNN <title>.jpeg
VanderGrinten Images of 19th century buildings in Nijmegen 808 GWToolset (Basvb) Basvb GWToolset Images from Evert van der Grinten <address>/Nijmegen - <description> - <collectionid> - Van der Grinten.jpg

2016Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Codex Aureus

Descriptionis Ptolemaicæ avgmentvm

NLS collection (establishing XML workflow) 1,503 + 393 + 265 & PeterKz GWT / https://github.com/peterk/suecia2commons Codex Aureus (A 135) Codex Aureus (A 135) p<page>.tif

<title> (SELIBR <libris>)-<page>.tif

Commons:Batch uploading/Fortepan.HU Fortepan photographs, Hungary 69,857 Custom Images from Fortepan <autogenerated title> Fortepan <image number>.jpg
Commons:Batch uploading/Imperial Encyclopaedia 18th Century Gujin Tushu Jicheng 800+ User:Fæ NA Custom Gujin Tushu Jicheng .
Photographs by Adolf and Carl Dransfeld Photographs by Adolf and Carl Dransfeld 1304 Reinhard Kraasch RKBot Custom (pywikibot based) Photographs by Adolf and Carl Dransfeld HANSif<image#> <title>.tif
HANSif<image#> <title>.jpg (cropped version)

2017Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
NPS Maps Public domain maps of U.S. National Parks, published by the National Park Service. 1968 Reinhard Kraasch RKBot Custom (pywikibot based) Files from the National Park Service uploaded by RKBot NPS <title>.<file type>
Incompetech music CC-BY-3.0 music files 1,277 NA Pywikibot Category:Audio files from Incompetech <title> (ISRC <ref>).mp3

FailedEdit

Date Name (Subpage) Fail Reason
Flickr Imre Solt collection denied because the UAE doesn't have FOP laws which result in most image being copyvios.
Commons:Batch uploading/Modern Egypt Digital Archive Egyptian copyright doesn't have a limit for copyright of photographs, only that it becomes pd 50 years after the author is dead. Not enough images for a batch.
Commons:Batch uploading/Images from LIFE Most of the images didn't have a clear copyright label.
Commons:Batch uploading/Gathering the Jewels Images don't appear to be free.
Commons:Batch uploading/Staffordshire Gold Hoard (en.Wikipedia front page news) the images were quickly changed from Share Alike to Non-commercial on the same day.
Commons:Batch uploading/World War II in Africa from Flickr user gbaku User wasn't author of the album, only purchased the images.
Commons:Batch uploading/Kartrummet Website did not show interest for partnership, license verification not possible.
Commons:Batch uploading/beeldengeluidwiki unclear situation of authorship
Commons:Batch uploading/Dermnet Owner of the website doesn't own the images.
Commons:Batch uploading/Ekta Media Not done, dead link
EVDeportes Already uploaded on commons.
Commons:Batch uploading/Media of "banco de imágenes" of Ministry of Education of Spain cc-by-nc
Commons:Batch uploading/Sir William MacArthur Botanical Images Low quality
Commons:Batch uploading/Spanking Art Wiki
Commons:Batch uploading/Land Air Sea Warfare unclear what to upload. incomplete request and no response from user.
Commons:Batch uploading/WWII unclear situation of authorship
Commons:Batch uploading/US Coast Guard dead link
Commons:Batch uploading/Nasa Technical Reports Server (NTRS) Public NTRS access suspended indefinitely.
Commons:Batch uploading/KROK2009 Out of scope (portraits)