Commons:Batch uploading

Bot help and list · Requests to operate a bot · Requests for work to be done by a bot  · Requests for batch uploads
Gnome-system-run.svg

This page has a backlog that requires the attention of experienced editors.
Please remove this notice if it won't be needed in the future.

Bahasa Indonesia  Boarisch  català  Deutsch  Deutsch (Sie-Form)‎  English  español  français  galego  italiano  magyar  Nederlands  polski  português  português do Brasil  sicilianu  svenska  Türkçe  Ελληνικά  беларуская (тарашкевіца)‎  български  македонски  українська  বাংলা  മലയാളം  ไทย  한국어  日本語  中文(台灣)‎  中文(简体)‎  فارسی  +/−

Shortcut
COM:BATCH
Nuvola apps kcmsystem.svg

Commons Batch Uploading is a project to centralize the uploading of a collection of files, that have released their work as PD or any Commons compatible license. The files would be assigned to a bot operator who would see how the request would be fulfilled. (To upload batches from Flickr, please make requests on Commons:Flickr batch uploading)

Before you request a batch upload here, please read the guide to batch uploading first.

See w:Wikipedia:Public domain image resources for potential future batch uploads.

ScriptersEdit

Currently inactiveEdit

ToolsEdit

  • See Commons:Upload tools. The Python Wikipedia Bot framework supports image uploads and is particularly versatile.
  • Commonist - free Java program to upload large numbers of files to Commons
  • d:Help:QuickStatements - tool for batch upload of metadata to Wikidata, which can be than accessed by {{Artwork}} and other templates.
  • Flickrripper allows batch uploading from a set, group or a user id on flickr.
  • GLAMwiki toolset was built by Europeana to quickly get whole collections into Wikimedia. (As of 2019, no longer actively maintained. Pattypan is the recommended tool)
  • We need tools to facilitate rapid, accurate categorization of many images at once.
    • Please explain. (Which images? Before upload or after? ...) --Slick (talk) 08:12, 3 March 2018 (UTC)

Scripts, Examples and InformationEdit

RequestsEdit

Biblioteca Digital HispánicaEdit

  • Source to upload from: Photography collection from the Biblioteca Digital Hispánica: search query
    • Do the media URLs follow a pattern? metadata permalink, viewer permalink, JPEG deep link
    • Does the site have an API? No
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) The HTML is quite well-formed and follows an homogeneous structure, although metadata tabulation is a bit weird.
    • Did you contact the site owner? No
  • Describe the works to be uploaded in detail (audio files, images by …): This request is for a subset of this Digital Library covering photographies and engravings. Note that the JPEG deep link provided above is valid only to fetch the first page of the document. For this collection, most (all?) works are a single page.
  • Which license tag(s) should be applied? It depends on the work. I think it should generally be PD-old-assumed, and in some cases PD-old-70 and PD-old-100.

I already have a scraper and (work in progress) page generator for this collection. So I can help to provide everything in the required format. Anyway, I think the bulk of pending work is probably identifying author and the right license tag for each work.

MarioGom (talk) 21:15, 12 October 2020 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Perry–Castañeda Library Map CollectionEdit

  • Source to upload from: http://legacy.lib.utexas.edu/maps/ams/
    • Do the media URLs follow a pattern?
      The urls themselves, so far as I can work out, don't, but in the same way as in Adobe Acrobat Pro you can set it to go down a list of web links to generate a single pdf, a bot may be able to too
    • Does the site have an API?
      Bit technical, but I dont' think so
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
      I don't know
    • Did you contact the site owner?
      No
  • Describe the works to be uploaded in detail (audio files, images by …): vast series of maps generated by the US Army Map Service (i.e., PD-USGov-Military) in the Perry–Castañeda Library Map Collection, The University of Texas at Austin
  • Which license tag(s) should be applied? PD-USGov-Military
  • Is there a template that could be used on the file description pages? Do you think a special template should be created? In terms of the file naming convention, this could follow that of the site, i.e, the top of each page has the series, a credit to the US AMS, and the date, then each map file has the name of the map, the sheet number (for the index pages, cross-references from adjoining maps etc), and the scale

NB there are already some files at Category:India maps by U.S. Army Map Service (plus various other individual uploads etc within Category:Maps by the United States Army Map Service), and it looks from below on this page and eg this commons image that "Slick-o-bot" may have been used in 2012 to upload some or all of these (I'm most keen on the various Japan-related maps (especially the 3x Honshu 1:50,000 series) but imagine every region would benefit).
This would be a mind-bogglingly great addition, thank you, Maculosae tegmine lyncis (talk) 14:08, 13 August 2020 (UTC)

PS, these are much more detailed than google maps - and the labelling is in English (with some Japanese too), Maculosae tegmine lyncis (talk) 19:27, 21 August 2020 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Claremont Colleges Digital LibraryEdit

  • Describe the works to be uploaded in detail (audio files, images by …):

All photos in the Boynton Collection of Early Claremont, all of which are dated prior to 1925. If it's not too much trouble, it would also be very nice to have all photos in the Claremont Colleges Photo Archive and City of Claremont History Collection dated prior to 1925.

  • Is there a template that could be used on the file description pages? Do you think a special template should be created? Not sure

Sdkb (talk) 07:42, 8 August 2020 (UTC)

OpinionsEdit

Were these photos published prior to 1925, or merely taken prior to then? Publication needs to be pre-1925 for {{PD-US-expired}} to be allowed. Pi.1415926535 (talk) 08:25, 8 August 2020 (UTC)

@Pi.1415926535: The about page states The collection ... is believed to have come to Pomona College included with the papers of Charles Luther Boynton, a Pomona College alumnus and missionary to China. Boynton himself graduated from Pomona around 1900. I can't say for sure the year his papers came into possession of the college, though (which I assume would be the date of publication?). The library would probably tell us if we asked, though. Sdkb (talk) 05:47, 10 August 2020 (UTC)
Acquisition by the college would not be considered publication for the purposes of copyright. Only use in a publicly released printed material, or on a webpage, is considered publication. Pi.1415926535 (talk) 06:48, 10 August 2020 (UTC)
@Pi.1415926535: does being added to a library not count as publication? The collection has presumably been housed in the special collections department and publicly available to anyone who requested access since it was obtained. Sdkb (talk) 20:55, 10 August 2020 (UTC)
A collection merely being in a library does not constitute publication, by my reading. Under copyright law, publication is the distribution of copies or phonorecords of a work to the public by sale or other transfer of ownership or by rental, lease, or lending. Offering to distribute copies or phonorecords to a group of people for purposes of further distribution, public performance, or public display also constitutes publication. (From here.) Is the death date of Boynton known? If it was before 1950, then {{PD-old-70}} applies. Pi.1415926535 (talk) 23:14, 10 August 2020 (UTC)
@Pi.1415926535: According to here, Boynton died in 1961, so not quite. The above would seem to me to indicate being in a library counts, though, because of lending, which is what a library does. Sdkb (talk) 19:11, 11 August 2020 (UTC)
A collection in the library would be the originals (not copies) and is likely for use only in the library (not lending). I understand that you wish to have this collection available on Commons, but from the available evidence I do not believe the images are public domain. Pi.1415926535 (talk) 20:58, 11 August 2020 (UTC)
Assigned to Progress Bot name Category

Balinese Lontar from Internet ArchiveEdit

  • Source to upload from: http://archive.org/details/Bali
    • Do the media URLs follow a pattern? yes
    • Does the site have an API? yes
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) N/A
    • Did you contact the site owner? yes
  • Describe the works to be uploaded in detail (audio files, images by …):
    • Balinese Lontar (palm-leaf manuscripts) from the Internet Archive's Bali collection
    • Each manuscript is a PDF containing photographs of the originals
    • This batch upload is in connection with an active project grant.
  • Which license tag(s) should be applied?

{{PD-scan}}, following the behavior of the ia-upload tool.

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yes. I will follow the ia-upload template closely when doing the batch upload. I will use a short python script that aggregates info from the Internet Archive API and sends each upload request via pywikibot. If necessary I will create a bot account for this purpose. There are approximately 2700 items to upload.

Lautgesetz (talk) 01:03, 4 July 2020 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Catalog of Copyright EntriesEdit

  • Source to upload from: https://archive.org/details/copyrightrecords?&sort=-date
    • Do the media URLs follow a pattern? Unsure.
    • Does the site have an API? Unusre, but there seems to be an RSS feed - Not sure if it contains all entries.
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Commons has tools for upload transfer from IA.

    • Did you contact the site owner?

No.

  • Describe the works to be uploaded in detail (audio files, images by …):

Scanned volumes (647) consisting of the Catlog of Copyright Entries volumes for the United States for the period 1891-1977/8)

  • Which license tag(s) should be applied?

{{PD-USgov}}

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

No new templates are required, additional fields could be added in {{Book}} or {{Information}}

ShakespeareFan00 (talk) 07:37, 3 June 2020 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Commons:Batch uploading/Modern SketchEdit

  • Source to upload from: This Complete Gallery
    • Do the media URLs follow a pattern? There are 39 links. Inside of each link there are all the pages of every issue, in order.
    • Does the site have an API? I don't know
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
    • Did you contact the site owner? It's Public Domain
  • Describe the works to be uploaded in detail (audio files, images by …):

Each one of the 39 issues of Chinese magazine "Modern Sketch". They are in public domain for the reasons given in the following parametre. All the pages can be uploaded.

  • Which license tag(s) should be applied?

PD-China and PD-1996

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

No Special Template: PD-China and PD-1996 as license and Category:Modern Sketch as Category. TaronjaSatsuma (talk) 10:29, 18 February 2020 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category Modern Sketch

Japanese Homes and their surroundingsEdit

  • Source to upload from: List of files, List of illustrations with names assigned to each number. It would be really nice if the figures contained their original names in teh uploaded filenames.
    • Do the media URLs follow a pattern?:
    • Does the site have an API?:
      • I assume that Gutenberg has an API. If someone can point me at instructions on how to use it with Commons, I might be able to do this myself; I assume this is a beaten path...
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?):
    • Did you contact the site owner?:
      • No, for Gutenberg this seems redundant.
      • I uploaded some manually already, with permission, from another site (names files with pattern https://www.kellscraft.com/JapaneseHomes/JapanHomes001.jpg, to JapanHomes301.jpg, 129-130 are duplicates, figure numbers do not align with file names, so combined illustrations cause no disruption to sequential numbering). The Gutenberg images are in better in many, but not all, cases (higher-res, better scan).
      • The same book is also at [1], but the images seem to be worse.
  • Describe the works to be uploaded in detail (audio files, images by …):
  • Which license tag(s) should be applied?:
    • {{PD-old-70-1923}}
    • note: five years from PD-100
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Thank you! HLHJ (talk) 04:17, 4 February 2020 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Baseball Hall of FameEdit

Is there a practical way to batch extract and upload files that are tagged with "http://rightsstatements.org/vocab/NoC-US/1.0/" under the "Copyright note" section? They basically confirm which files are in the public domain. Or they will sometimes post in that same section "The National Baseball Hall of Fame and Museum is not aware of any U.S. copyright or any other restrictions in the documents."

Oaktree b (talk) 02:16, 23 November 2019 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

OpenUp RBINS Beetles collectionEdit

  • Source to upload from: http://projects.biodiversity.be/openuprbins/
    • Do the media URLs follow a pattern? Yes: http://projects.biodiversity.be/openup/rbins/pictures_only/<PICTURE_ID>.jpg
    • Does the site have an API? No
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Since I helped build the website, I have a CSV file containing metadata for each picture: scientific name, family, location where the beetles was collected, photographer name, ...
    • Did you contact the site owner? Yes. They approve the upload of medium resolution images (such as on the existing website), and may approve later higher resolution versions of those.
  • Describe the works to be uploaded in detail (audio files, images by …): 4,074 detailed pictures of 1,926 different beetles species. See content on http://projects.biodiversity.be/openuprbins/
  • Which license tag(s) should be applied? {{CC-BY-SA-4.0}}
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Niconoe (talk) 09:12, 26 June 2019 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

GeoDILEdit

There are 3096 pictures of rocks and minerals.

  • Source to upload from: https://geodil.dperkins.org/
    • Do the media URLs follow a pattern? The images themselves are /i/NUMBER.jpg. The pages for the images are /h/NUMBER.html. Numbers range from 1-3144 with some gaps.
    • Does the site have an API? No.
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) The site owner uses a script to generate the HTML and the sitemap, /sitemap.xml. That data could be modified if it would make uploading significantly easier. On the back end, information is stored in a CSV, /db/details.csv, should that be useful.
    • Did you contact the site owner? Site owner: Douglas Perkins.
  • Describe the works to be uploaded in detail (audio files, images by …): JPGs of rocks and minerals. Most of these were taken by people working on the GeoDIL project at the University of North Dakota, 2001-2002.
  • Which license tag(s) should be applied? 2,711 are CC Zero, and 14 are government works and PD. The remainder are not freely licensed. All licensed images are noted as such on their HTML pages, and it's also in the sitemap.
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Douglas Perkins (talk) 01:14, 10 March 2019 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

NPGalleryEdit

  • Describe the works to be uploaded in detail (audio files, images by …):

"NPGallery supports a wide array of digital asset file types (images, MS office formats, adobe pdfs, audio files, videos)." We would, I think, be primarily interested in their photographs of national parks.

  • Which license tag(s) should be applied?

{{PD-USgov}} may apply to many images, but they need to be checked individually. This could probably be automated to some degree.

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Standard templates such as {{Photograph}} should be acceptable.

This was spotted by Animalparty on COM:VP. BMacZero (talk) 00:12, 22 January 2019 (UTC)

OpinionsEdit

  • Comments by Animalparty.
  • {{PD-USGov}} would be the most inclusive template, but is rather vague. More specific templates include {{PD-USGov-NPS}} and {{PD-USGov-Interior}}. Any Photographer field that says "NPS Staff" or "NPS Photo" (e.g. [2]) should automatically get PD-USGov-NPS.
  • I think {{Photograph}} or {{Information}} are fine, ideally with detailed semi custom fields for keywords, collection, location, etc., as seen in the Library of Congress images uploaded by User:Fæ (example).
  • The more pre- or auto-categorization, or at least clearly noting collection, yeear/decade, geographic unit, etc., the better, else we dump thousands of unsorted of images into already cluttered categories like Yosemite National Park.
  • There may be overlap with some material on Archives.gov , individual National Park Flickr feeds/websites, and such material already uploaded. But I think the value of the images uploaded at their largest file size and with curated metadata outweigh the inconvenience of some duplication.
  • Many files have geographical coordinates, but I suspect that many are generic coordinates of the center of the National Park or Monument, rather than being unique to the photograph.
  • Thanks for initiating this, sorry if these comments are basic/obvious to experienced mass uploaders. --Animalparty (talk) 01:29, 22 January 2019 (UTC)


On some more inspection, certain images may be a bit problematic in terms of copyright, namely works of art (e.g. paintings and sculptures) not explicitly credited to NPS employees, but that are nonetheless labeled "Public domain:Full Granting Rights". Some of these appear to be created by Artist-in-Residence programs (e.g. this gallery and this one), and from browsing elsewhere it appears that different parks may have different rules regarding copyrights. Rocky Mountain National Park states "Artists are also required to provide the copyright for this artwork to the National Park Service. The National Park Service will not allow the commercial use of any donated artwork once it is selected and accessioned into the Park's permanent museum collection", which is a restriction against public domain. Perhaps no art from Rocky Mountain was transferred to NPGallery? These 2 images from the U.S.S. Arizona memorial are labeled PD on NPGallery, yet on a different NPS page their status is ambiguous, with the included usage disclaimer "Multimedia credited with a copyright symbol (indicating that the creator may maintain rights to the work) or credited to any entity other than NPS must not be presumed to be public domain; contact the host park or program to ascertain who owns the material" (emphasis added).

Side note: I think every photograph I've viewed on NPGallery has the Copyright disclaimer "Permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this website. Digital assets without any copyright restrictions are public domain.", but every file is also labeled Public domain in the Constraints Information.

Another snag I've noticed, just from browsing the term "Artist", are that some images are scans/photographs from newspapers that were most likely not originally created by Federal employees (although the derivative scans/photos are): for instance Louis Grell illustration album, with cartoons by Louis Grell published in World War I.[3] These are still PD via pre-1924 publication (and possibly by {{PD-USGov-Military}}), but it hinders accurate bot-designation of PD template.

And public domain rationale is ambiguous on this vido, with Copyright" "Photo courtesy of Betty Maya Foott, Colorado Plateau Dark Sky Cooperative" (so, probably not a federal employee), yet is nonetheless labeled "Public domain:Full Granting Rights". I may have just found a relative handful of exceptions. But there are also probably a good deal of historical photographs that are PD-1923 or PD-no-notice yet not US Government works. Perhaps a generic umbrella template similar to {{Flickr-no known copyright restrictions}} could be used to encapsulate different possibilities, like {{PD-NPGallery}}.

I think it would be a good idea to contact someone at NPGallery to double check that all media labeled public domain is in fact public domain, for some reason, especially when rationale is ambiguous or lacking. We also might want to consider not transfering the somewhat intimidating, potentially misleading Copyright message "Permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this website. Digital assets without any copyright restrictions are public domain." This may be a liability disclaimer on NPGallery's end, but ideally, everything we transfer to Commons would be in the public domain, and so no permission need be secured. --Animalparty (talk) 11:45, 25 January 2019 (UTC)

  Working on adapting my bot to handle this. I'll contact them, and also start with only things that are obviously PD. BMacZero (talk) 17:50, 9 February 2019 (UTC)
I e-mailed NPGallery a while back about the public domain statuses of images and neglected to share here. Unfortunately got a not-too-helpful response essentially saying that the licenses and attributions are not "consistent" and "there is not a good way to assure an asset id is truly in the public domain, or not". We'll have to figure out what types of signals we can rely on to decide whether {{PD-USGov-NPS}} or other templates apply. Of course, publication pre-1924 will be a good one to start. BMacZero (talk) 04:30, 11 April 2019 (UTC)
I'm currently harvesting a list of all the images. It's going a bit slow but it should only a take a few days. After that I'll start downloading the metadata, which may take several days. BMacZero (talk) 04:45, 12 April 2019 (UTC)
Ah, a shame about the inconsistent licensing criteria. I guess pre-1924 and files credited to "NPS staff" or similar can be prioritized for now. --Animalparty (talk) 19:13, 12 April 2019 (UTC)
Started downloading the item metadata. You can check on the progress on this fun page I made. BMacZero (talk) 15:49, 13 April 2019 (UTC)
BRFA filed (Commons:Bots/Requests/BMacZeroBot 6). BMacZero (talk) 05:35, 10 May 2019 (UTC)
Started uploading last night, will probably be ongoing for quite a while. See Category:Images from NPGallery to check to help with validation and categorization! – BMacZero (🗩) 16:35, 29 June 2019 (UTC)
Assigned to Progress Bot name Category
User:BMacZero   In progress User:BMacZeroBot Category:Images from NPGallery to check

See AlsoEdit

APPLAUSEEdit

  • Does the site have an API? Yes: 101_xxxx (x is a variable number)
  • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) https://www.plate-archive.org/query/
  • Did you contact the site owner? No
  • Describe the works to be uploaded in detail (audio files, images by …): Historical astronomical plates, logbooks, envelopes or notes https://www.plate-archive.org/applause/info/gallery/ (we don't need to upload all, but I think the plates would be insteresting.
  • Which license tag(s) should be applied?

The database is licensed under CC-0 (https://www.plate-archive.org/applause/project/disclaimer/)

  • Is there a template that could be used on the file description pages? Do you think a special template should be created? Yes, I think a template should be created.

Habitator terrae 🌍 16:37, 27 October 2018 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

PauloGuedesEdit

This url generates 94 results pages, each linking to 10 individual image pages. Each image page url is
http://arquivomunicipal2.cm-lisboa.pt/X-arqWeb/ContentPage.aspx?ID=code&Pos=1&Tipo=PCD
while the image in it is at
http://arquivomunicipal2.cm-lisboa.pt/X-arqWeb/ContentDisplay.aspx?ID=code&Pos=1&Tipo=PCD&Thb=0
with code being a 20-digit lower-case hex number — which has no bearing with the official identification references (cota — see below).
  • Does the site have an API? dunno
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) consistent, machine-generated HTML (parsable, even if not necessarily valid)
    • Did you contact the site owner? No
  • Describe the works to be uploaded in detail (audio files, images by …): Smallish batch (711, according to inventory, or 933, according with the database search report) of scanned b/w photos in various hardcopy formats.
  • Is there a template that could be used on the file description pages? Do you think a special template should be created? {{AMLx}}; it needs to be fed at least {{{cota}}} (given also as código de referência), a slashed crumbthread-like alphanumeric string of variable length; other values to be (trivially) extracted from each image page are:
  • Título
  • Assunto
  • Data(s)
  • Dimensão e suporte
  • Nota(s)
  • Cotas antigas or Cotas or Cota(s)
The filenames can be constructed from Título (possibly trimmed) and the two last crumbs of {{{cota}}}, in parenthesis, devoided of the slash (which is one of the Cotas)

-- Tuválkin 16:54, 30 June 2018 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

VOA News filesEdit

  • Source to upload from: https://web.archive.org/web/*/https://www.voanews.com/mp3/voa/english/nnow/NNOW_HEADLINES.mp3
    • Do the media URLs follow a pattern? They all have the same name. The date when archived is given in 14 digits, with the first eight digits being the year, month, and day respectively, with the remaining digits being the time of day archived, in UTC.
    • Does the site have an API? Don't know.
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Don't know.
    • Did you contact the site owner? No need to, since U.S. government works so public domain.
  • Describe the works to be uploaded in detail (audio files, images by …): VOA world news headline newscast audio files for (almost) every day spanning from 5 May 2009 to 6 July 2019.
  • Is there a template that could be used on the file description pages? Do you think a special template should be created? Just use the standard one. Upload as "VOA News Headlines (MONTH DAY, YEAR)". If possible, upload them in FLAC, WAV, and OGG.

Illegitimate Barrister (talkcontribs), 13:07, 26 May 2019 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

HiRISEEdit

  • Describe the works to be uploaded in detail (audio files, images by …):
    Images by HiRISE (High Resolution Imaging Science Experiment)
  • Which license tag(s) should be applied?
  • As explained in each image's description page for example: "All of the images produced by HiRISE and accessible on this site are within the public domain: there are no restrictions on their usage by anyone in the public, including news or science organizations. We do ask for a credit line where possible: NASA/JPL/University of Arizona"
  • PD-USGov-NASA or a variation of it to include JPL and University of Arizona must be used.
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?
There is no template yet. It must be created to include all the relevant data e.g. Acquisition date, Latitude , Longitude , etc. from the label files.
  • Note: Due to JPEG2000 not being currently supported on Wikimedia Commons, a conversion to PNG is also needed. File sizes may be large!

Meisam (talk) 21:58, 20 June 2018 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

freepd.comEdit

Site contains production music tracks, in various genres, mp 3 format.

  • Source to upload from:

http://freepd.com/

    • Do the media URLs follow a pattern?

None found. Tracks seem to be in sub-directories related to nominal genre, MP3 files are named for the track title apparently.

    • Does the site have an API?

Unknown.

    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Unknown.

    • Did you contact the site owner?

Site owner not contacted.

  • Describe the works to be uploaded in detail (audio files, images by …):

"Production music", in various genres., in MP3 format.

  • Which license tag(s) should be applied?

Site claims tracks are in the public domain:- http://freepd.com/faq.html ; However some of these tracks were previously under CC-BY on the site owners other site at incompetech.

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

{{Information}} with additional field as was done on the previous batch upload for incompetech.

ShakespeareFan00 (talk) 10:20, 18 December 2017 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Commons:Batch uploading/timbeek.com/Edit

  • Source to upload from:

http://timbeek.com/ in particular music tracks listed in http://timbeek.com/royalty-free-music/isrc/

    • Do the media URLs follow a pattern?

No general pattern, but there's a master list (not sure if it's complete) of track pages here - http://timbeek.com/royalty-free-music/isrc/, Donwload links in the UI seem to link to numbered subdirectories, but general pattern undetermined or not obvious.

    • Does the site have an API?

Unknown.

    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Unknown

    • Did you contact the site owner?

Site owner not contacted.

  • Describe the works to be uploaded in detail (audio files, images by …):

A Small set of 'production music' tracks, in assorted genres.


  • Which license tag(s) should be applied?

See: http://timbeek.com/royalty-free-music/license/ , assuming attribution requirments are met the music appears to be under CC-BY 4.0. (see also: http://timbeek.com/royalty-free-music/faq/ and http://timbeek.com/royalty-free-music/copyright/)

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

{{Information}} with additional fields as was previously implemented for the incomptech.com batch upload(this site seems to use a simmilar approach).

ShakespeareFan00 (talk) 19:05, 15 December 2017 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Images of listed buildings by Stephen Richards on Geograph.org.ukEdit

  • Source to upload from: http://www.geograph.org.uk
    • Do the media URLs follow a pattern? Yes: http://www.geograph.org.uk/photo/[ID]
    • Does the site have an API? Yes: http://www.geograph.org.uk/help/api
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Don't know
    • Did you contact the site owner? No need
  • Describe the works to be uploaded in detail (audio files, images by …):
All photographs of listed buildings by this user are of high quality and are tagged [listed building]. They would be very useful to have on Commons as every listed building has an item on Wikidata. I'd like them to be uploaded en masse and given the categories Category:Listed buildings in [county or London borough] and Category:Images by Stephen Richards. I could then further refine the listed building categories manually. However, the terms "Grade I", "Grade II*" and "Grade II" (the three listing grades for buildings in England and Wales) appear in the image descriptions, so is there a way that these could be picked out and used to categorise the images on Commons?
  • Which license tag(s) should be applied?
{{Geograph}}
  • Is there a template that could be used on the file description pages? Do you think a special template should be created?
{{Geograph}}

Ham II (talk) 19:50, 16 November 2017 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

USDA NRCS Plants DatabaseEdit

  • Source to upload from: http://plants.usda.gov/
    • Do the media URLs follow a pattern? Yes.
    • Does the site have an API? No.
    • What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) valid XHTML
    • Did you contact the site owner? No.
  • Describe the works to be uploaded in detail (audio files, images by …): Public domain: 10771 photos and 7064 line drawings, with species information for categorization. There are other copyrighted images as well, some of which may be freely licensed.
  • Which license tag(s) should be applied?

{{PD-USGov-USDA-NRCS}}

  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

OpinionsEdit

@Guanaco: There is a lot of copyrighted material within these images, e.g. [4] [5]. (Just because this is a U.S. government web site this does not mean all the material is U.S. government material and by this means freely usable!) Actually I have not found too many images that really can be used (e.g. [6]). You should at least provide a procedure how to distinguish between copyrighted and free material. --Reinhard Kraasch (talk) 11:02, 9 July 2017 (UTC)

@Reinhard Kraasch: The gallery search function [7] has a filter by copyright status. [8]
I've found that the URLs linked by the thumbnails provide species information within <title>: https://plants.usda.gov/core/profile?symbol=HACA2&photoID=haca2_003_ahp.jpg#
The search is navigable with &page=2, 3, 4, etc.
I'm actually interested in scripting this myself now, though it would be my first batch upload task. Guanaco (talk) 14:23, 9 July 2017 (UTC)
@Guanaco: Well, just go on... On the other hand it always is a good idea to have a second opinion with such a batch upload - especially for the non-technical aspects. --Reinhard Kraasch (talk) 20:52, 10 July 2017 (UTC)
Assigned to Progress Bot name Category

US National ArchivesEdit

I am hoping to begin a bulk upload of media from the US National Archives in the next few weeks. This will be a very different approach from the first upload, which was based on uploading files from an offline drive and scraping HTML for the metadata. This time around, NARA has an API for our online catalog, and so I am building a bot, using mwclient, to upload using the live metadata and files from the API. Some details:

Dataset

The dataset includes all PD materials at https://catalog.archives.gov (API: https://catalog.archives.gov/api/v1). I plan to begin with a series of ~100,000 WWI-era photos. Technically, there are over 15 million files (and counting) in this dataset.

File names

The script is currently configured to name files with the formula: For single-page items:

  • "File:[TITLE] - NARA - [NAID].ext"
    Where "[TITLE]" is the catalog record's title field, and "[NAID]" is the National Archives Identifier. If this is over the character limit, "[TITLE]" is automatically truncated, with "(...)" appended.

For multi-page items (since the above formula would give all files belonging to one catalog record the same title):

  • "File:[TITLE] - NARA - [NAID] (page X).ext"
Metadata

We are developing a custom metadata mapping, since NARA does not adhere to a metadata standard. You can see the metadata template we use here: {{NARA-image-full}}. Some notes:

While all the records in this catalog come from NARA or partner institutions, there are many different facility locations, and some NARA facilities have their own institutions templates already (e.g. US presidential libraries). Therefore, I am creating institution templates to go along with all NARA locations, and the script will insert the correct institution template based on a mapping.

NARA's authority file is not yet mapped to Wikidata, however that is definitely something that would be useful in the future. For now, we will upload files with NARA's creator and author names and their NAIDs and links back to the catalog authority record. However, including the NAIDs in a Commons template field means that in the future, Wikidata could be used to make creator templates appear instead. Any help with this would be appreciated.

Licenses

Because NARA records are nearly all (>99%) derived from the records of US federal agencies, these uploads will use {{PD-USGov}} or its subtemplates. Most NARA records are in one of about 600 record groups based on their creating agency, so I am using a mapping of NARA record groups to Commons PD-USGov templates so that the bot can apply the more specific agency templates in most cases. Help filling out this mapping would be appreciated.

Nearly all holdings of the US National Archives are in the public domain as a work of the federal government (or, otherwise, due to age). This is marked in the "use restriction" field in the catalog, with a value of "Unrestricted" indicating public domain determination by the archivists. Therefore, the script will be configured to skip over any records in which the use restriction is anything other than "unrestricted" (even "possibly" ones, which could ultimately be PD, but need a human determination).

Categories

All uploads will be automatically categorized by the metadata template into Category:Media contributed by the National Archives and Records Administration and a category for the series they belong to (such as Category:US National Archives series: DOCUMERICA: The Environmental Protection Agency's Program to Photographically Document Subjects of Environmental Concern, compiled 1972 - 1977). Eventually, the script will be designed to create the series category if a file is uploaded for a series which does not yet have one.

When it comes to topical categories, past NARA uploads utilized the {{Uncategorized}} tag to encourage the community to add topical tags. However, since this creates work for the community, I am planning this time around to run uploads a small batch (hundreds to a few thousand) at a time, so I can upload them with one or more topical categories that apply to all records in the batch, rather than uncategorized.

Code

You can find the upload bot's code at https://github.com/usnationalarchives/wikimedia-upload. This project is being developed in public on NARA's official GitHub account. I would welcome collaboration (pull requests or otherwise) there. In addition, the Commons community is welcome to file issue reports on that repo.

Examples

The most recent test uploads can be viewed in Category:US National Archives series: American Unofficial Collection of World War I Photographs. I am still polishing the upload script, but these examples essentially represent what should be expected from the bot once it gets started.

OpinionsEdit

The bot account is technically already flagged from the last bulk upload a couple of years ago, however I would like to submit the current plan to community review before restarting uploads. If there are any opinions on the bot's design or the format of uploads or other issues, I am happy to hear them. We'd also like to know whether to limit what is uploaded in any way—as in, would Commons actually be interested in 15 million files, or might some of these, like the millions of census cards, not be of interest. Also, if anyone is interested in helping out with the coding or other tasks, please feel free to let me know. This is a big undertaking. Thanks! Dominic (talk) 17:25, 31 May 2017 (UTC)


Assigned to Progress Bot name Category
User:Dominic Coding User:US National Archives bot Category:Media contributed by the National Archives and Records Administration

ESA-Rosetta-NAVCAMEdit

  • Describe the works to be uploaded in detail (audio files, images by …):
Images the comet 67P/CHURYUMOV-GERASIMENKO by the NAVCAM on the Rosetta spacecraft.


  • Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yann (talk) 14:32, 6 June 2015 (UTC)

OpinionsEdit

Assigned to Progress Bot name Category

Old requests (over two years)Edit


Batch uploads in progressEdit

Batch uploads on holdEdit

Past batch uploadsEdit

2005 - 2009Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
10,000 paintings from Directmedia 10,000 public domain images digitized by the Yorck project and contributed to commons 10,000 Eloquence File Upload Bot (Eloquence) PD-Art (Yorck Project)
Picswiss project Roland Zumbühl agreed on releaseing his images as GFDL, depicting various areas and subjects in Switzerland. 5,000 of 13,000 Dake Dake Images from Picswiss
Bundesarchiv From the German Federal Archive, the images depict Germany between the 19th and 20th century including valuable photographs of the Nazi era and World War II. 100,000 Duesentrieb BArchBot Information fetch Images from the German Federal Archive Bundesarchiv <id>, <desc>
Starr images Images of plants of Hawaii 60,000 Multichill Multichill Images from Forest & Kim Starr Starr <date>-number <taxon/desc>
Wenceslas Hollar Digital Collection A collection of 2700 high resolution images of engravings of Wenceslas Hollar, about 90% of his life works 2,700 Dcoetzee Dcoetzee University of Toronto Wenceslas Hollar Digital Collection
National Portrait Gallery Various portraits of famous people between the 16th and 19th century. 3,000 Dcoetzee Dcoetzee National Portrait Gallery, London
Deutsche Fotothek Images from Deutsche Fotothek mainly about east Germany between the 19th and 20th century including the Bombardment of Dresden and other events. Only 25% of the images have been uploaded till now. 62,176 of 250,000 Multichill FotothekBot Tools used Images from the Deutsche Fotothek Fotothek <id> <desc>
Berger Collection A collection of high resolution images of paintings and other works from the Berger Collection, depicting British art, culture and people. 140 Dcoetzee Dcoetzee Berger Collection
Great Images in NASA Images from Great Images in NASA 1,400 TheDJ Multichill Great Images in NASA
Alaska-Yukon-Pacific Exposition of 1909 High-resolution scans of documents from the Alaska-Yukon-Pacific Exposition found here. 700 Dcoetzee Dcoetzee Alaska-Yukon-Pacific Exposition
Commanster Pictures of plants, animals, birds and insects of Commanster, Belgium by James Lindsey 6,000 Sarefo Sarefo Pictures by James Lindsey
WLANL Images from Wiki Loves art Netherland imported from the flickr group pool, depicting Netherland and its different museums. 4,000 Multichill BotMultichillT Images from Wiki Loves Art Netherlands WLANL - <team> - <desc>
FEMA site All the images found on US Federal Emergency Management Agency Disaster Photo Librarywas copied to Commons, depicting US environmental disasters and emergency actions. 20,000 Multichill BotMultichillT script PD US FEMA FEMA - <id> - Photograph by <photographer> taken on <date> in <location>
AntWeb images All the images found on http://www.antweb.org/ depicting different species of ants. 32,000 Dave Thau File Upload Bot (AntWeb) Images from AntWeb <desc> <specimenID> profile <viewnumber>
Images of erosion All the images found on http://picasaweb.google.com/VolkerPrasuhn depicting erosions. 700 Leyo manual Images by Volker Prasuhn
livepict.com All the images found on http://livepict.com/ depicting bands. 1000 Justass Justass Images from LivePict
Tropenmuseum A partnership with Tropenmuseum 40,000 Multichill KITbot svn Images from the Tropenmuseum COLLECTIE TROPENMUSEUM <desc> TMnr <id>

2010 - 2013Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Randolph Caldecott All pages in The complete collection of pictures & songs / by Randolph Caldecott 510 Diaa abdelmoneim Dudubot upload.py The complete collection of pictures & songs by Randolph Caldecott Randolph Caldecott collection-page <page>
Rob Lavinsky Mineral images from Rob Lavinsky on mindat.org 34,917 Reinhard Kraasch RKBot upload.py + pyodbc Images by Rob Lavinsky <mineral1>[-<mineral2>[<mineral3>]]-<mindatID>
Rob Lavinsky Mineral images from Rob Lavinsky on irocks.com 20,582 Reinhard Kraasch RKBot upload.py + pyodbc Images by Rob Lavinsky <mineral1>[-<mineral2>[<mineral3>]]-<irocks file name>
Bibliothèque Nationale de France Books provided by the Bibliothèque Nationale de France (French National Library) as part of a partnership with Wikimédia France 1,413 Seb35 (with help from Plyd and Jean-Fred) BnF import, operated by Tim Starling svn Books provided by the BNF <Author> - <Title>.djvu
Erling Mandelmann Portraits of notable people donated from Erling Mandelmann 581 Diaa abdelmoneim Dudubot Photographs by Erling Mandelmann <Title> - <Author>
Travelers in the Middle East Archiven Historical images from books about the Middle East from Travelers in the Middle East Archive, provided by Rice University 2,277 Diaa abdelmoneim Dudubot Images from the Travelers in the Middle East Archive "<Title>" (<Year>) - TIMEA
Fonds Eugène Trutat Photographs by famous French photographer Eugène Trutat, donated by the City Archives of Toulouse as part of a partnership with Wikimédia France 200 Jean-Frédéric TrutatBot GitHub Fonds Trutat - Archives municipales de Toulouse <Title> (<Year>) - <Id> - Fonds Trutat
Festivals - - User:Esby - - Comédie du Livre 2010 - Supported by Wikimédia France -
Nordiska Museet A collection of early photographs, donated by Nordiska Museet as part of a collaboration with Wikimedia Sverige. 1,000 Prolineserver NordiskaMuseetBot Toolserver Images from Nordiska museet <Title> - Nordiska Museet - <Id>.jpg
Adams Ansel Adams National Park Service photographs 221 User:Kaldari User:File Upload Bot (Kaldari) Perl 2011 Ansel Adams donation from U.S. National Archives Ansel Adams - National Archives - 79-AA-<digit digit>.jpg
Web Gallery of Art Large collection of well documented artworks. Uploaded ~15k new files and synchronization metadata for ~6k already uploaded files 21,700 Jarekt JarektUploadBot UploadWGA.py
FixWGAMetadataInfo.py
FixWGAMetadataArt.py
Images from Web Gallery of Art <Author> - <Title> - WGA<ID>.jpg
Monument_lists Images of German cultural heritage monuments 3000? User:ElyaUser:Raymond User:SternthalerBot <STRING>-Nr. <##>, <STRING> (<####>).jpg
Commons:Chris's Acorns Large collection of Acorn computer hardware and peripherals from Chris's Acorns 1700 Smallman12q Smallbot C#4 w/ LINQ and MSHTML interop Chris's Acorns just filename...no format
Minerals from various sources on mindat - 902 User:Reinhard Kraasch - - Files by Leon Hupperichs
Files by Christian Rewitzer from mindat
-
Flickr Fotostream of NOAA Photo Library Botanical images ? User:Kobac Images_from_NOAA
Walters Art Museum Collection of 3D and 2D artworks from around the world 19,000 Kaldari File Upload Bot (Kaldari) modified botclasses.php Media contributed by the Walters Art Museum <Author> - <Title> - Walters <ID> - <View>.jpg
Commons:Bible Illustrations Bible illustrations 2993 Smallman12q OrophinBot VBScript, XHR, XMLDOM, MSHTML, COM Media contributed by the Sweet Publishing <name> <chapter>-<section> (Bible Illustrations by Sweet Media).jpg
Flora Batava Illustrations of all plants in the Netherlands 1582 Rillke FloraUploadR own implementation using VB6/COM/C++ Files uploaded from Flora Batava by FloraUploadR <latin plant name> — Flora Batava — Volume v<number>.jpg
Commons:Bots/Requests/Smallbot 2 Oregon Historical County Records Guide 4273 Smallman12q Smallbot VBScript, XHR, XMLDOM, MSHTML, COM Images from Oregon Historical County Records Guide <name> (<Countyname> County, Oregon scenic images) (<id>).jpg
The World's Columbian Exposition PD-Photos of the The World's Columbian Exposition 115 Rillke RillkeBot own implementation using VB6/COM/C++ World Columbian Exposition taken by Press Chicago Photo-Gravure Co. <caption> — Official Views Of The World's Columbian Exposition — <file number>.jpg
Defenselink Defense.gov News Photos 14572 Slick Slick-o-bot pywikipediabot and some bash scripts Defense.gov News Photos to check Defense.gov News Photo <VRIN>[ - description].jpg
U.S. Army Map Service Maps of India and Pakistan from the U.S. Army Map Service 304 Slick Slick-o-bot pywikipediabot and some bash scripts India maps by U.S. Army Map Service Map India and Pakistan 1-250,000 Tile <tile name>.jpg
Defense.gov Photo Essays Defense.gov Photo Essays 23106 Slick Slick-o-bot pywikipediabot and some bash scripts Defense.gov photo essays to check Defense.gov photo essay <VRIN>.jpg
Navy SEAL pics and vids Navy SEAL pics and vids 682 Slick Slick-o-bot pywikipediabot and some bash scripts United States Navy SEALs Images to check United States Navy SEALs <NUMBER>.jpg
Beaverton, Oregon Historical Photo Gallery Beaverton, Oregon Historical Photo Gallery 305 Smallman12q Smallbot VBScript, XHR, XMLDOM, MSHTML, COM Beaverton, Oregon Historical Photo Gallery <name> (Beaverton, Oregon Historical Photo Gallery) (<number>).jpg
ForestWander Mostly nature photos from West Virginia 2600 Rillke Forestwander Nature Photography upload bot own implementation using VB6/COM/C++ Bot-uploaded files from Forestwander Nature Photography <name> - [West Virginia|Virginia] - ForestWander.jpg
Navy SEAL pics and vids U.S. Navy SEALs pictures and videos 681 pics, 56 vids Slick Slick-o-bot pywikipediabot and some bash scripts United States Navy SEALs Images to check
United States Navy SEALs Videos to check
images: United States Navy SEALs <number>.jpg, videos: different
Umair Zafar fashion shoot Umair Zafar fashion shoot 91 Slick Slick-o-bot pywikipediabot and some bash scripts Images from Umair Zafar fashion shoot to check different
New Orleans Bee New Orleans Bee 136667 Slick Slick-o-bot pywikipediabot and some bash scripts The New Orleans Bee by year The New Orleans Bee <year> <month> <number>.pdf
Brooklyn Museum Brooklyn Museum 3629 Slick Slick-o-bot pywikipediabot and some bash scripts African art in the Brooklyn Museum Brooklyn Museum <ID> <SHORT DESC>.jpg
U.S. Marines Corps U.S. Marines Corps 77288 Slick Slick-o-bot pywikipediabot and some bash scripts Marines.mil images to check USMC-<NUMBER>.jpg or USMC-<VRIN>.jpg
Photographic History of the Civil War Photographic History of the Civil War 3668 Mattwj2002, Slick Mattwj2002, Slick-o-bot pywikipediabot and some bash scripts The Photographic History of The Civil War The Photographic History of The Civil War Volume <VOLUME> Page <NUMBER>.jpg
Rijksdienst voor het Cultureel Erfgoed Photos of historic buildings in the Netherlands (Rijksmonumenten) 4650000 Multichill Multichill pywikibot based Images from the Rijksdienst voor het Cultureel Erfgoed <title> - <id> - RCE.jpg
AELG Photos of Galician writers 800 User:Smallman12q User:Smallbot Images from AELG <NAME> (AELG)-<N>.jpg
Defence Imagery (UK) High quality selected photographs by the UK Ministry of Defence (MoD), released on the Open Government Licence (equivalent to Public Domain with an attribution requirement) 2,880 pywikipediabot Category:Images from MoD uploaded by Fæ <MoD title> MOD <file number>.jpg
Weather maps Weather maps of the USA, daily and weekly from the U.S. National Oceanic and Atmospheric Administration 20,000 (10 year archive) and ongoing at 5 new maps per day User:Fæ User:Fæ pywikipediabot Category:NCEP maps by year <YYYY-MM-DD> <map type> NOAA.png
Los Angeles County Museum of Public Art Art history - photographs of artifacts from LACMA 22,000 pywikipediabot Category:Images from LACMA uploaded by Fæ <LACMA description> LACMA <Accession Number>.jpg
LSH Objects in the LSH-museum collections 19,961 (approx 1,500 missing from / missnamed on drive and still to be uploaded) Lokal_Profil LSHuploadBot own script Images from Livrustkammaren och Skoklosters slott med Stiftelsen Hallwylska museet <description> - <mueseum> -_ <imageid>.<filetype>

2014Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Fonds Trutat − Muséum de Toulouse Historical images by Eugène Trutat 213 Jean-Frédéric TrutatBot GitHub Category:Media contributed by the Muséum de Toulouse <Title> - Fonds Trutat - <Id>
Commons:Batch uploading/Art of Japan in the Rijksmuseum - 213 User:Fæ User:Fæ - Category:Art of Japan in the Rijksmuseum -
Archives Nationales (France) Archive documents from the French history 77 Jean-Frédéric ArchivesNationalesBot GitHub Category:Media contributed by the Archives Nationales (France) <Title> <Page> - Archives Nationales - <Id>
Commons:Batch uploading/World Digital Library Old books from WDL - - Pywikibot
geo/map-marker icons by Nicolas Mollet more than 700 free icons to use as placemarks for POI (Point of Interests) locations on maps 6,880 Rillke GeoUploadR node.js / nodemw Category:Map icons by Nicolas Mollet – Uploaded by GeoUploadR Map marker icon – Nicolas Mollet – <Title> – <Category> – <Style>.png
EnergieagenturNRW Contemporary - active North Rhine-Westphalia (German) politicians 2,249 (61% of the Flickrstream) pywikipediabot EnergieagenturNRW <Flickr title> (<Flickr ID>).jpg
RA Coat of Arms drawn by the National Archive of Sweden 336 André Costa (WMSE) RA-uploadbot PyCJWiki (modified) Coats of arms by the National Archives of Sweden <name> <type>vapen - Riksarkivet Sverige.png
Atlas de Wit 17th-century Dutch atlas of the lower countries from the collections of the Koninklijke Bibliotheek (Dutch National Library) 145 Husky HuskyBot Pywikibot (script) Atlas de Wit 1698 Atlas de Wit 1698-<page>-KB PPN 145205088.jpg
goodfreephotos.com different public domain images, landscapes, objects and so on ... 3547 Slick Slick-o-bot pywikipediabot and some bash scripts Category:Images_from_goodfreephotos.com and Subcats of Category:Import by User:Slick-o-bot/Images from goodfreephotos.com (based on galleries for maintenance) Gfp-<name>.jpg
Sustainable Sanitation Alliance Contemporary photographs of sustainable sanitation, Africa 9,810 pywikipediabot Files created by Sustainable Sanitation Alliance (SuSanA) <Flickr title> (<Flickr ID>).jpg
KNBLO Images of the Vierdaagse (walking event) from 1910-1940 1,183 GWToolset (Basvb) Basvb GWToolset Images from KNBLO <description> - <id> - KNBLO.jpg
(upload) (description fixes) Wigman Images of nature photographer A.B. Wigman 576 Basvb GA Ede (upload) BasBot (description fixes) Uploadwizard (upload) pywikibot (description adding) A.B. Wigman/Images from Gemeentearchief Ede (could be filled with other images as well) <description> - A.B. Wigman - <id>.jpg
Commons:Batch uploading/Atlas of Mutual Heritage Old maps 2479 User:Husky and User:Gerritdeveer1597 User:HuskyBot Category:Media from Atlas of Mutual Heritage AMH-NNNN-XX <description>.jpg
Commons:Batch uploading/Wellcome Images Medical history 99,000 - pywikibot
RCE shipwrecks Images of Shipwrecks in the Netherlands 18,568 Basvb BasBot pywikibot Images of shipwrecks from the Rijksdienst voor het Cultureel Erfgoed <description> - <shipwreck> - <id> - RCE.jpg

2015Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Commons:Batch uploading/Manuscripts by Srečko Kosovel Images of writings by Srečko Kosovel 1050 User:Sporti User:Sporti semi-automatic Category:Manuscripts by Srečko Kosovel Srečko Kosovel - <title>.jpg
Commons:Batch uploading/US Army Research Laboratory Eniac A few images of ENIAC-era Army computer systems 13 BMacZero BMacZero C# custom Category:ENIACCategory:EDVACCategory:ORDVACCategory:BRLESC-I etc
Commons:Batch uploading/Freshwater and Marine Image Bank PD images related to all things marine and limnological 20747 User:BMacZero User:BMacZeroBot C# custom Category:Images from the Freshwater and Marine Image Bank FMIB NNNNN <title>.jpeg
VanderGrinten Images of 19th century buildings in Nijmegen 808 GWToolset (Basvb) Basvb GWToolset Images from Evert van der Grinten <address>/Nijmegen - <description> - <collectionid> - Van der Grinten.jpg

2016Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
Codex Aureus

Descriptionis Ptolemaicæ avgmentvm

NLS collection (establishing XML workflow) 1,503 + 393 + 265 & PeterKz GWT / https://github.com/peterk/suecia2commons Codex Aureus (A 135) Codex Aureus (A 135) p<page>.tif

<title> (SELIBR <libris>)-<page>.tif

Fortepan.HU Fortepan photographs, Hungary 69,857 Custom Images from Fortepan <autogenerated title> Fortepan <image number>.jpg
Imperial Encyclopaedia 18th Century Gujin Tushu Jicheng 800+ User:Fæ NA Custom Gujin Tushu Jicheng .
Photographs by Adolf and Carl Dransfeld Photographs by Adolf and Carl Dransfeld 1304 Reinhard Kraasch RKBot Custom (pywikibot based) Photographs by Adolf and Carl Dransfeld HANSif<image#> <title>.tif
HANSif<image#> <title>.jpg (cropped version)
Archives Nationales (France) 367 Jean-Frédéric ArchivesNationalesBot Custom (pywikibot based) Media contributed by the Archives Nationales (France)/1-8
Tropenmuseum Expeditions Basvb BasBot Files from the Nationaal Museum van Wereldculturen
Catharijne Convent AWossink

2017Edit

Date Name (Subpage) Description Images Scripter Uploader Script Category File naming
NPS Maps Public domain maps of U.S. National Parks, published by the National Park Service. 1968 Reinhard Kraasch RKBot Custom (pywikibot based) Files from the National Park Service uploaded by RKBot NPS <title>.<file type>
Incompetech music CC-BY-3.0 music files 1,277 NA Pywikibot Category:Audio files from Incompetech <title> (ISRC <ref>).mp3
Edo period coin collecting catalogues Public domain Japanese coin collecting catalogues 25 NA Donald Trung NA Category: Kokin kousei, Shinsen zeni kagami and Category:Shinpan kaisei, Kosen nedantsuke, Narabi ni bantsuki .jpg
Zeno images Public domain images 23,834 NA Pywikibot Category:Images from zeno.org <title> (Zeno <collection>).jpg

2018Edit

FailedEdit

Date Name (Subpage) Fail Reason
Flickr Imre Solt collection denied because the UAE doesn't have FOP laws which result in most image being copyvios.
Commons:Batch uploading/Modern Egypt Digital Archive Egyptian copyright doesn't have a limit for copyright of photographs, only that it becomes pd 50 years after the author is dead. Not enough images for a batch.
Commons:Batch uploading/Images from LIFE Most of the images didn't have a clear copyright label.
Commons:Batch uploading/Gathering the Jewels Images don't appear to be free.
Commons:Batch uploading/Staffordshire Gold Hoard (en.Wikipedia front page news) the images were quickly changed from Share Alike to Non-commercial on the same day.
Commons:Batch uploading/World War II in Africa from Flickr user gbaku User wasn't author of the album, only purchased the images.
Commons:Batch uploading/Kartrummet Website did not show interest for partnership, license verification not possible.
Commons:Batch uploading/beeldengeluidwiki unclear situation of authorship
Commons:Batch uploading/Dermnet Owner of the website doesn't own the images.
Commons:Batch uploading/Ekta Media Not done, dead link
EVDeportes Already uploaded on commons.
Commons:Batch uploading/Media of "banco de imágenes" of Ministry of Education of Spain cc-by-nc
Commons:Batch uploading/Sir William MacArthur Botanical Images Low quality
Commons:Batch uploading/Spanking Art Wiki
Commons:Batch uploading/Land Air Sea Warfare unclear what to upload. incomplete request and no response from user.
Commons:Batch uploading/WWII unclear situation of authorship
Commons:Batch uploading/US Coast Guard dead link
Commons:Batch uploading/Nasa Technical Reports Server (NTRS) Public NTRS access suspended indefinitely.
Commons:Batch uploading/KROK2009 Out of scope (portraits)