Commons:500px licensing data
After discussion at Village Pump (check it), Wikimedia Commons volunteers decided to import files... with caution; as there are ~210k files saved by Archive Team, some are out of scope (the minority), and some copyvio also. So the community develop a tool to select files, and import via OATH.
- import-500px It's in development, if you want to help, use the talk page here.
You can also contribute using to import, stills in developing, so bugs will eventually happen.
The tool allow volunteers to see the pictures, the metadata, select, and than import. Solving the issue raised by the community, that saw as problematic import all files at once. Also a better solution that volunteers download all files, select and than massive uploads...
The only draw back is the Internet Archive server, that is slooow, so to help in this project, you will need to be zen. We can use the 500px server to import, but they already deleted a good part of the photos, so you will not be able to see all pictures.
How the tool verify the licenseEdit
After the announce of 500px announce, they removed from their pages any Creative Commons licenses tags, and didn't delete the majority of files, as promised, infringing the cc licenses...
So to verify the license you can "manually" ding into the source code of the 500px page (until this moment is on air) and also to the 500px page stored at web.archive.org.
Using File:Cereal Field At Sunrise (199200911).jpeg as example:
Going to the correspondent 500px page you can see the license snippet in the source code, including a link to the license:
<a about="https://drscdn.500px.org/photo/199200911/q%3D80_m%3D2000/v2?webp=true&sig=51421b73dbd5c42c71736cd4ac48b2e511d4a4ff63fa645fa67a10b4265d195b" href="https://creativecommons.org/licenses/by/3.0/" id="server_photo_cc_license" rel="license"></a>
Check the existence of the snippet
"license_type": 4 on that page by viewing the source code.
This all does not at all mean all photos marked with valid CC licenses in 500px are fine from a license perspective, FoP and other considerations may apply. And there are copyright violations that were uploaded directly to 500px. For that reason, the software developed allows us select files.
- Find a way to import files verifying the license, and able to select the files; (done)
- Creation of tool (already on, but with room to improvements);
- Importation using import-500px (~28% read);
- Categorizations, rename, improving description;