Open main menu

Shortcut: COM:BR · COM:BWR

Bot help and list · Requests to operate a bot · Requests for work to be done by a bot  · Requests for batch uploads
Gnome-system-run.svg

SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days.

Contents

Corrupt jpgEdit

Watch uploads and find corrupt files, notify uploader and tag file, if unchanged 7 days later, tag for speedy deletion.

Search all jpg files already uploaded, and notify uploader and tag file, then tag for speedy delete 30 days later if unchanged.

  1. File:OS GATOS DA CONA DA TUA CASA.jpg
  2. File:Krishna Ramchandani .jpg
  3. File:ViBIM office 2019.jpg
  4. File:ViBIM office 2019.jpg
  5. File:Vietcorset7.jpg
  6. File:Kodak Brownie Starlet.jpg
  7. File:Campus EMBA.jpg
  8. File:Rodriguesalves1.jpg.--BevinKacon (talk) 18:16, 30 April 2019 (UTC)
Is there a way to check for corrupt files without downloading them first? If not, I would consider this basically unfeasible, as it would mean to download all files first. --Schlurcher (talk) 07:36, 16 May 2019 (UTC)
@BevinKacon: Programmatically, File:OS GATOS DA CONA DA TUA CASA.jpg and File:Kodak Brownie Starlet.jpg appear to be valid JPG images? Do you know how you would expect a script/bot to be able to tell if they are not valid? I was able to verify both with pillow. --TheSandDoctor (talk) 23:47, 26 May 2019 (UTC)
@TheSandDoctor: I just tested with Embedded Data Bot code:
>>> detection.detect('testfiles/OS_GATOS_DA_CONA_DA_TUA_CASA.jpg')
Traceback (most recent call last):
  File "detection/by_ending/pillow.py", line 35, in detect
    image.tobytes()
  File "[...]/venv/lib/python2.7/site-packages/PIL/Image.py", line 733, in tobytes
    self.load()
  File "[...]/venv/lib/python2.7/site-packages/PIL/ImageFile.py", line 232, in load
    "(%d bytes not processed)" % len(b))
IOError: image file is truncated (19 bytes not processed)
image.tobytes() should do. --Zhuyifei1999 (talk) 22:01, 28 May 2019 (UTC)
That worked, thank you Zhuyifei1999! Does that work properly for other image formats (ie png, gif etc) or just jpg? I tried it on a png I know is good and it returned as such. I just don't have many corrupt images to check ;) --TheSandDoctor (talk) 02:15, 29 May 2019 (UTC)
I forcibly made my own corrupt png (opening in text processor and deleting chunks) and the code appeared to work correctly. The more frequent error I was able to generate was unrecognized data stream contents when reading image file, however. Would images like that also be tagged as something (different, I assume)? --TheSandDoctor (talk) 02:19, 29 May 2019 (UTC)
@Schlurcher: you could only download the thumbnails and filter by new users only. For already uploaded files, get a tool to run directly on server. @TheSandDoctor: those images are over half of a single solid mass of color, a tool should be able to detect that. It would be almost impossible for a real photograph to have that.--BevinKacon (talk) 08:17, 1 June 2019 (UTC)

Category:Files from the MassDOT flickr stream needing curationEdit

This might be doable with VFC, but I don't have the expertise to do so (but glad to learn!). Category:Files from the MassDOT flickr stream needing curation (~2200 files) was uploaded with {{PD-author}} (which is probably valid, but {{PD-MAGov}} is the true license). Some files have been license reviewed, but some have not. This creates a need for two tasks:

Thanks! Pi.1415926535 (talk) 20:13, 29 May 2019 (UTC)