Archives:

User talk:ShakespeareFan00/Archive1
User talk:ShakespeareFan00/Archive 3
User talk:ShakespeareFan00/Archive 4
User talk:ShakespeareFan00/Archive5

A vote of supportEdit

For what's worth, I believe the Metapedians were way out of line with their dismissive attitude and thinly-veiled threats. As I stated on the Administrator's Noticeboard, the issues you've raised are of vital importance and should not be pushed aside under the catch-all phrase of "Not Censored". I seriously hope you'll reconsider your leave of absence: as far as I can see, you're not "too close to the problem"; you're one of the few people to grasp the full implications of the proposed age filter and - unlike many others - are capable of approaching it with a clear head and a rational mind. All the best. AshFriday (talk) 01:09, 17 March 2019 (UTC)

AshFriday, I've spoken to ShakespeareFan00 since leaving and I'm sure he'll appreciate your kind words. I'd like to clarify that both mine and Nick's opinions were not based only on the one incident. SF00 understands there are basic levels of competence and trust required and is worried that he is not currently due to his moral and legal panics able to maintain them. I'm sure he will return in the near future with a fresh pair of eyes once his break is over and rediscover the Excellent community that WikiMedia Projects have. — Preceding unsigned comment added by RhinosF1 (talk • contribs) 06:46, 17 March 2019 (UTC)

PD-US-expiredEdit

Books like File:The apple- king of fruits (IA applekingoffruit00powe).pdf that are published in the US before 1925 do not need any review. The fact that authors may have lived on later into the 20th C. is not relevant in US law, the work is public domain by default. It's only relevant to start questioning death dates for non-US publications before 1925.

Unfortunately there's no consistent way of testing for country of publication in the IA metadata. -- (talk) 22:30, 20 June 2020 (UTC)

@: . With that one it wasn't clear from the metadata, when the exact publication date was. If there's a more appropriate tag to say this metadata may be incomplete...
I was using {{Placeholder date}} to mark very ambiguous dates, Placing the category that adds to pages into the appropriate maintenance category and backlogs would be appreciated.ShakespeareFan00 (talk) 07:42, 21 June 2020 (UTC)
The discussions at Commons:Village_pump/Proposals#Mass_"Evacuation"_copy_of_Public_domain_resources_from_Internet_Archive_to_Commons. also seem to be moving in a certain direction. ShakespeareFan00 (talk) 07:44, 21 June 2020 (UTC)
If we are talking hundreds of books with not-good metadata but are very probably unlikely to be challenged as public domain, it may be simpler to create our own housekeeping category like Category:Books uploaded from Internet Archive with incomplete dates or other metadata. Someone prepared to surf to WorldCat or visit the source archives, may then be able to double check before removing from the housekeeping backlog. -- (talk) 09:50, 21 June 2020 (UTC)
@: That might be a good idea anyway. Although a concern raised in the linked proposal was to do with the time , potential reviewers and researchers have to undertake such reviews and research anyway. {{Placeholder date}} was a pragmatic response to an immediate issue, it isn't necessarily intended as permanent long term answer. 10:22, 21 June 2020 (UTC)

ResidualsEdit

There are a fair number of un-uploadable files. Apart from size, I'm not sure how they might be identified automatically. Here's the first automated upload fork with now 9 persistant residuals (just re-trying them while writing this, it's had 4 tries so far):

001 motionpicturenew121unse    252.7 MB (☒ 4) 
002 motionpicturenew153unse    138.3 MB (☒ 4) 
003 motionpicturenew171unse    259.9 MB (☒ 4) 
004 motionpicturenew172unse    327.3 MB (☒ 4) 
005 motionpicturenew192unse_0  332.9 MB (☒ 4) 
006 motionpicturenew212unse    183.3 MB (☒ 4) 
007 radi19241101219241925radi  318.7 MB (☒ 4) 
008 radio192317919231924radi   278.3 MB (☒ 4) 
009 talkingmachinewo13bill     274.4 MB (☒ 4) 

Any thoughts on what might connect these? Note that for #9, someone has left a review comment saying the jp2000 zip was corrupted. -- (talk) 12:02, 10 July 2020 (UTC)

I do not know why these might be residuals... 12:11, 10 July 2020 (UTC)
More usefully User talk:Fæ/IA books/residuals is now automatically created. Anyone that wants to try different upload methods, might find this a handy backlog to attempt. -- (talk) 11:10, 20 July 2020 (UTC)

BlankEdit

Any thoughts on File:Medical Heritage Library (IA b30417983).pdf? It's not rendering, but this may be a failure at the IA end as though it's viewable in the IA site itself, when I try to look at the PDF in my browser using the IA PDF download link, it's all blanks too, plus the file size looks suspiciously small. -- (talk) 07:57, 21 July 2020 (UTC)

Would suggest re-generating this from the original scans locally, if you can get a torrent. ShakespeareFan00 (talk) 08:01, 21 July 2020 (UTC)

Lint concern...Edit

:(copied from Template talk:Image generation)

A sample invocation like :-

{{Igen|I}}

generates :

<span style="vertical-align:middle"><div class="mw-content-ltr">
{| cellspacing="0" style="color:#000;background:#F8F9FA;border:1px solid #BAB;margin:.1em;width:;float:left;vertical-align:middle" class="createdwithtemplate layouttemplate"
| style="width:1.2em;height:1.2em;padding:.2em;" | [[File:Inkscape Logo.svg|20px|link=File:Inkscape Logo.svg|center]]
| style="font-size:.85em;padding:.2em;vertical-align:middle;"|This &#32;[[:w:Computer graphics|graphic]]&#32;was created with [[:w:Inkscape|Inkscape]]..
|}</div></span>

This is clearly not good as neither a DIV or a table can be place inside a nominal SPAN.

The template is too convoluted for me to figure out what it's calling to make any sense of where the DIV SPAN flip actually takes place, and so it would be much appreciated if someone could look into resolving this, or converting this template into something slightly more readable. ShakespeareFan00 (talk) 20:40, 20 September 2020 (UTC)


Hi Shakespeare, seems that you know a lot about such things – you can help to make it better.
The code you found is generated by Created with, an old template from 2009, now used one million times. I had not found yet where the SPAN comes from, but DIV & table origin by that template, which is transcuded at the end of a chain of other templates.
I had discussions about the best method to draw boxes; while all the older templates generate them as tables, some newer do it with SPAN - I just converted Projekt Wappen, which can now cause other problems. With my reduced knowledge I managed that the output boxes of Valid SVG, Created with and others ared displayed in the same line. May be that not all needed workarounds comply to all rules. Of course it will be possible to get the same output with clearer code!

The problem I mentioned is: As can be seen at e.g. Carniola Arms.svg three boxes are displayed side-by-side. It is intended that the third box (about heraldry) should appear in the same line when there is enough space, or otherwise in the next line. But currently the third box can be split up to two or even more lines, which looks ugly and is not at all intended. You can see these reactions by zooming the page. Do you know a solution for that small problem - display in the same line when all fits, or in the next line when not, but never break it?

The other inconveniencies with mixed DIV, SPAN and table can then also be converted and solved. -- sarang사랑 12:05, 21 September 2020 (UTC)

@Sarang: CSS has an attribute for handling whitespace wrap behivour - https://developer.mozilla.org/en-US/docs/Web/CSS/white-space. You might want to look into that (The thought here is that the 'box' is wrapped, that way it will break 'before' or after the box dependent on space avialable, but not within the box itself, as I understand it.). Also examine if display:inline-block is applicable to the use case. Also I strongly suggest moving inline CSS that's static into a Templatestyle. I don't claim to be an expert on HTML/CSS matters though. ShakespeareFan00 (talk) 12:13, 21 September 2020 (UTC)
Thank you, the problem with the third box seems now solved; and I can treate other templates with similar problems the same way. The method with inline-block needs that all boxes are under the control of one instance(?) which is rather not applicable. The div/span/table-mix runs since more than ten years, a few weeks more won't be too bad, so I will try to find out in peace a real good solution. I'll keep it in my mind, and it's time will come -- sarang사랑 14:27, 21 September 2020 (UTC)

CSS problemsEdit

Thank you for editing Projekt Wappen. Because this is only one of several boxes which can be displayed one-beside-of the others (as long as the page width allows) I tried to generate all these boxes by one new template; my intention is that all these boxes have the identical height - independent of the size of the icon, no matter when the icon is too large to fit into the box completely. I thought that the attribute height will care for that – but it does not. I made an example with four boxes and got different heights. Can you help me? -- sarang사랑 15:39, 23 September 2020 (UTC)

IA books copyright statement searching in OCR textEdit

Got around to a fairly simple page search for copyright statements. You may have ideas for a better search expression, currently using '©|copyright|all rights reserved' (case insensitive). In the obvious example of Category:Documents from the US Naval Postgraduate School Library, the first handful of matches show up as:

172 96 A computational tool for the rapid design and prototyping of propellers for underwater vehicles (IA acomputationalto109453007) 
  1 ©2007 Kathryn Port D’Epagnier. All rights reserved. 
257 61 Acoustic characterization of a stationary field synchronous motor (IA acousticcharacte1094524382) 
  2 © E.C. Woodward Jr., 2001. All rights reserved. Permission is granted to MIT for 
282 97 Acoustic scattering of broadband echolocation signals from prey of Blainville's beaked whales modeling and analysis (IA acousticscatteri109452330) 
  1 c Benjamin A. Jones, 2006. All rights reserved. 
283 163 Acoustic scattering from sand dollars (dendraster excentricus) modeling as high aspect ratio oblate objects and comparison to experiment (IA acousticscatteri109453694) 
  1 e Gregory C. Dietzen,2008. All rights reserved. 
290 158 Acoustic travel time perturbations due to an internal tide and internal wave field in the Barents Sea. (IA acoustictravelti00rayd) 
  6 All rights reserved. 
389 120 Adapting e-management to support geographically dispersed military training (IA adaptingemanagem109453724) 
  11 Software applications by Powersim (Copyright Powersim Software 

This shows how deep in the category the file is, the number of pages in the pdf, the filename, then on the next line the pdf page number(s) with the matching lines text, currently only looking as far as page 25. This may not be reliable as a way of identifying copyright issues for certain, but it could populate a category for later review. The same script could be adapted for any useful pdf text searching, on the presumption that the OCR at IA leaves metadata behind in the same way. -- (talk) 21:00, 1 November 2020 (UTC)

Looks good so far. There was already a category for moving items to be reviewed to.. ShakespeareFan00 (talk) 22:17, 1 November 2020 (UTC)
If the technique Works you could also look for strings like "Hellenic Navy" (I.E Greek navy) and so on to look for Non-US Personell? ShakespeareFan00 (talk) 22:17, 1 November 2020 (UTC)
Or in the other hand for strings that are an obvious indication of potential 'Federal' status? ShakespeareFan00 (talk) 22:17, 1 November 2020 (UTC)
As this is "processing cheap", because it's not actually looking at the PDF but only the metadata held in FileInfo via the API, I'll run an experiment with ©|copyright|all rights reserved|Hellenic Navy and put these in a sub-cat. We can run again if there's something to learn about poor matches or better phrases to use.
Being processing cheap, it may be reasonable later on to run on all 800,000 uploads based on date and highly likely copyright trigger phrases, for files that have not had any human review post-upload. At least then, we have a smart filter of suspect files to recommend a backlog of reviews for. This may also turn up collections that we have no idea might have been an issue.
Refer to Category:IA books copyright review automatically suggested
One issue is non-English texts, whether it's worth doing more in that area, I have no idea but we may be limited by the amount of programming time I have sufficient energy to invest at this moment. -- (talk) 12:41, 2 November 2020 (UTC)
@: The page numbers, Are these 'scan' page numbers (I.E what the Commons selector uses, or internal Page numbering for the work?) ShakespeareFan00 (talk) 13:19, 2 November 2020 (UTC)
They are IA's OCR scanned page numbers. They might be always one less than Commons', not sure. It would be easy to +1 to these in the edit comments if that's better.
Gone ahead and made that change and restarted, we'll see if it is less misleading.
By the way, I think this can work at around 2,000 documents an hour, it may be a bit less. The on-going issue with the API returning truncation errors might mean it keeps on returning the same files wrongly when reading the category, so I'm seeing the same files coming up more than once. Anyway, it sets the principle, if there are bugs in the API, it's something to investigate separately. -- (talk) 13:43, 2 November 2020 (UTC)
Thanks for this. Can you consider doing a seperate run against the /reviewed subcat.. (It shouldn't produce any genuine results). ShakespeareFan00 (talk) 13:47, 2 November 2020 (UTC)
It's about 601 files - Category:Documents from the US Naval Postgraduate School Library/reviewed
Also It might be worth looking at stuff in Category:Documents_from_the_US_Naval_Postgraduate_School_Library_for_license_review to identify other indicators of Non-US Gov status.ShakespeareFan00 (talk) 13:47, 2 November 2020 (UTC)
Will do, may be tomorrow.
Changed the query method, but still got bugs, so now stepped down so that rather than 500 entries a time from a category list, it's getting 8. Phab:T255981 is the unfixed bug.
So it's slower, maybe half previous rate, but still a magnitude faster than downloading each PDF to search it. -- (talk) 14:48, 2 November 2020 (UTC)

Notification about possible deletionEdit

 
Some contents have been listed at Commons:Deletion requests so that the community can discuss whether they should be kept or not. We would appreciate it if you could go to voice your opinion about this at their entry.

If you created these pages, please note that the fact that they have been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with them, such as a copyright issue.

Please remember to respond to and – if appropriate – contradict the arguments supporting deletion. Arguments which focus on the nominator will not affect the result of the nomination. Thank you!

Affected:

And also:

Yours sincerely, ShakespeareFan00 (talk) 23:14, 20 November 2020 (UTC)

Notification about possible deletionEdit

 
Some contents have been listed at Commons:Deletion requests so that the community can discuss whether they should be kept or not. We would appreciate it if you could go to voice your opinion about this at their entry.

If you created these pages, please note that the fact that they have been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with them, such as a copyright issue.

Please remember to respond to and – if appropriate – contradict the arguments supporting deletion. Arguments which focus on the nominator will not affect the result of the nomination. Thank you!

Affected:

And also:

Yours sincerely, ShakespeareFan00 (talk) 23:43, 20 November 2020 (UTC)

File:UK traffic sign 776.svgEdit

 
File:UK traffic sign 776.svg has been listed at Commons:Deletion requests so that the community can discuss whether it should be kept or not. We would appreciate it if you could go to voice your opinion about this at its entry.

If you created this file, please note that the fact that it has been proposed for deletion does not necessarily mean that we do not value your kind contribution. It simply means that one person believes that there is some specific problem with it, such as a copyright issue.

Please remember to respond to and – if appropriate – contradict the arguments supporting deletion. Arguments which focus on the nominator will not affect the result of the nomination. Thank you!

Taivo (talk) 10:00, 26 November 2020 (UTC)

Renewal databasesEdit

Hello. You may know this already but US copyright and renewal records can be queried via several websites, most notably the Online Books Registration and Renewal Page (see also 1925 to 1949 records). Browsing these before nominating a file or category for deletion may prove beneficial. Copyrighted publications needed to be renewed after 28 years, which is critically absent info that should be explained at Commons:Hirtle chart and {{PD-US-not renewed}} (I proposed a modification to clarify over a year ago to no avail). Cheers, --Animalparty (talk) 04:14, 13 December 2020 (UTC)

license questionEdit

I am confused by this File:Rules for compositors and readers ... at the University press, Oxford (IA rulesforcomposit00oxforich).pdf. The author is Oxford University Press. The year of publication is 1912. I read the United Kingdom stuff here. Non-government entity? If not, Crown or Parlimentary? If so, how to proceed? The link to "outofcopyright" is not resolving where I am.--RaboKarbakian (talk) 19:14, 17 December 2020 (UTC)

@RaboKarbakian: OUP is private, normal UK rule on works of joint authorship apply. ShakespeareFan00 (talk) 19:15, 17 December 2020 (UTC)

Thank you! Its probably not the Oxford comma book tho.--RaboKarbakian (talk) 19:21, 17 December 2020 (UTC)

File:Books from the Biodiversity Heritage Library (IA eleventhannualre00hayd).pdfEdit

Hi ShakespeareFan00 noticing your request for rename, I was wondering if adding also the date (1877) would be an option to retrace easier. Though the printing date was 1879 as described. Thank you for your time. Lotje (talk) 07:51, 20 December 2020 (UTC)

I had considered that but thought it was excessive given the alreayd long title. ShakespeareFan00 (talk) 09:19, 20 December 2020 (UTC)
  Done Lotje (talk) 10:20, 20 December 2020 (UTC)

DR and speedyEdit

Hi. Please don't nominate files for DR and then for speedy - it just creates extra work. I understand the frustration that the admins don't action DR's as fast as you would like but we will get to it Gbawden (talk) 09:08, 27 December 2020 (UTC)

Which files got DR and then marked for speedy? This was a situation I was actively trying to avoid.ShakespeareFan00 (talk) 09:37, 27 December 2020 (UTC)

Category for IA speediesEdit

I appreciate the work you're doing to clean up incidental copyvios from the IA mass upload. Would you be willing to add a tracking category (Category:IA book uploads for speedy deletion or whatnot) to these? That would allow me or others to run VFC on the whole tracking category, since those images don't need to be individually checked like the regular copyvo queue. Cheers, Pi.1415926535 (talk) 20:06, 28 December 2020 (UTC)

The reason I haven't done that as such, is that even though they are speedy, there should still be a human in the loop somewhere. On evaluation some of the media I thought was speedy, turned out to in fact need a licence change instead. I was already using VFC to mark entire categories as "speedy" or to file DR's. ShakespeareFan00 (talk) 22:32, 28 December 2020 (UTC)

Commons:Deletion requests/File:Bonner zoologische Beiträge - Herausgeber- Zoologisches Forschungsinstitut und Museum Alexander Koenig, Bonn (1976) (20204767260).jpgEdit

FYI, I changed the speedy tag to a discussion. De728631 (talk) 00:32, 29 December 2020 (UTC)

Massachusetts public documentsEdit

I went over a few of your discussions that dealt with Massachusetts public documents without knowing that they were in the public domain; would you willing to relicense those and close those discussions? I don’t have a list of discussions, but they were started (by you) after 16 December 2020. TE(æ)A,ea. (talk) 01:57, 14 January 2021 (UTC).

If they can be re-licensed I'll certainly look into it, a list of affected files would be appreciated. ShakespeareFan00 (talk) 08:53, 14 January 2021 (UTC)
  • I only found two that I looked at specifically and you hadn’t changed, but I think there may be a few more: 1 (not applicable) and 2. There’s also one from Maryland, but I don’t know if there are laws from there allow that. I will look for more in the future. TE(æ)A,ea. (talk) 14:09, 14 January 2021 (UTC).

Re: deletion requests (and other works)Edit

I started my work by going through the deletion requests you made, as most of them were for these works. I went through reverse-chronologically, and am now working at 08 December 2020 requests. I have made comments on all requests, with the exception of works that were clearly under copyright, and Chinese works, as I do not know how registrations for those works would be classified. It would be nice to get some of these images renamed, but unless I am able to add the captions to a list, I don’t want to have to manually make hundreds upon hundreds of rename requests. Thank you for pointing out the PetScan query; I shall go through it after I finish going through these deletion requests, and some during, as well. TE(æ)A,ea. (talk) 18:48, 17 January 2021 (UTC).