Wikisource:Bot requests

Bot requests

This page allows users to request that an existing bot accomplish a given task. Note that some tasks may require that an entirely new bot or script be written. This is not the place to ask for help running or writing a bot.

A bot operating performing a task should make note of it so that other bots don't attempt to do the same. Tasks that are permanently assigned or scheduled for long-term execution are listed on Persistent tasks.

See also

Unassigned requestsEdit

Merge multiple gap templatesEdit

Some editors have been using multiple instances of {{gap}} (i.e. ) {{gap}}{{gap}}{{gap}}{{gap}}{{gap}} instead of {{gap|10em}}. Can a bot please fix these, as in this edit? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:21, 27 April 2021 (UTC)

We can get a bot to do the fix, though finding the pages is the issue. I am unaware of a tool that counts the uses of a template on a page and judges its correctness. If you have a list of pages, or you have /special:prefixindex/ then I can have a look to see what we can do. — billinghurst sDrewth 13:14, 24 May 2021 (UTC)
Being pedantic, one reason to repeat "gap" is to be sure that several "gap"s always give a multiple spacing of "gap", regardless of the default value, should this be changed in future (e.g. see a case like Page:The complete poems_of Emily Bronte.djvu/112). It depends on how likely this scenario is and possible consequences. Maybe a 'per work' approach would be more robust, making explicit the spacing also in case of a single "gap"? Mpaa (talk) 20:03, 5 September 2021 (UTC)
The opposite is true; if the default gap changes, then multiples of it will change at the same time; but 10em will always be 10em. As for finding them, the bot would need to scan all pages that transclude the template. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:43, 6 September 2021 (UTC)
Then, in the page I posted, you also need to explicitly state the spacing for the 'single gap' occurrences, not just collapse the 'multiple gaps'.Mpaa (talk) 20:15, 6 September 2021 (UTC)
@Pigsonthewing: It would have been polite to advise me, for what you termed 'abuse' of gap or whatever, I couldn't decide if you were joking. I do this habitually and happy to discuss why … elsewhere. CYGNIS INSIGNIS 21:57, 5 September 2021 (UTC)
Why would I? This is a generic issue, not a complaint about your behaviour. If you have objections, you should raise them here, or they are likely to be overlooked. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:43, 6 September 2021 (UTC)

Move all subpages of Who's Who in the Far East (June) 1906-7 to use title caseEdit

I was informed by User:Beeswaxcandle that I should use title case instead of all caps in article names. So I request to move all subpages of Who's Who in the Far East (June) 1906-7 to use title case. Although I can use a bot to move it myself, that would leave tons of redirects for admins to delete. But if an admin can easily batch-delete a list of pages, I can move it myself and then provide the list of pages to delete. I'm sorry for the inconvenience. Thanks, --Stevenliuyi (talk) 08:58, 6 May 2021 (UTC)

@Stevenliuyi: Please review the list at Wikisource:Bot requests/sandbox. I notice that there is at least one English name that needs to be fixed, and the Chinese names didn't convert on the regex that I used. Would you fix or create the target (only) in the list in the pair list, and I will get it done. No need to fix those that are broken though you should fix the previous/next links of the articles either side. To note that as I did for your other work, I will look to get a work specific template in place, though will do that afterwards. — billinghurst sDrewth 13:10, 24 May 2021 (UTC)
I suppose that I really to want to ensure that the Chinese names are capitalised properly. — billinghurst sDrewth 02:57, 25 May 2021 (UTC)

Add {{R from case citation}} to all redirects from case citations to casesEdit

Would it be possible for a bot to detect redirects pointing from a case citation (e.g. 347 U.S. 483) to a case (here, Brown v. Board of Education), and add {{R from case citation}} to these. It would be useful to have all case citations in one category, given the obvious utility of Wikisource as a caselaw database for lawyers. BD2412 T 04:25, 24 May 2021 (UTC)

@BD2412: What's the identifying feature of a case page? Is it in a particular category? Does it contain a unique template? Once identified, should all redirects to these be so tagged, or only a subset of them? Do the redirects to these have a distinguishing feature to identify them as distinct from all other redirects on the site?
Adding a category to a set of pages is pretty straightforward, so the challenge is how to identify those pages automatically. Doing something to one page (the redirect) based on properties of another page (the actual case page) can also be challenging depending on the details (it may require writing a custom bot rather than just running one of the existing scripts for pywikibot).
Also, how many of these are there? If the criteria are complex, and the number of pages relatively low, it may be better to do it manually or semi-automated (a user script in the browser that finds and tags redirects to the current page on request, say). Xover (talk) 04:06, 25 May 2021 (UTC)
I have been thinking about exactly those issues. I don't know that we have any case citation redirects from non-U.S. cases, so the group to start with would be documents in the category tree under Category:United States case law by court. The redirects themselves will all be in a [Number] [Reporter] [Number] format, so for example the first page of results from a search for pages starting with "1" is almost entirely redirects to cases (everything from 100 L.Ed. 1003 on, with a few exceptions). So, anything in that format redirecting to something in that category tree should be a case citation redirect. I would also note that a great many of these were generated by User:BenchBot when that bot was active, and just counting those from the bot's contributions, there are over 9,000. I suppose I could do those manually, or use BD2412bot, but that will leave stray case citation redirects added by others. BD2412 T 04:25, 25 May 2021 (UTC)
I am doing some manually to see if there are any hitches that come up that way. BD2412 T 19:30, 25 May 2021 (UTC)

ReOCR Index:This side of paradise (IA sideofparadise00fitzrich).pdfEdit

This is a text that has no proofreading done on it. The OCR is from 2006 and is of quite poor quality. I am requesting that the text be reOCRed. Since the image quality is quite poor, it also might make sense to create a fresh DJVU from the IA scans. Languageseeker (talk) 02:27, 18 October 2021 (UTC)

  Not done I am unaware of anyone having an OCR bot. For the new version of the output, best to put a request to [[[WS:S]] rather than here. — billinghurst sDrewth 13:32, 18 October 2021 (UTC)
The text for the djvu version seems OK to me. Maybe reuploading the djvu file is enough. Mpaa (talk) 17:58, 18 October 2021 (UTC)

Wikidata bulk editEdit

I made a query for works on enWS that have WD items with no "instance of" statement. The criteria I used are:

  • Pages in mainspace
  • No redirects or disambiguation pages (this includes Versions and Translations btw)
  • Does not contain a forward slash in the page name (in order to exclude subpages)
  • Is linked to Wikidata, and linked Wikidata item does not have a P31 statement

This query returns 13889 results, which is more than even QuickStatements can handle. Would it be possible for a bot to update these Wikidata items with P31=Q3331189 (instance of = version, edition, or translation)?

Thanks :) —Beleg Tâl (talk) 13:22, 1 November 2021 (UTC)

I think we could be more specific for certain groups, e.g I have addressed "Presidential Radio Address" articles as "instance of speech". There are several groups of articles that can be identified and then addressed with QuickStatements. After that, the bot can be run on what is left. Mpaa (talk) 23:13, 1 November 2021 (UTC)

Assigned requestsEdit