Wikisource:Scriptorium/Help/Archives/2021

Warning Please do not post any new comments on this page.
This is a discussion archive first created in , although the comments contained were likely posted before and after this date.
See current discussion or the archives index.

Putting a letter "inside" another letter

I'm in a situation where I (guess?) I have a logo that says TC, but in that original logo, the T is larger and the C is inside it. See Page:A Dog's Love (1914).webm/1. Is there some trick to emulate this on Wikisource, or do we just not do it at all? (Not that it's incredibly important to the transcription...I mean it's just the letters "TC" that could be found anywhere.) PseudoSkull (talk) 17:23, 1 January 2021 (UTC)

Logos are done as images rather than textually re-created. Just rip the image from the scan. Beeswaxcandle (talk) 17:32, 1 January 2021 (UTC)

Page doesn't display in read mode

After my last edit THIS PAGE no longer displays the read view. Already purged it and was curious what causes this and how can I deal with it myself when I come across such issues in the future.— Ineuw (talk) 18:43, 1 January 2021 (UTC)

The problem is in {{sc|{{nop}} at the top. What do you want to achieve by that? --Jan Kameníček (talk) 19:04, 1 January 2021 (UTC)
@Ineuw: Use {{nopt}} instead of {{nop}} for tables. And you'll often run into trouble if you to use {{sc}} on whole blocks of stuff. Since you have a {{ts}} on every table cell there, I suggest you move the small-caps formatting there instead ({{ts|sc}}). --Xover (talk) 19:19, 1 January 2021 (UTC)
I just did not see any small caps in the original, so that is why I asked about the purpose. But now I can see that Ineuw removed the template and the page works. --Jan Kameníček (talk) 19:44, 1 January 2021 (UTC)
I feel terrible, so let me start with wishing you and everyone else a healthy and happy new year. This was a Logitech keyboard and mouse error and Windows 10, with which I am dealing with currently. It has nothing to do with Wikisource, or the Firefox browser which I originally suspected. Logitech's problems also affected my main editing tool AutoHotkey, which triggered the macro generating {{sc}} and I didn't notice it. Thanks to everyone for the help.— Ineuw (talk) 19:55, 1 January 2021 (UTC)
@Ineuw: No problem, funny things sometimes happen to all of us :-) Happy new year to you too! --Jan Kameníček (talk) 20:11, 1 January 2021 (UTC)

Source file with Corrigendum published later

I'm considering to import the Singapore Arms and Flag and National Anthem Rules 2003, in which some images are later omitted by a corrigendum. So how should I deal with the corrigendum (i.e. follow the original version or the version amended by the corrigendum)? Many thanks.廣九直通車 (talk) 09:20, 4 January 2021 (UTC)

@廣九直通車: I think create the two documents (Rules 2003 and Corrigendum) as two separate documents, following the source documents, and make a note in the header of each about the presence of the other. Creating versions of laws at every stage of amendment often turns into a mess, so I'd steer clear of synthesising an "as-amended" version. Inductiveloadtalk/contribs 10:49, 4 January 2021 (UTC)

Tag: Bad markup

I've just spotted that a recent edit I made here has been tagged with bad markup. I'm not sure what I have done wrong, but assume it's to do with my adding two images. Can anyone advise what problem I have caused? I may have created more of these before without noticing them, sorry for that. Sp1nd01 (talk) 23:25, 9 January 2021 (UTC)

I'm not seeing any warning message, but I do see you've introduced a paragraph break that isn't present in the original. --EncycloPetey (talk) 00:21, 10 January 2021 (UTC)
@Sp1nd01: This is because you have used the align="right" HTML attribute in the table. The use of the align attribute is deprecated the the HTML standards, and the better modern markup is to use something like CSS: style="float:right;" (the equivalent for align=center is margin:auto;). It's unlikely to cause major issues on export, because the export tool actually removes them and replaces with CSS. The original motivation was tables were coming out as text-align:center; for align=center, but it turned out to be an issue with the "fix" in the export tool, rather than programs not handling align. There's a patch on the way. phab:T270807.
Also, use of the tags <big>, <small>, <center>, <tt>, <font> set off the filter, as these tags are also deprecated. These will also trigger a linter error). Deprecated attributes are not (yet) a linter error; phab:T173944 is a task to add them as such.
I will write some documentation for this, as it is indeed not clearly stated.
This markup will not cause major harm in the short (or mid) term, but over the longer term, we will need to clean it up one day as phab:T26529 progresses. Inductiveloadtalk/contribs 00:40, 10 January 2021 (UTC)
I tried using {{img float}} with the signature in the cap parameter. It seems to work well, but if you disagree, feel free to revert. --Jan Kameníček (talk) 00:59, 10 January 2021 (UTC)
Thanks for the detailed explanation, its useful to know the reason. I wasn't aware of these html details, and to be honest I usually just try and copy what style (if any) had been previously used on a book for consistency, so I'm sure there are going to be other pages with the same problems present in that volume. I do like the img float solution, and I didn't know that an image could be added for the caption, that's nice to know. If it's ok I will use that approach for any future images I add, and if I can track any other examples will change them also. Sp1nd01 (talk) 10:52, 10 January 2021 (UTC)

Fostered content...

Page:An Index of Prohibited Books (1840).djvu/73

I fail to see where the 'fostered' content is coming from. Suggestions? ShakespeareFan00 (talk) 18:16, 11 January 2021 (UTC)

@ShakespeareFan00: Why are you using a table for this anyway? This seems to work OK:

Example

{{plainlist|1=
* {{di|T}}he quick brown fox
* Jumped over the
* Lazy dog
}}
  • The quick brown fox
  • Jumped over the
  • Lazy dog
Wrong answer Not sure about the fostered content yet.
Looks like your fostered content was because you had an extra table header in the page header field (but it was scrolled off the bottom): Special:Diff/10829638. Inductiveloadtalk/contribs 18:34, 11 January 2021 (UTC)

Too many short chapters

This volume has 75 Chapters with many adjacent Chapters are one page long. Is it permitted to name the main namespace page as Chapters 36 to 39. I am using anchors to access these embedded chapters from the TOC and elsewhere.— Ineuw (talk) 05:05, 17 January 2021 (UTC)

I would keep them as a subpage per chapter. By doing so, you're keeping better to the author's/editor's intent. I'm dealing with a similar situation on Anecdotes of Great Musicians, which has 300 short items, some only a couple of paragraphs long. Beeswaxcandle (talk) 06:32, 17 January 2021 (UTC)
That is what I am planning but the main ns page name should also indicate the chapter numbers subbed beneath it. These short chapters are 1st level TOC entries.— Ineuw (talk) 11:48, 17 January 2021 (UTC)

Uploading of an original translation

Dear Sir/Madam,

I am a newcomer to Wikisource and need some guidance on how to go about submitting a text. I recently contributed to the transcription of a renaissance Italian text from the original manuscript on Wikisource of "Peregrinaggio di tre giovani figliuoli del re di Serendippo.djvu/1" by Christoforo Armeno (1557) with the purpose of creating a source text for translations. The translation in English was recently completed and I would like to upload it to Wikisource. For your information, translations in various European languages have been created since the original was published in 1557, but they vary a great deal in adherence to the original, as for them being a literary translation or even the adherence to the original text. The English translation I would like to upload is close to the original Italian version and is original, so not subject to copyright restrictions. I intend to make it available to Wikisource under a Creative Commons Attribution-ShareAlike 3.0 Unported License (“CC BY-SA”), and GNU Free Documentation License (“GFDL”) (unversioned, with no invariant sections, front-cover texts, or back-cover texts).

Since I have never uploaded a text to Wikisource, I am looking for guidance as to the various steps to take to complete a successful upload.

Your guidance is appreciated;

Regards,MvRwiki1944 (talk) 00:33, 19 January 2021 (UTC)

Mark van Roode Wikisource username: MvRwiki1944 vanroodemark@aol.com

@MvRwiki1944: Welcome to Wikisource. It sounds like your translation will be a great fit and a welcome addition. The steps are:
There are some (sparse) details at Wikisource:Translations. Let me know if there's anything you are not clear on. Inductiveloadtalk/contribs 11:50, 19 January 2021 (UTC)

ENGVAR?

Please can someone advise on WikiSource's policy regarding the variation of English spelling used? I was reading The Bronze Ring, written by a Scottish author, and noticed that the text uses American spelling. Is this something that should be changed? Regards, DesertPipeline (talk) 11:22, 19 January 2021 (UTC)

@DesertPipeline: We use the text as it was published in the edition being reproduced. I will note that we have a variety of UK editions and US editions, especially noting that may of the scanned works come out of US libraries and can therefore have a bias to US editions for main works. If you don't know which was used in the edition being reproduced, then do nothing. — billinghurst sDrewth 11:39, 19 January 2021 (UTC)
It depends on the edition the text comes from. In this case, the text comes from Project Gutenberg, so the source document is unknown and they may have made editorial changes. Ideally, we'd be looking to transition this to a known edition with scans to prove it. For example Index:Lange - The Blue Fairy Book.djvu, which was published in London and New York and has UK spellings, e.g. "colours". Inductiveloadtalk/contribs 11:42, 19 January 2021 (UTC)
Thank you both for the responses; I'll leave it alone then and let someone with more experience in these matters handle it :) DesertPipeline (talk) 12:03, 19 January 2021 (UTC)
@DesertPipeline: you're more that welcome to proofread the work against the scan if you'd like to help, and I can assist with queries you may have. "Letting someone with more experience in these matters handle it" effectively means that probably no-one will touch it for years and years unless it becomes a WS:POTM or someone else really wants to move it over, because our backlog of unsourced or PG copy-dumps is...large. Inductiveloadtalk/contribs 12:15, 19 January 2021 (UTC)
Is the scan available somewhere for me to view? I can possibly do that tomorrow if so. DesertPipeline (talk) 16:48, 19 January 2021 (UTC)
@DesertPipeline: Indeed it is! Index:Lange - The Blue Fairy Book.djvu, along with some of the tricksier formatting done already. The Bronze Ring starts at Page:Lange - The Blue Fairy Book.djvu/29. Inductiveloadtalk/contribs 16:52, 19 January 2021 (UTC)
Hmm, the second link is red for some reason. Also I notice that link was sent earlier – not sure how I missed it :) I'm not sure what the filetype .djvu is though. Something like pdf? DesertPipeline (talk) 16:58, 19 January 2021 (UTC)
Strange, it's blue now... did you create the page while I wasn't looking or was that just a bug? :) DesertPipeline (talk) 16:59, 19 January 2021 (UTC)
@DesertPipeline: I just created that page for you as a demo. DjVu is quite like PDF, it's a container of images and text in a single file. It has a few benefits over PDF for the purposes of scanned documents, but the difference isn't really important for now. You don't really need to worry about it—the point is that there is a Page:XXX page for each page of the book. Inductiveloadtalk/contribs 17:02, 19 January 2021 (UTC)
Very well, thank you :) I'll write this down on a note on my desktop so I don't forget to do it tomorrow. See you around, and thanks again! DesertPipeline (talk) 17:07, 19 January 2021 (UTC)
Good luck and remember you can leave me a message here (use {{ping|Inductiveload}} or on my talk page if you get stuck. Some basic instructions are at H:Formatting conventions, but some things aren't covered clearly (yet). Inductiveloadtalk/contribs 17:09, 19 January 2021 (UTC)

Moving/copying a source from the wrong wiki

I've recently found a source that has (it seems to me) been uploaded to the wrong place. Here is the URL:

https://wikisource.org/wiki/Index:Melancholy_loss_of_the_whale-fishing_ship_Oscar,_of_Aberdeen,_on_Thursday,_April_1,_1813.pdf

It's a short pamphlet (in English) that has been loaded to the main wikisource domain/namespace/whatever instead of English wikisource. Is it possible to move it across to English wikisource, retaining its edit history and leaving a redirect in the old place? Is it actually necessary to do that, and is it perfectly OK where it is?

My intention is to transclude these pages into a single one, then put a link to it at https://en.wikipedia.org/wiki/The_Wreck_of_the_Oscar_(1_April_1813)

Chuntuk (talk) 20:56, 15 January 2021 (UTC)

I imported the pages but now realise the file is also at mulWS, not Commons. Could a Commons transwiki importer please import https://wikisource.org/wiki/File:Melancholy_loss_of_the_whale-fishing_ship_Oscar,_of_Aberdeen,_on_Thursday,_April_1,_1813.pdf to Commons? Inductiveloadtalk/contribs 21:58, 15 January 2021 (UTC)
The move has now been done by Billinghurst. See Talk:Melancholy loss of the whale-fishing ship Oscar, of Aberdeen, on Thursday, April 1, 1813. Inductiveloadtalk/contribs 16:57, 24 January 2021 (UTC)
@Inductiveload: There is now a feature "FileImporter" that anyone can use at any wiki to move files to Commons. It doesn't need any special requirements just the ability to have an account to upload to Commons. Detail at mw:Help:Extension:FileImporter. Works WAY better than transwiki as it grabs the whole underlying construct of the file and moves it. — billinghurst sDrewth 22:25, 24 January 2021 (UTC)
  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Inductiveloadtalk/contribs 16:57, 24 January 2021 (UTC)

There's a problem in which I can't get the Template:rule in the right place on the TOC. I tried. Anyone have any solutions to get a line in between those two row items? PseudoSkull (talk) 16:25, 23 January 2021 (UTC)

Issue resolved by User:ShakespeareFan00 PseudoSkull (talk) 17:37, 23 January 2021 (UTC)

Hanging indent paragraphs spanning multiple pages?

Page:History of Woman Suffrage Volume 5.djvu/804 How can I join hanging indent paragraphs spanning several pages so that they show up properly in the main namespace? — Ineuw (talk) 20:35, 25 January 2021 (UTC)

@Ineuw: yes, you can, with {{hi/s}}, {{hi/m}} and {{hi/e}}. The /m goes in the header of the last n-1 pages. Inductiveloadtalk/contribs 20:49, 25 January 2021 (UTC)
@Inductiveload: Many thanks. Is there also a simple way to reduce the font-size of the paragraphs as well? — Ineuw (talk) 21:14, 25 January 2021 (UTC)
@Ineuw: Actually, for that page, {{plainlist}} with a hanging-indent parameter might work better. Firstly, it'll collapse the items to not have gaps, and secondly you only need one pair of templates per page and thirdly it's semantically more correct, as this is is actually a list. Inductiveloadtalk/contribs 21:22, 25 January 2021 (UTC)
@Inductiveload: Thanks again. This index list is 52 pages. It cannot be enclosed in a single {{plainlist/s}}{{plainlist/e}} tag. So, this begs the question on how to do it from the beginning to the end in a generally preferred way that would make the main namespace display correctly?— Ineuw (talk) 02:27, 26 January 2021 (UTC)
P.S: {{plainlist/s}} does not indicate how to add parameters.— Ineuw (talk) 15:55, 26 January 2021 (UTC)
@Ineuw: why can't it go inside a single plainlist? The split templates work just like any other, using the headers and footers. Unordered lists have no limit on items. However, you would probably have a list per letter, since the heading isn't a list item:
{{c|A}}
{{plainlist/s|hanging-indent=1em}}
* A Item 1...
* A Item 2...
...
{{plainlist/e}}
{{c|B}}
{{plainlist/s|hanging-indent=1em}}
* B Item 1...
...
Plainlist/s has the same parameters as plainlist. Inductiveloadtalk/contribs 16:11, 26 January 2021 (UTC)
Much thanks for the example. I never used parameters with start and end templates before, so I was confused. — Ineuw (talk) 16:22, 26 January 2021 (UTC)

Page:Motion Pictures 1912 to 1939 (IA Motionpict19121939librrich0010).djvu/18 is not separating from p. 17 properly. It is supposed to say:

"ACTION FOR SLANDER. Presented by Alexander Korda. Released through United Artists. 1937. 8 reels, sd. From the novel by Mary Borden."

Instead it says:

"ACTION FOR SLANDER. Presented by Alexander Korda. Released through United Artists. 1937. 8 reels, sd. From
the novel by Mary Borden."

Can anyone fix this? PseudoSkull (talk) 17:43, 27 January 2021 (UTC)

@PseudoSkull: It turns out you shouldn't put empty lines between the list items, or it breaks the list into hundreds of single-item lists. You can either remove the gaps, or replace with <!----> (a comment, with or without content). Inductiveloadtalk/contribs 18:34, 27 January 2021 (UTC)

Why I can't make Template:Dotted TOC page listing indent??

After I discovered the existence of {{Dotted TOC page listing}}, the template definitely helps me with my works of Hong Kong laws. However, some of them requires the indentation of some dotted TOC listings (such as Page:Prevention of Child Pornography Ordinance (Cap. 579).pdf/1, section 138A and 153Q to 153R). How can I achieve this??? I tried the most common : to methods like {{Indent}}, but nothing works? Assistance will be appreciated.廣九直通車 (talk) 13:25, 13 February 2021 (UTC)

@廣九直通車:The only way I found is to make a separate entry for "14. Section added" and separate for "138A. Use, procurement…". The problem is that Dotted TOC page listing does not enable to switch the dots off and TOC page listing is a completely different kind of template not suitable for combining with the Dotted TOC page listing within one list. So I tried to make some workaround, replacing dots in line 14. by spaces, the result of which can be seen at my sandbox. It is not ideal, it would be better if the dots in the line 14. could be simply switched off. --Jan Kameníček (talk) 14:03, 13 February 2021 (UTC)
@廣九直通車: I really cannot emphasise enough how strongly I recommend to not use {{Dotted TOC page listing}}. From a technical perspective it is a very bad solution, that creates problems (like the one you bring up here), and is really not needed. The dot leaders are purely stylistic, often exist in a very paper-page specific form, and are better omitted even if they could be added in a technically sound way. This is an issue that's up to each contributor to decide, of course, but I really strongly urge everyone to not use {{Dotted TOC page listing}} except in some hypothetical special case where having the dot leaders is extra special important. --Xover (talk) 14:25, 13 February 2021 (UTC)
Well, they are not only stylistic, their main purpose is to help the eye keep the line, and until somebody finds a better way of generating dot lines (e. g. something as easy as generating dynamic dot lines in MS Word documents), this template is the best thing we have in this regard. However, I admit that they are used also for stylistic reasons, but this stylistics is imo very important too. Without dots we would have to help the eye e.g. by creating bordered tables or something, which would interfere with the style of Contents pages of books (especially old books) too much. --Jan Kameníček (talk) 17:35, 13 February 2021 (UTC)
{{dtpl}} is specifically annoying because it produces a whole, separate, table for every single row. As well as being semantically kind of messed up, this also means it generally doesn't work brilliantly on export.
In this case, I'd say {{TOC begin}} probably produces the easiest result:

Example

{{TOC begin}}
{{TOC row 1-dot-1|spaces=0
 | 1.
 | Short title and commencement
 | A1387}}
{{TOC row 1-dot-1|spaces=0
 | 2. 
 | Interpretation
 | A1387}}
{{TOC row c|3|'''Amendments to Crimes Ordinance'''}}
{{TOC row 1-1-1
 | 14.
 |Section added}}
{{TOC row 1-dot-1|spaces=0
 | 
 | 138A. Use, procurement or offer of persons under 18 for making pornography or for live pornographic performances
 | A1405}}
{{TOC row 1-dot-1|spaces=0 
 | 15.
 | Conviction for offence other than that charged
 | A1407}}
{{TOC end}}
1.
Short title and commencement
............................................................................................................................................................................................................................................................................................................
A1387
2.
Interpretation
............................................................................................................................................................................................................................................................................................................
A1387
Amendments to Crimes Ordinance
14. Section added
138A. Use, procurement or offer of persons under 18 for making pornography or for live pornographic performances
............................................................................................................................................................................................................................................................................................................
A1405
15.
Conviction for offence other than that charged
............................................................................................................................................................................................................................................................................................................
A1407
The dot markup is still "messy" at the HTML level, but it's functional in-browser and removed on export (since very very few readers can deal with it). It might be worth investigating if we can export them only in PDF (since the PDF renderer probably can handle them).
{{TOC begin}} also handles things like vertical alignment and setting text wrapping by default. Inductiveloadtalk/contribs 19:38, 13 February 2021 (UTC)
After some cleanup, I found Inductiveload's solution is very suitable for the case: I just replaced the space between the indented TOC section number and the subtitle with {{Gap|1em}}. Definitely feels good!
By the way, what's the issue of HTML? I think previously when Billinghurst told me not to use <center></center> at here, he mentioned similar reasons.廣九直通車 (talk) 08:37, 14 February 2021 (UTC)
@廣九直通車: Great work! But is there any particular reason you have not marked the pages as Proofread? See Help:Page status for details. --Xover (talk) 10:06, 14 February 2021 (UTC)
Isn't that the pages are for other users to proofread? I'm only the guy who import them into Wikisource...廣九直通車 (talk) 11:36, 14 February 2021 (UTC)
@廣九直通車: "Importing" isn't really a concept in our model. Transcribing and formatting a page is what we call "proofreading". Proofreading can happen iteratively and collaboratively, but once a page is presumed to be "finished" (and thus ready to be transcluded for presentation) it should be marked as "Proofread". At that point it should be double-checked by a second person, and once that's done it is marked as "Validated". When you mark a page as "Not proofread" you're saying there is more work to be done before it's ready, and it should generally not be transcluded for presentation in mainspace. By my cursory look you have finished proofreading the pages and should mark them as "Proofread". I could be wrong, of course, as I only took a quick look. --Xover (talk) 13:44, 14 February 2021 (UTC)
@Jan.Kamenicek: I'll concede your point about dot leaders sometimes being used to help the eye track, but most of the time (in my experience) they are used purely stylistically and sometimes to the outright detriment of readability. And in either case I think the technical disadvantages of {{dtpl}} outweigh any benefit of the dot leaders in all but the most exceptional of cases. I very much wish the draft specification for dot leaders in CSS would materialise soon, but absent that we have no good ways to reproduce them, only various degrees of bad ways, and {{dtpl}} is the worst of the bunch. Please avoid using it whenever possible (I absolutely guarantee that at some point down the line, some poor schmuck is going to have to go through and redo every single page we use {{dtpl}} on, and it's already a bear of a task with what we have so far). --Xover (talk) 13:51, 14 February 2021 (UTC)
OK, many thanks for everyone's assistance!廣九直通車 (talk) 13:59, 14 February 2021 (UTC)

Bug in Text Downloading function

I found that the blue "Download" button on the upper-right-hand corner cannot process Chinese text, instead outputting replacement characters. Can somebody report the stuff to Phabricator? Many thanks.廣九直通車 (talk) 10:33, 17 February 2021 (UTC)

@廣九直通車: have you got a specific example? Inductiveloadtalk/contribs 10:35, 17 February 2021 (UTC)
@Inductiveload:Please try to download any of the transcribed pages on Category:Laws of Hong Kong (like this), or precisely, try to download any texts hosted on Chinese Wikisource (like this). I actually suspect that the problem is not limited to English Wikisource.廣九直通車 (talk) 10:39, 17 February 2021 (UTC)
Ah, I see, it's only in the PDFs - I was looking at the EPUBs. Inductiveloadtalk/contribs 10:41, 17 February 2021 (UTC)
Also, it seems that the bug affects Japanese and Korean as well, as reflected in those (totally scrambled) downloaded Chinese texts. Even the Japanese and Korean disambiguation are affected.廣九直通車 (talk) 10:42, 17 February 2021 (UTC)
@廣九直通車: reported at phab:T274997. CJK is generally a whole "thing", so it's unsurprising that Japanese and Korean are also affected. Inductiveloadtalk/contribs 10:50, 17 February 2021 (UTC)

Missing page 97 for Volume 24 of EB1911

As of Feb. 12, 2021: https://en.wikisource.org/wiki/Page%3AEB1911_-_Volume_24.djvu/110 gives page 96. https://en.wikisource.org/wiki/Page:EB1911_-_Volume_24.djvu/111 gives page 98. Suslindisambiguator (talk) 23:28, 12 February 2021 (UTC)

@Suslindisambiguator:The pages on the index are fine. The page number on "Page:" subpages are reference numbers to the source file's page number.廣九直通車 (talk) 11:38, 14 February 2021 (UTC)
Not really index 97->98; 98->99; and 99->97 . The index needs some cleanup and I'm not sure how to do it. Languageseeker (talk) 03:51, 25 February 2021 (UTC)

Poem tag and page wraps

How do we handle <poem>, when the verse in the source document is across two or more pages? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 10:25, 25 February 2021 (UTC)

You simply finish the page with </poem> and start the new page with <poem> again. Only when a page break coincides with stanza break, you finish the page with
…
Stanza one.
<br />
</poem>
{{nop}}

For details see Help:Poetry --Jan Kameníček (talk) 11:58, 25 February 2021 (UTC)

Alternatively, don't use poem tags at all, but use <br /> between lines and block center/s & /e. This is what many of us do because of the problems of alignment of poems across pages when using poem tags. Beeswaxcandle (talk) 03:39, 26 February 2021 (UTC)

Tooltips lost in PDF Export and in Kobo epub Reader

I noticed that Richard II (1921) Yale has the notes in tooltips. When exported to a pdf, the relevant text is underlined, but there is no pop-up note. When exported to an epub, the comments renders in Calibre, but not on a Kobo. Languageseeker (talk) 13:59, 25 February 2021 (UTC)

Whether or not PDFs can support tooltips rather depends on whether the PDF format supports such a thing. @Samwilson: is there any hope of this working in PDFs?
For EPUB, which is basically HTML, Calibre uses the Chrome engine internally, so it behaves more or less just like a browser. Kobo uses a much less capable rendering engine, based on RMSDK which presumably doesn't handle the title attribute. Koreader also does not handle the title attribute. On a touchscreen, it's tricky to handle in general, because there's no "hover" concept without a mouse, and it's down to the program in question to look for a title attribute on long press.
Something that might be possible is on-the-fly conversion of elements with tooltips to references in the export process.
There are more details about what does and does not currently work at Help:Preparing for export. Inductiveloadtalk/contribs 15:29, 25 February 2021 (UTC)
Thanks for your reply. Seems that thinks are a bit broken with Kobo. Would it be possible to automatically convert tooltips to footnotes when generating an epub? Languageseeker (talk) 04:06, 26 February 2021 (UTC)
For PDFs, could we transform tooltips into comments? Languageseeker (talk) 06:24, 26 February 2021 (UTC)
@Languageseeker, @Inductiveload: This is mainly about the {{tooltip}} template isn't it, and maybe {{SIC}}? I sort of feel like it shouldn't be up to the exporter to do the translation to footnotes; that it'd be better to implement tooltips on-wiki in a better way. Because viewing the wiki on a touch screen is as annoying as on an ereader, as far as not being able to hover goes. Maybe re-implementing them as an 'annotation' reference group would be better? That'd work in print as well, and it'd give ereaders the required structure to be able display them as popups (which my Kobo does). Open to suggestions though! :) — Sam Wilson 09:05, 26 February 2021 (UTC)
Personally speaking they are web page features only, along hte lines of Wikisource:Annotations. The exported works should not have our annotations, and they should be #noprint. — billinghurst sDrewth 12:07, 26 February 2021 (UTC)

Two Enhancement to Search

Would it be possible to create two enhancement to search that would make this site much more usable to those in university?

  1. Add a citation Box to the Mainpage of a series. For example, The Czechoslovak Review would have a box that says Volume:|_| Number: |_| Page: |_| in the top right corner. You can either type in Volume:|3| Number: |4| Page: |97| or you can select the volume and number from a drop down menu and that would take you immediately to The_Czechoslovak_Review/Volume_3/Joža_Úprka#97 or to the Page:The_Czechoslovak_Review,_vol3,_1919.djvu/131 if there is no proofread version.
  2. Allow users to perform a search within a Category. For example, in Category:Periodicals, Current affairs, there would be a search box that would enable to only search the texts in the category. You can also perform an advance search as well. The search would look through both proofread text and OCR text for the words. For instance, you can search for "soldiers" from June 12, 1912 until June 16, 1914 in Category:Periodicals, Current affairs

Languageseeker (talk) 14:50, 25 February 2021 (UTC)

  • For searching within categories, use incategory:. The problem with the first proposal would be finding what article is on that page; this would likely be a manual entry for each work. As a drop-down menu (showing volumes, issues, and then articles), it is much more feasible, and can likely be done automatically. TE(æ)A,ea. (talk) 03:04, 26 February 2021 (UTC).
Thank you for your reply. I think that it's important to be able to jump to the precise page number because there are works with tens of thousands of pages where the citation is given as a specific page number. For example, the Federal Register is routinely over 50,000 pages per year with thousands of entries. A standard reference would be 78 FR 51713. A citation tool should be able to take this and jump to the right place in the Federal Register. Perhaps, there can be a lookup table generated as part of the trasclusion process that this page is found in Consumer Product Safety Commission: Notices: Meetings; Sunshine Act.
The incategory: is a good start, but I know that when I'm teaching undergrads, we use databases that allow us to search specific subsets of works. The wiki category is the best equivalent. However, strong search capacities is essential. I'm thinking of something more along the search box in Popular Science Monthly with advanced options. For example, imagine a student wants to find all mentions of parasol by women authors between 1789 and 1815. They would be able to go to the Category "Women authors," go the search box, type in "parasol," select the advance option and limit it by date. Languageseeker (talk) 04:05, 26 February 2021 (UTC)

  Comment I have added a search box that will search within the publication of The Czechoslovak Review. We can make it more configurable, though at this time it hasn't been necessary. Otherwise Search doesn't work that way you want as the search data is not recorded that way, nor is the work set up that way. Search itself does not understand dates which need to be a special indexed component for any search engine. You can read more about WMF's search at mw:Help:CirrusSearch. These searches can also be setup with the Index: namespace, per Index:Men-at-the-Bar.djvu though we would not typically setup a main ns work that searches in Page: ns for works. Main ns searches should and will only retrieve transcluded work. — billinghurst sDrewth 12:22, 26 February 2021 (UTC)

  Comment 2. Re incategory: searches, that is going to be a little problematic with how we do things, and maybe it is something that we should rethink now that search is more expanded. We typically only categorise the top level of a work, so INCATEGORY search parameter will only search that page, not the subpages of the work which are not categorised where you identified your interest. There are means around that, though each of them has its consequences, and definite amounts of work to achieve. — billinghurst sDrewth 12:35, 26 February 2021 (UTC)
I see. It seems to me more of a limitation with CirrusSearch that anything else. To perform the kind of research searches required in universities would require a new search engine. Would it be appropriate to post a request on Phabricator? Languageseeker (talk) 16:09, 26 February 2021 (UTC)
Phabricator is probably not (yet) the right place for this. What you describe requires a base of structured data, which we store on Wikidata. But we don't have very good tools for managing that data, so what's there is and will continue to be very spotty for the forseeable future. First step, thus, is improving our tooling for adding and maintaining that data. Once we have a reasonable coverage on Wikidata, the next is allowing search and navigation based on it. That's going to require a custom search facility that understands the information model (i.e. it knows that works have authors, were published on a given date, can be collective works, are split into volumes and issues that in turn contain articles, etc.). This kind of engine is not useful outside Wikisource (i.e. Wikipedia has no direct use for it) and is a relatively complex bit of software, so it's not something that the (often volunteer) developers working on MediaWiki can just add. It's a tall order, so it needs to start with coming up with a reasonable specification and description of how this will interact across projects and how the technology will interact with the community. Then it needs wider discussion with those impacted (the various language Wikisource projects, Wikidata, and Commons). And then we can start looking for ways to actually get it implemented (probably starting by trying to find a volunteer willing to do the programming).
Don't get me wrong, I think it's a really good idea and something I wish we had; but the realism of getting there any time soon is questionable. --Xover (talk) 16:36, 26 February 2021 (UTC)
Can we start the conversation now? I think that this is the perfect time because the pandemic has devastated university budgets and they are looking for alternatives to costly databases. Most university libraries spends millions of dollars a year for databases that mostly use texts in the public domain. We can offer them a free alternative. I'm sure that if we reach out to them, some will be willing to provide technical support and grants. What about the various Marc standards as the basis of our dataset? It's the universal standard for library catalogs and will make it super easy to batch import information from university libraries to Wikidata. We could also sell Wikisource to libraries as a way to become ADA compliable. Imagine, a student with a disability requests an electronic version of an article from 1893. Then the university can pay a student to proofread the article on Wikisource and wikisource will generate the electronic version. It's a win-win. I know the the National Library of Scotland and other libraries have participated her. Maybe, we can reach out to them and see what they want. Languageseeker (talk) 18:26, 26 February 2021 (UTC)
The conversation not only could and should start now, but it is actually overdue. I am just trying to manage the expectations of what it is realistic to achieve in any relatively close time frame for a chronically resource-starved all volunteer project. We need much better Wikidata integration for all sorts of things, and what you propose here would just be one exceptionally good showcase for such functionality. The fact is, for what we have coverage for, our metadata is actually generally better than most libraries and archives' databases; we just don't do a good job of making them structured and reusable. Bulk imports of data to Wikidata are happening at an absurd pace already so that's not really a problem; it's connecting the bibliographic related work we do here to Wikidata that's the major gap. --Xover (talk) 19:23, 26 February 2021 (UTC)
Glad you agree. I look through Wikidata and, as far as I could tell, there is no MARC parser or exporter. I think that this would be the first step towards working with any library. It will enable us to import the data from libraries into Wikidata and export it out. The MARC standard is available online [1] and open source implementations exist [2]. This is a huge undertaking and will probably require fundraising, but I think that it's the first critical step towards making Wikisource a true online library. It also probably makes sense to reach out to the Open Library for possible collaboration in software development. I'm a new user here so I haven't earned my stripes, but I would love to work with you on this project. Languageseeker (talk) 20:13, 26 February 2021 (UTC)

Bot to replace long s with {{ls}}

I've noticed that there are numerous texts where there is a long s "ſ". I was wondering if it would be possible to create a bot to replace all long s "ſ" with {{ls}}. This would enable users to toggle the view between"ſ" and s. It would also help to make the texts more compatible with no drawbacks. Perhaps, we could limit the bot to just validated texts in case the {{ls}} makes proofreading more challenging. Languageseeker (talk) 17:36, 22 February 2021 (UTC)

  • Whether or not to use reproduce long s is not a settled matter and is currently treated as a matter of discretion for the initial proofreader of a work. In general, we don't mess with their formatting provided that it's consistent with policy and they've completed the work with that formatting.
  • {{ls}} is certainly not drawback-free. The switching script is a user script that's incompatible with one of our gadgets.
So I don't think this would be appropriate.
Also, since this would be a substantial change, if you want to purse this, it should be at WS:S, not this subpage. BethNaught (talk) 19:11, 22 February 2021 (UTC)
Gosh is that still used? I though it was turned off years ago, I didn't realise it was still advertised at {{ls}}. It might be fixable now I know kind of what I'm doing :-s. Inductiveloadtalk/contribs 19:24, 22 February 2021 (UTC)
I think that it's an important script. Can you fix it please. Languageseeker (talk) 21:45, 22 February 2021 (UTC)
OK, I have fixed it up a bit and it seems to work. Whatever was the problem with the alt-index gadget seems to be resolved. Perhaps this should be a Gadget? Inductiveloadtalk/contribs 09:53, 23 February 2021 (UTC)
I am against the forced use of {{ls}}. We have works where the long-s has some significance and works where its orthographic reproduction is entirely superfluous. As for proofreading, my experience is that any differences from standard modern English are a challenge for proofreaders. This includes ligatures such as æ, diacritics such as in é or ï, and archaic hyphenations as in "to-day". We have no mechanism for turning these off. Long-s is just one of many issues that proofreaders have to train their eyes for. --EncycloPetey (talk) 16:40, 23 February 2021 (UTC)
I am for using long s but paste a ſ when proofreading. I think there should be an annotated copy -sans ſ- produced when the book is transcluded for those who find it difficult to read with ſ. Old style script is actually part of the charm of reading old books so should be preserved, I think. Zoeannl (talk) 21:49, 23 February 2021 (UTC)
Thank you for all the thoughtful feedback. @EncycloPetey I'm not advocating for forcing proofreaders to use the long s, but I think that if a text already has a long s, it might be useful to have a bot that will automatically replace it with {{ls}} in the posted text. Then, readers can turn it on or off. Some enjoy the long s, others find it distracting. Languageseeker (talk) 02:24, 24 February 2021 (UTC)

@BethNaught: I would disagree that this is not a settled matter, I believe that it is a well-settled matter. The discussion was had, and the decision was to use a standard s, or to have {{long s}}, so that users could have their long s in page: and we had a modern characterisation in main. The documentation is pretty certain about the approach. For many years we used to fix this up through patrolling and stopping the use of long s in works, though it seems that today's patrollers have not been as stringent.

With regards to bots, where I see (stumble upon) works with long s in use, I do in fact replace them using my bot. I don't go hunting them specifically. — billinghurst sDrewth 03:41, 24 February 2021 (UTC)

@Billinghurst: Believe it or not, I did search the archives before making my assertion, and I could find no such definitive discussion about never showing long s in mainspace. Could you actually point us to it, rather than merely asserting it exists? Indeed, while I found several discussion in the past decade, none were definitive. For example: this one, where a variety of opinions were expressed, including that long s should be displayed in the primary version; this one, similarly; this one where you yourself describe long s as a preference.
Additionally, where is the "pretty certain" documentation? The style guide does not mention long s; Wikisource:Style guide/Orthography does talk about long s, and while it discourages literal long s, there is no language forbidding it nor mandating {{long s}}.
It seems to me that on a project where a lot of norms are uncodified, and latitude is given for contributors' editorial discretion, to an extent practice is policy. And given that we have a featured text with literal long s, you can't rely on historical patrolling practice to counter modern patrolling practice.
I disagree that it is appropriate for you to mass-replace long s with a bot, at least in a completed work where it is consistently used. BethNaught (talk) 10:46, 24 February 2021 (UTC)
@BethNaught: Orthography says don't use long s. It is not wanted in main namespace. Template:Long s was designed to allow for those who wanted long s in the page namespace to represent the work there, yet displaying properly in main ns. So _orthography_ is meant to allow users to use long s template, or use the standard letter s and not have to reproduce long s in the text as printed.

No, the guidance doesn't ban it, and as I said elsewhere in the past couple of days, there are works where it is specifically added to be long s as a more modern work posing as an older work, for example Manners and customs of ye Englyshe; so banning it would defeat those needing to display as required. Similarly there is some German use, and also old text reproduced in works where we display it as reproduced. The conversations about the use of the ligatures and long s was also contained in search discussions as long s texts do not reproduce plain English searches, works are lost. — billinghurst sDrewth 11:26, 24 February 2021 (UTC)

So you can't point to an actual discussion deciding in favour of your position? I'm looking for the authority of the community, not the dictum of an elder statesman. BethNaught (talk) 12:15, 24 February 2021 (UTC)
  Comment By the way, the internal CirrusSearch search engine and Google, Bing and Yandex treat "ſenſe" the same as "sense" (and vice versa). On the other hand, Yahoo and DuckDuckGo do not "normalise" long-s. So ſ at least doesn't totally torpedo searchability any more. Inductiveloadtalk/contribs 09:22, 25 February 2021 (UTC)
I have also noticed that search engines can cope with long s well and so I tend to agree with not preventing its usage in our main namespace. It is historical typography and it looks good in historical documents. So I would not forbid users to enter a typographical version according to their preference. If somebody wanted, they could also be allowed to enter both typographical versions (e.g. one of them as annotated) and we may think of some ways of enabling users to switch between the two versions. --Jan Kameníček (talk) 10:20, 25 February 2021 (UTC)
@Jan.Kamenicek: +1. There actually is such a system, unofficially, at least: pages using {{ls}} can be toggled using this script:
mw.loader.load("//en.wikisource.org/w/index.php?title=User:Inductiveload/Visibility.js&action=raw&ctype=text/javascript");
It also allows you to toggle external links' blue color. Inductiveloadtalk/contribs 12:42, 25 February 2021 (UTC)
@Inductiveload: Yes, I know, but that is not what I meant. I can do it after I was explained, but my parents would not do it no matter how well you would explain it to them. Similar tools make real sense only if they are accessible to ordinary users. What is more, similar scripts can be utilized only by logged in users, while vast majority of our readers do not log in. --Jan Kameníček (talk) 13:03, 25 February 2021 (UTC)
Sure, but with approval, this can be made a default gadget. Inductiveloadtalk/contribs 13:09, 25 February 2021 (UTC)
I think it would be much more user friendly to have a button to toggle long s on and off on the page than to have to make custom css. Perhaps, we could place a slider button on the top right corner of the page. Languageseeker (talk) 13:20, 25 February 2021 (UTC)
Yes, something like that would be great. --Jan Kameníček (talk) 14:49, 25 February 2021 (UTC)
"it looks good in historical documents" — no it doesn't. It looks awful and causes difficulties in reading fluently—principally because the common fonts in use don't distinguish it effectively from "f". The glyph had its hey-day in the Tudor and Stuart periods and fell into disfavour during the end of the Stuart period and was mainly used through the 18th and 19th centuries by publishers who wanted to pretend antiquity. My approach to this is to restrict use of the to works printed in the Tudor and Stuart periods up to and including the First Commonwealth. I use a plain "s" for works printed after the Restoration. Beeswaxcandle (talk) 03:59, 26 February 2021 (UTC)
I'm not a huge fan of the long-s except in facsimile reprints. But I will say that as w:long s details, use of the long-s in English plummeted between 1790 and 1810; its use in the 18th century was pretty standard.--Prosfilaes (talk) 03:26, 27 February 2021 (UTC)

UniversalLanguageSelector font list?

ꜵ̈
ꜵ̈̄

Is there anywhere that I can view a list of the fonts included in the UniversalLanguageSelector extension? Perhaps I am just being a dunce, but I can't find any obvious place that lists them.

The reason I ask is – the dictionary I'm transcribing is from a time before the IPA was standardised, and it uses a few bizarre characters that don't seem to be rendered very well with the default fonts that are used (at least on my computer). The ones that are causing the most issue are the two culprits in the infobox. The diaeresis and the macron are supposed to be centred above the ꜵ. Would anyone happen to know if any of the fonts in the UniversalLanguageSelector would render this character properly, and if not, if a suitable font could be added?— 🐗 Griceylipper (✉️) 22:22, 26 February 2021 (UTC)

@Griceylipper: All fonts included with ULS. But your two ao-ligatures seem to render just fine in my browser (Safari on macOS) so I'm not really sure any ULS shenanigans are necessary. --Xover (talk) 07:49, 27 February 2021 (UTC)
I see the error on firefox and IE on Windows. Languageseeker (talk) 15:14, 27 February 2021 (UTC)
 
Thanks for the list @Xover:. This is what I see on Windows 10 with Chrome. Charis SIL seems to render the ao ligature properly, but there seem to be characters missing in it as well such as ꬶ (renders fine for me, but another font is being substituted for this character.) Though I can live with that.
If Charis SIL is the best compromise out of all the fonts included in ULS, would it be appropriate to blanket apply this font to the whole work? Or is that not advisable?— 🐗 Griceylipper (✉️) 19:25, 27 February 2021 (UTC)

Undo a Move

I moved Index:Milton - Milton's Paradise Lost, tra il 1882 e il 1891.djvu to a different title and it seems to have broken everything. Is their anyway to undo the move? Languageseeker (talk) 06:48, 27 February 2021 (UTC)

@Languageseeker: Done. For the future, perhaps temper enthusiasm with a pinch of caution until you gain more experience with the software and our community? :)
Most things can be fairly easily fixed so mistakes are not a big deal, but we do have some relatively unique software and community features that can sometimes make assumptions based on experience elsewhere a little iffy. In this case the issue was that the way the software connects the scanned file at Commons with the Index: and Page: pages here is through the page name. If a file is named "File:The Book.djvu" then the Index: must be at "Index:The Book.djvu" in order to work. We also cannot easily rename the File / Index / Page: pages for various technical reasons and so we tend to treat those as mostly opaque (rather than human readable) strings. We prefer nice logical and accurate file names if they can be had at the time of upload (or shortly after, before dependent pages are created), but we live quite happily with all but the most misleading filenames in most instances. --Xover (talk) 07:43, 27 February 2021 (UTC)
@Xover: Thank you! I'll be more cautious in the future. I didn't want to rename the book in Commons because I didn't upload it, but it seems that that would have been the place to do it. Languageseeker (talk) 15:01, 27 February 2021 (UTC)

Table Trouble during proofreading

How to achieve that kind of custom table style as used in paragraph (c) of Page:Administration of Justice (Protection) Act 2016.pdf/35? I'm still confused with Template:Table style. Many thanks for solutions.廣九直通車 (talk) 07:47, 28 February 2021 (UTC)

Xover has beaten me to it. A slightly simpler way of doing the same thing is:

"175 If the document or electronic record is required to be produced in or delivered to a court of justice Ditto Ditto Ditto Imprisonment for 6 months, or fine*, or both Ditto";

Beeswaxcandle (talk) 08:54, 28 February 2021 (UTC)

@廣九直通車: (e/c) I had a go at it, see if it's what you had in mind.
Tables in HTML and CSS are complicated, table syntax in MediaWiki is complicated, and {{ts}} is complicated. And you need to wrap your head around all three to be able to effectively work with tables here. Add the sometimes really quirky table formatting in the works we reproduce and I would be very much surprised if most people didn't struggle with this at least some of the time (I know I do).
For this particular table the key was to turn off borders for the table overall, and then to apply a single border to each of the cells (except the last one). {{ts}} is just a shortcut for inserting style="border-top: none; text-align: left;" and similar. Each of the obscure little keywords documented for the template equates to one CSS keyword with a specific value (text-align: left, for example, is al).
And since {{ts}} inserts a style attribute, you need to use it where such attributes are valid; most often on the main HTML <table> element (created by {|), a <tr> (table row, created with |-), or a <td> (table cell, created with various combinations of | and ||). Inside a table row each || separates the cells, but you can also put attributes directly on those cells with || attributes | cell content. Attributes can be a {{ts}} (it outputs a style="…", recall) or colspan="…" or rowspan="…".
In any case, the point is that you need to keep four different models of the concept of a "table", with attendant syntax and details, in your head at once so it's really more surprising anyone ever figures it out. --Xover (talk) 09:08, 28 February 2021 (UTC)

Joining lines

Is there a way to easily join lines together? I'm working on a few texts with hanging indents and if I don't join the lines together, then the indent breaks. Languageseeker (talk) 11:42, 28 February 2021 (UTC)

@Languageseeker: See Wikisource:Tools and scripts#PageCleanUp --Jan Kameníček (talk) 13:07, 28 February 2021 (UTC)
@Jan.Kamenicek: Awesome, thank you so much. It's even better that I hoped for. Languageseeker (talk) 14:37, 1 March 2021 (UTC)

Corruption in main namespace page?

There is some corruption on the top of this main namespace page and I have no clue what caused it. Can someone please look at it. Thanks.— Ineuw (talk) 07:30, 14 February 2021 (UTC)

@Ineuw: I excluded the empty page and now it looks OK. --Jan Kameníček (talk) 08:09, 14 February 2021 (UTC)
@Jan.Kamenicek: Thanks, but the page error is still showing this morning, even though I deleted and recreated the page. This is what it looks like on the recreated page. I copied the contents to this Sandbox where it shows OK. The error disappears when I am logged out. So, I deleted the browser cookies of all wikis, logged back in and the problem re-appeared.— Ineuw (talk) 15:21, 14 February 2021 (UTC)
I am not experiencing this problem in my browsers neither logged in nor logged out, trying Chrome, Firefox and Edge, so it must be something related to your settings… Could Xover give some advice? --Jan Kameníček (talk) 17:29, 14 February 2021 (UTC)
Since the last post, it has also disappeared while I was logged in. I mostly use Firefox, and then Vivaldi as a Chromium substitute, but looked there too late. Thanks. — Ineuw (talk) 18:20, 14 February 2021 (UTC)
(e/c) Not happening for me either. However, there was something screwy with the pagelist command, which I've corrected in accordance with the policy on these. Beeswaxcandle (talk) 18:22, 14 February 2021 (UTC)
@Beeswaxcandle: Is there a policy that limits pagelist layout to numbers but not alpha letters? What about roman letters? I have done many index pages with indicating the chapters, sections, images, etc. I need it! Especially, when another policy bars me from creating wiki links to the page namespace in a book's table of contents.
In my opinion, if something is technically feasible it should be allowed. Otherwise remove the feasibility. Community policies barring the use of features is the same of telling developers not to use one programming language vs. another. Let's see how that works for Wikimedia. If I am not allowed to organize and identify the data my way to ease my work, then I don't see how I can contribute at the level I have been contributing.— Ineuw (talk) 19:07, 14 February 2021 (UTC)
@Ineuw, @Beeswaxcandle:, there is nothing technically wrong with that syntax AFAICT, it's a string like any other. It would be good to be able to have an extra "label" that doesn't affect the numbering for the purpose of splitting up the page list and making it easier to navigate during proofreading: so I created phab:T274740. Inductiveloadtalk/contribs 20:18, 14 February 2021 (UTC)
Any page that is part of a numbered flow needs to be labelled as that number. Whatever is put into the pagelist command is an anchor when the page is transcluded into the mainspace. If a page is labelled as something other than its number, then it cannot be linked to in a standardised way from other works. Pages that are interpolated (images, plates, and the like) can be labelled with what they are. In terms of roman numerals as opposed to arabic, yes, these can be done. It is because the pagelist command accepts strings, that the policy at Help:Index pages#Pages was developed to ensure that we have a standardised approach to implementing the pagelist command. Beeswaxcandle (talk) 08:40, 16 February 2021 (UTC)
@Beeswaxcandle: Yes, I understand that, which is why I'm suggesting a an extra "label" that doesn't affect the numbering. Something like this:
 
Such a thing is a pretty trivial gadget, but would be easier if it could be part of the tag markup. Inductiveloadtalk/contribs 09:10, 16 February 2021 (UTC)
Such labels would also assist with the situation of multiple works in a single pagelist? Currently some of these end up as seperate Pagelists, which confuses some Gadgets. 88.97.96.89 09:21, 16 February 2021 (UTC)
@88.97.96.89: Yes. My personal use case is delineating issues in a periodical or other collective work, which are otherwise really hard to work out. E.g. find the start of the December issue start in Index:Notes and Queries - Series 5 - Volume 10.djvu, or things like Index:Parliamentary Papers - 1857 Sess. 2 - Volume 43.pdf. Inductiveloadtalk/contribs 09:28, 16 February 2021 (UTC)

  Comment It has long been possible and acceptable to have multiple <pagelist> where it adds value to the Index, and it doesn't break transclusions. I did it at index:Men-at-the-Bar.djvu years ago, as it is a work that has been dipped into and out of, and has not ToC or easy to work out where in the work. It wouldn't be normal to do it where there is a ToC as it become superfluous. — billinghurst sDrewth 11:50, 16 February 2021 (UTC)

@Billinghurst: Thanks for the example of multiple pagelists. But, please keep in mind that what is superfluous because of your knowledge and experience, may not apply to others.— Ineuw (talk) 18:41, 17 February 2021 (UTC)
Hey what.? If a transcluded ToC on an index page says chapter 1 starts at 53, then it starts at 53. Not typically a hard concept for anyone transcribing or transcluding. I didn't say don't do it, I just said generally superfluous if there is a ToC. — billinghurst sDrewth 06:27, 18 February 2021 (UTC)
I suspect we are talking about two different things. I find your solution of using multiple pagelists in a single index page to be an excellent solution for my problem and I am working on it now.— Ineuw (talk) 07:47, 18 February 2021 (UTC)

  Comment/plug: The User:Inductiveload/index preview script can help here since you can see a page preview without leaving the index page. Actually, so can User:Inductiveload/popups reloaded.js, but that's still WIP. Inductiveloadtalk/contribs 08:01, 18 February 2021 (UTC)

Corruption in main namespace page? Redux

I have a simple question. All Index files' pagelists have characters in them. In fact, all Index pages I edited have text in the pagelist, and I never heard about this being an issue. The problem only cropped up in one Index page. I am continuing to indicate pages as before, and no problems. Could it be that the page is damaged? Otherwise, what changed? — Ineuw (talk) 01:02, 3 March 2021 (UTC)

the book Golden Treasury of English Songs and Lyrics

Hi I have this book, it is missing the first 22 pages including the introduction, also pages 455 to 466 are missing I found them someone put them back in the book but in the wrong place. As far as I can tell all the other pages are intact. It was first printed in 1861, I think the one I have was in 1903?? Just FYI, On updating the file. Jere L Wilson jerewilson66@outlook.com

Hello. Do you mean Index:Golden Treasury of English Songs and Lyrics.djvu? The file looks OK. --Jan Kameníček (talk) 21:45, 6 March 2021 (UTC)

Texas Independence Referendum act

I would like to add the Texas Independence Referendum Act which can be found here to Wikisource. How do I start? Do I upload it to commons? How do I create an index page. Also, the lines are numbered, should I keep the numbering or not? Thanks in advance, -Gifnk dlm 2020 (talk) 14:41, 10 March 2021 (UTC)

@Gifnk dlm 2020: I left our welcome template which contains all the relevant help messages. In short upload the PDF to Commons, and best to utilise their c:template:book. Then you can simply create the Index: page here the filename and indexname need to have the same page name, and if {{book}} is used properly the data will populate over. We can help once the file is uploaded to Commons. — billinghurst sDrewth 09:54, 11 March 2021 (UTC)
@Billinghurst:, thank you very much? How should I call the pdf file? Texas Independence Referendum act? or is there a convention in naming that sort of documents. Thanks in advance, -Gifnk dlm 2020 (talk) 19:29, 12 March 2021 (UTC)
@Gifnk dlm 2020: Typically follow c:Commons:naming conventions, though the one proviso that we would emphasise is that we are typically working with editions of works, so there can be multiple versions of some works, so additional identifying information can be of value, eg. year, place, etc. — billinghurst sDrewth 03:46, 13 March 2021 (UTC)

Increasing the resolution of a PDF in edit mode

I've been transcribing Frank Leslie's Illustrated Newspaper (starting with Volume 18), and because it has large pages and small type, I've found that the resolution of the preview images in edit mode is not sufficient to read the words even zoomed in (not a problem with the PDF). I suspect this is also the cause of the issues with the OCR, as neither the automatically-generated OCR, the OCR gadget, nor the Google OCR gadget does a remotely reasonable job of picking up the text on the page (and they get wildly different unreasonable results). I've tried putting very large numbers in the "Scan resolution in edit mode" field of the index, but this hasn't had any noticeable impact. I had the same problem with converting pages of the PDF to images for image extraction (Preview on Mac FWIW), and for those had to resort to zooming in on the PDF and taking a screenshot. — CalendulaAsteraceae (talk) 07:56, 11 March 2021 (UTC)

@CalendulaAsteraceae: Looking at the file (714 × 1,045) and File:Frank Leslie's Illustrated Newspaper Vol. 11–12.pdf (606 × 904) the problem is that the source is not high quality and you cannot increase it past the file quality. For images, I would be heading over to https://archive.org/download/franklesliesilluv1718lesl and peering into the "View Contents" of the zip, and you should be able to get good access to high quality jp2 images or jpg if you prefer. It may be possible to rederive the file at a higher quality as those images are (1,193px x 1,747px) though that is not an insignificant task. — billinghurst sDrewth 09:51, 11 March 2021 (UTC)
Thank you! —CalendulaAsteraceae (talk) 03:01, 12 March 2021 (UTC)
@CalendulaAsteraceae: There is also a user script that pulls the images from Internet Archive or Google Books directly, when Wikisource is in the edit mode on a Page: (assuming the PDf/DJVU has certain detectable links set up). The script is User:Inductiveload/jump_to_file.js. However the script is not fool-proof and in this instance a manual offset would need to be applied, something which @Inductiveload: was working on adding to the script concerned. ShakespeareFan00 (talk) 10:45, 11 March 2021 (UTC)
@ShakespeareFan00, @CalendulaAsteraceae: it should now allow you to set an offset (on a per-index basis), as well as turn high-res loading on and off.
For Index:Frank Leslie's Illustrated Newspaper Vol. 18.pdf, the offset appears to be 192.
It is be installed like this:
mw.loader.load('//en.wikisource.org/w/index.php?title=User:Inductiveload/jump to file/load.js&action=raw&ctype=text/javascript');
More docs at User:Inductiveload/jump_to_file. Have fun! Inductiveloadtalk/contribs 17:13, 11 March 2021 (UTC)
Thank you, that works really well! —CalendulaAsteraceae (talk) 03:01, 12 March 2021 (UTC)

A minor gadget/edit toolbar distraction

On activation of both OCR Gadgets, with the first edit page refresh, the Tesseract OCR icon is separated from the Gogle OCR icon by the "Insert template" icon. Afterwards, the two OCR icons are side by side, and it's very distracting. Would it be a major correction to keep the two separated by anything? — Ineuw (talk) 23:28, 13 March 2021 (UTC)

Asking for suggestions for a minor javascript correction with an image this time

Can someone suggest how I can correct this issue for myself? I copied User:Ineuw/Gadget-ocr.js to my namespace, hoping to modifying the script because, on the first edit, the [Template button] is positioned between the two OCR buttons (which is very helpful) and afterwards it's positioned to the left as in this image. (Having both side by side is distracting). Half my thanks in advance, many thanks for a successful solution.— Ineuw (talk) 19:29, 16 March 2021 (UTC)

Index from Image Files and Haithi Trust

@Xover, @Inductiveload, @ShakespeareFan00: Today, I experimented with creating an Index from a set of images for three reasons

  1. There is the possibility of adding JP2 support that will enable the usage of image files from IA and other sources. However, the Wikifoundation has made it clear that it will not support images in zip files. Therefore, Wikisource will need to create Index files from individual images to leverage the advantages of JP2 files.
  2. Books can be downloaded from Hathi Trust as individual images. If uploaded to the IA, then IA converts them from the original files to JP2 and then to PDF incurring serious quality loses. One file went from 2GB in JPEG files to a 100mb pdf.
  3. Understanding the problems of Index files created from images can help prepare this site for the future.

For the purpose of review, I created Index:A dictionarie of the French and English tongues based on 2gb of images of a book from 1611 and Index:Shen of the Sea based on 137 mb of a book from 1925.

Here are my major finding

Hathi Trust

  1. Images for a book are stored in a mixture of JPEG and PNG files. The images stored on Hathi Trust have no extension and the extension must be added from the minetype in the file. Therefore, most book downloaded from Haithi Trust will contain a mixture of image formats.
  2. Hathi Trust does not return an error when it goes beyond the last page of a book. It just keeps on sending the back cover.
  3. You need a program like trid to add an extension based on the minetype.
  4. Images can be downloaded sequentially from Hathi Trust.
  5. Hathi Trust throttles downloads and either automatic retrying or a 4 second delay between requests is needed.
  6. Images downloaded from Haithi Trust must be renamed into a sequential order during download.
  7. Pattypan makes it easy to upload the image files to common and generates a sequence of images that can be used to generate the list of Pages for the Index file.

Advantages of an Index File generated from Images

  1. Much faster loading time.
  2. Easier to add in missing pages.
  3. Images can be cropped directly.
  4. Text is easier to read due to higher quality images.
  5. Easier to add missing pages or replace damaged pages.
  6. No need to rely on the IA upload tool or try to figure out why your file won't upload to Commons because individual 400kb to 2.5mb are no problem for common and Pattypan makes batch uploads trivial.

Disadvantages of Index generated from Images

  1. No OCR layer
  2. Merge and Split doesn't work
  3. index preview.js does not work
  4. Preview Pagelist does not work
  5. Uploading 1,000 images to Common takes a long time.

Suggested Changes

  1. Add a third parameter page_sequence for the Pages category on Index ns
    1. Currently, when adding pages from images the code is [[Image|Page Name]]. This creates Page:Image. This is out of sync with the way Pages are created from DJVU or PDF files; namely, Page:Index_name/Sequence.
      1. Example, the same book Index:Shen of the Sea creates Page:Mdp.39015056023214 001.jpeg, Page:Mdp.39015056023214_028.png, Page:Mdp.39015056023214 280.jpg.
      2. They would make more sense as Page:Shen of the Sea/1 Page:Shen of the Sea/28 Page:Shen of the Sea/280
    2. The current approach to creating Page from separate image file will always break Merge and Split because of the possibility of mixed format images.
    3. The new approach should be [[Image|Sequence|Page Name]] creating Page:Index_name/Sequence.
    4. For Index created from Images with two parameter index ([[Image|Page Name]]), a bot should exist to automatically add a numerical sequence starting with 1. This should also be done on the creation of an Index.
  2. Fix index preview.js and Preview Pagelist to handle Index ns with images from individual files.
    1. Having a sequence number for images can help.
  3. Automatically run OCR for all the Images and created a text layer.
  4. The Scans section on Index ns does not make sense because of the possibility of multiple minetypes. Instead of asking the User to manually enter the file type, the Index ns should automatically list all minetypes present.

Here is a page created with text generated by Google OCR and an Image generated by the Crop tool.

I've probably forgotten a few things, so please ask questions. Languageseeker (talk) 03:03, 21 March 2021 (UTC)

@Languageseeker: While I appreciate your enthusiasm and zeal here, you're misunderstanding the problems and prescribing inapt solutions to the wrong problems. The above is a reasonably accurate summary of the status quo (which we're painfully aware of), and "being able to use images from Commons for an Index:" is roughly a description of the desired goal (which we have previously articulated). But getting from here to there is going to take sustained effort from the community in specifying the solution, followed by advocacy and recruitment to find the developers able to do the work, and then a significant amount of developer resources over a fairly long stretch of time (including ongoing maintenance). And due to the existing platform infrastructure, into which any solution for our needs is going to have to fit, this will not be green field development: it will involve not just "a developer" hacking together some new functionality in Proofread Page, but multiple developers from multiple teams at the WMF working on multiple components of the technology stack. It is also very probable that the features we need do not all actually exist in the stack and will need to be developed more or less from scratch (and without breaking anything in the process). And because these do not yet exist out of the box we are actually going to need some developer assistance in just coming up with a specification that is even remotely implementable. Meanwhile, we can't even get minimal ongoing maintenance of our core software components (except by the kindness of volunteers in their very limited spare time) or bug fixes for which there is a patch provided applied. So, to put it succinctly, this problem only looks simple if you ignore all the hard parts.
I wouldn't for all the world want to put a damper on your enthusiasm, but right now you're flailing about all over the place without the background to direct the energy constructively (hint: it's not in getting Commons to ban duplicate scans in PDF and DjVu because you think the issue has any similarities with VHS vs. Betamax). Slow down. Learn. Discuss. And then figure out how to direct your energies where they can do the most good. There is no simple short term solution that none of us have been able to come up with: there are only hard long-term solutions that will take all of us pulling together in a sustained effort. --Xover (talk) 09:04, 21 March 2021 (UTC)
@Languageseeker: Re Hathi Trust: FYI you can get the number of pages in the book from the Data API (along with the image data itself). You need a free UoM Friend account to get an API key. You can also find it in the HTML for a book (<span data-slot="total-seq">254</span>, but the Data API is tidier.
The file extension is easy to work out from the returned mime-type of the image data. Generally PNG is bitonal and JPG is coloured. If you do make a DJVU from the images, you should use this information as it cuts the filesize by an order of magnitude. Inductiveloadtalk/contribs 13:51, 21 March 2021 (UTC)
@Xover: Thank you both for your thoughtful replies. I agree that I probably do need to learn more and that many of these changes are far more complicated. Do you think it would be possible to change the Pages generated from an image based index to Page:Index_name/Sequence instead of Page:Image Name or would that also a deep restructuring of the platform infrastructure?
@Inductiveload: Thanks for the advice. I actually discovered that triad [3] can do this as well. For me, the question is what Source do I put down on an Index ns if I use the JPG/PNG file from Haiti Trust. Languageseeker (talk) 14:07, 21 March 2021 (UTC)
@Languageseeker: Without knowing the code intimately, so caveat etc.… I think that would be a relatively contained change only in the Proofread Page extension. I don't want to speculate about the complexity / how much work that change would be without knowing the code, but it doesn't obviously require any major surgeries anywhere. But a better question is why do it? What does it gain us? The individual page names could essentially be random strings for all it matters: it's the Index that ties it together and the software knowing what the sequence of pages is. So long as we can use the next/previous buttons on each page, and transclude sequences of pages (from/to) using <pages … /> what does the page naming matter?
The Match & Split tool doesn't support non-multipage formats, but if that's what you want to use why not pursue making it support that? It's not really the mixture of image formats that's the problem there, but rather that it assumes it's a single multi-page format file. But the source code is available and I have access to the relevant server if need be. Of the two JS tools one is developed by a long-time enWS contributor, and the other by a more recent contributor as a student Google Summer of Code project, and both of them are active and responsive to queries. Both tools should technically be able to work with an image-based Index, albeit possibly with code that is too hacky to want to implement in production (I don't think there's a clean API in place for the necessary information yet). If you're having trouble using one of those tools for a specific project that's the level at which you'll want to pursue it. --Xover (talk) 15:34, 21 March 2021 (UTC)
@Xover: The major reason would be to harmonize the way we make Pages for PDF and DJVU indexes with the way we do so for images. For PDF and DJVU, the system uses Page:Index_name/Sequence and for Images Page:Image Name. This means that every piece of code has to take into account these two systems. Also, for tools, such as merge and split, you would need to query the list of images and then match to individual images instead of a sequential range of pages. Languageseeker (talk) 16:41, 21 March 2021 (UTC)

Appendix 2 of Katherine Mayo's book, "General Washington's Dilemma"

Help requested to upload to Wikisource please

I wish to upload Appendix 2 of Katherine Mayo’s book, “General Washington’s Dilemma” to Wikisource, since it is missing from the New York (1938) edition and only available in the London versions. I wish to upload it here: https://en.wikisource.org/wiki/Help:Beginner%27s_guide_to_adding_texts but would far rather upload the text, with references and Wikilinks, which has been prepared by me here (full explanations are there too): https://en.wikipedia.org/wiki/User:Arbil44/New_sandbox4 I think this would be far more useful. The text commences: “The following is a faithful copy...”. Copyright approval has been obtained here: https://en.wikipedia.org/wiki/Wikipedia_talk:Copyright_problems#General_Washington%27s_Dilemma_by_Katherine_Mayo%2C_published_in_1938 see (Nthep (talk) 15:07, 17 March 2021 (UTC)). I am essentially looking for some kind editor who would do this for me since I’m too old to have good IT skills and this is likely to be well beyond me. Help would me massively appreciated. Once done, there are three main pages where this will need to be linked, the main one being https://en.wikipedia.org/wiki/Sir_Charles_Asgill,_2nd_Baronet but I should be able to copy the outcome to the other pages. Many thanks in advance to the person willing to help me out here. If this really can only be done with either jpeg or pdf documents, then I will have to ask my daughter to do this since she has a copy of the correct edition of the book. Arbil44 (talk) 18:24, 18 March 2021 (UTC)

Thank you for your willingness to contribute to Wikisource. Do you have a full copy of the book? I know that you say that the only the Appendix is different, but there can be subtle differences between editions. Also, we prefer to have our works scan backed. Is there any place where we can find a scan of the book? Languageseeker (talk) 19:26, 18 March 2021 (UTC)
Thank you Languageseeker. I have the book (well my daughter does now) and it is the London: Jonathan Cape, 1938 edition, pp.263-268. The New York Harcourt, Brace edition does not have an Appendix 2, and that is the reason I would like it uploaded to Wikisource here. I have six scanned pages of Appendix 2, in jpeg. format. Would that be acceptable? If you were able to send me an internal email, I could then send these scanned images to you? That said, I wish you could use my Sandbox 4 edition! It is a faithful copy and I put a great deal of work into it, including lots of Wikilinks, and of course the references needed (2 of them) as well. However, if that must go to waste, the important thing is to get it up on Wikisource please. Arbil44 (talk) 01:32, 19 March 2021 (UTC)
Just adding some information here from my sandbox 4.
The following is a faithful copy of Appendix 2 of General Washington's Dilemma by Katherine Mayo. This appendix appears in the London: Jonathan Cape, 1938 edition, pp.263-268, and in the New York/London: Kennikat Press 1970 edition, pp 263-268, but not in the New York, Harcourt, Brace & Co., 1938 edition, which is also online here:[a 1]
All references to The Hon. R. Fulke Greville, of the First Foot Guards, are now known to refer to Lieutenant and Captain The Hon. Henry Greville of the 2nd Foot Guards (now known as the Coldstream Guards[a 2] Arbil44 (talk) 01:39, 19 March 2021 (UTC)
  1. Mayo, Katherine (1938). General Washington's Dilemma. New York: Harcourt, Brace and Company. 
  2. "The Thirteen Officers and Their Regiments". The Journal of Lancaster County's Historical Society 120 (3): 100. 2019. OCLC 2297909. 
@Arbil44: None of your proofreading would not go to waste if we scan-backed this work, it would be placed next to the scan images. Otherwise no-one else can validate this book, as it appears to be not otherwise available online. For more information, Help:Beginner's guide to proofreading provides a quick intro to how this normally works.
But you can email me the complete scan images of the book (ideally including covers and blank pages—a scan of only part of the book is not very ideal at all) at my username at gmail.com and I'll make them into a file that we can use to scan back your edition of this book and set it up at Wikisource. A Google Drive/Dropbox link or similar is fine too if the images are too big to email.
Thanks for contributing to Wikisource and I look forward to helping you realise your goal. Inductiveloadtalk/contribs 22:14, 20 March 2021 (UTC)
Sorry, I'm a bit of a tech idiot and I don't really understand what you have said, but I have uploaded the Appendix 2 of the Mayo book (six pages) and you can find them here: [4]. However, the entire book is available online here: [5] with the exception of the Appendix 2. I would simply say that my daughter now has my copy of the book and I couldn't possibly ask her to scan the entire book, when it is already available at HathiTrust. If that has to happen I'm afraid I will probably have to abandon this quest! That will be a great pity as two of the most important letters written regarding The Asgill Affair are the only element comprising Appendix 2. Thanks for your email address, but now I have uploaded the 6 pages to Wikimedia I imagine you will be able to find them there? I don't know who does the proofreading, but I have typed up the entire Appendix 2 here: https://en.wikipedia.org/wiki/User:Arbil44/New_sandbox4 Please remember that my notes regarding the editions which do and don't have an appendix are important, as is also my notes regarding the different spellings of Asgill's name and the totally incorrect particulars of Greville's name. [User:Arbil44|Arbil44]] (talk) 00:44, 21 March 2021 (UTC)
Book added at Index:General_Washington’s_Dilemma_(1938). I'll try to merge and split asap. Languageseeker (talk) 01:34, 21 March 2021 (UTC)
Thank you. I think the page numbers are labelled incorrectly. I had some trouble with this during the upload process. My apologies for that. Arbil44 (talk) 08:31, 21 March 2021 (UTC)
Please note: This appendix appears in the London: Jonathan Cape, 1938 edition, pp.263-268, and in the New York/London: Kennikat Press 1970 edition, pp 263-268, but NOT in the New York, Harcourt, Brace & Co., 1938 edition, but all other pages of that edition appear in the online edition.Arbil44 (talk) 09:28, 21 March 2021 (UTC)
All the various issues (where the appendix is and is not - who Asgylle and Asgyle really is - and the bad transcription by Earl Spencer with all details regarding Greville) are all covered in my Sandbox 4. Arbil44 (talk) 09:38, 21 March 2021 (UTC)

Languageseeker please could you correct the publisher's name, because Harcourt Brace is wrong. That edition is the one which does not have an Appendix 2. Arbil44 (talk) 09:09, 23 March 2021 (UTC)

small caps names within italic text

Esme Shepherd (talk) 16:34, 20 March 2021 (UTC) I have been formatting many dramatic instructions that are in italics except the character names, which are in small caps. The usual form is therefore 'text in italics' Name 'text in italics', there being spaces between the text blocks and the name. Mostly, this works fine but sometimes the result is 'text in italics'Name'text in italics' without spaces. I don't know why this is so, and all I can do is format it as 'text in italics ' Name ' text in italics' to provide the spaces. Is there any rationale that differentiates these cases or is it just random?


@Esme Shepherd: Can you give a link to a page where it happens? --Jan Kameníček (talk) 17:46, 20 March 2021 (UTC)

Esme Shepherd Esme Shepherd (talk) 10:17, 21 March 2021 (UTC) It isn't easy to spot them retrospectively, so I will post you the next one I find.
https://en.wikisource.org/wiki/Page:Dramas_1.pdf/284 The Exeunt at the bottom needs separating from the character name.

Pardon if I have misunderstood, but there is an optical effect created by the slope of italic text. The example given 'looks' fine to me, there is a space. CYGNIS INSIGNIS 12:53, 22 March 2021 (UTC)

Esme Shepherd (talk)Yes, I have put a space where one is not usually required. It may have something to do with the italics but compare the following page 'Re-enter Leonora' (no space here). https://en.wikisource.org/wiki/Page:Dramas_1.pdf/285 There is also a longer passage on https://en.wikisource.org/wiki/Page:Dramas_1.pdf/291 without spaces, where a character name is preceded by an italic t. Also sometimes, the word following the character name needs a space before it. I haven't located an example of this yet.

Esme Shepherd (talk) 19:33, 22 March 2021 (UTC)Okay, I think we are working towards eliminating the problem. It was just a puzzle and annoyance. I still don't understand why, but at least I can overcome it! Thank you.

@Esme Shepherd: There is no need to add any extra spaces, I have removed the extra space that you added to Page:Dramas 1.pdf/284 and the result is as expected. Such extra spaces should definitely not be inserted. If you do not see the space, it can be caused by the effect described above by Cygnis insignis. This effect can be stronger in some browsers than in others, but it is not the reason to add any extra space which does not belong there. --Jan Kameníček (talk) 21:19, 22 March 2021 (UTC)

Esme Shepherd (talk) 19:31, 23 March 2021 (UTC)All spaces on proofread pages have now been removed and the rest will soon follow. The closing up still appears sometimes on transcluded pages and doesn't look good, but the spaces are confirmed by 'copy and paste', so I'm happy with that.

Importing from PGDP

The following discussion is closed:

We stepped into this one with perhaps more enthusiasm than sense, so best let this lie for now.

I was wondering if there is a way to import a project from PGDP to Wikisource. The works on PGDP have images and corrected text with formatting. I know that the formatting will need to be wikified, is there a tool to do this? Or is this a request best posted on Phabricator? Languageseeker (talk) 16:07, 26 February 2021 (UTC)

@Languageseeker: Please don't. Project Gutenberg texts are not generally of any particular edition of a work (amalgamations of multiple editions in some cases), and their transcribers sometimes "innovate" in various ways (modernised or americanised spellings, for example). Works here should generally start with uploading a scan and then proofreading against that scan; and the raw OCR in the scan will usually be a lot better for that than the PG text. If you don't care about the fidelity of the text, why not just read it on PG directly? --Xover (talk) 16:23, 26 February 2021 (UTC)
Totally with you on account of Project Gutenberg. I would advocate for a removal of all texts from Project Gutenberg. However, it's not Project Gutenberg, but Distributed Proofreaders that feed into Project Gutenberg. They have proofread texts with formatting and the original images. So, we would get a proofread text that we could compare to and validate them against the original image. See, for example, [6] (login required) Languageseeker (talk) 17:31, 26 February 2021 (UTC)
@Languageseeker: My apologies. I have obviously not been entirely clear on the distinction between PG and DP. Having a quick look at their guidelines it appears at least mostly compatible with our practices, so they could certainly be one source of text for us (provided what they actually output matches the guidelines, which I haven't checked). We'd need to find some technical way to import page by page to a scan hosted here so we can run our own Proofreading (just with a better starting point than OCR) and to make sure our texts are validatable to that scan for our readers. Possibly a mechanism akin to Help:Match and split, and it would probably require DP to have something API-ish that we could consume, but overall it should be feasible. --Xover (talk) 18:53, 26 February 2021 (UTC)
@Xover: Created a phabricator task. Hope it gets done. Languageseeker (talk) 20:27, 26 February 2021 (UTC)
@Languageseeker: this is an interesting idea, but an importer would almost certainly be done as an external tool that constructs a matching DJVU file from page images, feeds data in over the MediaWiki API, and then uploads the pages. I do wonder if it can be fully automated. The biggest worry so far, after logging in and sniffing around a bit, is that I cannot find page images for the "complete" works, nor a reliable link to something like the IA.
Also, I'm rather jealous of their velocity, even with such a huge number of review stages, they're clocking 140 works a month.
The other problem is that they do not format works to our level, for example "--" instead of "—", capitals, not small caps (the do mark this up), no centering, no sizing, etc etc.
On the subject of Phabricator, I've recently been wishing for a way to track enWS tasks, since they often have dependencies. Does anyone know if we can use Phabricator for that? Can we ask for a project? For example "move {{header}} to module logic". Inductiveloadtalk/contribs 20:53, 26 February 2021 (UTC)
(e/c)At present, the instructions for Match&Split specifically exclude DP works. However, IF a DP work is based on a single edition and the other criteria are met, then the Match & Split tool is fine. Certainly some of Laverock's contributions were done this way and the EB11 project is also utilising a version of the process. We would still require the normal enWS validation process. Beeswaxcandle (talk) 21:00, 26 February 2021 (UTC)
@Beeswaxcandle:, DP provides a file split by page, so you can in theory do better than M&S. However, you do need to figure out where the scan came from (hopefully the IA) and work out the offset (their page 1 is not the front cover) or construct a scan from the DP images, if present. The bigger challenge will be to write a parser for their markup, because it'd be a shame to junk it all. Inductiveloadtalk/contribs 21:30, 26 February 2021 (UTC)
So, I made a really shonky script to import a DP page-by-page text file: User:Inductiveload/dp_reformat.js using the magic of regex. It seems to have worked OK: Index:The ways of war - Kettle - 1917.pdf. However, the biggest issues I see is that once DP "archives" a project, the links to the marked-up source are removed from public view, as well as the page images. I'm unsure of why they do this, but it makes it all-but impossible to do a perfect match/split on the work, even if you can hook it up with a matching edition's scan. Inductiveloadtalk/contribs 10:52, 1 March 2021 (UTC)
The long and short is that DP does not appear willing to share their archived projects. So, making a tool that is specific to DP makes little sense. However, I still think that it makes sense to create a tool that would allow us to import OCR from a different source or replace the image files. So, I made a different phabricator ticket. Languageseeker (talk) 14:36, 1 March 2021 (UTC)
@Languageseeker: as long as you can massage the text into "Match and Split" format, you can already drive mass page uploads directly though the normal Wikisource interface. For the case of the User:Inductiveload/dp_reformat.js, this script will (attempt to) transform raw DP text into split-ready text with as much wikiformatting as it can. I will add some quick docs at User:Inductiveload/dp_reformat. It might not work for every type of DP project (since AIUI, different projects have different formatting standards). Inductiveloadtalk/contribs 15:06, 1 March 2021 (UTC)
@Inductiveload: Your script is utterly amazing. I'm astonished. I've used it on several books and it is great. I do have one bug and one suggention
  • Bug: If the offset is negative, you need to type the number first before you can insert a minus sign.
  • Request: Can you make the Index Menu a drop down menu so that the tool could be used on non-English Wikisource pages? For example, for French it would be "Livre" and "Page"
Also, is it possible to redo the match and split if the results are incorrect? I started one for Index:The American encyclopedia of history, biography and travel (IA americanencyclop00blak).pdf and it turns out they removed several blank pages from inside of the book, so I would need to rerun the script. In the past, when I tried to do this, it failed silently.
BTW: Everything from [7] upwards is still available on PGDP. It might be good to do a collective project to add these to Wikisource before the files are archived. Languageseeker (talk) 18:20, 2 March 2021 (UTC)
@Languageseeker: I looked at the offset and I think it's a bug in OOUI (phab:T276265).
Re the Index drop down, the namespace names "Index" and "Page" are canonical, so the script should just work at other Wikisources. E.g. s:it:Index:Peregrinaggio di tre giovani figliuoli del re di Serendippo.djvu works, even though the local namespace is "Indice". Let me know if it does not.
As for fixing a bad split, this is best fixed by a bot and admin, otherwise the redirects make a mess. Let me know the range to be moved and the offset and I'll sort it for you.
I am working on salvaging the texts at F2 levels (~1600). Inductiveloadtalk/contribs 19:26, 2 March 2021 (UTC)
Haha, you're awesome. Thanks for salvaging those texts. The ones that are posted to PG are archived first, so it probably makes sense to salvage those first. It's such a rich source.
For the problem with the merge, starting with Page:The American encyclopedia of history, biography and travel (IA americanencyclop00blak).pdf/22, the text should be moved +2 pages. So 22 has the text for 24. Languageseeker (talk) 21:15, 2 March 2021 (UTC)
I tried splitting a French book and it mostly works except for Modèle:Nop and Modèle:Ch don't work. Languageseeker (talk) 21:15, 2 March 2021 (UTC)
@Languageseeker: Move underway for the misaligned pages. In future please be cautious before splitting that the alignment is correct. It is annoying, I know, but if they mess with the pages, thems the breaks.
Re the French templates, I guess it possible to handle the other subdomains, as long as you know what to map each formatting element to. E.g. I think is their "nop". But it will take a little bit of a fiddle to do so. You can also do the replacements in a text editor if you know what you want to replace with.
Also, I wonder where to put the text files - they total over 950MB when uncompressed! Maybe the IA? Inductiveloadtalk/contribs 23:18, 2 March 2021 (UTC)
Thanks. I checked a few pages in the beginning and this one tricked me.
The IA might be a good place, or you can batch upload them to Commons. It might be good to store the original OCR for the future. You never know.
A few more markup for your script: [ae] = æ; [oe] = œ; {{...}} = … Languageseeker (talk) 02:19, 3 March 2021 (UTC)
@Languageseeker: Commons doesn't accept random zips/txt files, though. Anyway fill your boots: https://archive.org/details/dp_texts
Re the OCR, I'm not really sure about that, as long as we have the scan, we can OCR to our hearts' contents.
{{...}} is actually a WS template, it's designed to prevent a line break in the middle of ". . ." Inductiveloadtalk/contribs 09:09, 3 March 2021 (UTC)
Given DP don't have a publically available archive of their completed projects, and have stated that is intentional, I'm not sure harvesting everything that is available and posting it ... to a publically available archive ... is a great way to make friends! Nickw25 (talk) 08:24, 4 March 2021 (UTC)

All, it's important to note the position of the DP in relation to the activity in this space. In summary, DP have stated a view that WS should not use their in-progress texts in line with their community wishes. This is stated by the DP General Manager in the forum discussion on this topic at their site, which can be easily located. It was stated on the first phab ticket above that DP pointed the enquirer to in progress texts rather than archived ones. That was never stated publicly there. I don't know if it was stated privately or not, although it is no longer the position, if it ever was. It's also stated by their administrators in the same forum that the bulk harvesting of texts (presumably the same referenced above) was so disruptive it caused their server to become unresponsive for a period. Maybe unintentional, although destablising other projects servers to harvest information they don't want harvested cannot be the standard for a WMF project. Given this, I'd think it's reasonable for WS volunteers to refrain from harvesting and bulk importing content from DP given their community wishes. Disclaimer that I'm a volunteer at DP as well, and have been for many years on and off. To be clear, I'm a standard volunteer there, as here, and have no more knowledge other than what has been publically posted on their forums. Nickw25 (talk) 02:35, 6 March 2021 (UTC)

@Nickw25: I'm somewhat familiar with the situation. Inductiveload archived the project pages and concatenated texts because DP removes the images and text from DP soon after they get posted to PG. It did not cause the server crash. We asked if it was possible to just obtain the concatenated text afterwards and they said no. I'm not sure what the exact issue is. It seems to be a moral/philosophical issue rather than a legal one. They asserted no copyright claim, just a statement that downloading texts is a subversive activity that disrupts the core mission of the site.I'm sure that millions of authors who have their texts lapse into the public domain would like to restrict copying their work with a similar argument, but that is not how the PD works. If the text is an exact mechanical reproduction of a text in the PD, then the reproduction is in the PD. You cannot copyright a PD work. Languageseeker (talk) 02:55, 6 March 2021 (UTC)
The issue is not the public domain status of the material. If a physical library has a work that is in the public domain you're not entitled to photocopy it if the library says their policy is no photocopying; you are still subject to the terms and conditions of entrance to the library and any conditions they put on your access to the resource. Given those files require being a member to access; DP is entitled to set the terms and conditions for said access. I've been around there for a long time, and I understand why they have arrived at the conclusion they have -- they've arrived at it many times before this request. Either way, most websites have a fairly dim view of that kind of scraping. While I think it is a shame they aren't open to sharing their outputs more widely, nobody is being forced to volunteer there. Unfortunately such aggressive tactics don't win friends and influence people; they just result in more technical barriers being implemented and tighter T&C's than would have otherwise existed. Frankly, harvesting everything like that is a form of information colonialism in my opinion, it comes from a position of deep entitlement, shows no respect for the community that is there and damages the reputation of WMF projects along the way. If you disagree that strongly, those energies might be better directed to improving WS to learn from the attributes of DP that make it so effective despite not necessarily having all that many more volunteers than WS. Nickw25 (talk) 04:44, 6 March 2021 (UTC)
@Nickw25: I read carefully through the Code of Conduct and there is nothing that prohibits the copying of texts to other sites. It only states "Volunteers or guests must not intentionally harm or subvert DP processes or systems." If downloading a concatenated file harms or subverts their website, why do they have that option on every project page? If they want to write a stricter code of conduct, then I would welcome that. I think that volunteers should be allowed to know exactly what terms they are agreeing to. My intention was never to go over there to scrape all their files and I never did. I cannot speak for anybody else. I was hoping to be able to gain access on a case-by-case basis and not a massive batch import. For example, if we decided to add "Waterless Cooking for Better Meals, Better Health," then I hoped that we could ask DP for the concatenated text file and receive it. We might do so because the current version on PG is not disability friendly with dark text on a blue background. Even with the files on the IA, it would take weeks if not months of continuous work to match and split them all. Languageseeker (talk) 05:25, 6 March 2021 (UTC)
I agree that their code of conduct could be clearer around their expectations, I suspect that that will now be followed up. The ability to download is a little ambiguous. DP has been around since before Wikipedia; when it was put there I doubt anyone thought to use it for anything other than DP things. That said, in reality, if you quietly downloaded a few projects that were still active that you had some kind of 'personal use' for, it would have very very likely gone under the radar, especially if you were cautious to remove non public-domain elements (i.e. volunteer annotations). My interpretation of their code of conduct is that doing that at scale however, would be frowned upon as I interpret harvesting as subverting systems (they were made for human use). A bit like using the photocopier at work for personal use, the occasional page is OK but don't make a habit of it. One of those tricky to navigate grey zones, and like most small organisations, it takes getting to know the culture + what is actually written to figure out how things work--and the internet can make that more difficult.
Anyway, looking at Waterless Cooking; copyright logistics on that one aside, provided you can source a scan-set (DP do make their scans publicly available when they archive a project) my question would be what is stopping WS from working back from the final PG file? The PG file has the page numbering embedded; and the text could easily be matched up copied and pasted into WS (or a tool could be developed to extract based on the embedded page numbering). The transcribers note acknowledges a couple of silent corrections; which is what we'd have to keep an eye-out for when touching it up.
I would say, many of the 'reasons against' PG on WS seem outdated, and mainly affect earlier titles in PG's collection. Everything that goes to PG from DP for many years is expected to be from a single edition, doesn't get modernised, generally has a list of corrections made, and at worst has a few silent corrections to obvious errors that WS needs to keep an eye out for. I'd say DP processed files at PG would be the most reusable items from PG for WS given how DP work. It's certainly better than starting with raw OCR! I'd add the DP team have a point that the post processed file on PG might be better, even if a bit more tedious to work with. I've done 4.5k pages in their formatting rounds, and previously Post Processed a handful of titles there. I assure you things do slip through the 3 proofing (transcription) rounds. The PP and SR processes are quite likely to pick up on a number of those. Nickw25 (talk) 10:16, 7 March 2021 (UTC)
@Nickw25: I agree entirely, and I think you put the issue forcefully and cogently. In this particular case we've treated DP like some faceless entity rather than as a community with a culture and values of their own. While we may not have done anything wrong in any formal sense, we've acted hastily, and quite possibly brashly, without sufficient concern for a kindred project and its community. In comparable circumstances there would be outrage on the Wikimedia side.
There are probably some irreconcilable philosophical differences between our two projects (I suspect we're in The Cathedral and the Bazaar territory) that may make close cooperation effectively impossible. But we share similar enough interests and goals that we really should be cooperating wherever possible, and coexisting in an environment of mutual respect. That probably means we need to start by building some cross-cultural understanding so that we have a basis for dialogue, which in turn might let us identify what we actually agree on (which is probably a lot more than what we disagree on). --Xover (talk) 09:14, 6 March 2021 (UTC)
@Xover: I wouldn't entirely agree with this statement. The original proposal was to write a parser to allow for the importation of PGDP text files on a case-by-case basis. Then, we found out that DP remove the text files and images about three months after posting to PG. The immediate question became would it make sense to write a dedicated parser if we cannot get access to the files. So, I asked my friend, who is a DP volunteer, if she could find out if we could get access to the files. The basic answer was absolutely no under any circumstance; DP never has and never will share its in process files with anyone. PG is all you get. At that point, I believe that Inductiveload independently decided to archive the text files so that we could have more time. Since I was worried about copyright, I asked my friend to find out if the files are under copyright. DP would not give her a direct answer.
I don't think that time would help at all. DP simply believes that its sole purpose is to create ebooks for PG. When it comes to user contributions, the idea is that they belong to DP and, more specifically, the site administrators. They characterized wikisource as project that produces inferior ebook that will never meet their standard of quality. I'm not happy about the outcome or how the conversation went, but they left no room for dialogue. Languageseeker (talk) 02:31, 7 March 2021 (UTC)
I'd agree Xover that I hope at some point in the future there can at least be some mutual understanding. I'd never say never languageseeker. Anyway, a few points on all the discussion above:
  • Above Langugageseeker stated DP said they won't provide access under any circumstances. As such I think we just need to draw a line under it, respect their decision and wishes and move on, even if we disagree. As such I think it is important we don't bulk import items that were harvested. To do so risks diminishing WS's reputation in the DP community; but also at Project Gutenberg (where there is volunteer crossover) and potentially more broadly--it's a small world (even if most of us can't go anywhere right now!).
  • From what I can tell DP only shut the conversation down only after it became apparent that the bulk harvesting of texts had occurred against their wishes. I'd suggest that is really what crossed the line with them. It's an unfortunate series of events, although, regardless of who did what or how we got there, it was unsurprisingly interpreted as related and determined as a bad faith move on their part. I suspect they don't want to hear from anyone claiming to be from WS for a while.
  • As background, to understand the culture of DP, you need to consider them akin to a small local all volunteer community group in your neighrborhood and approach them as such. Anyone who has spent some time perusing their forums would be aware they are a very small organisation, overseen with a traditional governance structure (i.e. a board), run on a shoe-string budget of a couple of hundred dollars a month, and kept going by a very small group of people that have basically made it their FT job. There is no WMF on standby to take care of a whole bunch of foundationals or step in if required; and that no doubt contributes to their independent mindset. PG is similar. Even so, the community votes in the Board and has a say in the leadership and strategic direction. While some folks there might like them to be more open, it is clear, in WikiSpeak, that the Community Consensus on DP is they produce for PG and they are happy with that scope for now. There has been a lot of community upheavel at various points in their 20 year history. At the moment their community is settled and calm. As such, I suspect DP's leadership group are equally prioritising community harmony - as the leader of their community should. One good way to do that is keep the mission clear! As with small community groups change doesn't come quickly, and rarely happens because an outsider turns up. Such change takes many months, if not years, generally has a strong and trusted advocate within the organisation, and is quite complex and doesn't always succeed. It certainly doesn't happen inside of a week. Nor need it, DP and WS have both been doing their thing for the better part of 2 decades, there is no reason for haste. Nickw25 (talk) 10:16, 7 March 2021 (UTC)
They shut down the conversation before the harvesting happened. It was precisely their unwillingness to share the archived works that led to the bulk harvesting. I don't want to be specific because the conversations were private, but they claim ownership of the work of their volunteers and will not share any archived material. In the end, my own analysis is that they are afraid that Wikisource will destroy their community by attracting away volunteers. They have spent decades building the community and they don't want to lose it. Which I completely understand and accept. They want to be left alone and I'm leaving them alone. I don't think that wikisource should approach them for help in the future. Languageseeker (talk) 02:32, 8 March 2021 (UTC)
Appreciate you've come to your conclusions, although I would not agree after both being around for this long they are threatened by WS. At any rate, the issue is now not what happened, the legalese or interpretation of their T&C's, what misunderstandings took place or why they may not like WS. In the long term what WS should not do is engage DP the way that occurred on this occasion. We must expect genuine collaborations take time, be based on trust + mutual benefit and start from a place of curiosity, rather than just WS just wanting something.
The central issue now is the ethics of using that material. I've previously stated my personal expectations are WMF projects uphold the highest standards. You state they have asserted ownership and they've otherwise stated WS does not have permission to use. The raw download files, as is, contain material that is quite arguably not in the public domain. If I'm not mistaken your edit history shows you continue to import their files in bulk as recently as a few hours ago? I'm curious do you intend to continue given all the above? Given the potential damage to WS's reputation and that DP are unwilling collaborators in this instance, where is the community consensus to proceed, especially given this course of action does not align with WMF values as far as I can tell? Those values include statements like we will be "caring neighbors" and "humbly learn from our mistakes". https://wikimediafoundation.org/about/values/ . I'd respectfully submit it is time we do those two things in relation to this matter. There is much work to do on WS -- why persist with something that has the potential to be so damaging to our community? It's not a race to see who can get the most books, and the way we do things is just as, if not more, important than what we do. Nickw25 (talk) 11:13, 8 March 2021 (UTC)
@Nickw25: I think it is not overstepping if I suggest that the community at Wikisource now acknowledge that we've acted a bit of a bull in a china shop here, and that this can beneficially be conveyed to the DP community. We seem to have several philosophical differences that makes not just cooperation, but even just dialogue, a challenge. But, ultimately, I think both projects share a fundamental goal of making these works available; and both projects value accuracy and quality in the texts we produce. That, to me, seems like enough of a foundation to build on. Let's try to at least keep the lines of communication open and eyes open for any issue where our interests might align (increasingly draconian copyright, as one obvious shared concern). If someone that volunteers on both projects feels like acting as a bit of an informal liaison I think that would probably be a very good idea. --Xover (talk) 15:58, 8 March 2021 (UTC)
I am sorry for my actions in over-eagerly downloading the text. I have removed the public IA item. I didn't intend to cause such a problem, it was a "I wonder if I can" exercise that got out of hand. It was rude and thoughtless to proceed at such a rate. Inductiveloadtalk/contribs 13:36, 8 March 2021 (UTC)
I apologize as well. I never meant to cause the community any harm. I truly hoped that we could establish some form of collaboration, but they merely want to be left alone. Apologies were extended to the senior leadership of PGDP who consider the matter resolved. Languageseeker (talk) 13:49, 8 March 2021 (UTC)
  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Xover (talk) 14:12, 30 March 2021 (UTC)

Project to Match and Split OCR from Distributed Proofreaders

The following discussion is closed:

We stepped into this one with perhaps more enthusiasm than sense, so best let this lie for now.

Distributed Proofreaders has several thousands projects to proofread and correct texts against a single-edition work. Most of their projects derive their scans from IA or Google. However, they archive they scans after posting to Google. The project is to download the proofread text, match them with their appropriate scan, and post it here. Inductiveload has created a awesome tool that makes it possible to preserve much of the formatting and easily get the text into a format ready for match-split.

Here are the requirements

  1. Install User:Inductiveload/dp_reformat.js

Project Instructions

  1. Pick a project from one that Inductiveload archived at [8].
  2. The "Concatenated Text File" in the zip file and the the Project Description is in the matching HTML.
  3. See if the Project Description gives the original location for the scans. If it doesn't, you'll need to manually match the work.
  4. Create an index file for the work and make sure to create page numbers.
  5. Go the the Sandbox. Select Edit and paste in the text from the text file inside of the zip that you downloaded from Distributed Proofreaders.
  6. Select Reformat DP text in the Tool section of the left side bar.
  7. You will need to paste in the Index name for the work and calculate the offset. Distributed Proofreaders deletes the first few pages, so you will need to do a bit of math to get the correct number.
  8. Either select Show Preview or Publish Changes. Verify the pages. Distributed Proofreaders sometimes remove blank pages from within the work and you will need to check a few pages to make sure that the offset is correct.
    1. If their are multiple offsets, you will need to do the merge and split in stages.
  9. A tab called "Split" will appear next to discussion. Select discussion. If you want to verify that your split started, visit [9]
  10. Once you have done this, record that you have imported the work at User:Languageseeker/PGDP.

Disclaimer, this is a personal project and has no official sanction. Just asking for help from the community. Languageseeker (talk) 23:25, 2 March 2021 (UTC)

Follow-up note, Inductiveload's script will work on non-English wikisources, but the markup will need to be manually updated.

Are the downloaded texts coming from Gutenberg? We have had poor success with their "proofread" texts. They often mix-and-match editions, have modernizations, or other problems that make them incompatible with Wikisource standards. --EncycloPetey (talk) 00:09, 3 March 2021 (UTC)
No, Distributed Proofreader takes a single source usually from the IA or Google Books, proofreads the books against the source images, and then processes them and posts them to PG. They are strict to match the source text to the image prior to posting to PG. Before they post to PG during post-processing, they sometimes correct errata and deviate from the source text. The files on Distributed Proofreaders match the source text. You can take a look at the couple of books that I matched-and-split from Distributed Proofreaders as an example. The major thing that is lost in the Distributed Proofreader sources is the header and footer, but this is easier to add-in that proofreading the text. Languageseeker (talk) 01:08, 3 March 2021 (UTC)
Just commenting on the concerns about PG. Many (most?) of their texts now come from DP, who expect them to be a single edition. Most of DP's final output will list the changes made, and many also state if further silent changes were made. PG has come a ways from the days of refusing to acknowledge which edition something was prepared from (as I understood they did). There is certainly a subset of projects from PG that are lower risk from a WS perspective. They'd have been processed at DP (identifiable in the PG credits line), have an easily identifiable scan set (sometimes referenced in the PG text, otherwise can be traced back via the DP project comments, or a bit of investigative work) and have a transcribers note that silent changes were not made. Nickw25 (talk) 02:41, 6 March 2021 (UTC)

Also posting here to link to my comments above, that DP do not want WS using their in-flight works for this purpose. See that post for more detail. Nickw25 (talk) 02:45, 6 March 2021 (UTC)

  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Xover (talk) 14:12, 30 March 2021 (UTC)

3 versions of Pratt's History of Music

There is an unsourced and incomplete version in the Main The History of Music and two transcription projects: Index:Pratt - The history of music (1907).djvu and Index:Pratt - The history of music (1907 Preface Variant).djvu. If The History of Music could be moved to The History of Music (Pratt, unsourced), I can take it from the redirect. Thanks.--RaboKarbakian (talk) 17:08, 7 March 2021 (UTC)

Pratt - The history of music (1907).djvu is the original printing; Pratt - The history of music (1907 Preface Variant).djvu is a later reprint of the 1907 edition with a new preface, list of deaths after the appendix, and the removal of blank plages. The transcription is probably sourced from Pratt - The history of music (1907).djvu, but, outside of the images, I'm not sure how much value it contains as it was last edited in 2007 and the text in Pratt - The history of music (1907).djvu comes from a good source. Languageseeker (talk) 17:36, 7 March 2021 (UTC)
They have a {{versions}} here. I was told to ask an admin to move things due to clean-up of redircts being easier with some tools they have. There is a broken, unfinished version in Main that needs moving. So, I ask here for that.--RaboKarbakian (talk) 15:53, 8 March 2021 (UTC)
@RaboKarbakian: Just transcribe, and when you have the pages, just transcluded into the existing pages. No point in moving an unsourced, unfinished work, we just overwrite with something verifiable. — billinghurst sDrewth 09:58, 11 March 2021 (UTC)
@Billinghurst: I would have suggested to delete the 2007 work that has nothing to transcribe as I understand the definition of that word here. But the way of the sourcerers, usually, is to move it from the Main namespace. I might be completely confused, so perhaps you can provide a link to where it is to be transcribed at and I will work at it....--RaboKarbakian (talk) 14:21, 11 March 2021 (UTC)
@Billinghurst, @RaboKarbakian: Seconded, their is a sourced copy with good text to replace the unsourced copy. Languageseeker (talk) 14:30, 11 March 2021 (UTC)
I am presuming that the name is okay, and that the chapter structure is okay, so just transclude in place. There is no requirement to delete, just replace. Nothing is gained by deleting; and nothing is gained by moving an incomplete work that will never be completed`. — billinghurst sDrewth 07:53, 12 March 2021 (UTC)
@RaboKarbakian: Do you still need assistance with this or has it been resolved? --Xover (talk) 14:07, 30 March 2021 (UTC)
@Xover: as far as I am concerned, I left this to billinghurst's competent management....--RaboKarbakian (talk) 14:23, 30 March 2021 (UTC)

Tom Urton

The following discussion is closed:

Misplaced.

I happened to read one of your letters from Tom Urton, who lives in Norton, England with great interest. I live in Santa Rosa, Sonoma Co, California, and my great-grandmother was Kate Elizabeth Urton. It is not a common name. A few years ago we went to Norton and to St James Church, but we could not go inside. I am a genealogist and have a lot of information about the Urtons, but I am stuck about 1470. Tom, I hope you see this note! I sent a letter to you last December, but it was returned the first week of March and said the address was incorrect. I used the address you posted with your note on Wikisource. I am afraid to write my address/email here because it would be available to everyone in the world. I hope you will write again so that I can see your address or an address that you use. I live in Spring Lake Village, Santa Rosa, CA 95409. Suzanne (Tharp) Guerra

  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Xover (talk) 13:49, 30 March 2021 (UTC)

The following discussion is closed:

Request appears to have been otherwise resolved. @ShakespeareFan00: If you still need this doctored in some way feel free to hit me up on my talk page.

Can someone trim this down? It seems there are some additional clippings in it, that aren't part of the original. ShakespeareFan00 (talk) 22:21, 25 March 2021 (UTC)

Why? Just mark them as no text and move on. Creating work for others where there is no value, and an easy solution. — billinghurst sDrewth 14:50, 27 March 2021 (UTC)
  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Xover (talk) 11:22, 30 March 2021 (UTC)

Help: Need to move Index and its Pages because of a spelling error

The following discussion is closed:

All pages, file, and index migrated to new name.

Years ago, I copied and pasted OCR text for the Index name, and I missed a typo in the name. The scanned "g" in Egypt ended up as the letter "q". Index:On the Desert - Recent Events in Eqypt.djvu. I would like to save rather than delete and re-install, but don't know how to move the index and pages. — Ineuw (talk) 06:36, 27 March 2021 (UTC)

  Doing… --Xover (talk) 13:20, 27 March 2021 (UTC)
  Done File rename requested at Commons, and a temporary redirect established. @Ineuw: --Xover (talk) 14:33, 27 March 2021 (UTC)
I have moved the file at Commons, though I wonder why we even bothered. There is no need, and it creates a lot of work for next to no value. — billinghurst sDrewth 14:48, 27 March 2021 (UTC)
@Xover: Many many thanks for taking the time out and correcting my error. I should be able to do it by now. What tools do you use to rename the pages? — Ineuw (talk) 19:02, 27 March 2021 (UTC)
@Ineuw: Pywikibot has built-in support for moving pages, you just need to massage up a list of from->to page names. --Xover (talk) 19:04, 27 March 2021 (UTC)
Thanks.
@Ineuw: I also have a handy script for exactly this purpose: User:Inductiveload/Scripts/Page shifter. Inductiveloadtalk/contribs 20:32, 27 March 2021 (UTC)
  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Xover (talk) 11:16, 30 March 2021 (UTC)

Index:The star in the window (1918).djvu has problematic pages, need to be replaced

The following discussion is closed:

All pages migrated to a new index.

I need some help, because pages 286 through 293 of The Star in the Window are missing for some reason, and are just a bunch of copies of page 284 and 285.. This would be the entire content of Chapter 31.

The pages on the DJVU need to be replaced without damaging the Index page. This DJVU came from Internet Archive, but there is another version at Google Books, which is located here: File:The Star in the Window (Grosset & Dunlap).pdf. The pages 286-293 are correctly shown there, and the actual book content between the two versions is exactly the same (trust me).

I have no idea how to do this, so could someone please replace the problematic pages at Index:The star in the window (1918).djvu with the same correct pages from File:The Star in the Window (Grosset & Dunlap).pdf? Thank you! PseudoSkull (talk) 17:16, 27 March 2021 (UTC)

Pages will be proofread as normal from the other scan but are marked as problematic for now until this is fixed. PseudoSkull (talk) 17:19, 27 March 2021 (UTC)
I'll upload the Harvard copy of the Stokes version later. Languageseeker (talk) 20:53, 27 March 2021 (UTC)
New version at Index:The Star in the Window.pdf] Languageseeker (talk) 03:13, 28 March 2021 (UTC)
@Inductiveload, @Xover: Could one of you move the text from Index:The star in the window (1918).djvu to Index:The Star in the Window.pdf. The offset is -1. Languageseeker (talk) 15:05, 28 March 2021 (UTC)
Yes, please. I have tried myself. PseudoSkull (talk) 15:23, 28 March 2021 (UTC)
I also made this request at Commons to convert that to a DjVu and keep the index we already made. If migrating the page contents is what we must do instead I can do that much. But it will take a damn long time... PseudoSkull (talk) 15:25, 28 March 2021 (UTC)
Migration in progress. PseudoSkull (talk) 16:04, 28 March 2021 (UTC)
I aborted my migration due to User talk:PseudoSkull#Please do not bulk move Page: namespace pages. @Xover, @Inductiveload, @Billinghurst, @Jan.Kamenicek: Any of you can migrate the rest in whatever way you wish, since you are the ones with more efficient tools. My bot got to Page:The_star_in_the_window_(1918).djvu/89. None of the pages in the transclusion (except in the front matter) have been migrated to the better scan. PseudoSkull (talk) 22:18, 28 March 2021 (UTC)
The migration needs to be from Index:The star in the window (1918).djvu to Index:The Star in the Window.pdf. PseudoSkull (talk) 22:20, 28 March 2021 (UTC)
More info: Only Chapter 1 up to Chapter 30 need to be changed to reflect the correct scan, since I'm proofreading the rest of it on the correct scan. PseudoSkull (talk) 23:29, 28 March 2021 (UTC)
@PseudoSkull:   running; pages are being moved, and the transclusions are being updated. And has been mentioned previously, please do not do titles as hard links
| title = [[The Star in the Window (Stokes)|The Star in the Window]]
please do relative links like
| title      = [[../|The Star in the Window]]
Thanks. — billinghurst sDrewth 02:28, 29 March 2021 (UTC)
@Billinghurst: Ah yes, thank you for doing that. And I apologize for repeating a previous mistake, I must have copied from elsewhere and not have seen that error. I will try and do differently in the future. PseudoSkull (talk) 02:32, 29 March 2021 (UTC)
  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. Xover (talk) 11:15, 30 March 2021 (UTC)

Paragraphs with a left margin across page-breaks

I have come across numerous instances where a paragraph that is left indented runs from one page to the next. There does not seem to be any way of running this paragraph together on transclusion. Am I correct in this? see, for example: https://en.wikisource.org/wiki/Page:A_Series_of_Plays_on_the_Passions_Volume_3.pdf/172 and the subsequent page. Esme Shepherd (talk) 10:23, 31 March 2021 (UTC)

In this case, use the "split" templates {{left margin/s}} and {{left margin/e}}, just like the {{block center}} equivalents. These are undocumented, so it's hardly surprising you didn't find them! Inductiveloadtalk/contribs 11:16, 31 March 2021 (UTC)

Thank you, that's brilliant! I had experimented with this, but I must have had the formulation wrong! Esme Shepherd (talk) 09:55, 1 April 2021 (UTC)

Black Beauty, versions and translations

No big deal as I put them on Author:Anna Sewell, but there is a translation and a film of Black Beauty (silent though). There are other versions with different illustrators, but I haven't compiled that list.

Maybe Black Beauty could be moved to Black Beauty (first edition)? Or not. Just let me know.--RaboKarbakian (talk) 16:17, 31 March 2021 (UTC)

Another perfectly good option is to tell me that I was given bad guidance and allow me to move it myself. I am annoyed to be here asking, which often means that I am being annoying.--RaboKarbakian (talk) 17:18, 31 March 2021 (UTC)
Labeling something as "first edition" can be problematic, as there can be a first book edition, first magazine edition, first paperback edition, first edition in the US, first edition in the UK, etc. It is much better to use the date and/or publisher and/or place of publication to identify the edition rather than an edition number. --EncycloPetey (talk) 17:37, 31 March 2021 (UTC)
And the items you added to Author:Anna Sewell were not written by Sewell, so they should not be placed on her Author page. Nor should you link in an Author page to a Wikipedia article about the work she didn't write from a title that would be expected to link to the work itself on Wikisource. --EncycloPetey (talk) 17:45, 31 March 2021 (UTC)
@RaboKarbakian: First, you only need to ask for help in moving pages if you are uncertain about the correct page names etc. or there are a large number of pages involved (think "more than about five" as a rule of thumb). In the latter case both because with a large number of pages any messes are also going to be large and because it is much easier to get an admin to do the move than to go back and clean up redirects etc. Black Beauty is a case where it's a good idea to ask for help for both those reasons. So I think the advice you were given that led you here in this case was very good.
Because… I'm not sure any move is warranted here. We generally don't preemptively create versions or disambiguation pages (yes, there are exceptions), and so far as I can see we currently only have one work with that name and one edition of that work. Once we have an edition of Black Beauty Retold in Words of one Syllable ready to transclude we might want to look at what to do, which might be a disambiguation page or might be to put the latter at Black Beauty Retold in Words of one Syllable. If it were concluded to use Black Beauty for both, the original would live at Black Beauty (1877) and the other at Black Beauty (1905) (or, often, disambiguating using the author's last name because we usually don't have multiple editions of the same work). --Xover (talk) 18:10, 31 March 2021 (UTC)
@EncycloPetey: Everything you said was true, although, I have been treating "one syllable" works as translations (as others are here). Everything you removed from Anna Sewell belongs at Black Beauty which is the reason I am here.--RaboKarbakian (talk) 18:29, 31 March 2021 (UTC)
@Xover: The Main space name is interesting, in that it is a pain. At commons, another sourcerer was naming cats: Title (YYYY, publisher) which I started to follow there, leaving Title open for all editions, or Title (Author) for problem titles, or Title (YYYY, Author) for the prolific and revisionists. Whatever name you (all) think works will be just fine.--RaboKarbakian (talk) 18:29, 31 March 2021 (UTC)
@EncycloPetey: What author page linking? I am confused.--RaboKarbakian (talk) 18:31, 31 March 2021 (UTC)
I am referring to the links you incorrectly added as part of the discussion you started. You placed links on a page where they should not have been placed. If you need to keep notes, you can place them in your User space. Or you could place them on the Talk page for Black Beauty. But please do not place works by one author onto the Author page of a different author. --EncycloPetey (talk) 18:42, 31 March 2021 (UTC)

Transclusion not wiki-formatting heading

Page:CTSS programmer's guide.djvu/53 is fine by itself, but not when transcluded on Compatible time-sharing system: A programmer's guide

Thanks, Phillipedison1891 (talk) 15:03, 1 April 2021 (UTC)

Nevermind, was able to fix it. Phillipedison1891 (talk) 15:04, 1 April 2021 (UTC)

Was looking through this, and found some 'bonus' images and other ehpemra in the scans..

I've marked the file as problematic, so that a further discussion can be had here.

The images look like they are of Holbien (or similar-era) paintings (so PD-art). If they can be identified it would be reasonable to retain them..

However the copyright status of the ephemera is unclear. Do I mark the ephemra for blanking given the unclear status? (it's also not clear if they are contemporaneous withe the rest of the book.)

Example : News clipping of unknown date Page:Hans_Holbein_the_younger_(Volume_2).djvu/30 ? ShakespeareFan00 (talk) 22:46, 18 March 2021 (UTC)

@ShakespeareFan00: If it's not part of the book as published then mark the pages as without text. If they are additionally of unclear or dubious copyright status then flag the specific pages and I can excise them. Just looking at the index it wasn't clear to me which pages this was concerning. --Xover (talk) 13:48, 30 March 2021 (UTC)
I think you are both overthinking this. Like I did with my failed 9 page djvu file. Look at the publication date. — Ineuw (talk) 00:24, 3 April 2021 (UTC)

Setting up Merge and Splits for The Complete Works of Geoffrey Chaucer

I just uploaded the scans for all 7 volumes of Author:Geoffrey_Chaucer#Collected_works. The first 6 volumes have text from Gutenberg done by PGDP. For that reason, they have page numbers. Is there anyway to merge-and-split these texts? Languageseeker (talk) 01:45, 2 April 2021 (UTC)

Missing page images of a linked djvu file?

I created this eight page article as a .djvu file which displays correctly in my desktop DjVu app. But here, the page images are not showing, but the text layer is re-created with the OCR. — Ineuw (talk) 04:18, 2 April 2021 (UTC)

Page images are missing and OCR error

Installed this 9 page document. The page images are missing, but the OCR succeeded, except on the last page on which OCR generates an error. Whenever someone has the time, please look at what's wrong. Thanks. — Ineuw (talk) 13:09, 2 April 2021 (UTC)

IOError: https://upload.wikimedia.org/wikipedia/commons/thumb/6/6f/The_Rise_of_the_Russian_Jew.djvu/page9-1024px-The_Rise_of_the_Russian_Jew.djvu.jpg (invalid url?)
@Ineuw: That file claims to have a resolution of 19,204 × 26,458 pixels (about 10x what's typical), but still only 9.31 MB. I'll dig a bit, but my initial guess is that this file is broken in some way. --Xover (talk) 13:33, 2 April 2021 (UTC)
Uhm. How did you extract the 9 pages? And for that matter, why? You can proofread and transclude only those 9 pages even if the file and index contain many more. --Xover (talk) 13:35, 2 April 2021 (UTC)
Definitely a funky file. It's got indirect chunks, looks to put the text layer in annotation blocks, and claims to be an insane resolution. What tool created this file? I'll try to generate a DjVu of the whole volume, but it'll have to wait until later today or tomorrow. --Xover (talk) 13:42, 2 April 2021 (UTC)
@Xover: Please don't waste your time. I will try it again in a different way to learn how to do it. These were made from 9 JP2 pages converted to PNG then uploaded to Convertio to convert to 9 separate djvu pages (I have no offline djvu conversion tool), which was stitched together with djvm in Windows. Go ahead and laugh. :-) — Ineuw (talk) 22:19, 2 April 2021 (UTC)
@Ineuw: Regardless of how roundabout that process sounds (happy to provide guidance, but tl;dr if you have djvm you should have c44, which would convert a JPG input directly to DJVU): why not just upload the entire document https://archive.org/details/worldswork14gard, which even comes with the OCR? And then it allows proofreading of the rest of The World's Work v. 14 by others. Inductiveloadtalk/contribs 22:59, 2 April 2021 (UTC)
@Inductiveload: You are absolutely right. There is no excuse for my approach, except that I was exploring (playing) to see the end results. The djvudump displayed everything that's wrong. So, went back to the drawing board, found c44.exe, as well as the scripts posted on the Wikimedia Commons. About uploading the complete volume. I try not to upload books which I have no interest to proofread, so this seemed to be an alternative and a teaching moment. Only because it's 9 pages.— Ineuw (talk) 00:18, 3 April 2021 (UTC)
@Inductiveload, @Xover: I converted the .jpg page images with c44 and then assembled them with djvm. It's about 20% of the previous uploads, but the same problem exists. The text comes through but not the page image. Could you please look at it. — Ineuw (talk) 23:06, 4 April 2021 (UTC)
@Ineuw: A big part of the issue is that you tagged the images with Internet Archive identifier: worldswork14gard on Commons which prevents the IA tool from uploading the file. Languageseeker (talk) 23:27, 4 April 2021 (UTC)
Thanks for explaining. This is not working out for me. The volume is 700 pages and is not worth uploading in my opinion. So, I will delete it here, and ask for a deletion at the commons.— Ineuw (talk) 23:59, 4 April 2021 (UTC)
@Ineuw: I checked the new version of the file and it looked just fine, including showing the images in the Page: namespace. If you're still seeing broken images it is probably a caching issue or similar. The only thing wrong with your new version is that it doesn't have a text layer in the file itself (let me know if you want instructions for adding one: it's complicated and inconvenient, but entirely doable). --Xover (talk) 00:48, 5 April 2021 (UTC)
@Languageseeker: And just what in the world does that have to do with anything? --Xover (talk) 00:48, 5 April 2021 (UTC)
@Xover: The IA tool checks if there is a file tagged with {{IA|worldswork14gard}} on Commons. Even if it's an image, then the IA tool will not allow you to upload the file stating that the file already exists. I tried uploading the entire file with the IA tool and the images that Ineuw uploaded to Commons and tagged with the IA link prevented the uploading of the actual book. Languageseeker (talk) 00:55, 5 April 2021 (UTC)
@Languageseeker: Yes, that is roughly how the ia-upload tool works. However, as ia-upload was involved nowhere in Ineuw's problem, why are you bringing it up at all, much less framing it as a causal factor for the problems they were having? --Xover (talk) 10:06, 5 April 2021 (UTC)
@Ineuw: It's actually easier to manage a single 700 page volume than managing an extracted article. You don't need to proofread the entire thing, just the part that interests you. Languageseeker (talk) 00:46, 5 April 2021 (UTC)

┌─────────────────────────────────┘
@Languageseeker: Thanks for the correction on the commons and will that with future uploads.— Ineuw (talk) 00:50, 5 April 2021 (UTC)

@Ineuw: If an uploaded image, DjVu, or PDF comes from IA then the file's information page should definitely contain the IA identifier or another link to IA. ia-upload was designed to avoid duplicate uploads based on an assumption that most works available on IA were not, and probably never would be, uploaded to Commons. That assumption has been turned inaccurate over the last couple of months thanks to a way overzealous bulk upload of as many of IA's PDFs (mostly low-quality, and with awkward autogenerated filenames and the raw IA bibliographic metadata) as the bot could get their paws on (mostly constrained by copyright). This state of affairs most likely means that the ia-upload duplicate checking in its current form is no longer feasible, and will either have to be removed or rewritten to work in a significantly different way. At which point the problem Languageseeker is talking about, and that affects one single specialised uploader tool, will disappear, but we will still need good information about the source of media files on Commons. --Xover (talk) 10:06, 5 April 2021 (UTC)

Uploading Large PDFs to Commons

I don't seem to have a lot of luck uploading large PDFs to Commons. I've tried Chunked Uploader and it does not work. Does anybody have any suggestion? For example, I want to create a PDF for [10]. Languageseeker (talk) 20:29, 2 April 2021 (UTC)

@Languageseeker: I use just Upload Wizard and imo it should be able to handle this file too. --Jan Kameníček (talk) 22:22, 2 April 2021 (UTC)
Upload Wizard refuses documents over 100MB.--Prosfilaes (talk) 23:16, 2 April 2021 (UTC)
@Prosfilaes: that's the Basic Upload - te Wizard goes up to 2GB, I think (it uses chunked uploading). Inductiveloadtalk/contribs 23:27, 2 April 2021 (UTC)
Actually, it should be up to 4GB. --Jan Kameníček (talk) 08:30, 3 April 2021 (UTC)
@Languageseeker: I've been running into phab:T278104 on and off for a while with API uploading and the upload wizard, perhaps it's that?
On the other hand, this document produces a 55 MB DJVU from the 494MB of Hathi images, so perhaps that's a better way forward? If you must have a PDF and you want to crush it down, JBIG2 encoding the PNGs produces a PDF around 12MB, but I don't have tools to combine the JPGs with the PNGs as a PDF so only the PNGs are JBIG2'd, and I don't have tools to write the OCR into PDFs. Also the PDF is mind-expandingly slow to render compared to the DjVu. Inductiveloadtalk/contribs 23:27, 2 April 2021 (UTC)
@Inductiveload: Yep, that's the exact error that I'm getting. I'll just wait until that bug get's fixed. I'm trying to preserve the image quality because of the illustrations. Thanks for your help. Languageseeker (talk) 00:48, 3 April 2021 (UTC)
@Languageseeker: The illustrations are already pretty damaged by the Google's compression, so IMO it's not particularly critical (especially as it's way easier to extract the images from the existing JPGs at Hathi rather than from a PDF that another user wouldn't know has or hasn't re-encoded the image). As was said by Nemo_bis in phab:T277921, Commons isn't attempting to compete with Hathi/IA for storage of endless terabytes of "raw" (not that is really is raw, see below) scan images. Because what's the point?
Even then, the 36 JPGs in this file total 35MB, so, on top of the ~12MB of lossless JBIG2-encoded bitonal images, you could still produce a PDF under 50MB, without a byte of data loss from the Hathi scan (except in the Google watermarks). But the PDF will render like molasses, because JBIG2 is very slow to decode. So I'd still suggest going for DjVu, and if the image quality from the default c44 encoder settings is not good enough for whatever reason, you can set that manually. For example:
$ c44 -decibel 50 mdp.39015011058198.0001.jpg page765.djvu
$ ddjvu page765.djvu -format=pnm page765_from_djvu.pnm
$ compare -metric PSNR mdp.39015011058198.0765.jpg page765_from_djvu.pnm diff.png
50.7065
Which is kind of what you expect since we asked for 50. 50dB of PSNR is really rather good (way over JPG quality=90). In fact, since 255 is ~48dB it's essentially perfect (below the quantization error of the actual 8-bit image, but since the two aren't quite identical I'm obviously missing something). This is the difference map between the input JPG and the 50dB c44 encoding. White means identical.
Which is all kind of moot, because although the Hathi JPGs may be set at Q=95, they're encoding substantial compression noise, probably from before the data ever reached HT, which implies that 95 is far from representative of the paper-to-user Q factor and using a Q=95 level of compression is mostly just a waste of bits:
 
Striving to store data that's already totally swamped by compression noise is not particularly useful (in the context of Wikisource), IMO. Sure, reducing compression damage at each step is a nice goal, but once the data is trashed to n dB (where n << 50), what are you hoping to achieve by worrying about further lossless encoding. You have to ask yourself what exactly you are trying to achieve, or it's going to turn into a classic w:XY problem. Inductiveloadtalk/contribs 16:41, 3 April 2021 (UTC)
I'm trying to make sure that users do not have to go through Help:Image_extraction to crop an images from a file. I know that Google scans are of an inferior quality, but they are often all we have. There is nothing wrong with lossless compression, but lossy compression alters the image. As you know, getting images from Haithi Trust is difficult. So why make users go through extra work?
Yes, DJVU can compress more, but DJVU is no longer being actively developed. It's one major bug away from following the fate of Lilypond phab:T257066. If the security team discovers a major security bug in the DJVU viewer, who will fix it? What about if the code become incompatible with the latest release of Debian? As for JBIG2, it's dangerous to use because it can alter the image, see JBIG2.
I'm not asking to import the entire IA or Haithi Trust, but I want to make sure that the images are of the highest quality because the quality of monitors are continuously improving. A higher quality image will last longer. If the scans come from IA, I don't care because I know that we can pull the scans at any time. For Haithi Trust, I'm not so sure because it already imposes restrictions. Downloading from Haithi Trust at this moment, places Wikisource in National Portrait Gallery and Wikimedia Foundation copyright dispute territory. Languageseeker (talk) 00:16, 4 April 2021 (UTC)
FYI, not that I'm saying JBIG2 is ideal (due to the insane decode time making them truly miserable to use on all but the most monstrous CPUs), but jbig2 operates in lossless mode by default (it's lossy if you set -s).
And even if you do just use PNG, remember to make them bitonal first, because the Hathi PNGs are only not bitonal due to the Google watermark. That will save you hundreds of MBs per file. Inductiveloadtalk/contribs 07:11, 4 April 2021 (UTC)

Transcriptions of an audio work

Hello Wikisource editors, we have been publishing (in Apple Podcasts, and the like) and also donating to Wikimedia Commons a podcast series, under the standard Creative Commons Attribution-Share Alike 4.0 International. We are considering creating a Wikisource page with the transcription of those podcast episodes. It seems that Wikisource welcomes transcripts of audio (WS:SCOPE), but more guidance, especially to confirm whether this contribution is within the scope of Wikisource, would be much appreciated. JCPod (talk) 19:52, 3 April 2021 (UTC)

Wikisource:What Wikisource includes should give you an idea of what we include. For works published after 1925, the work should meet out equivalent of "notable". Podcasts generally do not meet that criterion, as they do not pass through peer review or editorial controls. --EncycloPetey (talk) 19:58, 3 April 2021 (UTC)
@JCPod: while it's pretty unlikely a modern "self-published" work like a podcast meets WS:WWI, I think it sounds like something Wikibooks would allow, since it's essentially a book? I don't speak for them, but you could ask at wikibooks:Wikibooks:Reading room. Inductiveloadtalk/contribs 20:27, 3 April 2021 (UTC)
Thank you both for your prompt responses. JCPod (talk) 20:58, 3 April 2021 (UTC)

  Comment Aside from this specific case example, we need to better look at how we handle transcriptions of audio works, especially progressive transcriptions. Are we going to work in the Index: / Page: ns from a file at Commons, and look to go through the double process of validating. How would we get the snippets of sound into files, etc. We have done something with video, and I think that it is time we looked to better formulate these media types. Needs guidance in Help: namespaces for video and audio files. PseudoSkull would be our current lead exponent. — billinghurst sDrewth 00:13, 4 April 2021 (UTC)

Author creation requested

Can anyone help to create the author page for Bruneian sultan Hassanal Bolkiah? I'm working on his Syariah Penal Code Order, 2013, and other emergency enactments solely made by him. In particular, I'm not sure how to deal with all of those authority control scribble-scrabbles. Many thanks.廣九直通車 (talk) 13:54, 5 April 2021 (UTC)

@廣九直通車:   Done See Author:Hassanal Bolkiah. I am uncertain about the best copyright tag to use, so I've stuck EdictGov there for now. --Xover (talk) 19:45, 5 April 2021 (UTC)

Looking for help for some hebraic caracters in a french Champollion book about hieroglyphs !

Hi,

I'm active in the french Wikisource, and I'm working on a book from Jean-François Champollion about hieroglyphs... In this book, there is THIS PAGE with a text in hebraic caracters... As I'm not good in hebrew langage nor in hebraic caracters, I'm looking for some help to correct the page. Any help would be welcome. Thanks Lorlam (talk) 18:50, 5 April 2021 (UTC)

@Ineuw: Is this something you are able to help out with? --Xover (talk) 19:38, 5 April 2021 (UTC)
Done. — Ineuw (talk) 19:52, 5 April 2021 (UTC)
Many thanks for your help — Ineuw :-) --Lorlam (talk) 21:19, 5 April 2021 (UTC)

Paginated text without scan

Following from this lengthy discussion and others on English Wikipedia w:en:Talk:Sir Charles Asgill, 2nd Baronet#General Washington's Dilemma by Katherine Mayo, Anne User:Arbil44 has transcribed a hard-to-find historical letter at w:en:user:Arbil44/New_sandbox4. It's well out of copyright. Anne has retained the original pagination and headers. Would someone be able to help copy this across to Wikisource with the appropriate page structure? Or advise me how to do it? (For example, without a scan, do we still use the Index: namespace to assemble the pages?)

Note, I don’t want to ask Anne to go back and add a scan, I get the sense that she has become somewhat frustrated in her interactions with Wikipedia and I don’t want to make things worse. So I’m hoping we can accept this as a non-scan-backed text as it is.

Pelagic (talk) 01:34, 6 April 2021 (UTC)

@Pelagic: I made the 6 pages into an index: Index:General Washington's Dilemma - Mayo - 1938 - Appendix 2.djvu. I'm not quite sure how it should be transcluded to mainspace, as it's just a fragment of a complete work. Inductiveloadtalk/contribs 01:38, 7 April 2021 (UTC)
OMG, I didn't see that Anne had already posted above. Great news that she was able to provide a scan! Many thanks for your help on this, Inductiveload.

This work and subsequent volumes of the United Nations Treaty Series are in English and French. I see the first volume proofing only English, so I would like to be ask if separate indexes would have to be made in French Wikisource to proofread the French portions. If so, I am making more indexes here to encourage proofreading, but I do not have a reliable OCR.--Jusjih (talk) 05:01, 6 April 2021 (UTC)

@Jusjih: frWS would need separate index pages, yes. But they should mostly be able to just copy the data we have here if we have ones they don't already have. And, of course, they can use the same File: on Commons.
What's your problem with OCR? --Xover (talk) 07:35, 6 April 2021 (UTC)
Thanks. I wonder if reliable OCR is available online.--Jusjih (talk) 18:04, 6 April 2021 (UTC)
Perhaps this site already has OCR when creating page namespace? I just added some well formatted covers of the United Nations Treaty Series, but we will have to mark the year published since Volume 401.--Jusjih (talk) 00:47, 7 April 2021 (UTC)

Transcribing directly from webpages (Highway Code)

Hi, I believe the current Highway Code, published by the British government's Department for Transport, falls under the CC-BY-compatible Open Government Licence and thus would be eligible for inclusion (we already have a 1931 edition and parts of the 2008 Traffic Signs Manual). But how would I go about copying it here? I know scans are preferred for verifiability - would it be appropriate to print the webpages to PDF and upload them to Commons, or is a URL sufficient attribution? If so, how do I create the relevant pages without a scan? --Wodgester (talk) 17:01, 7 April 2021 (UTC)

I would "print" web pages into PDFs, upload then to Commons saying that the source web pages have been converted to PDFs, create indexes here, then proofread the pages.--Jusjih (talk) 20:49, 8 April 2021 (UTC)
Thanks for the help @Jusjih! I've started an index. --Wodgester (talk) 16:17, 9 April 2021 (UTC)
You are very welcome and I see the PDF well describing the tools used.--Jusjih (talk) 01:48, 10 April 2021 (UTC)

How to Browse? A Lot of Confusion for a Beginner User

From the Navigation Sidebar => Help - at the bottom of the Help page

I clicked: "Do you need assistance? Post a request!"

My request is to reduce the confusion for a beginner user to browse, and to improve the browsing experience overall.

I came looking for books about time travel. My objective is to browse by Fiction => Genre, and I expect to be able to choose "science fiction" in a list of genres, and narrow my search further to "time travel" in a subsequent sub-genre list to return a list of all the science fiction books in Wikisource that revolve around time travel. I also want to browse Science => Physics and find time travel in a list to return a list of all science books that discuss time travel, but I did not get that far.

The most impactful improvement would be an enhanced method to create, assign and search Categories. Here are some comments I had as I browsed:

In Help:Beginner's guide to navigation => Browsing

  • Browse by Authors - this is fine, and there is a Navigation Sidebar link to click for Authors - intuitive, consistent and useful.
  • Browse by Subjects - this is confusing. There are portals, and there are categories. Neither term appears in the Glossary of Terms on the Help page. Neither term appears on the Navigation Sidebar. There is "Subject Index" on the Navigation Sidebar that returns a Portals list. My first thought was, "What is a Portal?" Nothing in the portals list says Fiction, Popular Fiction, Genre, or Science Fiction. At a glance, this looked fruitless. Clicking on "Index" at the bottom exposes a hierarchy of portals, and with some exploration, there is a sub-genre for time travel with three books. This is a paltry list, and I do not know if the list just reflects a small collection of books in Wikisource, or poor use of categorization. or ability for a book to belong only to one portal that reflects the dominant sub-genre when there are potentially several that are appropriate.

In Search

  • I next tried to use the Search function for a browsing tool - I searched at the top for "science fiction", and I had three directions I could take - Portals for Science Fiction and also for Science Fiction Films, and for pages containing "science fiction". Are there any Categories for science fiction? I did not see any in the search results.
  • I used the Search function again for a narrower search of "time travel" - nothing bubbled up in the near-in results except for Wells' The Time Machine. The vast majority of results were for time, or travel, which were all unrelated to my search objective.

Portals and Categories

My presumption is that Portals are hierarchical, that a book belongs to only one portal at the bottom of the hierarchy, and that a portal may contain many books (or perhaps as few as one). I frankly do not like the Lib. of Congress classification system, but it is well described and freely reproducible, so why not, I guess. It works fine.

My assumption is that a book may have several categories. I would think that categories are analogous to "tagging" in metadata. A book should probably include categories for each child portal in its classification hierarchy, and I see that is done. It would be useful if a book had a variety of topical categories that the contributor assigns, but I don't see that is done here. That would be useful!

What is the basis for Categories? How are they chosen / assigned? What categorization is automatic if any? Is there a "pick-list" of Categories? unsigned comment by ‎Bccrowe (talk) .

Did you see the "Highlights" block on the Main page, with a line on "General literature" and a direct link to "Science fiction"? --EncycloPetey (talk) 21:26, 18 April 2021 (UTC)

Please stop tinkering with the mediawiki OCR!

The title says it all. It's dead in the water again.— Ineuw (talk) 21:11, 18 April 2021 (UTC)

Formatting of dot-separated left and right align

Hello! I'm a bit new here, and so I'm still getting used to the text formatting templates. Can someone let me know what to do in the case of a page like this? I assume it's a bad practice to hardcode the dots in (see example 1), but I do not know of a way to replicate this with a template. So, should it just be aligned to left and the page number to float right (see example 2)? Is there a more elegant way to do this?

Example 1:

Input: 1. Section Title .......... 1
Output: 1. Section Title .......... 1

Example 2:

Input: 1. Section Title {{float right|1}}
Output: 1. Section Title 1

Thanks, Tol | Talk | Contribs 01:37, 20 April 2021 (UTC)

There are currently two schools of practice here with respect to dot leaders: a) replicate them, using the various Dotted TOC templates; b) omit them and use a table format instead. Both are accepted practices. I lean to the second (e.g. Page:At the Fall of Port Arthur.djvu/15). An example of the first style is at Page:Ballantyne--The Pirate City.djvu/11. Beeswaxcandle (talk) 04:04, 20 April 2021 (UTC)
@Beeswaxcandle: Thank you! Tol | Talk | Contribs 04:06, 20 April 2021 (UTC)
IMO, if you're going to use dot-leaders, it's better to use {{TOC begin}} and {{TOC row 1-dot-1}}/{{TOC row 2dot-1}} rather than {{dotted TOC page listing}}, because the latter uses a complete table for every single row and this 1) exports badly, 2) is semantically highly suspect and 3) massively inflates the HTML output. {{TOC begin}} produces a single HTML table (though the markup within the row isn't very "tidy", but I don't think it can be in the current state of CSS). Inductiveloadtalk/contribs 06:19, 20 April 2021 (UTC)

Locked myself out of My Facebook Account

i dont have access to the phone number to get the login codes but still have access to my gmail account

That is some misunderstanding. This is Wikisource and we are not able to help you with your FB account. --Jan Kameníček (talk) 13:56, 21 April 2021 (UTC)

Template:PD-EdictGov and HK Commission of Inquiry Reports

I recently found the two Blair-Kerr reports commissioned by the Hong Kong government on corruption published in 1973, which resulted in the establishment of the Hong Kong ICAC. Are these reports OK for English Wikisource under Template:PD-EdictGov? I know both reports won't be accepted on Commons, can it be accepted here? Many thanks.廣九直通車 (talk) 14:22, 24 April 2021 (UTC)

TOC with braces and dotted cells

Please help to format TOC with braces and dotted cells: Page:Works of Thomas Carlyle - Volume 01.djvu/286. Thanks. Ratte (talk) 18:21, 24 April 2021 (UTC)

I've made a start for you by way of example. I don't usually bother with dotted cells, so haven't attempted to reproduce those. Beeswaxcandle (talk) 18:37, 24 April 2021 (UTC)
Thank you for your help! Ratte (talk) 18:56, 24 April 2021 (UTC)

DjVu: two missing pages

Index:Works of Thomas Carlyle - Volume 17.djvu: pages 214, 215 are missing. Could someone please add them to file from here or here? Thanks in advance! Ratte (talk) 19:29, 24 April 2021 (UTC)

Ratte, you don't need to add them to the file, simply uploading those two pages from the same edition separately can be suitable. Create the file, index page and just transclude them. It is not technical issue in transcluding works fromdifferent index pages to the same page. — billinghurst sDrewth 12:48, 25 April 2021 (UTC)
But this is not a fix for the source file, as the red message here dictates („Source file must be fixed before proofreading“). And the source file will remain incomplete. I have doubts about the correctness of such approach. Ratte (talk) 13:06, 25 April 2021 (UTC)
I suggested somewhere recently that languageseeker replace this file with a different source (NYPL, Robarts), after consulting you that it was okay, and indicated that I've had bad experiences with scans from the current source. CYGNIS INSIGNIS 14:07, 25 April 2021 (UTC)
@Languageseeker, @Ratte: probably best to coordinate with yourselves on what to do with these volumes. CYGNIS INSIGNIS 14:46, 25 April 2021 (UTC)
Ratte, it is an acceptable solution, that it is not a fix for the source file is a different situation. Been done on multiple occasions. That dropdown is generic, and guidance only. There are many ways to resolve issues. If you are solely focused on a fix for the file, then please drop the request into the appropriate section in WS:Sbillinghurst sDrewth 23:38, 25 April 2021 (UTC)
@Ratte:   Done
@Billinghurst: For DjVu files, both Inductiveload and myself can make these kinds of repairs fairly easily (we just both happened to be a bit busy right now). Since fixing the files in place is a far simpler solution than having a work transcluded from multiple indexes I would generally recommend trying that approach first. For PDF files that cannot be manipulated quite as easily as DjVu files the equation may fall out differently. And in either case, as you say, the requests are best put in the Repairs and moves section on the Scriptorium so the right people will notice them, and so we can keep track of them. Xover (talk) 09:37, 26 April 2021 (UTC)
There is next to zero difficulty in transcribing two different indices. Both ways work, either are functional. The fixation on perfect File: is a fixation, it has no necessity. — billinghurst sDrewth 09:51, 26 April 2021 (UTC)
Both ways work, sure; but all else being equal, fixing things at the source is generally the better approach. In particular, for most users that's going be easier and less confusing, and it won't create extra complexity in keeping track of extra indexes, files, and special transclusion rules that we will have to maintain indefinitely. So long as we have people available that are able and willing to patch files in this way, my strong recommendation is that we try that first and only fall back to multiple indexes and other workarounds when we have to. Xover (talk) 11:05, 26 April 2021 (UTC)
@Xover: you are awesome, thank you! Ratte (talk) 12:16, 26 April 2021 (UTC)

Dotted TOC line template: An additional column for author?

Could somebody show me how to add another column, for the authors of magazine articles, to the {{dotted TOC line}} template? See here: Page:Pacific Monthly volumes 9 and 10.djvu/13 -Pete (talk) 16:26, 26 April 2021 (UTC)

You likely would need to create a new template for that. --EncycloPetey (talk) 17:33, 26 April 2021 (UTC)
Ah, OK thanks. I don't have any strong preference for this particular template -- is there another way of approaching these pages that you'd recommend using existing templates? Would it be better, for instance, to render the whole page as one big table, rather than a bunch of individual templates for each line? -Pete (talk) 18:45, 26 April 2021 (UTC)
Typically, I have simply used a table for complicated ToCs like this one. If the ToC is on a single page or two pages, that might be the simplest approach. I am not familiar enough with the template alternative options to comment on those. --EncycloPetey (talk) 18:50, 26 April 2021 (UTC)

Songs and Sonnets (Coleman) table of contents

Why is the multi-page table of contents broken? I genuinely have no idea. I thought I carefully prepared the headers/footers for this and can't fix it... Can anyone help please? PseudoSkull (talk) 02:36, 30 April 2021 (UTC)

Inspecting the page I find that in the instances where it shows a "|-", it combines two <td>s into a single <tr> Hmm... PseudoSkull (talk) 02:39, 30 April 2021 (UTC)
@PseudoSkull: It's {{TOC row ragged}}: it wraps its contents in <div>…</div>, so when transcluded you end up with <tr><td><div></div><span><span class="pagenum ws-pagenum" ></span></span></td></tr>. That is, a <div>…</div> followed by a <span>…</span>. Since the div—by nature of being a block-level element—is followed by a new line, and the span—despite being otherwise empty—has line-height, you end up with a blank line separating each page of the toc.
This is one reason why I really don't recommend using any of the TOC templates, in favour of just using plain table markup. You'll run into weird edge cases there too, but they are rarer, much more obvious when they occur, and generally easier to fix. --Xover (talk) 18:28, 30 April 2021 (UTC)
@Xover: this was actually a missing new line in the template: diff. Though the ragged template is probably not ideal here since the right column isn't ragged.
I know what you mean re the templates, but on the other hand, manually formatting a table in the general case is actually quite a lot of direct formatting, once you have taken into account the text alignment, page position, wrapping, vertical alignment, padding and so on (most of which need setting on every single cell). Inductiveloadtalk/contribs 19:31, 30 April 2021 (UTC)
Yeah, I don't say that because raw wikitables are a perfect solution: it's a pragmatic far-lesser-of-two-evils call, and the upshot of having to do all that direct formatting is the direct control it gives you. --Xover (talk) 19:52, 30 April 2021 (UTC)

Proofing, Formatting, and Linking in The Pilgrim's Progress

I'm brand new to wikisource. The book that has my interest and that I have chosen as a first project is the original Pilgrim's Progress published in the 1600s. I have done several pages, but would like an experienced eye to look over what I have done to make sure I am doing it right before I get too far into it.

Specifically,

  1. Am I using formatting correctly (for centered text and major font size changes)?
  2. Is it appropriate to link to the wikisource KJV bible as I have done in the title page and for the footnote on page 1?
  3. This footnote was originally a sidenote. Was I wrong in changing the format, and if so, how should it be encoded?
  4. Are my comments on the relative discussion pages appropriate?

--Bountonw (talk) 01:37, 2 May 2021 (UTC)

A few notes:
  • There's a few cases where text should be centered but isn't, like on Page 8.
  • The w:long S should be kept in the transcription, using template:ls.
  • The first letter on page 13 should use template:di. On page 21, it should also use an image (see the template's documentation).
  • Bible links can be done with Template:Bibleverse.
Glad to see a new editor! Mcrsftdog (talk) 16:50, 2 May 2021 (UTC)

Welcome! On point four, comments on discussion pages often go unnoticed, central discussion pages like this one draw more eyes. I would replace the 'long s' with an 's', because given a choice most readers would prefer a clean transcript. However it is done, using the template is preferred if that character is transcribed, but that is so the labour of the proofreader to display them can be avoided. CYGNIS INSIGNIS 18:25, 2 May 2021 (UTC)

I've had a look now. On your query 3, I think what you have done is appropriate, in fact an improvement on the sidenotes they replace. On query 2 the KJV is a reasonable assumption for the quote, but another caution: if we get more than one edition of that Authorized Version then linking the relevant part of the text will present a difficulty. 18:35, 2 May 2021 (UTC)

TOC linking for a single line with multiple parts

Does anyone have a good idea for how to link the last two entries at Lippincott's Monthly Magazine/Volume 46 (namely, "Book-Talk" and "New Books"). Both "sections" actually have 6 parts (one per issue). I'm planning to omit the "issue" tier of the naming structure, since the TOC isn't done like that, so the links are probably going to be [[Lippincott's Monthly Magazine/Volume 46/Book-Talk (1)], etc.

My problem is: where does one physically put the links? Linking the page numbers is pretty non-standard and non-discoverable since people probably assume that would lead to the Page NS.

This is an issue For Lippincott's in general, but lots of periodicals do the same and combine recurring segments into a single TOC Line when they have a per-volume TOC. Inductiveloadtalk/contribs 18:18, 4 May 2021 (UTC)

Also, there were per-issue TOCs at the time (at the BL, for example), but apparently the covers were either removed for binding or the bound volumes never had them. Inductiveloadtalk/contribs 20:37, 4 May 2021 (UTC)

When I was thinking about transcluding similar series in The New Monthly I was thinking about linking to something like Book-Talk (Lippincott) from the Main TOC which then can link to the (1), (2) via an AUX-TOC to have them all together and linked, possibly with a volume or year anchor if there are a large number of them. MarkLSteadman (talk) 22:29, 4 May 2021 (UTC)
I probably would put them in a Portal (assuming they're in all issues, there could be ~600 of them in the first series). Going via another page would break basic export expectations, so it'd at least need an {{hidden export TOC}} to compensate.
Another option I though of was to (re)construct the in-order issue TOCs as an AuxTOC or similar. Inductiveloadtalk/contribs 09:26, 5 May 2021 (UTC)
@Inductiveload: This is why PSM is like it is. The other option is just transclude them all together into one section per volume, the only issue that causes is that you can only have one _/SOURCE\_ tab and the page numbering will not flow, otherwise it works well blending things. — billinghurst sDrewth 12:46, 5 May 2021 (UTC)

Moving existing pages after an index has been renamed

Index:Narrative of Henry Box Brown.pdf has been renamed, but it has a number of pages, which appear to have had some proofreading done, that have not been moved to this new index. It starts with Page:Narrative of Henry Box Brown - who escaped from slavery enclosed in a box three feet long and two wide and two and a half high (IA narrativeofhenry00brow).pdf/10 and goes up to page 94. Could someone with the ability to batch a move to the new index name do so? Thanks. —CalendulaAsteraceae (talk) 09:03, 5 May 2021 (UTC)

@CalendulaAsteraceae:   Done (it was actually p4–94). Inductiveloadtalk/contribs 09:20, 5 May 2021 (UTC)
@CalendulaAsteraceae: Please just ask an admin to move in the Index/Page namespaces. Far easier to just do it all and less chance of mistakes. — billinghurst sDrewth 12:41, 5 May 2021 (UTC)
@Billinghurst: Will do! Where's the appropriate place to do that? — CalendulaAsteraceae (talk) 09:35, 9 May 2021 (UTC)
Any of those public spaces where admins are sitting, so here, or WS:S or WS:AN. Wherever you feel most comfortable, we aren't fussed, just best to make it clear with a good subject line. Or use {{helpme}} on the index talk:. — billinghurst sDrewth 10:13, 9 May 2021 (UTC)

Rename source file

When uploading the Declaration of Change of Titles (General Adaptation) Notice 1997 as File:Bc540715cb2-2570-3c-scan.pdf, I forget to rename the source file name. Can someone help to change File:Bc540715cb2-2570-3c-scan.pdf into File:Declaration of Change of Titles (General Adaptation) Notice 1997.pdf? Many thanks.廣九直通車 (talk) 05:21, 11 May 2021 (UTC)

@廣九直通車:   Done. Xover (talk) 05:46, 11 May 2021 (UTC)

Default layout, can't get it working.

I've seen the usage of the {{default layout}} template on Immigration Restriction Act 1901 which neatly places text in the middle of the page and allows margin text to work.

I've tried replicating that using this page in my Sandbox, but it completely ignores the default layout template, placing the margin text over on the left sidebar. Does anyone know if I'm missing anything? Supertrinko (talk) 21:45, 11 May 2021 (UTC)

IIRC the layouts only work in the Mainspace: and don't apply in User: space. The quick way to tell if a namespace allows them is to look for a "display options" box on the left hand bar. For me it appears under the "navigation" box. Beeswaxcandle (talk) 22:07, 11 May 2021 (UTC)
Oh that's helpful to know, thanks very much! Yup, I only see the display options field under the mainspace. Supertrinko (talk) 22:55, 11 May 2021 (UTC)

Disambiguation for John Ward

This doesn’t seem to be working: {{similar|Author:John Ward}}. It redirects to only one of the John Ward’s. I’m not sure how it is supposed to work? Cheers, Zoeannl (talk) 10:06, 21 April 2021 (UTC)

  Done Author:John Ward had a redirect on it. — billinghurst sDrewth 12:46, 21 April 2021 (UTC)
@Billinghurst: Thanks for this. I have another disambiguation to do. I’ve figured the format now but how do I redirect links to Author:Richard Jones which should be Author:Richard Jones (1564-1602). Cheers, Zoeannl (talk) 22:58, 30 April 2021 (UTC)
Hi. How do you move?—tab or drop down at the top of the page). Or how do you fix and find the links that point to the page?—Special:WhatLinksHere/Author:Richard Jones. Call me as you need, generally I will teach rather than do. — billinghurst sDrewth 23:42, 30 April 2021 (UTC)
@Billinghurst: I’ve replaced the links I could find. The wikidata page d:Q18672102 should be updated? and some pages Special:WhatLinksHere/Author:Richard Jones links to, I couldn’t see a link to Author:Richard Jones? Cheers, Zoeannl (talk) 05:24, 13 May 2021 (UTC)
Hi Zoeannl. We move existing author pages, rather than create new pages, and that will update the wikilinks at Wikidata, and maintain the history on the page both public-facing and in the page metadata. Then we would edit/replace the redirect with a disambiguation page. I have done that. I will undertake the disambiguation of the links next. — billinghurst sDrewth 05:42, 13 May 2021 (UTC)
Oh you have already done them, there is just the legacy of the transclusion, and they will catch up themselves in time as the cache update gets to them, or until they are push purged. — billinghurst sDrewth 05:48, 13 May 2021 (UTC)

Pardon, this has probably come up before. I attempted to use the IA upload tool, which indicated that it would generate a djvu from jp2, but the result was not evident to me. So I directly uploaded the pdf from InternetArchive, which was a scan of a copy in an Indian library, but no text layer appears. There is a djvu layer text file and .xml at IA, I checked the former to see that the quality of ocr was okay. CYGNIS INSIGNIS 12:40, 14 May 2021 (UTC)

Looks like you uploaded the original PDF from the DLI which doesn't have a text layer. The IA-generated PDF is ..._text.pdf. Note that the IA PDF is JP2-encoded so takes aeons to render compared to the DLI (which is CCITT encoded).
The IA-Upload failed at the Commons upload step due to phab:T268400 (logs), but the (large at 65MB) DJVU did generate: https://ia-upload.toolforge.org/dli.ernet.476036.djvu. Inductiveloadtalk/contribs 13:02, 14 May 2021 (UTC)

Using modern editions of medieval texts

Can I use modern editions of medieval texts (Old English period 650-1100 AD so very much within the public domain!) without infringing copyright. Obviously the editors of these texts have done painstaking work to digitise the Manuscripts so I don't know whether the modern versions are now their copyright?

Also the Old English section of this wiki is really quite inconsistent. If anyone could point me to a WikiProject page or any key editors of the OE section that would be appreciated.

There is an entire corpus of Open Source Anglo-Saxon poetry out there (https://www.sacred-texts.com/neu/ascp/) which can be copied onto Wikisource and I have access to a number of modern editions of OE prose texts which may or may not be public domain worthy.

Edit / P.S. different editors have different opinions on how to read certain parts of certain manuscripts so to make Wikisource a sort of palimpsest for the various readings would be useful.

Rho9998 (talk) 12:55, 14 May 2021 (UTC)

@Rho9998: I think sacred-texts.com have just copied from others, so they're not a great source. For example, it looks like the Exeter Book was cribbed from a 1995 online version which we have some of: The Exeter Book (Jebson). In this case, all the modern content is almost certainly copyright.
In general, it's preferred if the texts have a verifiable source, which is often a scan of a book. For example https://archive.org/details/exeterbookanthol01goll.
I don't really think there is such a WikiProject (or if there is, it's kept very quiet!). A major problem with OE/Saxon works is they are very often parallel texts and we have, up to now, never come up with a really satisfactory way to handle such texts on the web (it's easier in books where the recto/verso page layout is obvious).
In general, digitisation doesn't create a new copyright (this is called w:Sweat of the brow doctrine, which generally doesn't hold water in the US) Inductiveloadtalk/contribs 13:30, 14 May 2021 (UTC)

@Inductiveload: Thank you for your answer. That answers a few questions and inspires a few. Firstly does that mean that the modern English parts of Wikisource's copy of Jebson's edition of the Exeter Book breach copyright? Secondly, isn't it fine to create a Wikibook for each Old English text (e.g. the Beowulf MS is digitised and available online (verifiable source =British Library Digitised Manuscripts)), rather than for a book for each modern edition? And if that's okay would the contributor(s) have to make decisions about how to read the MS' handwriting etc. themselves or could they copy an edition's readings, then citing the editor somewhere? Feel free to reply on my talk page if it would clog up this page. Rho9998 (talk) 14:18, 14 May 2021 (UTC)

@Rho9998: For the first part, yes, any modern English parts with some level of creative input are indeed copyright. I'm not sure how much modern English there actually is in the original. The titles probably don't qualify for copyright on their own if that's what you are wondering.
As long as the text is in scope for Wikisource you can add it. Functionally, that means one of:
  • From before 1926
  • From 1926 to now, and actually published (mostly to avoid copy-pasting of blogs and things) and either public domain or freely licensed.
    • Some non-published works are OK if they're some kind of "historically interesting", and there's quite some leeway for that, especially if the contributed text is of decent quality rather than a drive-by copydump.
  • Your own translation of one of these
So, you certainly can create a Wikisource "version", as we'd term it, for each MS.
I'm not really sure about the MS handwriting thing. If the reading is unambiguous, then I suppose there is no creative input in it and it doesn't get its own copyright. But, yes, it would still be a good idea to say where you got the reading from to help others in future. If there is creative input, it probably gets a copyright of its own (and then a grey area in the middle where you can argue over the level of input). This is probably worth asking at WS:CV with a specific example of an MS and a modern text. Inductiveloadtalk/contribs 14:51, 14 May 2021 (UTC)
Note that transcribing Beowulf from the manuscript is not going to get you the full text; it got burned in a fire and has been flaking a bit since then, so the best editions are based on the first transcription, modified by the known problems of that transcription. I'd think there's better editions pre-1926 of almost everything in Old English.--Prosfilaes (talk) 22:30, 18 May 2021 (UTC)

Dotted line lists

Hi there,

Wanting some guidance on how best to present the list given in Some Account of New Zealand/Chapter 11, I've kinda used the {{dotted TOC page listing}} template to make it work, however this looks weird with some lines that get split into multiple lines if there's a dash.

Is there a better template or way to present lists in this format? Any help would be appreciated. Supertrinko (talk) 09:33, 18 May 2021 (UTC)

If you wrap the contents of each line with {{nobr}} then it won’t break (and you won’t have to replace all the spaces with &nbsp;). — Dcsohl (talk) 15:17, 18 May 2021 (UTC)
Perfect, I see it's been redirected to {{nowrap}}, which does the same job, I've used that. Supertrinko (talk) 23:42, 18 May 2021 (UTC)
@Supertrinko: [Aside from the fact I am not a fan of the dot leaders.] I would be more inclined to wrap it in a centred block and set a max-width (set in em). No one really wants to read a two column spanning table at 100% width on a computer. — billinghurst sDrewth 03:18, 19 May 2021 (UTC)

Help to publish a book from Internet archive to Wikisource.

Can we publish the given book from internet archive to Wikisource. Book link 👇 https://archive.org/details/wonderthatwasind00alba/mode/2upTewariKamal (talk) 12:41, 19 May 2021 (UTC)

@TewariKamal: Probably not, that work appears to be in copyright in the US (published 1968, ©1967 with a copyright notice -- 1964 through 1977). I am surprised that IA has it uploaded, not certain how they are getting away with it, not certain what release or loophole they have found. — billinghurst sDrewth 12:57, 19 May 2021 (UTC)
I see a 1963 edition published in India (Arthur Llewellyn Basham, 1914-1986, UK historian) and complying with copyright, so that would make it 2068 in US or (based on w:Copyright law of India) it may be 60+PMT which is still 2047 outside of US, so still no, and I still don't see how IA is hosting that work either. — billinghurst sDrewth 13:06, 19 May 2021 (UTC)
The copyright is probably not being enforced (though that is no excuse for anyone to be hosting it, including us). PseudoSkull (talk) 05:39, 23 May 2021 (UTC)

Talking about the part that says "Abel X*his mark* Goodfriend". How do I deal with this type of subtitle/note at Wikisource, where there is xx-smaller text above and below the letter? Do I treat it as a regular footnote (with the <ref> tag)? Or do I do something else with it?

One thought I'll volunteer is that upon selecting that text, I would like at least for "X" to take precedent over the word "his". So when you select the text, I hope it doesn't say "his X mark", but rather "X his mark".

Also I've never seen any kind of subtitle in a book like this in my life, what is it even called? Is it some kind of Canadian typographical thing? (The author is Canadian) PseudoSkull (talk) 23:03, 19 May 2021 (UTC)

@PseudoSkull: One way is to allow flexbox to the rescue:

Example

Abel <span style="display:inline-flex; flex-direction:column; align-items: center; vertical-align: middle; line-height:1.1;">
<span style="order:2;">X</span>
<span style="font-size:80%; order:1;">his</span>
<span style="font-size:80%; order:3;">mark</span>
</span> Goodfriend

Abel X his mark Goodfriend

Copy pastes as "X his mark" (and will probably render as such on export, since not many e-readers handle flexbox). Vertical alignment is a hair off where you might want it, but all in all not bad, I think. Inductiveloadtalk/contribs 23:56, 19 May 2021 (UTC)
@PseudoSkull: I would have put it in a tooltip or footnote, instead of trying to reproduce a typographic effect that was awkward even on paper, in this specific case. Xover (talk) 05:09, 20 May 2021 (UTC)

Download pdf on Erotica book isn't downloading all book pages

Hi everyone! I hope you're having a nice day. If anyone has an interest in this beautiful book - the Erotica book "download" button (to download a pdf) doesn't download all the pages, but only 11 pages, not any content pages. Have a nice day. --The Eloquent Peasant (talk) 13:33, 25 May 2021 (UTC)

@The Eloquent Peasant: the use of {{AuxTOC}} overrides the main TOC. One solution is to wrap the main TOC in {{export TOC}}. There's a bit more detail (that I just added this case to) at Help:Preparing_for_export#Listing_pages_for_export. Inductiveloadtalk/contribs 13:47, 25 May 2021 (UTC)
@Inductiveload: You fixed it! Thank you! I love that book! --The Eloquent Peasant (talk) 14:00, 25 May 2021 (UTC)

Please delete Index:Aladdin-1890s.djvu and all of the pages. I found a much better scan, see Index:Aladdin and the Wonderful Lamp-1875.pdf. Thank you!--RaboKarbakian (talk) 19:44, 26 May 2021 (UTC)

A problem with table

Please help with «235|-» (The Works of Thomas Carlyle/Volume 2, table of contents). Ratte (talk) 16:31, 31 May 2021 (UTC)

@Ratte: You needed a {{nopt}} at the top to ensure the markup for the second page really starts on a new line. Help:Page_breaks#Tables_across_page_breaks has a bit more background. Inductiveloadtalk/contribs 16:37, 31 May 2021 (UTC)
Thank you! Ratte (talk) 16:45, 31 May 2021 (UTC)

Parser functions...

Template:LR sidenote/sandbox & Template:LR sidenote/sandbox/dummy

The test cases are here:-

Template:LR sidenote/testcases

Having had to abandon an entire morning's effort because of the issue shown in the above I'm not best pleased.

Can someone PLEASE explain slowly, why in the testcases the Expected and Actual results do not match, when all that's being dealt with in the relevant template parameters is simple text or numerical values?

The sandbox is a very simplified version of some template logic I was attempting to use elsewhere to make some other templates considerably more powerful, and I very nearly had the approach working. ShakespeareFan00 (talk) 17:43, 31 May 2021 (UTC)

issue is suspected to be parser function related and so I've reverted the sandbox to be a mirror of the main template.ShakespeareFan00 (talk) 18:46, 31 May 2021 (UTC)

Logic simplification?

As part of attempts to resolve an issue elsewhere, I wrote: Template:Right sidenote/sandbox/CSSline with a view to providing a standardised way of generating a CSS attribute, if and only if a non-standard value was used.

I provide a partial-specification and test-cases here: - Template:Right sidenote/sandbox/CSSline

I then used it to re-implement portions of {{Right sidenote}} in a sandbox with the goal of moving the default behaviour to a CSS class, only generating inline styles, when a non-standard value was provided, and having a situation where a template calling {{right sidenote}} did not need to know the default value for the attributes.

The relevant implementations where CSSline is called from being:- {{Right sidenote/sandbox}}, {{LR sidenote/sandbox}}, {{RL sidenote/sandbox}}.

I have concerns that the implementation here is overly complex (it uses 3 parser functions) and would appreciate another contributor using the testcases and partial-specification provided, advising on how these could be eliminated whilst retaining the robust handling this template was intended to provide. ShakespeareFan00 (talk) 05:34, 2 June 2021 (UTC)

All caps plus different size

Hello, I'm on this page. And, I am not sure how to do the "Synopsis of Events between the Battle..." line. Kindly suggest. Lightbluerain (Talk | contribs) 03:37, 3 June 2021 (UTC)

@Lightbluerain: The text in question is set in a small-caps typeface, and it looks like it's centered too. For this we have the {{c}} and {{sc}} templates. The construct above it is a "horizontal rule", for which we use the {{rule}} template. Xover (talk) 04:02, 3 June 2021 (UTC)
@Xover:, Thanks a lot. Lightbluerain (Talk | contribs) 15:44, 3 June 2021 (UTC)

Some problems about transcribing

I'm trying to work on Dictionary of the Foochow Dialect, but I am not sure about the format. Take the page Page:Dictionary of the Foochow Dialect.pdf/1902 as example, is that acceptable? How should I improve the format? --TongcyDai (talk) 12:50, 3 June 2021 (UTC)

@TongcyDai: I have made you {{DFD index}}, which should help.
I suggest to put each "section" as a new list, otherwise the columns will be extremely long and a reader will have to scroll up and down the entire index at each column break. Inductiveloadtalk/contribs 14:28, 3 June 2021 (UTC)
Thank you very much!!--TongcyDai (talk) 14:40, 3 June 2021 (UTC)
@Inductiveload: For Page:Dictionary_of_the_Foochow_Dialect.pdf/1901, the top part is divided into 9 parts, but we don't have Template:Rh/9, how should we deal with that? --TongcyDai (talk) 14:42, 3 June 2021 (UTC)
@TongcyDai: We have it now! Inductiveloadtalk/contribs 14:49, 3 June 2021 (UTC)
@Inductiveload: Really appreciate it! And I just found that the first character of the radical is slightly more to the left than other characters (please see Page:Dictionary of the Foochow Dialect.pdf/1901). Is the template able to show the difference? --TongcyDai (talk) 14:54, 3 June 2021 (UTC)
@TongcyDai: Yep, set n=0 and if should work now. Inductiveloadtalk/contribs 15:05, 3 June 2021 (UTC)
@Inductiveload: Thank you!!! Can you please help make a template for the main part of the dictionary, like, Page:Dictionary of the Foochow Dialect.pdf/177? --TongcyDai (talk) 15:10, 3 June 2021 (UTC)
@TongcyDai: I see you found it, but for the record: {{DFD entry}} (I used 一二 for the doc's chars, feel free to fix!. Fundamentally it is a table, as being too smart with CSS will kill the export possibilities. I suggest a new table for each "pronunciation" (i.e. end the tables where the gaps are). Then, when you transclude, transclude each table to its own page, which will make navigation easier for readers. With >1000 pages, even one page per letter gets very, very long. Inductiveloadtalk/contribs 15:59, 3 June 2021 (UTC)
@Inductiveload: Thank you soooo much!! I've just finished transcluding Page:Dictionary of the Foochow Dialect.pdf/29, hopefully I'm doing right! --TongcyDai (talk) 17:54, 3 June 2021 (UTC)

Chronological Table and Index of the Statutes/Chronological Table

Index:Chronological Table and Index of the Statutes.djvu contains a lengthy table representing various statutes, the years they were passed and any noted repeals up to the date it was published.

It would not be wise to transclude this as a very large single table, so it was split by Monarch, and for later portions by Regnal Year. See Chronological Table and Index of the Statutes/Chronological Table

Each of the table "Page: s" have a header, this remains the same across the whole table.

The combined table for each Monarch/Regnal year, should have the same header as the individual Page:s.

The current transclusion of the combined tables uses {{Page}} a template (which is deprecated with good reason) and the transclusion fails to fully respect dynamic layouts or Indexstyles, as the transclusion is direct. This is undesirable.

Ideally what I would like to do is to continue to using the existing "sections" via a <pages> tag instead.

What would be a "recommended" way of placing the header for each combined table, because placing a <pages> inside table syntax to achieve this is not advised? ShakespeareFan00 (talk) 16:57, 5 June 2021 (UTC)

Request for a script... Section tag Linter...

Wikisource uses labelled "sections" to aid transclusion of a portion or portions of a page.

Would it be possible to have some kind of script/linter that checks if those tags are matched up or if duplicated naming exists within a page? ShakespeareFan00 (talk) 10:48, 6 June 2021 (UTC)

Your request doesn't make sense. Sections are free labelled, so how would you make any comparison? Plus you can have multiple sections of the same name—they are independent. There is no logic to follow. Even if we did follow some logic, there is no guarantee that will even be text in the section. This where the user needs to have vigilance and check their work as they go. We do run checks on overall components of works (link on each Index: page) to see if pages are missing.

It also why I don't do s1, s2, s3, ... labelling and I match the labelling to the subpagename, it becomes very obvious immediately if things are not correct. — billinghurst sDrewth 12:15, 6 June 2021 (UTC)

If sections are generally free labelled, then I will clarify what I am looking for as "correct" with respect to the use case I had in mind, is a pair:-
<section begin\=\"([^"]+)\" \/>
<section end\=\"([^"]+)\" \/>

where $1 and $2, the inner groups matched for the section name) are the same for a 'begin' and 'end' tag pair encountered sequentially.

There may be any kind of text apart from other section tags in between a 'begin' and 'end' tag.

The following would be 'error' conditions:-

  • 2 'begin' tags placed without an intervening 'end' tag (with a name equivalent to the first begin.
  • 'begin' tag does not match next 'end' tag encountered sequentially.
  • 'end' tag encountered without corresponding 'begin' tag, or 'end' tag encountered before corresponding 'end' tag.

If such a script is impossible then fair enough.

ShakespeareFan00 (talk) 07:10, 7 June 2021 (UTC)

The following contains a complex index statutes.

It would be appreciated if someone could read User:ShakespeareFan00/Statute_index and comment on which of the 3 example layouts I've implemented would be the best option.

I have 3 example layouts in my userspace (none of them is ideal.) and I would appreciate the views of other contributors BEFORE I try to implement a consistent approach across the entire index.

The pages are:

If the intention advice is to provide a a basic list and not worry about matching the formatting in the scan, then I would prefer to use Example3 for simplicity, even though it doesn't match entirely with the nominal spec I tried to write. ShakespeareFan00 (talk) 06:52, 7 June 2021 (UTC)

Edits not patrolled yet

Hello, When I check my watchlist here, I see a red exclamatory sign before my edits. The watchlist says that that means that my edits are not patrolled yet. I don't see such things on Wikipedia. Does it mean that every editor here has to be in the Auto-patrolled user group? Or, that's only because I am new here and that red exclamatory sign would go after a particular number of edits made? Lightbluerain (Talk | contribs) 03:21, 4 June 2021 (UTC)

(See WS:APD) Autopatrolled status is done differently here at enWS than at other places. It is given when a user demonstrates that they have a good knowledge/understanding of our policies and style guide. There is no minimum number of edits. We also allow any user to patrol an edit, rather than restricting such to a special group of patrollers. Beeswaxcandle (talk) 04:52, 4 June 2021 (UTC)
Is patrolling actually paid attention to? If I filter recent changes to unpatrolled changes only, then I see there have been 50 unpatrolled changes made in the last 2 hours... and if I switch it to manually-patrolled changes only, then I can see that only 50 changes in the last week have been patrolled. It doesn’t seem like being unpatrolled means much. — Dcsohl (talk) 21:41, 8 June 2021 (UTC)

Diary of the times of Charles II Vol. I.

I have been validating proofed pages on Diary of the times of Charles II Vol. I, and have come across a page I believe might be in error. However, I don't know enough about coding here to correct this. Please see the top of this page: Page:Diary of the times of Charles II Vol. I.djvu/187. Maile66 (talk) 13:24, 14 June 2021 (UTC)

@Maile66: I think that is fixed now, but ping @Chrisguise: is this is going to work? CYGNIS INSIGNIS 13:45, 14 June 2021 (UTC)
Yes, it looks correct. Thanks. Maile66 (talk) 13:48, 14 June 2021 (UTC)
Cool, now I want to now why it happened, I think the ref system threw that syntax out. CYGNIS INSIGNIS 13:51, 14 June 2021 (UTC)
This was because ## s1 ## was not on its own line in this edit. Inductiveloadtalk/contribs 14:44, 14 June 2021 (UTC)
That should work, however, see testing of that [11] CYGNIS INSIGNIS 17:50, 14 June 2021 (UTC)

Missing pages require placeholders

Can someone tell me how I can insert two placeholder pages into this file?— Ineuw (talk) 20:27, 15 June 2021 (UTC)

@Ineuw: If you use File:Generic placeholder page.djvu, you can use djvm in "insert" mode to splice it into the file:
djvm -i "Uncle...djvu" "Generic placeholder page.djvu" 137
djvm -i "Uncle...djvu" "Generic placeholder page.djvu" 138
Then upload over the old file and adjust the pagelist. Inductiveloadtalk/contribs 20:35, 15 June 2021 (UTC)
Thanks for the reminder. I thought that it's done with some online SQL or Python voodoo. Offline Djvu is not a problem. And my blank page insert is named "blank.djvu". :-))))) — Ineuw (talk) 20:59, 15 June 2021 (UTC)
@Inductiveload: I found a replacement file which includes the missing text it's a single leaf with the two pages. Downloading it now to double check it. It is also available as an 1896, 2 volume edition of 636 pages, published by Houghton & Mifflin which seems to be a clean scan. — Ineuw (talk) 04:13, 17 June 2021 (UTC)

Transcluding a header at the top of a page

Hello! I'm having trouble transcluding a header at The Complete Lojban Language (2016)/Chapter 2. Section 2.8 (you can search for "2.8 The basic structure of longer utterances") should display as a header, but it initially displayed as follows:

such as le selbri [ku] (see Section 2.10 (p. 23)). === 2.8 The basic structure of longer utterances ===

So, I realised that I needed to add a {{nop}} to the previous page. I did so, but not it just looks like this:

such as le selbri [ku] (see Section 2.10 (p. 23)).

=== 2.8 The basic structure of longer utterances ===

How do I get the header to show up properly? Thank you! (Please ping me on reply.) Tol | talk | contribs 05:07, 18 June 2021 (UTC)

@Tol: You needed the {{nop}} at the start of the second page. Because the pages are glued together, what you actually had was {{nop}}===2.8....===. The equals has to be on a new line for MediaWiki to see it as a heading. Inductiveloadtalk/contribs 05:16, 18 June 2021 (UTC)
@Tol: Also note that while heading markup can be made to work, for this and various other reasons Wikisource does not use plain wikimarkup heading syntax in the works we reproduce. For the headings in this work we'd use something like {{xl|'''2.8 The basic structure …'''}}. Xover (talk) 05:24, 18 June 2021 (UTC)
@Inductiveload: Thank you; that makes sense. @Xover: I see; but then how would one link to the section headings? Tol | talk | contribs 16:47, 18 June 2021 (UTC)
@Tol: {{anchor}} and {{anchor+}}. We don't preemptively create link targets for every heading. Xover (talk) 17:03, 18 June 2021 (UTC)
@Xover: Ah; thanks. I think I still prefer headings, as I won't have to go back and add an anchor each time I want to link to one, but I'll try using that format in other works. Tol | talk | contribs 17:15, 18 June 2021 (UTC)
@Tol: the page numbers can be linked, without the need for wiki headings and throwing out anchors, a good enough solution that uses the works own structure. CYGNIS INSIGNIS 00:10, 19 June 2021 (UTC)
@Cygnis insignis: How would that be done? I could link directly to the page (in the Page namespace), but I don't know how I would link to a transcluded page. Tol | talk | contribs 01:21, 19 June 2021 (UTC)
The_Complete_Lojban_Language_(2016)/Chapter_2#21 will link to the number (small, blue, on the left) of the transcluded page. Near enough? CYGNIS INSIGNIS 01:40, 19 June 2021 (UTC)
@Tol: What’s the licensing on this work? I don’t see any indication in the scan or Index: of its license. — Dcsohl (talk)
(contribs)
02:53, 19 June 2021 (UTC)
@Dcsohl: See the banner at the bottom of the mainspace page. (Or, see Section 1.8). Tol | talk | contribs 04:06, 19 June 2021 (UTC)
@Tol: Thanks - knowing what I know of Lojban, I figured the license was good, I just wanted to be sure it was there, and I just missed it. — Dcsohl (talk)
(contribs)
14:36, 19 June 2021 (UTC)
You're welcome! Tol | talk | contribs 15:48, 19 June 2021 (UTC)

Check a sci-fi story import

Hello, I did a copy/paste import of a CC-By-SA licensed short story. Can someone please look at this and tell me if something is incorrect or out of place?

Also, do digital imports go through proofreading like OCR texts, or is this just good now? Thanks. Blue Rasberry (talk) 22:48, 19 June 2021 (UTC)

@Bluerasberry: Digital files do not need to go through the proofread process, though we would fall back to sourcing per {{textinfo}} on the talk page. Also note that there is a specific d:Wikidata:Badge to mark digital copies, that flows through to enWS (done it) and you will see it mentioned at Help:Text status. — billinghurst sDrewth 01:05, 20 June 2021 (UTC)
@Billinghurst: Thanks, I have never added a badge to anything, but now I see how that works. Yes, I see it in Wikidata at Inside the Clock Tower (Q107297325), and see that badges go there. Blue Rasberry (talk) 01:13, 21 June 2021 (UTC)

Text between section tags is not displayed

The sectioned text between the end tags of this page do not transclude to the end of this page This issue exists at the end of sectioned chapters, but the sectioned beginning is always good. Can someone please check what I am missing? — Ineuw (talk) 13:13, 25 June 2021 (UTC)

@Ineuw: The section had two <section end=E209 /> tags, when it should be one "begin" and one "end". Inductiveloadtalk/contribs 14:13, 25 June 2021 (UTC)
Many thanks again.— Ineuw (talk) 16:17, 25 June 2021 (UTC)

Index:Buchan - The Thirty-Nine Steps

Index:Buchan - The Thirty-Nine Steps (Grosset Dunlap, 1915).djvu - needs a small correction, and I don't know how to do it. I validated the individual Contents chapter heading pages. However, one of the pages is in error on the Contents list. Chapter X actually links to Page 202, but on the right-hand Contents listing it says "200". Thanks for your help. Maile66 (talk) 03:24, 29 June 2021 (UTC)

@Maile66: that was noticed and corrected in the page link, it seems, a sic template usually implies that a typo should ignored. The last index I did had the same problem, I do nothing when it happens. CYGNIS INSIGNIS 15:03, 29 June 2021 (UTC)
Good to know. Thanks. Maile66 (talk) 17:29, 29 June 2021 (UTC)

Cross-page pseudo-table

On this and the following two pages, there is a sort of table, with very complicated formatting. Would it be better to represent it as an image? I’ve extracted some text on the bottom of the pages, which would work with or without a table, but I am not sure about the rest of the text. TE(æ)A,ea. (talk) 00:36, 30 June 2021 (UTC)

I would do it as an image. Trying to do them as tables usually ends up with a whole lot of time and effort for large amount of compromise. I would ask important is the text, if highly relevant for search engines, then look for a way to integrate the text. — billinghurst sDrewth 14:13, 30 June 2021 (UTC)

Biographical query...

Index talk:Medicine and the church; being a series of studies on the relationship between the practice of medicine and the church's ministry to the sick (IA medicinechurchbe00rhodiala).pdf

The author concerned being :- A. W. Robinson, D.D., The author I linked provisionally is Author:Arthur William Robinson, but I can't at present confirm it's the same person as listed in the ToC as "Arthur W. Robinson, D.D., Vicar of All Hallows Barking, Examining Chaplain to the Bishop of London, and Rural Dean of the East City of London.". Is anyone here able to confirm I have the correct person? ShakespeareFan00 (talk) 19:32, 30 June 2021 (UTC)

An entry here though : http://www.airgale.com.au/robinson/d5.htm looks like very strong evidence though. ShakespeareFan00 (talk) 19:41, 30 June 2021 (UTC)
The other name I can't pin down is Ellis Roberts. ShakespeareFan00 (talk) 19:36, 30 June 2021 (UTC)

How to deal with lines/dotted lines?

How to properly format lines and dotted lines in indices, like this page or pages 14-15 of this file? In the former case you simply have the page ended up like this, while in cases like the latter one, the solid lines are not transcluded.廣九直通車 (talk) 09:07, 28 June 2021 (UTC)

Some people want to put in dot leaders, I don't bother. Take your choice. How does it affect the readability of the work? — billinghurst sDrewth 14:18, 30 June 2021 (UTC)
I mean, if the transcluded dots are left intact, they'll soon pop out of the page borders like this case, and if the lines are not transcluded, I'm also not sure how long I should extend the lines, such that they are kept in the page borders. That's the problem.廣九直通車 (talk) 05:48, 2 July 2021 (UTC)

Automatic new line

Hello, why does the OCR text give an automatic new line as in here (see in edit source)? This makes it count like a new paragraph which it is not. Is there any easier way to get rid of those? Or, we have to do that manually? Lightbluerain (Talk | contribs) 03:52, 6 July 2021 (UTC)

It just happens randomly. All non-pragraph ending line breaks should be removed anyway. They're only an artefact of the printing process. Beeswaxcandle (talk) 04:41, 6 July 2021 (UTC)
@Lightbluerain: I've seen this before, and actually suspect it's a Mediawiki bug. The actual source of that part of the page is "well-walled town; and the great</p><p>object of the besiegers"... it's like there is an invisible zero-width space or something, but even deleting and retyping the section of text doesn't fix it.
What does fix it, though, is just deleting all the newlines from the last paragraph (like you do when proofreading anyhow). If the last paragraph is 'all on the same line,' it's not an issue. Jarnsax (talk) 04:45, 6 July 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 06:42, 6 July 2021 (UTC)

Need inputs

Hello, did I format this page correctly? Kindly give inputs especially on the last part the footnote part. Lightbluerain (Talk | contribs) 19:20, 24 June 2021 (UTC)

@Lightbluerain: Hi! welcome and thanks for the edits! Some notes:
  • The first word shouldn't be capitalised because it continues the sentence from the previous page
  • Format references using <ref>The reference content</ref>, where the asterisk or dagger appears in the text. See H:REF for more. You do not need to (and actually should not) use the original marker (* or †), the automatic sequential numbering is better for collecting all the references at the end of the final page.
I can make that edit for you as a demo, or you can do it, let me know which you prefer.
Other than that, it looks sensible. Wikisource has a very steep learning curve, so 1) you are doing really well and 2) if you're not sure, that is totally normal and we'll be happy to help. Just ask, here, or in IRC (link at the top of the page). Inductiveloadtalk/contribs 19:41, 24 June 2021 (UTC)
@Inductiveload:, do you mean that even if the original text has * or similar markers for footnotes, Wikisource is supposed to use the numbering marker only ? If so, how can we (Wikisource people) say that we "digitized the original text", while, in reality, we made some changes for our own purposes? Lightbluerain (Talk | contribs) 11:38, 25 June 2021 (UTC)
And, thanks for the first point. I, actually, missed the first word. Thanks. Lightbluerain (Talk | contribs) 11:41, 25 June 2021 (UTC)
@Lightbluerain: Because using "*" makes sense when the book uses footnotes (i.e. at the end of each page) but we use endnotes because the final document is presented as a single continuous document for each section (usually a chapter). The original work can unambiguously use a "*" on each page, but we cannot, because there could be many "*"'s in a chapter.
We make a few concessions to the practicalities of the reflowable, continuous formatting of HTML (and ebook formats for that matter), as well as not making books too onerous to proofread. Not enforcing the numbering on what used to be, but not longer are, per-page elements is one of those concessions. Other concessions can include not manually formatting paragraph indentations, not replicating most fonts in body text, removing hyphens at line-breaks, not reproducing ligatures like "ct", optionally not using long-S, not hard-coding columns when it was just a space-saving device, etc.
Wikisource does not attempt to produce perfect facsimilies of the original works: for that, you can either use the original scan as-is, or there are dedicated format like TEI XML which produce "perfect" transcriptions that capture the work more exactly (and take a commensurately larger amount of effort to encode). Wikisource does aim to produce useful works that can be, more or less, read as intended (for example: did Creasy actually care the footnote was labelled *? Probably not, he just needed it to be unambiguously linked to that footnote, and the choice of the glyph "*" was likely just convention), and where possible also provide easy access the original scan so that interested users can refer to the original material for things like "what was the actual footnote number here?". Inductiveloadtalk/contribs 11:59, 25 June 2021 (UTC)
Alright, thanks a lot. Lightbluerain (Talk | contribs) 16:58, 25 June 2021 (UTC)
@Inductiveload:, please check the page now. Lightbluerain (Talk | contribs) 17:02, 25 June 2021 (UTC)
@Inductiveload:, because the final document is presented as a single continuous document for each section (usually a chapter) does this mean that I need to use some specific template to show that a chapter ends or starts here for the digitized document to be rendered correctly? Lightbluerain (Talk | contribs) 17:05, 25 June 2021 (UTC)
@Lightbluerain: Thanks! It now looks good to me.
What it means is that normally we would transclude the pages to a separate wiki page per chapter. In this case, 15 decisive battles of the world (New York)/Chapter 1 and so on. For a practical example, see Waylaid by Wireless. This means:
  • Each page is not unreasonably long and hard to scroll up and down (especially on a mobile device)
  • It's easy to find a specific chapter.
  • When the text is exported to PDF or EPUB, each chapter starts on a new page and gets and entry in the document's built-in table of contents. Inductiveloadtalk/contribs 17:12, 25 June 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 17:49, 25 June 2021 (UTC)
@Inductiveload:, am I supposed to do this page the same way? Lightbluerain (Talk | contribs) 17:54, 25 June 2021 (UTC)
@Lightbluerain: yes, that should be the same. You also do not need to try to format those footnotes on the same line.
FYI, using &nbsp; is not the way to do right alignment - it will go badly wrong if the text wraps on a smaller screen: phab:F34527379. If you really need right alignment (and here you do not), use {{right}} or {{float right}}. Inductiveloadtalk/contribs 18:37, 25 June 2021 (UTC)
@Inductiveload:, Sure. And, I used &nbsp; because right template didn't work there since the second footnote line also had another word there which was not right-aligned. Lightbluerain (Talk | contribs) 18:04, 26 June 2021 (UTC)
Anyways thanks a lot.! Lightbluerain (Talk | contribs) 18:16, 26 June 2021 (UTC)
@Lightbluerain: If I may interject here with one last suggestion … you should put the <references/> in the Footer section of the page. Here's why: when you put together the entire chapter, you are going to transclude a bunch of pages, and thus you will have a lot of <ref> tags spanning across a whole bunch of pages. Each <references/> usage is going to display all of the notes from the entire chapter, not just the ones from the page it’s on. (You also don’t want all the footnotes appearing mid-paragraph between the word "Conqueror" and the word "to" on the following page!)
The Footer section of each page is special - it only appears when users are looking at this particular page, but not when they are viewing the chapter as a whole. (In technical terminology, it is not transcluded with the body of the page.) So if you put <references/> in the Footer of each page (for viewers of the Page space), and then another <references/> at the very end of the chapter (for readers of the chapter as a single file), it will have the desired effect. — Dcsohl (talk)
(contribs)
03:24, 27 June 2021 (UTC)
@Dcsohl:, Alright, I'll put that in the footer from now on. Thanks. Lightbluerain (Talk | contribs) 16:41, 27 June 2021 (UTC)
@Dcsohl:, What about this page? I don't think it would go with <references /> in the footer. Lightbluerain (Talk | contribs) 17:57, 8 July 2021 (UTC)
@Lightbluerain: Why not? Xover (talk) 19:22, 8 July 2021 (UTC)
@Xover:, the poem below the footnotes is not in the footer in the photo I think. But, I saw you took that all as footnote. Lightbluerain (Talk | contribs) 17:02, 9 July 2021 (UTC)
@Lightbluerain: the poem seems part of the second footnote: the main text ends at "roving bands of" … CYGNIS INSIGNIS 20:11, 9 July 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 16:31, 10 July 2021 (UTC)

Need tool: check range/set of pages for redlinks

Stevenliuyi is coming to the end of proofreading a large work of 1100 pages, Eminent Chinese of the Ch'ing Period, in two volumes vol. 1 and vol. 2. I'm (much more) slowly progressing through validation.

The encyclopedia is cross-referenced by person names, where those names may be either Mongol or Mandarin or even European, with possible accented characters, and with the Mandarin using the atrocious Wade-Giles phonetization. Stevenliuyi has duly created nearly all the subpages using those names.

Within the texts there are internal within-work links for mentioned names. And those links change from redlinks to blue as Stevenliuyi finishes more and more subpages. But not all of the redlinks.

Getting the name correct, to make the link work, is hard given all the different accented letters used. An example is p. 239 where

ECCP lkpl|Lin Tse-hsü

needed to be

ECCP lkpl|Lin Tsê-hsü

Often I can see and fix redlinks as I (slowly) validate. Sometimes Steve finds broken links in vol. 1 as he works on vol. 2. But unless we make a third pass through all the pages, how will we catch and correct *all* the redlinks?

Is there a tool here at Wikisource to find any redlinks within a range or set of pages? Shenme (talk) 03:28, 29 June 2021 (UTC)

@Shenme: If we are talking internal self-references, then I have a script that I used for DNB that identified redlinks. I haven't looked at it for ages, but pretty certain that I can adapt it suitably. — billinghurst sDrewth 14:16, 30 June 2021 (UTC)
@Shenme: I have put a list of redlinks in these pages to User:Shenme/EC redlinksbillinghurst sDrewth 07:37, 10 July 2021 (UTC)
Thank you! I checked your list quickly and one of those
Page:Eminent Chinese Of The Ch’ing Period - Hummel - 1943 - Vol. 1.pdf/357 refers to missing Eminent Chinese of the Ch'ing Period/Li Tsung-wan
was one I messaged Steve about, just last night. So your list looks good.
I'll make a first pass over the list and collect the ones that are mis-capitalizations, so we can fix those all together (page renames, etc.) Most of the others look like as described, e.g. "Tulisen" -> "Tulišen", to be fixed in text.
Neat, one of these looks like a goof in the original text! Page Page:Eminent Chinese Of The Ch’ing Period - Hummel - 1943 - Vol. 1.pdf/145 has text "... serve on the staff of Mien-yü (see under Yung-yen)" but then shortly below indicates there should be a separate page for Mien-yü, but there isn't one. Text says "[qq. v.]" but it's a goof.
This will really help. Thank you. Shenme (talk) 19:49, 10 July 2021 (UTC)
Again, thank you. And for doing Vol 2. also. Fixed just under 50 within-work redlinks. Found at least three places the original work referred to pages that they never got around to doing. Such a huge work - making the internal links all work is great! Shenme (talk) 23:06, 10 July 2021 (UTC)

Book download

I tried to download His Last Bow to my Kindle, but when it downloaded, it showed a table of contents, the preface, and no chapters (the links of the table of contents linked to the website). How do I download the full thing? (Is this even the right forum for this?) 72.225.12.160 01:05, 4 July 2021 (UTC)

Whoops, forgot to sign in! MEisSCAMMER (talk) 01:12, 4 July 2021 (UTC)
@MEisSCAMMER: Unfortunately, this particular text is a very old one that is not set up correctly. I've tried to tweak it so that you now should be able to get a complete download of all the chapters that are there, but there may still be other issues that make the resulting ebook suboptimal. Xover (talk) 08:10, 4 July 2021 (UTC)
Thanks! It worked! MEisSCAMMER (talk) 17:06, 4 July 2021 (UTC)
I recently moved the pages to be subpages of the work. — billinghurst sDrewth 07:40, 10 July 2021 (UTC)

Red-linked templates (with high usage in content pages).

Is ther a list of these?

The criteria being:-

  • Template which is red-linked from Main , Page or Translation namespace.
  • Has over 25 links from those Namespaces.
  • Does not exist (currently) as a Template on English Wikisource.

High usage suggests a 'missing' or renamed template , whereas low usage suggests a typo (a different but related issue).

ShakespeareFan00 (talk) 11:56, 10 July 2021 (UTC)

@ShakespeareFan00: Special:WantedTemplates. But please don't start creating redlinked templates. These are either typos for an existing template (which should be fixed) or they're from some other problem (import, deleted without cleanup, etc.) that should be handled separately. We already have far too many templates, and most of them are essentially unmaintained, so we definitely want to think twice before creating more. Xover (talk) 17:00, 10 July 2021 (UTC)

Center allign in footnotes

Hello, look at this K2 in the end. Did I do this correctly? Lightbluerain (Talk | contribs) 05:57, 15 July 2021 (UTC)

Hi. That mark is one used by the printer and binder, it is not part of the text; those are usually removed in transcripts here. CYGNIS INSIGNIS 07:21, 15 July 2021 (UTC)
@Lightbluerain: As Cygnis says, printer's marks like this are not usually reproduced on English Wikisource. We care primarily about the parts of a book that its author was involved in, and anything that's simply an artefact of the printing process is either left out or approximated.
But if you do want to reproduce these the technical way to do it is to place it inside the footer text field. Roughly, everything in the main ("Page body") text box will be included when we combine all the pages for presentation (what we call "transcluding" them, as a hopelessly obscure and technical term), but anything in the header and footer text boxes will be left out.
The special case that makes that rule of thumb complicated is the footnotes, because when you look at an individual page in the Page: namespace (such as Page:15 decisive battles of the world (New York).djvu/231) it looks like the footnotes are in the footer. The key to keep in mind there is that the actual notes—the <ref>…</ref> bits—are in the body, not the footer. Only the {{smallrefs}} template is in the footer, and it only gathers up and displays the preceding footnotes. Once all the pages are combined for presentation, there will be a separate {{smallrefs}} template that collects all the notes from the combined pages for display.
It took me a loong time to wrap my head around this model, but once it clicks most issues like this become fairly obvious and can be reasoned about. Xover (talk) 07:51, 15 July 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 16:14, 16 July 2021 (UTC)

OTRS Text Permission Problem

I recently found The Religion of God, a text that had its permission verified by OTRS by Bookofjude in 2008, and at that time contains an old OTRS link. However, the {{PermissionOTRS}} on that page now don't have the OTRS ticket number, and is categorized under Category:Items missing OTRS ticket ID. Can any OTRS users here help to find and insert that ticket number? Regards.廣九直通車 (talk) 08:21, 18 July 2021 (UTC)

Should Template:UNTS volume title get better improvement to make an even better replica of each cover of the United Nations Treaty Series? The scan making Page:UN Treaty Series - vol 649.pdf/1 has a footnote that I have not seen in any earlier volumes. Thanks.--Jusjih (talk) 04:09, 29 July 2021 (UTC)

@Jusjih: There's no reason why not. I just made the template to cover what I could see at the time. In this case, though, it looks like some kind of printing mark that we often don't really worry about, so I would say it's not really needed to replicate anyway. Inductiveloadtalk/contribs 08:06, 29 July 2021 (UTC)
Thanks so much. Is there any way to make our replicated texts bolder in Template:UNTS volume title to better match the underlying image of Page:UN Treaty Series - vol 649.pdf/1, etc? Especially "United Nations • Nations Unies", "New York" and the year of publication since Volume 401 in our replicas seem not bold enough.--Jusjih (talk) 02:57, 30 July 2021 (UTC)
@Jusjih:   Done let me know if there are any cases it's incorrect. Inductiveloadtalk/contribs 03:08, 30 July 2021 (UTC)

Different templates for footnotes

Hello, in some pages of the book I'm doing, I used {{reflist}} and in the other, I used {{smallrefs}} for footnotes. Is it okay? or, it can cause any problem during transclusion? Lightbluerain (Talk | contribs) 18:02, 29 July 2021 (UTC)

@Lightbluerain: As long as they are in the page footer it won't matter, the header and footer are (invisibly) wrapped in noinclude tags. Jarnsax (talk) 18:23, 29 July 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 19:12, 30 July 2021 (UTC)

How to handle items missing from table of contents?

In the Works of Sir John Suckling, several items are missing from the table of contents, including two of his poems; one of these poems is also missing from an "Index to First Lines" at the end. What's the best practice for dealing with this? The poems won't be reachable from the main work page without a table-of-contents entry, but adding them would make the transcribed table of contents unfaithful to the original. Eureiachthon (talk) 01:07, 30 July 2021 (UTC)

@Eureiachthon: Use {{AuxTOC}} (see Sense and Sensibility for example). Basically 'your' TOC is a user annotation. Jarnsax (talk) 02:29, 30 July 2021 (UTC)
@Jarnsax: That makes sense; but then should I replace the original TOC entirely and not transclude it, or add the "supplemental" entries to it by way of AuxTOC (if that's even possible)? Eureiachthon (talk) 03:41, 30 July 2021 (UTC)
There's not a 'rule' about it that I know of, but I would personally just insert the template into the appropriate spots in the tablse, like I tested over there in that edit I reverted. It didn't look quite right (not wide enough), but that's because I forgot to make it colspan=2. Someone might suggest a better way, but you're perfectly right both that they shouldn't be orphaned, and that any annotations we add should be obvious. Jarnsax (talk) 04:19, 30 July 2021 (UTC)

Corrupted version of file

I have just attempted to upload a minor amendment to Romance & Reality 3.pdf but instead of the corrected file, all I have got is a file icon Fileicon-pdf.png which I cannot get rid of. I have reverted to the old version but this hasn't worked either (although I have found an indirect way to access the original file). It now shows dimensions of 0 x 0, 335 pages and none of the pages. Please can you help me to return to sanity. Esme Shepherd (talk) 16:56, 31 July 2021 (UTC)

@Esme Shepherd: The problem is over at Commons... usually when thumbnailing gets broken, it gets resolved within a few days. My understanding is it (and transcoding) can get overloaded by spikes in upload volume. Jarnsax (talk) 22:21, 31 July 2021 (UTC)
Thank you. I will be patient and wait for resolution. If it does not materialise, how should I apply to for help in Commons, please? Esme Shepherd (talk) 08:47, 1 August 2021 (UTC)
@Esme Shepherd: Unrelated to the file corruption issue as such, but please do not modify the scans you upload by inserting a table of contents you created yourself. If the work as published is missing a needed table of contents, one can be added on-wiki using the {{AuxTOC}} template. Xover (talk) 09:38, 1 August 2021 (UTC)
Okay, this suits me and I won't do it in future. I could easily remove the TOC on Romance and Reality and Francesca Carrara but the recent broken thumb-nailing makes me reluctant to upload any more revised versions at the moment. Esme Shepherd (talk) 10:04, 1 August 2021 (UTC)
I was right. I decided to revise Francesca Carrara 1 and the same corruption has occurred. This is terrible and the whole system is unusable. Please give me some help in sorting this out. Esme Shepherd (talk) 10:15, 1 August 2021 (UTC)

I have made some progress with this. I have been able to upload R&R 3 without TOC, with the correct icon! So I then uploaded R&R 1 and R&R 2 without TOCs without a problem. All that remains is Francesca Carrara 1 for which I can have either (a) with a TOC and the correct icon or (b) without TOC but with the rubbish interloper icon. I have left it at (a) for now and will try to remove the TOC later. Esme Shepherd (talk) 18:39, 1 August 2021 (UTC)

Correction, the versions without TOC appears to be blank, so it is of no use. I might as well leave the TOC in but not use it on transclusion. Esme Shepherd (talk)

Another language

Hello, how to do the second footnote here? It got another language I don't understand. Also, please check if I used the correct template formatting for the two poetic lines in the middle. Lightbluerain (Talk | contribs) 00:08, 3 August 2021 (UTC)

That's in Greek, so normally tag as {{greek missing}} and one of us who reads Greek will fix it up for you. I've done it, and also fixed the poem. Beeswaxcandle (talk) 01:12, 3 August 2021 (UTC)

I just finished proofreading this work, but there are a number of images and so many formatting problems, so I would like some other users to go over it. Also, how should this work be transcluded? I feel like it’s too long for one page, but it seems like there are an unnecessarily large number of subpages. TE(æ)A,ea. (talk) 14:17, 4 August 2021 (UTC)

Indentation

Hello, should we use indentation for this page? I read the style guide and it said that we don't indent the first line of a paragraph. So, why do we have indentation templates? Lightbluerain (Talk | contribs) 17:52, 4 August 2021 (UTC)

Neither a footnote nor a text

Hello, how to do the last paragraph of this page? It is neither a footnote nor the simple text. Lightbluerain (Talk | contribs) 17:34, 7 August 2021 (UTC)

It's a continuation of the footnote from the previous page. The way to deal with this is detailed on Help:Footnotes and endnotes#Footnotes that continue over page breaks. I've done this one for you. Beeswaxcandle (talk) 18:02, 7 August 2021 (UTC)

Periodical "The British Controversialist"

If there were (or were not) a project started or planned to publish the issues of this periodical, how would I determine its existence (or non-existence)? Klarm768 (talk) 12:26, 8 August 2021 (UTC)

@Klarm768: You would do a search for "The British Controversialist" and, finding no relevant mainspace page, portal or indexes (all of which are included in the default search), conclude that there is no such effort. And if you were still not sure, you can ask here, as you did, and someone like me will say "I'm fairly certain there is no such effort, though I technically cannot prove a negative".
Would you like assistance batch-uploading any volumes? Inductiveloadtalk/contribs 12:40, 8 August 2021 (UTC)

Page:The Masses, Volume 1, Number 1.pdf/1 overfloat image not centering

Overfloat image isn't actually centering the image itself? I've got 2 dummy images above the overfloat image I'm trying to use to show it is indeed not centering like the 2 above examples. Anyone have a fix? (Page is not otherwise proofread yet) PseudoSkull (talk) 16:13, 8 August 2021 (UTC)

@PseudoSkull:x1 is horizontal positioning.--RaboKarbakian (talk) 18:28, 8 August 2021 (UTC)
@RaboKarbakian: The image itself still isn't centered. The left side of the image is being centered by the template for some reason. Is this just happening on my end? PseudoSkull (talk) 18:57, 8 August 2021 (UTC)
@PseudoSkull: I set "width" to what it said in "[[File:" and everything changed! --19:45, 8 August 2021 (UTC)

Can anyone help me properly reproduce the table of contents at Page:The Masses, Volume 1, Number 1.pdf/3? The TOC exists on 2 columns, except with the last item being centered...

Also, most of the items use centered ditto marks ("), which I don't know how to reproduce in a TOC.

The rest of the content on the page besides the TOC is proofread. PseudoSkull (talk) 22:49, 8 August 2021 (UTC)

@PseudoSkull: I would treat this in the same way we treat multiple columns and just "don't" and do it in a single column instead. The layout isn't adding anything: it was likely basically just to save paper. A double column like this will not work well on smaller screens, and I don't think you'll be able to make it degrade gracefully.
For the ditto, check out {{ditto}}. Inductiveloadtalk/contribs 23:16, 8 August 2021 (UTC)

footnote numbers

How do I make the footnote number a "6" instead of "1" for the following page: Page:History_of_the_Reign_of_Ferdinand_and_Isabella_the_Catholic_Vol._II.djvu/74

Thanks much, Daytrivia 18:28, 10 August 2021 (UTC)

@Daytrivia: when the text is transcluded, that is displayed in the Main space, all of the footnotes from that section should be displayed at the bottom of the page and if you get the section "right" then that footnote will be "6". As it is, it is the first footnote on the proof page, to adjust it might screw up the final product.--RaboKarbakian (talk) 19:06, 10 August 2021 (UTC)

I presume, then, I did not get the section right? Daytrivia

The "section" (chapter or whatever) hasn't been made yet. Here is a "Page" Page:EB1911 - Volume 26.djvu/60 and the transclusion is here: 1911 Encyclopædia Britannica/Sugar. The footnotes 1 through 4 on the page become 5-8 in the transclusion.
@Daytrivia: I looked to see if your work had been transcluded or not by checking the "What Links Here" in the sidebar. If you check Page:EB1911 - Volume 26.djvu/60 "What Links Here" you can see what I was looking for.
"Main" as it is called in search and around is the namespace that a proofread "Page" (also a searchable namespace) eventually go to. Footnotes sometimes and even often get different numbers than how they were published, even in the Main space. My example is an encyclopedia, which to preserve the footnotes would be to have the final product appear on so many pages; so the book you are working on, if the footnotes are numbered for "Part I, Part II" etc, and the text is transcluded by chapter, then the numbers will not work out the way it was published.
The short answer to your question is this "Your footnote is fine.", although, I would consider loosing the columns, as eventually they will just stack up in the Main space.--RaboKarbakian (talk) 21:16, 10 August 2021 (UTC)

Thanks very much the examples are great. Daytrivia (talk) 15:23, 12 August 2021 (UTC)

Header space

How does one make the header space larger. I want ot highlite a phroase in a header but the size of the header space is not letting me highlite one line? Daytrivia

@User:Daytrivia Not sure what you are asking.. can you give us a link to where you are trying to do this? Also, to sign talk page messages, type four tildes (~~~~)... the wiki substitutes your signature and date stamp for that text when you save the page. Jarnsax (talk) 21:11, 11 August 2021 (UTC)

It might answer your question (not sure, again) that template boxes (like {{header}}) are generally coded to shrink themselves around their content... if you put a line break inside the template, it should cause the header to be a line larger. Jarnsax (talk) 21:14, 11 August 2021 (UTC)

A line break may help. Thanks much. Daytrivia (talk) 21:26, 11 August 2021 (UTC)

edit help please

I do not know why there is a "3" on both sides of this page:

Page:History_of_the_Reign_of_Ferdinand_and_Isabella_the_Catholic_Vol._I.djvu/147

Daytrivia (talk) 15:10, 12 August 2021 (UTC)

@Daytrivia: There was a stray {{rh|3||3}} on the page: Special:Diff/11587281. Inductiveloadtalk/contribs 16:09, 12 August 2021 (UTC)

Thanks much. Daytrivia (talk) 19:50, 12 August 2021 (UTC)

It seems I am not seeing everything in my headers. For instance the stray code mentioned above.Daytrivia (talk) 10:50, 13 August 2021 (UTC)

@Daytrivia: There were many blank lines above it - are you sure it's not hidden at the bottom of the text box? Inductiveloadtalk/contribs 11:21, 13 August 2021 (UTC)

I just don't know Page:History_of_the_Reign_of_Ferdinand_and_Isabella_the_Catholic_Vol._I.djvu/148 Daytrivia (talk) 21:39, 13 August 2021 (UTC)

Move to Commons help needed

I've found File:Constitution Act, 1902 (New South Wales).pdf eligible to be moved to Wikimedia Commons under {{PD-AustraliaGov}} (1902+50=1952) and {{PD-EdictGov}}. However, it seems that both Special:ImportFile and CommonsHelper both failed to do the thing. Can anyone examine with the problem and help with moving the file? Many thanks.廣九直通車 (talk) 13:57, 14 August 2021 (UTC)

Commons requires a compatible licence to import. Adding {{PD-US}} was needed to get it to go over. — billinghurst sDrewth 14:36, 14 August 2021 (UTC)
See mw:Help:Extension:FileImporter for the help detail — billinghurst sDrewth 14:39, 14 August 2021 (UTC)

Creating author for Jack Beames (who wrote Army without banners)

I'm trying to link up 2500 books for the Prison Library (1933)(transcription project) so you can go from the book to the authors mentioned therein. Unfortunately, on Page:2500_books_for_the_prison_library_(1933).djvu/7, I've run into a Jack Beames listed as author of Army without banners, and I can't find the author. There's a Author:Jack Beames in Wikisource, but this seems to be a different guy.--Prosfilaes (talk) 05:19, 16 August 2021 (UTC)

@Prosfilaes: Definitely not the same person. Here is a convenient (external) biography for your guy. 110.21.239.55 07:09, 16 August 2021 (UTC)
Thank you.--Prosfilaes (talk) 07:52, 16 August 2021 (UTC)

Problems with ditto template on transclusion

Here, {{ditto}} is not working as intended, instead producing the lowered quotation mark after the non-text. (This error does not happen on the page here.) TE(æ)A,ea. (talk) 14:47, 16 August 2021 (UTC)

I see what's going on, but not why. The {{ditto}} tag creates what should be hidden text, presumably for screenreaders and the like. The CSS class is __ditto_hidden, which is set to hidden and transparent. In your main space page the CSS definition is missing, so the erstwhile hidden text appears atop the ditto marks. Anybody have insight into where the CSS went? — Dcsohl (talk)
(contribs)
14:59, 16 August 2021 (UTC)

Unclear Print Problem

Hello, how to do the missing word in the last paragraph here? Lightbluerain (Talk | contribs) 15:15, 16 August 2021 (UTC)

@Lightbluerain: The missing word was "less". The way I found out was by searching the quote at Google in quotes to find exact matches. PseudoSkull (talk) 15:32, 16 August 2021 (UTC)
@PseudoSkull, alright, thanks. Lightbluerain (Talk | contribs) 16:23, 16 August 2021 (UTC)
@Lightbluerain: By the way, for future reference, there's {{illegible}} you can use instead of the underscores representing a blank line. You can also mark the page as "Problematic" instead of "Not proofread". PseudoSkull (talk) 17:56, 16 August 2021 (UTC)
Sure, thanks. I'd also try to, first, find the missing word through Google as you did. Lightbluerain (Talk | contribs) 09:19, 17 August 2021 (UTC)

Are Wikilinks necessary?

Hello, is it necessary to put wikilinks inside the text of a book? The book I am doing, I didn't put any Wikilink on any page at all. I didn't understand where (and why?) to put Wikilinks inside the text since the original text in the book has none. Lightbluerain (Talk | contribs) 09:25, 17 August 2021 (UTC)

There are no wikilinks that are "necessary"—other than links to chapters on the table of contents. Some light wikilinking is acceptable—principally to author pages and work titles for those mentioned in the text or footnotes. We have a draft policy at Wikisource:Requests for comment/Wikilinking policy, that I really must finalise and post. Beeswaxcandle (talk) 09:44, 17 August 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 16:50, 17 August 2021 (UTC)

Guidance on sic/SIC

Last night in relation to a page of potentially "duplicated words" in Page: namespace content I checked what sic and SIC currently did.

As a result of this, I found that {{sic}} doesn't display anything (meaning that any parameters supplied to it are ignored.), whereas SIC does.

In follow-up to this , I used AWB to update a considerable number of usages from {{sic}} to {{SIC}}, with the intent to review every single edit thus made against the relevant scanned page. I'm starting to review the 500 or so edits made as a result of that usage update this morning.

In reviewing the first few pages where the usage was updated , I am finding that where I am seeing the hi-res scans (thanks to a script User:inductiveload provided, that the nominal SIC did not exist in the hi-res version, (suggesting some of them are not actually present in the original scans and are an artefact of the coversion/display process for PDF /Djvu etc.)

In places has been used to add transcriber comments, I've generally attempted to avoid updating those usages in error, (and will be checking for situations where this may have happened by mistake on my part.)

However before continuing , I'd like some feedback...

  1. Is the consensus to leave words duplciated in (printer) error as printed but marked {{sic}}, or place the duplicated word inside {{sic}} to suppress their appearance?
  2. Should sic or SIC that do not appear to be present when better or higher quality scans are used, compared to when the Page: was originally proofread or validated, be updated or removed?
  3. Where a SIC has been provided without a correction, but a contributor can suggest an obvious one, should such as correction be added? ( I am aware that certain other sorts of annotations are discouraged.)?

ShakespeareFan00 (talk) 07:28, 8 July 2021 (UTC)

1–3 no. CYGNIS INSIGNIS 14:24, 8 July 2021 (UTC)
1) Words that are duplicated in the scan should be duplicated in the transcription. You should not put the second word inside {{sic}} to suppress its appearance, but you can put {{sic}} next to it or do something like {{SIC|the the|the}}. 2) if the scan is updated and there is no longer any error in the scanned text then the transcription should be updated to match the scan; since the error is thus removed, {{sic}} or {{SIC}} should also be removed to avoid confusion. 3) Yeah, go for it (while being mindful of WS:ANN and maintaining consistency throughout the transcription) —Beleg Tâl (talk) 21:38, 8 July 2021 (UTC)
Thanks, that concurs with the approach I had been taking. Of course checking over 10,000 uses of SIC against scans may take a while.ShakespeareFan00 (talk) 23:18, 8 July 2021 (UTC)
I will also note on one or two entries I converted from sic to SIC recently, that the SIC appeared to be a British vs American spelling, so NOT a SIC as such, but a correct spelling, typsetting for a UK vs US edition. :) ShakespeareFan00 (talk) 23:18, 8 July 2021 (UTC)
Note that I tried adding some SICs on duplicated words and they were undone with the message of basically "don't touch validated texts" and so I basically decided to stick away from doing anything besides obvious errors on validated texts ¯\_(ツ)_/¯. MarkLSteadman (talk) 20:41, 9 July 2021 (UTC)
I've seen {{SIC}} used on obsolete spellings as well. I personally would remove {{SIC}} from such places where it is a deliberate use of an alternate or unusual spelling, rather than a typo. But you don't need to do that if you don't want to. —Beleg Tâl (talk) 00:59, 10 July 2021 (UTC)
  Comment Further to Beleg Tâl's commentary
  • {{sic}} is the old old form that existed for when we did not have proofread page where it was clearly needed to indicate that the produced error was replicating the produced work where no scan existed.
  • With ProofreadPage, the use of {{SIC}} is more indicative than normative and it is to basically indicate "don't bother ..." We have no compulsion to its use. It should only ever be used to indicate clear errors. It should never be used when they are variations in spelling, or ye olde forms of reproduced text. It should only be used to show outliers, if a word is used repeatedly in a way in a work, then it should be bleeding obvious that it is not an error. If someone has chosen to not use it, then I would generally not then start using it. If someone is using it, then I would generally leave it in. If someone has only one parameter, I would leave it that way, if the word is obvious to you, it is obvious to the other person. Remember that it is to indicate an error for proofreading purposes.
I think that it would be an absolute waste of time to specifically go and check the usage of each. They will be checked and reviewed during proofreading, and that should be sufficient. Their use is a nicety, and of low value to prioritise. — billinghurst sDrewth 07:19, 10 July 2021 (UTC)
Re what Beleg Tal said, if it was "deliberate" but could use clarification by way of annotation, {{SIC}} is the wrong template, the comment should be in a {{tooltip}}, {{definition}}, {{abbr}}, or in a reference with {{user annotation}}. Sic is rather, by definition of the term, specifically to indicate that the text isn't a transcription error (made by us), but instead we are copying verbatim what appears to be an error in the source document being transcribed. And yeah, often {{SIC}} itself is the error, it's just a variant spelling. Jarnsax (talk) 17:00, 29 July 2021 (UTC)
IOW, if the source document says "robbed the teh bank", the reader should see "robbed the teh bank", with an indication (from {{SIC}}) that the apparent mistake is not our error. (theoretically, the same could apply if the text said something so nonsensical a reader would assume it's our mistake). Jarnsax (talk) 17:15, 29 July 2021 (UTC)
I only use the hidden {{sic}} indicating to subsequent editors that the anomaly was noted.— Ineuw (talk) 19:33, 21 August 2021 (UTC)

How to handle major printing error (swapped lines)

I was proofreading Page:Left-Wing Communism.djvu/39 and was struck by how nonsensical the sentence in the first paragraph starting "Probably some members of the Dutch Communist..." was. And then I realized there's a major printing error here - the second and third lines of this sentence are pretty obviously swapped. Is there any leeway around here for fixing this? My main concern here is that, unlike a normal misspelling typo, this error renders the sentence pretty well unintelligible as written. — Dcsohl (talk)
(contribs)
15:11, 17 August 2021 (UTC)

To clarify: The paragraph beginning "On the one hand, men …" has sentence 3 [When, in face of …] and 4 [Probably some members …]. It is suggested that this translation of [logical defying diatribe] by Vlad [Hecatombs] Lenin has inadvertently reversed sentences 3 and 4? CYGNIS INSIGNIS 17:25, 17 August 2021 (UTC)
@Dcsohl: Please see my solution at diff. I prefer solutions that reproduce what the text ideally should look like in the case of printing/typographical errors because I have a feeling it helps certain kinds of readers/scripts process the information properly. PseudoSkull (talk) 17:33, 17 August 2021 (UTC)
@PseudoSkull: Yeah, that's not bad - I just wish {{SIC}} worked better in e-readers. (In my experience you get the underline with no way of seeing what the "corrected" text is.) — Dcsohl (talk)
(contribs)
17:39, 17 August 2021 (UTC)
@Cygnis insignis: No, sorry, I can see I was unclear. I'm suggesting that the 10th and 11th lines in the scan (which start with "into traditions and conditions" and "Party who had the misfortune") should have been swapped. In other words, that the sentence as a whole should read "Probably some members of the Dutch Communist Party who had the misfortune to be born in a small country, into traditions and conditions of particularly privileged and stable legality, who have not known at all what it means ....." — Dcsohl (talk)
(contribs)
17:35, 17 August 2021 (UTC)
Page:Nytimes autosforfrenchrace 1906.png has a similar issue, where a couple lines of a sentence were inserted into the next paragraph (in the middle of a word, even!). My solution was to comment those lines out where they appear on the printed page, add comments delineating the insertion in the correct location, and add a footnote explaining the error. —CalendulaAsteraceae (talk | contribs) 18:26, 21 August 2021 (UTC)
@CalendulaAsteraceae: I'd say I disagree with this decision, because I think our general consensus here has been to include all typos or errors in a given text, unless it is due strictly to a technical limitation that was faced in the printing industry, which would make it not erroneous on the publisher's part. Technical limitations would include, for instance, the use of a black and white, low-quality facsimile version of a painting rather than presenting it in color, in order to save money, etc. (Example: Resurrection Rock (1920)) Another example would be using the letter "f" instead of "ſ" when quoting from older texts, because of a lack of typewriter that could produce said symbol. (Example: Southern Antiques)
Inserting text in the wrong place, however, isn't due to a technical limitation like the examples above—it's a legitimate mistake. But it's not really a big deal, so I digress. PseudoSkull (talk) 18:45, 21 August 2021 (UTC)
@PseudoSkull: That makes sense, and it is a meaningful distinction. With a little more thought, I figured out a way to make the {{SIC}} approach work in this case. —CalendulaAsteraceae (talk | contribs) 20:31, 21 August 2021 (UTC)
@PseudoSkull … and this is precisely the sort of thing I was thinking when I first came across this error. It’s clearly not the author’s intent, and much as we don’t strive to reproduce printer’s marks (as an artifact of printing rather than the author’s output), I’d expect one could make an argument to fix obvious mistakes that arose from printing, like here. But of course then come the blurred lines and the places where maybe the intent isn’t crystal-clear, so I can also appreciate the argument for just leaving as is and using {{SIC}}… hence my intent in starting the discussion. — Dcsohl (talk)
(contribs)
20:37, 21 August 2021 (UTC)

Greek with popup romanization?

Working from scanned text where the OCR (?) dropped the Greek characters but helpfully substituted a romanized transcription of some Greek characters. It gave me:

... and so received the title "[Books of] the Omitted Acts" ([Greek: paraleipomenôn]) or "the Omitted Acts of the Kings (or Reigns) of Judah."

I first substituted {{greek|παραλειπομένων}} to get παραλειπομένων.

Then I thought it'd be nice to somehow keep that phoneticization "paraleipomenôn". Using {{polytonic|παραλειπομένων|yes}} gets me a link to a wiktionary entry with romanization παραλειπομένων (but not a good link - wait and let it redirect from the semi-bad URL)

It would be 'nice' if there was a template that'd allow displaying the Greek with a popup with the romanization, something like

{{greektweek|Καλημέρα|Kaliméra}}

that would result in something like Καλημέρα.

Anyone know of something like this? Shenme (talk) 22:32, 23 August 2021 (UTC)

I just read that recently and messed with one today. The polytonic template recommends wrapping it into a span. <span title="romanized word">{{polytonic|greekish}}</span>, this span, however, conflicts with setting polytonic to yes for the wikitionary, and wikttionary wins.
After I saw the wiktionary link in polytonic, I started to remove polytonic from math problems. I have been also thinking about using entities &pi; for instance (or TeX's \pi because, I am pretty sure that the entity was made for math and not for the Greek alphabet and math shouldn't ever be linking to wiktionary. Is this considered to be wrong in any way?--RaboKarbakian (talk) 22:51, 23 August 2021 (UTC)
I would definitely try to dissociate math from {{polytonic}} or {{greek}}.
@RaboKarbakian: Using the HTML entities is likely best. The HTML entities like &Gamma; Γ will resolve to the Unicode range for plain Greek characters. This then is simply reusing the plain Greek characters for math.
Entirely separately there is a Unicode block for Mathematical Alphanumeric Symbols block which is assigned to one of the newer Unicode blocks. Well, new as in assigned in Unicode version 3.1 (2001). Current is version 13 so these ought to be safe to use everywhere now. (maybe)
But there are no entity names for these so...
  • &#x0393; Γ (from the original Greek char block)
  • &#x1D6AA; 𝚪
  • &#x1D6E4; 𝛤
  • &#x1D71E; 𝜞
  • &#x1D758; 𝝘
  • &#x1D792; 𝞒
That's pretty icky, so start out with plain entities &omega; ω until something else forces a change? Shenme (talk) 04:04, 24 August 2021 (UTC)
I'm not sure I understand what you mean. The plain entities reference the characters from the plain Greek text block, which are the ones to be used for plain characters in mathematics. There's no need to use the entity names instead of just the plain characters.--Prosfilaes (talk) 10:15, 24 August 2021 (UTC)
@Prosfilaes: I think it should be that unicode references the entities, at least some of them; they existed when unicode was just a good idea. And that they were first made for computers as math characters is the reason that they need to be eh, "told when to behave like letters of words". The main reason that I use entities (with the exception of the dashes) is ease. &pi or \pi can be created from a simple keyboard, reducing the "hunt and peck" nature of unicodish. And, if your browser is configured for unicode, they are unicode. If your browser is configured for something else, they are π or  . Unicode won some correctness wars and we now have a plethora of moody smilies for the win of that war. Anyways, I agree with @Shenme: to not use the {{polytonic}} for math in electronic documents, as math usage is the reason they render so badly for language usage.--RaboKarbakian (talk) 13:52, 24 August 2021 (UTC)
Unicode's first release was in 1991; HTML's was in 1993. HTML has been defined in terms of Unicode since RFC 2070 in 1997, and MediaWiki doesn't support any browser before 2011. If you can access and use Wikisource, Unicode text and the entities will appear exactly the same.--Prosfilaes (talk) 16:21, 24 August 2021 (UTC)
Initial mention was in reference to using {{polytonic}} for simple Greek characters. I'm agreeing that that shouldn't be done for math (unless you like that font). Above I was mainly intrigued with the existence of a math-specific Unicode block, thinking Unicode was wanting to indicate math usage, but they don't include/duplicate the plain Greek characters.
"Mathematical Alphanumeric Symbols is a Unicode block comprising styled forms of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles."
So you are correct that &Omicron; == "ω", so they are interchangeable. If you don't want the 'styled' variants for math symbols, ignore Unicode's Greek emojis. They've gone overboard again on Greek, like I did above. Shenme (talk) 13:13, 24 August 2021 (UTC)
A pop-up romanisation would be an annotation, so only to be done on an annotated version—which assumes that we have a complete unannotated version already. In terms of tooltips as a method, these behave badly in ebooks and thus should be used sparingly—if at all. Beeswaxcandle (talk) 23:03, 23 August 2021 (UTC)

Index:Rambles in Germany and Italy in 1840, 1842, and 1843.djvu

Hello, I need help to Create a pagelist for the source file before commencing proofreading (to verify file is correct). --Stamlou (talk) 16:56, 30 August 2021 (UTC)

Hello. I have noticed that page no. 2 is misplaced in the file (I did not check the rest). There is a file in HathiTrust which looks better. --Jan Kameníček (talk) 19:56, 30 August 2021 (UTC)

Texts/books etc

I want to create new material but i can't find anything Google books, nothing. Help? Thank you. Lutzbaer0 (talk) 08:25, 4 September 2021 (UTC)

@Lutzbaer0: what exactly are you looking for? Wikisource:Sources has a list of places you can search for a book. Or if you let me know what you're looking for I can try to find a copy. Inductiveloadtalk/contribs 08:38, 4 September 2021 (UTC)
just looking for new texts that ws will accept. I want to find book search etc, sorry if I'm confused you. Lutzbaer0 (talk) 08:40, 4 September 2021 (UTC)
@Lutzbaer0: the Internet Archive is usually the easiest source for books. However there are (literally) millions of books there, of basically every type imaginable, so it helps to have an idea of which books you are interested in. If you have a book at the IA, you can upload it with IA-Upload.
There are also books others have requested at Wikisource:Requested texts and there's an monthly collaborative text called the Monthly Challenge.
If you find a text you would like to upload scans for and need help with that, let me know, I'll be happy to help (anything to sucker in new Wikisourcerers! 😈) Inductiveloadtalk/contribs 09:13, 4 September 2021 (UTC)

Want to Delete

I carelessly created a topic (ပုသိမ်_ဗကသညီလာခံ_သခင်ကိုယ်တော်မှိုင်း_မိန့်ခွန်း) on en:s although my intention is to create on w:s. I am new in editing and don't know where is the Delete button. Could someone help me to delete that topic? P0rnytail (talk) 16:53, 9 September 2021 (UTC)

@P0rnytail:   Done (you don't have a delete button: only administrators can delete pages). Have fun at myWS, but please come back and edit with us if there are any Myanmar-related works you would like to see! We don't have many right now, and I'll be happy to help: you could even nominate some for our monthly challenge. :-). Inductiveloadtalk/contribs 17:01, 9 September 2021 (UTC)

Help with Scores

Index:The book of American Negro spirituals.pdf has about 100 pages of musical scores. Is there anyone who can transcribe them? Languageseeker (talk) 05:31, 11 September 2021 (UTC)

Just mark the pages as {{missing score}} and when and as I have time I'll deal with them. The backlog of scores waiting to be done lives in Category:Texts with missing musical scores. Beeswaxcandle (talk) 05:38, 11 September 2021 (UTC)

Is it okay?

Hello, did I do this page correctly by not including the chapter title in the header, but in the body? or not? Lightbluerain (Talk | contribs) 09:05, 11 September 2021 (UTC)

Yes, that is correct. --Jan Kameníček (talk) 09:16, 11 September 2021 (UTC)
Alright. Thanks. Lightbluerain (Talk | contribs) 12:24, 11 September 2021 (UTC)

Template {{center inline}} not showing up in category "Special effects templates"

One problem with searching for templates is when templates don't show up in categories. Just yesterday I finally noticed template {{Center inline}} is not showing up in Category:Special effects templates. Template {{Center}} does show up.

When I first looked at it {{Center inline/doc}} had

[[:Category:Special effects templates|Template:Center inline]]

which I changed to

[[:Category:Special effects templates|Center inline]]

and since have changed to

[[:Category:Special effects templates|Center]]

Just for reference {{Center/doc}} has

[[Category:Special effects templates|{{PAGENAME}}]]

In addition I've tried various purge dances, and waited half a day and still no mention of Center inline at Special effects templates. Do I need to step up from winged to four-legged sacrifices?

HMMM, this's interesting. It does show up at [[:Category:Span_based_templates]], but _only_ as "Template:Center inline/doc". Unlike almost all others in that category which *also* have the base template listed. Definitely need to add eight-leggeds to the cauldron. Shenme (talk) 18:33, 11 September 2021 (UTC)

It must have been stuck cache or something. I tried purge, hard purge and null edit one after the other and it seems it helped. --Jan Kameníček (talk) 18:51, 11 September 2021 (UTC)
On... which? the template page, the template/doc page, the category page...? The only thing I'd not tried was the null edit. (BTW: thanks!) Shenme (talk) 19:30, 11 September 2021 (UTC)
@Shenme: I tried the category page and when nothing happened I tried the template page. However, I cannot say which of them proved to be the right one: it could have been the category page too, only with some delay. --Jan Kameníček (talk) 19:56, 11 September 2021 (UTC)

Just wondering ...

Understanding there already is a page dedicated to this particular book, I had hoped to inquire how I might, or even if it were possible, to submit a different edition of the same. I am presently in the progress of compiling with a few minor edits Gaston Leroux's « Le fantôme de l'opéra » from the original newspaper serials, and had hoped to share both these, and my future translation with the site, as indeed there are some slight differences in the story between the serials and the later book-form of the story. unsigned comment by OuraniaAeternam47 (talk) 13:58, 5 August 2021 (UTC).

cstyle parameter of the Freeimg template is not working

Please see this page in edit mode. I placed {{Dhr}} above it instead. Page:Africa by Élisée Reclus, Volume 1.djvu/159— Ineuw (talk) 22:16, 12 September 2021 (UTC)

The problem is probably in some edits by Inductiveload, so I am pinging them to have a look at it. BTW, is it necessary to keep moving big parts of templates to modules in lua? The number of contributors who understand the templates gets significantly smaller by that. While I understood the original template’s code quite well, I do not understand the module at all. Are the modules’ advantages really so big? I am quite afraid that we are heading towards the situation when we will be absolutely dependent on only few people understanding the modules who we will have to ask for every triffle, not mentioning the situation when the few people stop contributing or decide to go for a wikiholiday… We are a small project and such a situation is not unimaginable. --Jan Kameníček (talk) 22:42, 12 September 2021 (UTC)
Personally, I have no problems with the template because I grew up with it from it's introduction by George Orwell III. I am sure that Inductiveload has a good reason for the changes.— Ineuw (talk) 22:53, 12 September 2021 (UTC)
I do not see a problem in the fact that you or some other specific person understands/does not understand a code of a specific template. I mentioned the trend to have too many things in lua because it makes the overall number of contributors who understand the codes of templates much smaller. --Jan Kameníček (talk) 23:04, 12 September 2021 (UTC)
I gave a fuller answer before but it got smashed out in an edit conflict. Long and short: template style {{FreedImg/styles.css}} contains existing (line 5) margin-top: 0.40em !important; (and always had going back effectively forever!) so no override may omit its own !important or it will be ignored by the rendering browser. 110.21.131.174 00:20, 13 September 2021 (UTC)
@110.21.131.174: So, what you are saying is that margin-top and bottom are useless? — Ineuw (talk) 01:44, 13 September 2021 (UTC)
@Ineuw: The question is ambiguous. If you mean in analogy to template parameters margin-left and margin-right; the template simply does not have -top or -bottom parameters and a change to this LUA module would be required to add their support.
On the other hand if you are asking how to form a valid cstyle override which will work look no further than this change.
I hope that is what you were looking for? 110.21.131.174 03:19, 13 September 2021 (UTC)

┌─────────────────────────────────┘
Thanks. That is what I was looking for. But there is an anomaly. The template has left and right margin codes without " cstyle = " as in

 | margin-right = .5em 
 | margin-left  = .5em 

But this requires "cstyle = " and which doesn't work without !important

| cstyle       = margin-top:2em; !important

Since margins are external to the container, it should not be restricted to a fixed dimension. Make the old fixed dimension the default, and let users specify any margin. — Ineuw (talk) 03:43, 13 September 2021 (UTC)

Umm. Two things:
  • Please note the keyword is exactly "!important" and as such must appear before the semicolon in CSS - so your example should read:
| cstyle       = margin-top:2em !important;
  • What you are proposing (about the container margins) makes sense to me but I am not the one you need to convince. Module:FreedImg seems to have concentrated all the logic concerned and the only person to edit that has been friend Inductiveload… Please form an orderly line: a small sacrifice may be appreciated but a humble request might be a good start? (I would not know as I am not they.) 110.21.131.174 04:05, 13 September 2021 (UTC)
I have no intention of making a request. I just thought to mention it. I can live with separate spacing until it will be corrected, when it will be corrected.
On the other hand, you seem to know what you are talking about. If you want to help, then sign up and take an active part please. I would be more than happy to test a simplified version in my namespace if it helps the community. After all, the original version was recommended to me by GO III when I had a lot of problems with floating images and seamless text wrap in PSM.— Ineuw (talk) 05:13, 13 September 2021 (UTC)
@Ineuw: as @110.21.131.174 says, the problem here is the !important on the margin-top rule. This dates right back to when the CSS came from site-wide CSS now archived at here (note that since interface admin rights have been introduced, there original page at Mediawiki/Dynimg.css was editable only interface admins).
I do not have any idea why the !important was specified, but I can only assume it was a "hack" to achieve something so I left it as-is. Thus, it is not related to the "module nature" of the template. But I will have a look and see if I can remove it without exploding anything.
As for the reason {{FI}} got "modulified": the template as-was had two major problems: over-complex HTML that worked badly in some situations, and, much more critically, it loaded the full-resolution image unconditionally, regardless of the image size. The primary "feature" of the module is that it reads the image size and then constructs a reasonable image size to load that is not larger than that. It's not a major concern in this case, as the image is small, but there are examples on Template talk:FreedImg where the images were between 10 and 300 (yes, three hundred) times the size they needed to be. This resulted in things like ~100MB book sizes (both on the web and in exports)
As far as I know, the logic to do this in pure templates is impossible. Even if there was an {{#imgsize}} magic word (or a wrapper template), the logic to derive an appropriate width would be fiendish.
In general, there are a handful of reasons a template gets a module to provide logic:
  • The logic is (or would be) very overwrought in template-mode. For example
  • {{FI}}'s width logic
  • massively complex (and, in places, wrong) constructions in the old {{header}} (which actually still defers to a template {{Header/main block}} for the main layout code which is clearer as a template). {{article link}} is another example where the logic was very complex in template-mode (and I should know, since I wrote the template in the first place!).
  • Another example, that I just fixed: Special:Diff/11686847 is in a template that's almost entirely impenetrable and has thus been hiding a typo in the braces for 6 years.
  • And while fixing that, I discovered {{Is year}}, which is...interesting.
  • when the template has an undefined number of arguments. There is no "for-loop" construct in template-mode, so you have to copy-paste entire blocks n times and hope no-one needs more.
  • The template needs access to Wikidata and also needs to make decisions based on what it finds (so {{#statement}} will not cut it. E.g. {{authority control}}
  • Template mode is substantially more expensive to render and starts hitting limits: E.g. {{ts}}
  • The template needs access to Lua or JSON data (e.g. much of the Monthly Challenge infrastructure reads from data tables to construct the statistics).
  • The template duplicates logic that a template provides - a template can call a module easily with invoke, but a module has to pass a frame object to a call to expandTemplate, so it's sometimes much easier if the logic lives in a module that can just be imported.
There are actually not that many general templates that have module logic. Certainly, if a template is simple enough, it should indeed stay as a template if it can. Inductiveloadtalk/contribs 06:30, 13 September 2021 (UTC)
@Ineuw: I have removed the !important from Template:FreedImg/styles.css. I don't see any obvious breakage (or a way it could obviously break), but please do let me know if you see anything wrong.
Also note that if you want the same formatting for all {{FI}}s in a work, you can now also use:
.wst-freedimg {
    margin-top:4em;
}
in the index styles.css subpage and it will automatically apply to all {{FI}}s therein. Inductiveloadtalk/contribs 08:30, 13 September 2021 (UTC)

Again, thanks. I noticed it. — Ineuw (talk) 17:37, 15 September 2021 (UTC)

"tag" is not "it" ?

So I'm trying to make sense of the WS templates, a "right of passage" I suppose. I came across a misuse of {{ref}} and so looked up that and {{note}} and then {{tag}} to see what they did. And found that {{tag}} was referenced from a bunch of template docs, which did not make sense.

It seems that {{tag}} is used in multiple docs in an attempt at 'quoting' HTML-like 'tags' like <nowiki> or <code> . Like {{tl}} and {{tlx}} are used for templates. But since {{tag}} is for something else the effect is a strange and unhelpful.

An example of problem converting to using {{tag}} is tlx/doc (look for "{{tag"), where for instance

<code><nowiki><code><nowiki>&lbr;{Anytemplate|arg1=23|size=250px|</nowiki><var>other parameters...</var><nowiki>}}</nowiki></code></nowiki></code>

was changed to

{{tag|code|content={{tag|nowiki|content=<nowiki>{{Anytemplate|arg1=23|size=250px|</nowiki><var>other parameters...</var><nowiki>}}</nowiki>}}}}.

with new display being

<code><nowiki>{{Anytemplate|arg1=23|size=250px|other parameters...}}</nowiki></code>


Oh, a mystery solved: w:Template:Tag, see docs there.   These usages assumed enWiki's template.

I'm wondering:

  • could {{tag}} have been changed in some mysterious way after George starting using it?
  • is there a template _here_ that will 'quote' HTML-like tags?
  • if not, should we copy enWiki's template for use here, and under what name?
  • should I go clean up these wrong usages so that the docs look correct again?

Shenme (talk) 19:34, 12 September 2021 (UTC)

First clutch of "bad eggs" done. Shenme (talk) 07:42, 18 September 2021 (UTC)
 
Person who is unrelated to discussion

I've finished proofreading and transcluding this, my first complete text here. I'd appreciate some feedback, to make sure things look good, before I add it to the "New Texts" block on the main page. — Dcsohl (talk)
(contribs)
13:11, 21 September 2021 (UTC)

@Dcsohl: It's gorgeous?! I wondered about this, as I am want to do, but can you identify what or where a concern might arise? That is, 'I wonder what I should do here?' Cygnis insignis (talk) 14:03, 21 September 2021 (UTC)
@Cygnis insignis: Mostly I just wanted to be sure that this work lived up to site standards. I think it does, but I'm hardly unbiased here. I do like your suggested change, since Eleanor Eden was also a novelist and no doubt, eventually, we will have works by her on here. (Though the picture to the right of this block is a different Eleanor Eden, which I think you know based on the caption, but I wanted to be sure.) — Dcsohl (talk)
(contribs)
15:58, 21 September 2021 (UTC)
@Dcsohl: Cheers. I'm with the guy who said they are an 'eventualist', the prefatory bit is a start to their works. There is a view that any work that says 'volume' ought to be sub-paged, I'm not so sure because 'we' try to adhere to the text/index/physical_volume as far as possible, but clearly it is helpful for some works. Cygnis insignis (talk) 16:18, 21 September 2021 (UTC)
@Cygnis insignis: I was thinking, if I did Volume II (which I may or may not), then I might create a "Letters from India" page that pointed to both of them (so you could download a combined epub), and then if we really wanted to move this work to "Letters from India/Volume I", that's easily enough done by a bot. — Dcsohl (talk)
(contribs)
16:25, 21 September 2021 (UTC)
Seems reasonable. If you say it is worth reading I will force the issue by starting the second volume ;) Cygnis insignis (talk) 16:28, 21 September 2021 (UTC)
@Cygnis insignis: The scan of the second volume is missing a ton of pages. There's what looks to be a good PDF at Hathi, but I'm not 100% confident of my abilities in this department... — Dcsohl (talk)
(contribs)
17:22, 21 September 2021 (UTC)
@Dcsohl: I can make and post a scan. Languageseeker (talk) 17:25, 21 September 2021 (UTC)
@Dcsohl, @Cygnis insignis: New Vol 2 posted. Languageseeker (talk) 19:04, 21 September 2021 (UTC)

I have plans to complete proofreading of the Index:A Collection of Esoteric Writings.djvu, in order to be able to put the complete work A Collection of Esoteric Writings of T. Subba Row to the front page of the English Wikisource — to the section "New texts". As some time ago one experienced user (User:Beleg Tâl) explained to me when I asked for clarification (now, when I seeked that question in the archive, I remembered that a year has passed already since that time!), a book cannot be put to "New Texts" until all its pages have been proofread, including those pages which are problematic. Could someone who is good in Indic scripts (may be, some user fron India?), help me and process the problematic pages containing Indic scripts, in order I become able to bring this interesting book to completion? --Nigmont (talk) 16:10, 2 October 2021 (UTC)

Transfer of a page that is in German, a law into this Wikisource

Hi, there is a page [12] that is a Nazi German law that brought back hanging. Would it normal to try and import this into this wikisource? Or do a translation? I have wikipedia article here: [13], an English article on the law and I would like to put a Wikisource template into the article that links to here somehow, with the english version of the law. Thanks. Scope creep (talk) 16:36, 2 October 2021 (UTC)

Can only be here in the Translation: namespace. See WS:TRANS#Wikisource original translations. Beeswaxcandle (talk) 18:29, 2 October 2021 (UTC)

Continuing footnote

Hello, how to do the first footnote in this page? It is in continuation from the previous page. Lightbluerain (Talk | contribs) 17:13, 2 October 2021 (UTC)

Needs to be named on the previous page, and then on this page. See Help:Footnotes and endnotes#Footnotes that continue over page breaks. I've done this one for you. Beeswaxcandle (talk) 18:22, 2 October 2021 (UTC)

What is this?

Hello, what does the "## s1 ##" , "## s2 ##" here in the edit source mean? Lightbluerain (Talk | contribs) 17:41, 4 October 2021 (UTC)

That is the result of using <section />. The wiki software converts it into that. "section" is used for dividing the pages for transcription.--RaboKarbakian (talk) 18:11, 4 October 2021 (UTC)
@RaboKarbakian, is it to be used in all the pages? I didn't know about it so didn't use in any. Lightbluerain (Talk | contribs) 18:08, 5 October 2021 (UTC)
@Lightbluerain, it's definitely not needed in all pages. It's only needed in places where a new chapter/section of the document starts partway down the page, and it's so the tranclusion process doesn't transclude the entire page if the chapter ends partway down. You can find full details of when, why, and how at H:LST. — Dcsohl (talk)
(contribs)
21:02, 5 October 2021 (UTC)
Alright, thanks. Lightbluerain (Talk | contribs) 03:32, 6 October 2021 (UTC)

Do town hall meeting transcripts count as "edicts of a local government"?

In any case I've seen, a city hall meeting consists of a vote for the city government to carry out a certain motion. Can these be counted as edicts of a government? Wikipedia says that town hall meetings "are a way for local and national politicians to meet with their constituents either to hear from them on topics of interest or to discuss specific upcoming legislation or regulation." So does this mean that only some meeting transcripts are edicts, and some are not? I find the edict rule a bit confusing, so forgive my legal ignorance on the subject. PseudoSkull (talk) 17:18, 5 October 2021 (UTC)

I'm pretty sure the answer is no - if the document does not have the force of law, then I don't think it can be considered an edict of government under US Copyright. Records of discussions, like parliamentary debates, might be available under PD-USGov or OGL or something, but not under PD-EdictGov. —Beleg Tâl (talk) 18:17, 5 October 2021 (UTC)
Scratch that. apparently w:Georgia v. Public.Resource.Org, Inc. specifically addressed "whether the government edicts doctrine extends to—and thus renders uncopyrightable—works that lack the force of law" and decided that the answer is yes:

A careful examination of our government edicts precedents reveals a straightforward rule based on the identity of the author. Under the government edicts doctrine, judges—and, we now confirm, legislators—may not be considered the “authors” of the works they produce in the course of their official duties as judges and legislators. That rule applies regardless of whether a given material carries the force of law.

[...]

Moreover, just as the doctrine applies to “whatever work [judges] perform in their capacity as judges,” Banks, 128 U. S., at 253, it applies to whatever work legislators perform in their capacity as legislators. That of course includes final legislation, but it also includes explanatory and procedural materials legislators create in the discharge of their legislative duties. In the same way that judges cannot be the authors of their headnotes and syllabi, legislators cannot be the authors of (for example) their floor statements, committee reports, and proposed bills. These materials are part of the “whole work done by [legislators],” so they must be “free for publication to all.” Ibid.

Under our precedents, therefore, copyright does not vest in works that are (1) created by judges and legislators (2) in the course of their judicial and legislative duties.

Beleg Tâl (talk) 18:32, 5 October 2021 (UTC)
Does it mean that {{PD-EdictGov}} also extends to all works of legislators performed in their capacity as legislators, i.e. not only laws? --Jan Kameníček (talk) 19:27, 5 October 2021 (UTC)
@Beleg Tâl: Should this bit of info be summarized in {{PD-EdictGov}}, to clarify to readers that even these types of works count by extension (to avoid too many redundant discussions at CV, for example)? PseudoSkull (talk) 17:34, 6 October 2021 (UTC)
@Jan.Kamenicek: Yes, though I don't know what the ramifications are for non-legislative work done by legislators (or non-judicial work done by judges). I mean, it clearly doesn't extend to the works of legislators that they do outside their legislative role (like an autobiography written on their own time), but there are a lot of things that politicians do on the job besides write laws. We might have to make our own policy on the subject. I'd be inclined to say that yes, anything that a politician or other legislator does on the job is included, legislative or not, and I think the court's arguments support this view, but I'd poll the rest of the community first. —Beleg Tâl (talk) 14:03, 7 October 2021 (UTC)
@PseudoSkull: Yes. I think that the documentation for {{PD-EdictGov}} should be updated for sure, but I would post a proposal on WS:S before modifying the license tag itself. And we should probably add Georgia v. Public.Resource.Org, Inc. (transcription project) to Wikisource as well, for reference. And make sure Commons is updated too, since they host our scans. —Beleg Tâl (talk) 13:51, 7 October 2021 (UTC)

Spaced-out wikimedia provided helper links don't work

Trying to track down 'who' on a template that got smashed, and at bottom of  Special:Contributions/Great_Brightstar   I click "Xtools counter", which then says "who?" - "The requested user does not exist". Manually type in user name in the dialog and it says "oh... 'them".

But notice that the first link is

https://xtools.wmflabs.org/ec/en.wikisource.org/Great%2BBrightstar

and one that works is

https://xtools.wmflabs.org/ec/en.wikisource.org/Great%20Brightstar

or, over as seen over at en.wikipedia, the link is

https://xtools.wmflabs.org/ec/en.wikipedia.org/Great_Brightstar


Similarly at page bottom the "Global contributions" link we have here is

https://guc.toolforge.org/?user=Great%2BBrightstar

which fails, versus en.wikipedia's

https://guc.toolforge.org/?user=Great+Brightstar&blocks=true


So... where are these helpful links being generated, but that don't understand how to handle user names with embedded spaces? I hit this the other day on another user with spaces in their name. Shenme (talk) 23:34, 6 October 2021 (UTC)

I would say here: {{Sp-contributions-footer}}. The why still needs to be looked into. The "urlencode" seems a good candidate. Mpaa (talk) 17:32, 7 October 2021 (UTC)
I attempted a fix diff. @Billinghurst: and @Inductiveload:, a review is appreciated. Mpaa (talk) 21:42, 8 October 2021 (UTC)
Looks good. @Shenme: My fault, there was a link conversion as there is now a interwiki map for numbers of targets and some updated following server naming changes. I didn't fully account for mw:Help:Magic words {{urlencode}} differences in the those changes and needed to go from PAGENAME to PAGENAMEE (per mw:Manual:PAGENAMEE encoding). Links were tested, but clearly not for all possible variations. [Note to self we need a permanent sandbox testcases for this] — billinghurst sDrewth 23:09, 8 October 2021 (UTC)
Thank you, and hope I'm not being too much a bother. When I'm 'awake' I notice _everything_ (except glass windows - hurts when I try to fly through 'em). Shenme (talk) 23:32, 8 October 2021 (UTC)
  This section is considered resolved, for the purposes of archiving. If you disagree, replace this template with your comment. — billinghurst sDrewth 02:46, 13 October 2021 (UTC)

RTL characters rearrange following non-alpha strings - wondrous weirdness warning

If you will never deal with right-to-left characters like Hebrew א or Arabic ش then skip over this madness - All's well in your world.


Inserting a character from a right-to-left (RTL) character set may rearrange following left-to-right words in a line. It may display delimiters replaced by their partners. But any following (Western?) alphabetic char stops the madness.

The first line below has no magic. The characters are seen in same order as entered. The next two lines have a Hebrew or an Arabic RTL character inserted just after the "ABC".

ABC 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCא 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCش 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67

How confounding. The space-delimited 'words' are order reversed. The "(" has become a ")" in appearance. Everything has become RTL in order, but only up to the next alphabetic character.

This shows up in this non-Page: edit box, and it shows up here for your viewing pleasure. But in Page: space it shows up only in the edit box, but not when displaying the Page: itself! Very irregular.

This was seen in page Page:The New Testament in the original Greek - Introduction and Appendix (1882).pdf/280, which if you'll view and then locate "adopted by" you'll see the view matches the scan - "13-69-124-346 209" is seen. Now go into edit mode and locate "adopted by". That string now is displayed as "209 13-69-124-346". Because there was a א preceding. And only non-alphabetics are rearranged. Rather disconcerting and definitely not WYSIWYG.

I've tried this with Chrome, Firefox and newEdge, on Win10. All show the problem syndrome.


Now here is a good question. If the problem shows up here in this WS section while viewing and then while editing, and in User: space also, and it shows up on the Page: while editing, why doesn't it show up as a problem on the Page: display? A 'normal' that changes depending on location sounds more like politics than proofreading.

Have y'all heard of anything like this? Shenme (talk) 02:23, 8 October 2021 (UTC)

It's standard behavior; https://unicode.org/reports/tr9/ . That page shows some HTML workarounds; <span dir="ltr"></span> around the numbers will keep them in line. Otherwise it will try and arrange the logical order of the sentence around the strongly directional characters, like the letters, flipping certain punctuation and whatnot. I assume Hebrew and Arabic speakers have gotten used to it by now. To properly wordwrap mixed English and Hebrew text requires some sort of rules, and these are the rules we got.--Prosfilaes (talk) 03:02, 8 October 2021 (UTC)
@Shenme, to elaborate on @Prosfilaes's comment: as explained in the Unicode standard he linked to, it is standard behaviour for numbers and punctuation to follow the dominant directionality of the preceding text. Since that was Hebrew, a right-to-left writing system, the numbers get displayed RTL. However, they are still stored with the "13" first and the "209" last; they're just displayed RTL in the editor. But the {{hebrew}} template has an &lrm; entity at the end of it, so that when the text is rendered (i.e. outside the editor), the text immediately resets to English-standard LTR for all characters after {{hebrew}} rather than "waiting" for an alphabetic character.
So what can you do? Not a whole lot, I'm afraid; this is a problem with the editor or rather the fact that the {{hebrew}} template and its embedded LTR marker isn't expanded while editing. I suppose you could always place your own LTR marker (which would duplicate the {{hebrew}} one, but no harm there. Here's one between these parentheses (‎) so you can copy and paste those parens as a pair and then delete them individually, leaving only the LTR marker. This does, though, run the risk that future editors might not know it was there, which could lead to confusion down the line... — Dcsohl (talk)
(contribs)
16:12, 8 October 2021 (UTC)
Thank you for surfacing &lrm; &#8206; (and &rlm; &#8207;) as a specific workaround for correct display.
If I reuse my examples above, and insert &lrm; directly after the RTL chars א and ش below, we now see the correct display:
ABC 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCא‎ 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCش‎ 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
To be explicit, those two lines were changed to
ABCא&lrm; 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCش&lrm; 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
But notice that the HTML entity names contain LTR alphabetic characters. By inserting those in the edit text, we've now switched back to LTR direction during editing. Both edit and display now appear the same!Win-win!
But... if I instead use the numeric-only form of &lrm; - &#8206; - we get correct results when displayed, but not during editing. On display we see correctly:
ABCא‎ 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCش‎ 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
But during editing we still see the out-of-order result:
ABCא&#8206; 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67
ABCش&#8206; 01 - 12 + 23 * 34 % 45 ( 56 XYZ 67


So... to use a mix of LTR and RTL characters when desiring LTR layout, do one or both of these:
  • use the templates, e.g. {{hebrew|א}} or {{arabic|ش}} for correct display (and other benefits)
  • use &lrm; following RTL characters for correct display and editing (and being explicit)
Thank you Χ, Eliyak and others for the use of &lrm; in the templates. Shenme (talk) 21:02, 8 October 2021 (UTC)

New Licence for Singaporean Legislation

From August 2021, the Singaporean Attorney-General’s Chambers announced that reproduction of Singaporean legislation published on Singapore Statutes Online is allowed subject to their revised term of use:

(13) AGC has granted permission to the users of SSO to reproduce any Singapore legislation in any print or electronic material or platform (collectively, “the relevant material/platform”), subject to these Terms of Use and the following conditions:

(a) the following information must be included in the relevant material/platform:
(i) that the Singapore legislation is subject to copyright of the Singapore Government and is reproduced in the relevant material/platform with the permission of AGC;
(ii) that the users of the relevant material/platform may check SSO for the latest version of the Singapore legislation;
(b) the producers* of the relevant material/platform are responsible for ensuring the accuracy of the Singapore legislation reproduced in the relevant material/platform;
“producers of the relevant material/platform” means—
(a) where the relevant material/platform is a printed or electronic material (such as a book) — the authors and publishers of the printed or electronic material; or
(b) where the relevant material/platform is an electronic platform — the owners and operators of the electronic platform.
(c) the reproduction of the Singapore legislation on the relevant material/platform must not create any suggestion that AGC or SSO is associated or affiliated in any manner with the relevant material/platform, the producers of the relevant material/platform or the producers’ affiliates

. . . (The remaining one is related to automated extraction)

If these conditions are allowable on Commons, then most of the files in Category:Laws of Singapore can be promoted to there. Do you think such permission is allowable on Commons? Please advise.廣九直通車 (talk) 09:49, 16 October 2021 (UTC)

  • I think this would count as Commons-appropriate. The requirements listed above require “accuracy,” but I don’t think that equates to a no-derivative-works restriction. There is a requirement for including information, but that seems to be a specific form of attribution. The third item is a standard restriction, independent of copyright law. 廣九直通車: You should also start a discussion on this topic at Wikimedia Commons, if you have not done so already. TE(æ)A,ea. (talk) 14:31, 16 October 2021 (UTC)
    • @TE(æ)A,ea.: Thanks, but after looking further into the terms of use, they stated that Modification of any of the Contents, or use of any of the Contents for any unauthorised purpose, will be a violation of AGC’s copyright and other intellectual property rights, and "Contents" include legislation, so it seems that it's a non-derivative license, and Commons won't accept ND licenses.廣九直通車 (talk) 08:36, 17 October 2021 (UTC)

Not an alphabet

Hello, the special characters like "a" with "e" (not very sure what do we call it) in the second line in "Epipol_" in the image here and other characters like acute "i" how can we type them on mobile and computer both?
I often use Wikisource on Mobile (but sometimes on computer as well), is there no other wayout than to install a new keyboard? Lightbluerain (Talk | contribs) 18:10, 17 October 2021 (UTC)

æ and œ have templates. {{ae}} and {{oe}}. There are others Category:Ligature templates. If you know "i acute" then you can use the escape character &iacute; which is í and kin. They are all named fairly simply.--RaboKarbakian (talk) 18:53, 17 October 2021 (UTC) Also Category:Diacritic templates
And another thing I forgot, there is a radio selector on every edit window that defaults at "wiki markup" or something, but the selector will change the character set. "Symbols", "Math & Logic", "Ligatures", "Accents", etc.--RaboKarbakian (talk) 19:01, 17 October 2021 (UTC)
On a mac / iphone you can hold down a key for a selection of characters: c has ç ć č and so on, and there are for keystrokes for combining diacritical marks ´ ¨ ` etc. which speeds up input. Cygnis insignis (talk) 20:13, 17 October 2021 (UTC)
You can also choose them from the "Special characters" above the edit window. --Jan Kameníček (talk) 20:26, 17 October 2021 (UTC)
On the standard Android keyboard (for a recent version of Android) you can hold a key down on the on-screen keyboard and get a selection of special characters.--Prosfilaes (talk) 22:40, 17 October 2021 (UTC)
Thanks a lot everyone. I'll try them.
Lightbluerain (Talk | contribs) 18:17, 18 October 2021 (UTC)

Help needed: transwiki import and help for transcription

Hi,

I'm a Wikimedian in Residence in Clermont library. I'm contributing myself on Wikimedia project (including Wikisource) and I also train people to contribute by themselves. One person created an index on the French Wikisource but it is actually a book in English, could an admin do a transwiki import of fr:Livre:Costello - A pilgrimage to Auvergne from Picardy to Velay - A 30154 1.pdf here?

Also, since this book is a bit long, any help for transcribing it is more than welcome.

Finally, I'm not sure how to translate fr:Catégorie:Facsimilés issus de Clermont Auvergne Métropole in english for an equivalent category here. Isn't there a "index by source" category here? (@Gweduni: didn't you categorised files from and for the NLS project?)

Cheers, VIGNERON en résidence (talk) 07:41, 18 October 2021 (UTC)

@VIGNERON en résidence: We don't have transwiki import enabled for frWS, so the easiest solution is just to create a new index here and cut&paste over the pagelist (there are also some potential technical issues, related to the dynamic data model, with transwiki import of Index: pages so those are best handled manually in any case).
On categorising the index I'll punt the issue to Billinghurst, but I'm pretty sure we established Category:WikiProject NLS for this purpose. Xover (talk) 09:29, 18 October 2021 (UTC)
@Xover: thanks and true, I did the manual usual creation: Index:Costello - A pilgrimage to Auvergne from Picardy to Velay - A 30154 1.pdf. Thanks, hidden categories are hidden by default (logical ;) ), that's why I missed it, thanks for the pointer. Cheers, VIGNERON en résidence (talk) 09:51, 18 October 2021 (UTC)

  Comment Yes, we a stream of things under category:WikiProjects. Seems that things are otherwise sorted. — billinghurst sDrewth 12:10, 18 October 2021 (UTC)

Interwiki to Translation namespace not appearing

According to https://www.wikidata.org/wiki/Q2898441, the page Translation:Tikunei_Zohar is linked to its source language he:s: page https://he.wikipedia.org/wiki/%D7%AA%D7%99%D7%A7%D7%95%D7%A0%D7%99_%D7%94%D7%96%D7%95%D7%94%D7%A8 However, althought the link appears in the he: page, it does not appear on the en: page. unsigned comment by Nissimnanach (talk) 16:18, 12 February 2021.

@Nissimnanach: This probably happened because the English page hadn't updated from the Wikidata item yet. If this is what's going on, you can fix it by purging the page. —CalendulaAsteraceae (talkcontribs) 06:41, 17 October 2021 (UTC)
Looks like they linked already.. Thanks Nissimnanach (talk) 12:45, 20 October 2021 (UTC)Nissimnanach Nissimnanach (talk) 12:45, 20 October 2021 (UTC)

Who's the author?

I've been validating a work fostered by User:Kwehner and am trying to figure out what would be put in the New Text's entry. The work is Index:Annual report of the superintendent of Negro Affairs in North Carolina, 1864.djvu and the title page is at Page:Annual report of the superintendent of Negro Affairs in North Carolina, 1864.djvu/3.

The index page currently has:

Author   United States. Army. Dept. of Virginia and North Carolina. Dept. of Negro Affairs

That would make the New Text's entry:

{{new texts/item|Annual report of the superintendent of Negro Affairs in North Carolina, 1864|United States. Army. Dept. of Virginia and North Carolina. Dept. of Negro Affairs|1865}}

Tain't such an author, and unwieldy at best. What should be used as author? Shenme (talk) 13:24, 20 October 2021 (UTC)

Thank you, and there he is w:Horace James (minister) and there it is at archive.org]. I'll create the author entry and see about making a link from WP to the work here, with their {{wikisource-inline}} ? Shenme (talk) 16:41, 20 October 2021 (UTC)

Reference errors

I've found some pages with reference errors I don't know how to correct.

First, Page:The philosophy of beards (electronic resource) - a lecture - physiological, artistic & historical (IA b20425272).pdf/75 has a footnote inside a poem inside a {{quote}}, and the footnote itself contains {{quote}}s. Something about this is making the <ref> tags not work properly. At the moment, the footnote is not in <ref> tags, but at the bottom of the page.

The other issue is {{errata}} inside footnotes, which seems to be a problem with nesting footnotes. Pages with this issue are

What can be done with regards to either of these issues? —CalendulaAsteraceae (talkcontribs) 18:07, 20 October 2021 (UTC)

This is the second time for this page. The suggestion was to not use <poem>. See Wikisource:Scriptorium/Archives/2021-09#Nested_footnotes_with_nested_poems.--RaboKarbakian (talk) 22:57, 20 October 2021 (UTC)
@RaboKarbakian: Thank you, that resolved the <poem> issue! —CalendulaAsteraceae (talkcontribs) 23:47, 20 October 2021 (UTC)

Help need with Organizational Chart

Does anyone know how to transcribe the organizational chart here Page:The Chaldean Account of Genesis (1876).djvu/82. Languageseeker (talk) 01:04, 21 October 2021 (UTC)

Use the {{familytree}} suite of templates. Beeswaxcandle (talk) 01:30, 21 October 2021 (UTC)
I'll mention also the {{chart2}} templates, but they have a problem that may also happen with {{familytree}}? They construct a diagram from fixed size units. If the number of units across the page is too many, you get a monstrosity as seen here Page:The New Testament in the original Greek - Introduction and Appendix (1882).pdf/92. We'll probably publish with this, but I won't be happy until I've redone it in SVG, much less wide. Shenme (talk) 01:41, 21 October 2021 (UTC)
And also {{tree chart}} templates. I strongly recommend using a sandbox in your user space to work on any family tree chart regardless of which set of templates you choose to use. Being able to save staging points in my user space was equivalent to saving my sanity.

@Shenme: Yours might not need to be that wide. A recent one I did is Page:Hawaiki The Original Home of the Maori.djvu/187 and the messiest is probably Page:A Dictionary of Music and Musicians vol 2.djvu/265, which is surely more complex than your one. Beeswaxcandle (talk) 02:50, 21 October 2021 (UTC)

  Comment @Languageseeker: Personally I would just use a cleaned-up cropped image, and putting the names into the alt description. I don't see value in trying to transcribe it. What value does it bring? The FT template is not customised for mobile, it is just going to be ugly. — billinghurst sDrewth 13:26, 21 October 2021 (UTC)

How to split strings

I'm trying to figure out how to split strings in a template, specifically dates in yyyy-mm-dd format, with the goal of simplifying the date handling in {{archive header}}. Doing e.g. {{#explode:yyyy-mm-dd|-|0}} to get the year didn't work, and Module:String doesn't appear to have a function to do this. —CalendulaAsteraceae (talkcontribs) 18:32, 13 October 2021 (UTC)

@CalendulaAsteraceae: I am not sure if this would do it, but in Parser functions there is a #time. See mw:Help:Extension:ParserFunctions##time --RaboKarbakian (talk) 18:47, 13 October 2021 (UTC)
@CalendulaAsteraceae We have also Module:String2, which is a clone of the Wikipedia one. I have also now cloned {{string split}}, which is a wrapper around {{#invoke:String2|split}}.
But, depending on exactly what you are trying to achieve, genuine data manipulation might indeed be better. Inductiveloadtalk/contribs 18:52, 13 October 2021 (UTC)
@RaboKarbakian, @Inductiveload: Thank you! {{Archive header}} gets the creation date from the name of the subpage. It was doing this with a switch statement; being able to split strings let me simplify the code a lot, and also let people set the date manually for archives with a different page structure without rewriting the whole description. Diff here. —CalendulaAsteraceae (talkcontribs) 23:05, 13 October 2021 (UTC)
@CalendulaAsteraceae: Every time you see that programming pattern in a template you should start thinking of rewriting it in Lua / a Module. Templates just simply aren't designed for that kind of logic. Xover (talk) 07:18, 14 October 2021 (UTC)
@Inductiveload: Do we really need two different string modules? And a template that exposes a single function from that module that you can just as easily get at with #invoke? Xover (talk) 07:15, 14 October 2021 (UTC)
String2 is an enWP import, since they've done all the hard work. We could reimplement the whole thing in a WS-local module, but then we'd also have to test it and maintain it a lot more.
I'm ambivalent on wrapper templates, but at least they provide a consistent and familiar interface for people who just want a simple template to do a specific task. Also in theory, it insulates "innocent" template users from Lua implementations (e.g. if we moved the split function into Module:String). Inductiveloadtalk/contribs 07:42, 14 October 2021 (UTC)
@Inductiveload: No, I mean: do we really need both Module:String and Module:String2? How many string handling modules do we need? Are they really different enough to merit that? Could we upstream changes to one of them to make the other one obsolete? We're a small enough project that the maintenance costs of a proliferation of implementations of the same thing is something we can ill afford. And wrapper templates don't really insulate users, they just add complexity, add to the maintenance burden, and encourage doing things in the template layer that really should be moved to Lua. Xover (talk) 08:25, 24 October 2021 (UTC)

Difference between Automatic Header and Manual Header

When transcluding The Chaldean Account of Genesis, I noticed that there is a difference in the formatting of Automatic Headers and Manual Headers:

The manual header displays
The Chaldean Account of Genesis (1876) by George Smith
While the Automatic header displays
The Chaldean Account of Genesis (1876)
George Smith

Is there anyway to make the two headers match? Languageseeker (talk) 15:23, 22 October 2021 (UTC)

  • Languageseeker: The only way I can think of is to mess with the formatting of the manual header, so that it matches the automatic header; but that is not an ideal solution. The reason the automatic header displays in such a way is because of the use of override_author to allow whatever content is in the Author field in the Index page to populate that part of the header. Also, the pages with manual headers you have created are incorrect, as the author is not “George Smith” but “George Smith (1840-1876).” TE(æ)A,ea. (talk) 17:15, 26 October 2021 (UTC)

Main Page nl.wikisource

 

Hello,

This is a request from the Dutch Wikisource. To this date our main page is not yet adapted for viewing on a mobile device. I understand this can be done by editing MediaWiki:Common.css. However, that's as far as I get. Could someone help us with this? The content of the main page doesn't have to be changed, just its mobile view. I have already requested help on nl.wikipedia (see w:nl:Help:Helpdesk/Archief/okt_2021#Hoofdpagina_Wikisource), but to no avail.

Thank you.

Regards, Vincent Steenberg (talk) 11:19, 1 November 2021 (UTC)

@Vincent Steenberg   Doing… will update when I have something for you. Inductiveloadtalk/contribs 12:03, 1 November 2021 (UTC)
@Vincent Steenberg have a look at s:nl:Gebruiker:Inductiveload/sandbox and see if that works for you. The CSS comes from s:nl:Sjabloon:Hoofdpagina/styles.css and is largely modelled after the enWS Template:Main page/styles.css. Inductiveloadtalk/contribs 12:29, 1 November 2021 (UTC)
@Inductiveload:. Thanks very much. That works perfectly. Only the tables are a little too wide, but that can be fixed, maybe by not using tables at all, like on en.wikisource. So yes, this is something I can work with. Thanks again. Regards, Vincent Steenberg (talk) 16:10, 1 November 2021 (UTC)
Glad to be able to help. Let me know if you need anything else! Inductiveloadtalk/contribs 16:14, 1 November 2021 (UTC)
Yes, definately helpful. I'm trying to find out why the different containers (featured, welcome, etc.) are too wide. Should I look at this code:
#nlws-mainpage-content {
  /* keep the mw-indicators above the main content
   * normally, they go next to the H1 title, but there isn't one here */
  clear: both;
  display: grid;
  grid-gap: 0.8em;
  grid-template-areas: 
  "header"
  "featured"
  "welcome";
  margin-right: 1em;
}
I'm not familiar with css as you can tell, but maybe (some of) the images are too big? Vincent Steenberg (talk) 17:11, 1 November 2021 (UTC)
@Vincent Steenberg I don't think that is it: it looks to me like one of the elements cannot shrink any further: probably the main image, the "J'accuse" box, or the sister projects table. Inductiveloadtalk/contribs 17:22, 1 November 2021 (UTC)
Yes, I also believe Sjabloon:Hoofdpagina - wikimediaprojecten1 is the culprit. Should be easy to fix. Thanks. Vincent Steenberg (talk) 17:25, 1 November 2021 (UTC)
s:nl:Sjabloon:Uitgelichtbox is also fixed at 24em. I tried to add a max-width but it didn't help much. Inductiveloadtalk/contribs 17:33, 1 November 2021 (UTC)
@Inductiveload: That's strange, on my screen it worked fine. I hope you don't mind if I have a go. I don't want to take up your time. Vincent Steenberg (talk) 20:12, 1 November 2021 (UTC)
Sure, go ahead, let me know if you have issues! Inductiveloadtalk/contribs 20:21, 1 November 2021 (UTC)
@Inductiveload: I think I did it. As far as I can see, the problem wasn't the Uitgelichtbox, but the width of the inputbox in s:nl:Sjabloon:Hoofdpagina - indexen1, which was set at 35. Now 25. So, thank you very much for your help. You really made my day! Regards, Vincent Steenberg (talk) 20:46, 1 November 2021 (UTC)
Great news! Good luck in the brave new world of mobile-friendly Wikis! Inductiveloadtalk/contribs 21:22, 1 November 2021 (UTC)

renaming index name needed

Index talk:A classified collection of Tamil proverbs (IA classifiedcollec00jensiala).pdf . --Info-farmer (talk) 03:31, 3 November 2021 (UTC)

I'm not entirely sure what you're after here. The name of an Index here is the same as that as the File as held on Commons. They match at present, and so there is nothing for us to do. Beeswaxcandle (talk) 03:44, 3 November 2021 (UTC)
I can rename the file as described in the source page at Commons, if i do, any one of you agree to rename here also in all the index pages? becoz i think only sysop can move all the pages without redirection.--Info-farmer (talk) 04:11, 3 November 2021 (UTC)
But, why? The file name, while a bit long, is fine. We don't use it in the mainspace here and there are thousands of files uploaded by Fae that have the IA chunk in their name. Why this one? Beeswaxcandle (talk) 04:23, 3 November 2021 (UTC)
You know that this is the original book name, keeping the original name is best practice. I am not convinced with the existing name. becoz nobody is having right to change the original name for his/her conveniences. As one of the sysop in ta.WS, usally i/we do practice to keep the original file name which will help till the end of the transclution by pycode. Okay it is up to your community consensus. bye.--Info-farmer (talk) 05:02, 3 November 2021 (UTC)
@Info-farmer we "normally" don't move the files not because of any policy as such, but because it would be a huge time-suck for a few users who can perform such actions if we were to move every imperfectly-named file. The IA suffix is untidy (personally, I would prefer a name like A classified collection of Tamil proverbs - Jensen - 1897.pdf, and normally I will upload the DjVu instead of the PDF anyway, and make a better name at that time), but it's not really an issue as such. "Advertising" the IA isn't really a problem IMO. This is just as well, because there are north of a million such files on Commons now. Inductiveloadtalk/contribs 09:26, 3 November 2021 (UTC)
Inductiveload! thanks for your details. A new experience.--Info-farmer (talk) 09:31, 3 November 2021 (UTC)

Unicode: mediawiki too helpful? (Unicode character normalization)

I'm working on a non-mediawiki-type IME for polytonic ancient Greek because I hate hunt-n-copy-n-paste-n-oops (Mediawiki only considers modern Greek). (Doc started at User:Shenme/BetaCodeIME)

The IME works nicely but needs more work. But when I grab a current copy of code from User:Shenme/BetaCodeIME.js it has *many* changes from the previous offline working source.

It seems that Mediawiki is forcing Unicode character normalization on every form of page output, or perhaps at time of saving page text?

Example gruesome details:
  • ' Ώ ' \u1FFB precomposed w:Greek Extended Unicode block
  • ' Ώ ' \u03A9\u0301 decomposed base and combining Unicode chars
  • ' Ώ ' \u038F precomposed w:Greek and Coptic Unicode block
  • so \u1FFB is converted to \u038F because that is preferred by Unicode NFC?
NFC - Canonical Decomposition, followed by Canonical Composition


Ah, oh, next searches found mw:Unicode_normalization_considerations and following that around gets a statement that "MediaWiki applies normalization form C (NFC) to Unicode text input."

So... when your page output doesn't match the input characters, that may be why. Shenme (talk) 22:58, 3 November 2021 (UTC)

Linking to different sections of a periodical

I'm validating some pages for the Popular Science Monthly Magazine Vol 19 and I've come across an article on page 324 (printed pagination) with a footnote "Continued from page 197." So my questions: 1) Would it appropriate in such a case to embed a helpful link in the "[To be continued.]" sign on page 197? 2) If so, how would I best go about doing this: perhaps with the anchor template in the page namespace? Thanks Helrasincke (talk) 10:48, 8 November 2021 (UTC)

naming of items without obvious name

i recently started adding the royal warrant granting the coat of arms of Australia see Index:Royal warrant of coat of arms of Australia.png. how should i go about naming sources like this with no obvious name?Serprinss (talk) 06:17, 10 November 2021 (UTC)

@Serprinss If there's truly no known "parent" work that it could be a subpage of, then any sensible name like Royal Warrant Granting of Coat of Arms of Australia (1912) would do fine (I only added the 1912 because I see there's another warrant from 1908, if there were just one, you would not need this).
Often these things are extracted from some collective work like w:Hansard or some other government gazette, in which case they should theoretically be subpages of the work as published, but if you don't know the parent work, then it's OK. Inductiveloadtalk/contribs 09:00, 10 November 2021 (UTC)

Headers and footers

Esme Shepherd (talk) 15:36, 12 November 2021 (UTC) Why have headers and footers suddenly disappeared? I can't put a header on the main page because it remains visable on transclusion. Are headers being dispensed with?

Esme Shepherd (talk) 15:42, 12 November 2021 (UTC) Ok. I have found the tool under Proofread tools. I was just surprised at the sudden disappearance.

Esme Shepherd (talk) 15:50, 12 November 2021 (UTC) I note also that the summary box no longer remembers previous usage, so the single letter 'h' will have to suffice for 'header'.

The prefill list for that field is stored (I think) in your browser, not in Wikisource. So if you've changed browser or computer or otherwise reset something, that could be it. Inductiveloadtalk/contribs 17:16, 12 November 2021 (UTC)

Esme Shepherd (talk) 20:59, 12 November 2021 (UTC) Thank you. All is well again. I am about to change computer, so I'll bear that in mind when I do.

Poetry in the main: conventions, recommendations, good practices?

I encountered a fully transcribed work that was not yet transcluded: Index:Memoir and poems of Phillis Wheatley, a native African and a slave.djvu and thought I would transclude it Memoir and Poems of Phillis Wheatley, a Native African and Slave. I checked around (a bit) and saw that the "chapters" for works of poetry are typically the name of the poem.

The poems in this book have really long names and the title of the work is also really long. If I just do this with those names, the pages will be like: Memoir and Poems of Phillis Wheatley, a Native African and Slave/To a Lady, on her coming to North America with her Son, for the Recovery of her Health and I am not happy with this.

I would like to shorten everything, like Memoir and Poems of Phillis Wheatley and Memoir and Poems of Phillis Wheatley/To a Lady, on her coming to North America with her Son (which is still long) as there is another "To a Lady" in the collection. But I will happily do whatever the conventional thing is, once I know what that is.

Also, with this question and in regards to my last comment here about poetry: If you have something snarky to say, please 1) say it (it will be appreciated by me, really) 2) say it on my talk page and not here. And if this suggestion causes snark to be, all the better, but on my talk page.  :) Thank you.--RaboKarbakian (talk) 16:01, 14 November 2021 (UTC)

  • RaboKarbakian: “To a Lady (1)” and “To a Lady (2)” is what I would do. A few of the other poem names should be shortened, as well: “Lines, on hearing of the Intention of a Gentleman to purchase the Poet's Freedom” can be shortened to “Lines,” &c. TE(æ)A,ea. (talk) 16:15, 14 November 2021 (UTC)
  • IMO, far too much effort and worry is expended on page titles in general. They're strings that identify the work a little more readably than the page ID numbers, and that's about the measure of them. The only actually functional part of page titles is the slashes that organise pages into a subpage hierarchy, everything else is window dressing. Inductiveloadtalk/contribs 16:27, 14 November 2021 (UTC)
Phshew and thank you!--RaboKarbakian (talk) 17:08, 14 November 2021 (UTC)

Transcribing Valuable Data from Biodiversity Field Notes? Is this possible?

Dear Wikisource Experts,

I'm investigating the use of Wikisource as a citizen science platform for scientific data transcription. This transcription activity is similar in nature to the Old Weather project but with a focus on the extraction of biodiversity data. Currently, there is a lot of data that is buried in these digitized historical texts. Here is just one example of the content:

https://archive.org/details/sea19631966196800natib/page/2/mode/2up

Is there anyone in the Wikisource community that has experience transcribing tables and if so would it be possible to chat with you to understand your process? I have three main questions:

Is this transcription work of interest to the Wikisource volunteer community? Can/Could Wikisource import and export data formats? e.g. csv, xls, tsv, json, xml? If not, do you know of any other transcription platforms that can transcribe data?

Many thanks,

--JJFord BHL (talk) 22:59, 12 November 2021 (UTC)

@User:JJFord BHL The answer is that, yes, Wikisource can be used for such data. However, it doesn't represent such data in any inherently structured format that would map directly to something like CSV. At best you could use a carefully arranged HTML table and hope that the format is consistent enough for handling. And for importation, you'd need some kind of script to generate the markup for display.
At a technical level, an extension or sufficiently advanced scripts might be able to provide an editor and exporter specifically for this data, but it's very doubtful that such a niche thing would find traction at a general purpose project like WS. Inductiveloadtalk/contribs 23:19, 12 November 2021 (UTC)
I made a separate page for some type species so that they could be used at wikidata (wikidata doesn't accept id links) and even though they were connected to wikidata, they were deleted. I see the dead links for them at wikispecies sometimes. JJFord BHL this is a very good place for all biodiversity poetry.--RaboKarbakian (talk) 02:58, 13 November 2021 (UTC)
@Inductiveload and @RaboKarbakian would either of you be interested in being interviewed about Wikisource in general? The BHL Secretariat has sponsored the writing of a whitepaper on the Wikimedia Ecosystem and they would like to understand the Wikisource community and platform a bit more and how it can assist with their OCR challenges. (273,000 OCR files to date in BHL)
If an virtual meet-up is of interest please feel free to add yourself to my calendar here. Many thanks for your helpful responses regarding tabular data -- I understand that this content is challenging to handle from my talks with others who are active transcribers.
Hope to speak with you soon, warmly,
JJFord BHL (talk) JJFord BHL (talk) 20:04, 24 November 2021 (UTC)
@User:JJFord BHL You picked a good time to try out such a project because of two major upcoming changes that will make this easier. First, Wikisource is getting a new imageviewer for the Proofreader Page extension baesd on Seadragon that makes magnification and panning much easier. Second, their is a new generator coming thanks to Inductiveload that will make it easier to export all the pages from an Index. One of the biggest challenge will be getting images of sufficient resolution so that they can be legible. You can do an image based Index which generally run better than a massive PDF. Also, think about how you want to structure the data. A consistent structure will be easier to parse than a hodgepodge. My hunch is that raw wikicode will be easier to import than processeded HTML. Languageseeker (talk) 00:11, 25 November 2021 (UTC)

Namibian Acts: Move to En:WS and Formatting Issues

I recently nominated the Promulgation of National Symbols of the Republic of Namibia Act, 2018 and National Anthem of the Republic of Namibia Act, 1991 on Commons for deletion, as Namibian copyright law protects government works for 50 year since publication, plus there are no exemptions for edicts. Can someone move the two files to English Wikisource locally under {{PD-EdictGov}}? Many thanks.廣九直通車 (talk) 10:15, 25 November 2021 (UTC)

Besides, the format adopted by MarkLSteadman seems to be quite inconvenient for searching and categorization. Will it be better to cleanup the format and titles based on South African legislation, as Namibian legislation share similar format and structure with their South African counterparts?廣九直通車 (talk) 10:15, 25 November 2021 (UTC)

Best practice for missing words in source scan?

In The Poetical Works of John Keats, two lines in book II of "Endymion" are printed defectively in the original scan:

Gold dome, and crystal wall, and turquoise <floor>, (II.596)
High with excessive love. "And now," thought <he,> (II.904)

Can/should anything be done about this? If so, what? (A SIC template, perhaps?) Eureiachthon (talk) 19:47, 14 November 2021 (UTC)

I usually use the {{SIC}} template in those situations. --EncycloPetey (talk) 19:50, 14 November 2021 (UTC)
Thanks very much; that's what I'll do, then. Eureiachthon (talk) 20:13, 14 November 2021 (UTC)
There is also {{reconstruct}} which uses the appropriate brackets for words inserted by a transcriber/translator to ensure accuracy or sense. ShakespeareFan00 (talk) 12:08, 27 November 2021 (UTC)
I also had this very issue in the book I am doing. An editor suggested searching for the nearby words of the missing/unclear word on Google with quotes. This helped me two times so far on my book. You too can try it for yours. Lightbluerain (Talk | contribs) 18:21, 27 November 2021 (UTC)

technical issue

Not very sure if this is some technical issue or what? I use Wikisource through mobile in desktop mode. Everytime I go to the edit tab, while matching the OCR-ed text with the text in the photo by enlarging in the photo in the edit window, I cannot get out of the photo: if I try to zoom out, the photo gets zooned out; if I try to swipe through the screen, the photo gets swiped not the window. I only see the option to reload the tab, which sometimes looses the unsaved edits. Anyone knows the reason behind it? And, is there any solution other than matching the text before entering the edit tab? Lightbluerain (Talk | contribs) 18:37, 25 November 2021 (UTC)

They've changed the way Zoom function works on the scanned page. To avoid this problem, tap/swipe/select in a location other than the photo. It will take some getting used to the new mechanics. I still try to scroll using my mouse, but end up zooming in/out on the photo. --EncycloPetey (talk) 19:16, 27 November 2021 (UTC)

Please delete these:

I am trying to get rid of schizophretic software. Thank you.--RaboKarbakian (talk) 15:01, 28 November 2021 (UTC)

  Done --Jan Kameníček (talk) 16:35, 28 November 2021 (UTC)

Move Constitution of Brazil and related pages

There's no "move" option at the top of the page for regular editors, unfortunately. These are clearer titles, especially when more parts/versions are translated and uploaded here.

If anyone with moving priviledges could move these that would be great, thanks. Linshee (talk) 00:18, 29 November 2021 (UTC)

Linshee look at the menu under "More" at the top of the page.--RaboKarbakian (talk) 02:10, 29 November 2021 (UTC)
RaboKarbakian I'm only seeing "Purge", "Hard Purge", and "Null edit" Linshee (talk) 12:46, 29 November 2021 (UTC)
Linshee Ah, I am sorry. There is an admin here who doesn't like to delete redirects. I can move them, but I cannot delete the redirects and was told not to move things when I left a lot of those not wanted redirects. Hmm, I think that I even tagged them with sdelete.... Apparently, the admin have tools that can make these moves and also delete the leftovers but it also puts a lot of work on them either way. Whether or not this is due to the recent admin who doesn't like to delete things that have been moved, I am sorry that I got involved and pointed you in the wrong direction. That "Move" would have been removed due to a vote of one, iirc.--RaboKarbakian (talk) 16:56, 29 November 2021 (UTC)
There was no vote, and this page is not protected. Move is not available for User:Linshee because they are not yet autoconfirmed here at English Wikisource, as they have only 4 edits (10 are needed). You must be autoconfirmed to be able to move pages at all. This is not special for enWS, it's a standard right that comes with autoconfirmation. Indeed, you must be a sysop to move pages without leaving redirects (since this is functionally a move-then-delete).
@Linshee I will move the pages for you, but if you make 6 more edits, you will be able to move things as well. Inductiveloadtalk/contribs 17:19, 29 November 2021 (UTC)
  Done (but the pages are numbered in Arabic numerals in line with wider convention, so the links are "Title 3", etc). Inductiveloadtalk/contribs 17:38, 29 November 2021 (UTC)

magnifying toolbar tool is messed up

First the magification group was moved from the toolbar to the top right of the editing box and the vertical scrolling is broken. It only magnifies or reduces the image but cannot be scrolled no matter how many times it's clicked. — Ineuw (talk) 02:18, 19 November 2021 (UTC)

The top right is the new place for the viewport controls for the OpenSeadragon-based viewer (there will be more once the OSD patch queue is addressed: rotation and marker lines, which you can see in action here)
As for the scroll behaviour, it's currently got the default OSD configuration, which is click to zoom in, drag to pan and scroll to zoom. I set up phab:T296079 to evaluate if we should re-implement the old system as-was, or find out if there's a better option. Inductiveloadtalk/contribs 12:20, 19 November 2021 (UTC)
@Inductiveload: Thanks for the technical explanation. My post was about not working, not that it was moved to a different location. I just want to be able to scroll the page. But, I see that it was reverted. Thanks.— Ineuw (talk) 17:28, 19 November 2021 (UTC)
@Ineuw it's not necessarily gone forever. it's just not yet implemented in exactly the same way in the first iteration of the new viewer. It's only been deployed for a day, ironing out usability bugs will happen.
If you (and everyone else!) could comment on phab:T296079 (or here, I guess) about what your ideal interface would be, OSD is probably flexible enough to make it happen. For example "scroll = scroll up/down, Ctrl-scroll = zoom, Shift-scroll = sideways scroll" is a very common way to do it. We can (probably) make it feel exactly like it used to, but perhaps we can do even better. Inductiveloadtalk/contribs 17:39, 19 November 2021 (UTC)
Sorry for the delay. I had to re-register my phabricator account access - it's been so long. I am in the midst of a long reply to this post, but first thing first. Don't introduce two handed operation like a keyboard and a mouse, where the mouse middle click, or an imitation of it, was sufficient until now.— Ineuw (talk) 18:40, 19 November 2021 (UTC)
@Ineuw That's a valid point. Personally (bear in mind, I did not implement the PRP OSD viewer, I have just been adding on top of someone else's implementation, so I didn't "do" anything here in terms of the UX) I almost always have my left hand on they keyboard so Ctrl-scroll is much faster for me (especially as the click-to-toggle was always very buggy for me), but it's important to hear how everyone uses the tools.
Also more things to bear in mind:
  • this weeks release train has been rolled back for unrelated technical reasons (a memory leak), so OSD is reverted too. I am not sure if it will be un-reverted, so OSD might be nixed until -11 (not -10, because...)
  • there is no release next week, so we have nearly two weeks to come up with and merge an "ideal" solution anyway
  • if it turn out we have multiple different "ideals" for different people, we can actually have it user configurable (it's already going to be configurable for things like animation speed and zoom rate) Inductiveloadtalk/contribs 19:16, 19 November 2021 (UTC)

┌─────────────────────────────────┘
This problem is back. It's the same problem. Cannot scroll the scan. It changes the magnification instead. Aren't these changes tested??????— Ineuw (talk) 23:30, 29 November 2021 (UTC)

Perhaps I can copy this script to my namespace and remove the magnification? All I would need is the script's name.— Ineuw (talk) 01:01, 30 November 2021 (UTC)

Merging Two Indexes

A Roster of General Officers has been proofread from Index:Southern Historical Society Papers volume 03.djvu and Index:Roster of General Officers (1872) by Charles C Jones Jr.djvu. It's not really missing pages, but two duplicate scans that need to be merged together. Languageseeker (talk) 22:19, 30 November 2021 (UTC)

If I understand corectly, these are not duplicate scans. Rather, one scan includes the same material as part of the scan which makes up the entirety of the other scan. "Merging" these would involve moving all of the pages from the shorter scan by both renaming and renumbering them. Is this correct? --EncycloPetey (talk) 22:32, 30 November 2021 (UTC)
Yes, that is correct. Languageseeker (talk) 23:07, 30 November 2021 (UTC)
The other potential issue I see is that many pages are page-spreads in the shorter scan, but are individual pages in the longer scan. This isn't simply a matter of transfer of pages, because the page-spreads are tables of information. Splitting a table vertically across two pages is not a simple thing to do. It might be better to simply keep the shorter scan separate. --EncycloPetey (talk) 23:22, 30 November 2021 (UTC)
I fine with transferring the pages from the longer scan to the shorter scan. It just feels like A Roster of General Officers should be transcluded from one source. Languageseeker (talk) 23:42, 30 November 2021 (UTC)

Table Trouble Again

I'm currently dealing with Page:Bribery Act 2010 (UKPGA 2010-23 qp).pdf/18, in which there is a minor problem relating to the column header of the text. As this will be used as a basis for other British laws, I'd like to ask how to achieve the desired effect (i.e. no straight line between "Short title and chapter" and "Extent of repeal or revocation") as generated in the index? Many thanks.廣九直通車 (talk) 13:05, 12 November 2021 (UTC)

@廣九直通車 the easiest thing to do is to style it with CSS using Index:Bribery Act 2010 (UKPGA 2010-23 qp).pdf/styles.css and then use ! syntax to define the headings as actual table headings, and give those cell a special style by selecting tr.
The HTML attributes rule and frame are deprecated in modern web standards. Inductiveloadtalk/contribs 13:31, 12 November 2021 (UTC)
@Inductiveload: Many thanks, but what did you just did actually? Or more precisely, what is the CSS stuff?廣九直通車 (talk) 13:35, 12 November 2021 (UTC)
It depends. If the formatting is re-usable across multiple documents, the best thing will be to create a template (e.g. named {{UKPGA schedule table start}}, but the exact name will rather depend on where this kind of table is used). Then you put the CSS at Template:UKPGA schedule table start/styles.css and the template would be something like:
<templatestyles src="UKPGA schedule table start/styles.css" />
{| class="wst-ukpga-schedule-table"
and the rest of the table is as normal.
If the formatting varies between documents, it might be easier to just use custom CSS in the index's own CSS each time. Inductiveloadtalk/contribs 13:40, 12 November 2021 (UTC)
Just another humble request, it seems that the bottom line is not added. I think when all the pages are complied, the bottom line between "Scotland Act 1998 (c. 46)" (in p. 18 bottom) and "Anti-terrorism, Crime and Security Act 2001 (c. 24)" (in p. 19 top) should not appear (i.e. using {{nopt}} and placing the table end for p. 18 at the footer, and table beginning for p. 19 at the header). Can CSS do this?廣九直通車 (talk) 13:46, 12 November 2021 (UTC)
@廣九直通車, yes, you can just set border-bottom: 1px solid black; on the table's class. It will have a bottom border at the end of the page in page namespace, but at the end of the whole table in the transclusion. Inductiveloadtalk/contribs 13:57, 12 November 2021 (UTC)

Two types of tables used in UK Acts

I've take a look on the styles, and in general, it seems that the Stationary Office used two type of tables when printing British Acts:

Can someone familiar with CSS consider create corresponding templates (like, Template:UKPGA schedule format 1/styles.css and so)? Many thanks.廣九直通車 (talk) 08:40, 5 December 2021 (UTC)

@Inductiveload: Sorry to interrupt you again, I've tried to mimick that kind of table format on Index:Criminal Damage Act 1971 (UKPGA 1971-48 qp).pdf/styles.css, but it seems don't working. Can you help with the table style? Many thanks.廣九直通車 (talk) 13:43, 5 December 2021 (UTC)
@廣九直通車 perhaps, but I can't see any table that actually uses that class, so I can't test anything without also having to transcribe the table itself. Inductiveloadtalk/contribs 13:53, 5 December 2021 (UTC)
@Inductiveload: Thanks for your reminder. Based on the tested style on Non-Fatal Offences Against the Person Act, 1997, I've made a test version on Page:Criminal Damage Act 1971 (UKPGA 1971-48 qp).pdf/9 and Page:Criminal Damage Act 1971 (UKPGA 1971-48 qp).pdf/10. It looks almost the same, except that for unfinished tables, there should be no bottom line at the ending of the ended table.廣九直通車 (talk) 14:12, 5 December 2021 (UTC)
@廣九直通車 I don't think the presence or not of that bottom line in page namespace matters much. If you wanted to, you could override manually with something like {{page other|1=style="border-bottom:none;"}} to suppress it in page namespace only. You can also use CSS to target page NS only by targetting .ns-104. But, IMO it's a waste of time to worry too much about Page NS presentation. Inductiveloadtalk/contribs 14:21, 5 December 2021 (UTC)
Many thanks for your help on table styles, (and sorry for the request — I originally thought about using the kind of CSS templates plus my previous experience to do standardization). Now that I think I have a better understanding on table formatting, I'll probably go forward to deal with the CSS templates based on your advice, thanks!廣九直通車 (talk) 01:20, 6 December 2021 (UTC)

How to transclude pages missing in DjVu?

Hi all, Please have a look at s:nl:Index:Frederik van Eeden-Johannes Viator(1895).djvu. Page 123 and some others are missing from the DjVu file. Therefore, they do not show up in the readable version, e.g. , note 123 missing. How to fix this? TIA, henne (talk) 21:48, 30 November 2021 (UTC)

@Hamaryns what you can do here is proofread the page from an alternative source (which appears to have been done). The text will transclude as normal, even if the scan image is just a placeholder. Normally, you should leave a note somewhere, perhaps on the index talk page and a comment <!-- like this--> on the page to inform future editors where the text came from.
You can also optionally repair the scan file by inserting pages from an alternative source. WS:LAB can help you with that: to request such a repair you would just need to indicate where the alternative pages can be found and which pages need to be replaced. Inductiveloadtalk/contribs 22:12, 30 November 2021 (UTC)
@Inductiveload: I’m not sure I understand you correctly. Pages 123 and 127 are proofread, but if you go to the reading version ([14]), they are not there.
I will have look at repairing the file, but I do not have a scan of the missing pages. Hamaryns (talk) 19:29, 1 December 2021 (UTC)
@Hamaryns: -- Those two pages have the format png, not djvu. They are not actually part of the file. That's why they are not showing up on transclusion. Hrishikes (talk) 01:23, 2 December 2021 (UTC)
@Hamaryns: -- As for the missing pages, they are available in the HathiTrust scan. -- Hrishikes (talk) 01:45, 2 December 2021 (UTC)

Racist text in old books

I am in the process of transcluding a book of poetry to Cy Wikisource. It's a book from the turn of the 19/20 C. I have some dilemmas regarding one of the poems which is extremely racist. The translated title would be "S***o and the pipe - he wants to be white". The poem is a moral lesson in the evils of smoking. It talks about a boy of African descent smoking a pipe in order to be as good as a white man, but the tobacco makes him ill. The moral is that smoking never improves your status. What should I do with the poem 1) Publish it, to be true to the original work, 2) Publish it with a note of explanation of the aspects of the period, with a disclaimer on the part of Wikisource, 3) Completely omit it?

If I do publish it, with or without comment, would I (and / or Wikisource) be liable for publishing racially hateful materials under the laws of the US; where the poem would be hosted, or England and Wales from whence it would be published? AlwynapHuw (talk) 03:30, 26 November 2021 (UTC)

@AlwynapHuw: The goal of Wikisource is to accurately digitize and preserve texts. Such texts, while deeply problematic, provide crucial evidence of past values and beliefs. Altering such texts would destroy evidence of past racism. Their transcription does not reflect either that of the transcriber or that of enWS. Languageseeker (talk) 03:38, 26 November 2021 (UTC)
@Languageseeker: Thank you very much for your prompt reply. This is the first time I have encountered the issue so your guidance on how to deal with it is very much appreciated. AlwynapHuw (talk) 18:50, 26 November 2021 (UTC)
Add a template as a disclaimer?--Jusjih (talk) 00:48, 7 December 2021 (UTC)

Transcription Section Help

I transcluded some poetry, but when sections like The Battle of Waterloo (Neilson)/Lawrie O'Broom's Rambles from Ireland to Scotland are transcluded, the beginning part of the section starts at the right point. But the end skips past the end of the section and includes the rest of the page. DoublePendulumAttractor (talk) 23:43, 6 December 2021 (UTC)

User:DoublePendulumAttractor you didn't have a <section end= for the software to know where to break it at. I put one on it and (I think) it seems to be working now, but you might want to check.--RaboKarbakian (talk) 00:04, 7 December 2021 (UTC)
Oh! That seems to have done the trick. Thanks! DoublePendulumAttractor (talk) 00:32, 7 December 2021 (UTC)

Charinsert help

Hello, I'm having a problem with my custom charinsert. I'd like to add <div style="line-height:1.3;"> but the decimal in 1.3 is interpereted as a space. Is there a way fix to this? Jpez (talk) 06:03, 3 December 2021 (UTC)

@Jpez: Where are you trying this? When I input
{|
|style="vertical-align:top;"|Testing<br/>
Testing
|style="vertical-align:top; line-height:1;"|Testing<br/>
Testing
|style="vertical-align:top; line-height:1.3;"|Testing<br/>
Testing
|style="vertical-align:top; line-height:1.5;"|Testing<br/>
Testing
|style="vertical-align:top; line-height:2;"|Testing<br/>
Testing
|}
the result is
Testing

Testing

Testing

Testing

Testing

Testing

Testing

Testing

Testing

Testing

which looks like the 1.3 line height is working. —CalendulaAsteraceae (talkcontribs) 01:38, 4 December 2021 (UTC)

It's in the charinsert tool where you can enter your own custom entries. Line 17 here
Jpez (talk) 05:09, 4 December 2021 (UTC)
@Jpez: Please don't use raw HTML markup in works. If you need to adjust the line height (and think carefully about whether you really need to do that; on, e.g., Page:Heaven Revealed.djvu/216 replicating the tighter line spacing is not strictly needed), use the {{lh}} or {{lh2/s}} templates. There is now also the possibility to apply CSS per work using the "Styles" tab shown on the Index: page, which can target the containers provided by other templates. Xover (talk) 09:24, 4 December 2021 (UTC)
@Jpez: My two cents worth, the standard line height for 100% font-size is 1.4 or 140% but please listen to User:Xover. The {{ts}} has a variety of line height definitions. The lh is automatically and proportionally increased for > 100% font-size. It's the < 100% font-sizes that don't reduce the lh. Here is my default definition with which I start building a table, or define a <div>
Testing
Testing
100%
Testing
Testing
100% line height
Testing
Testing
130% line height
Testing
Testing
150% line height
Testing
Testing
200% line height

— Ineuw (talk) 00:19, 7 December 2021 (UTC)

P.S: One cannot define different line heights for a single row.— Ineuw (talk) 00:25, 7 December 2021 (UTC)

Thanks guys, I didn't know a template for this existed or else I would have surely used it. The reduced line height is used as a form of quotation in some of the works I'm working on and I'd like to keep it to also preserve the character of the book. But if there is a serious reason not to use the reduced line height I have no problem forfeiting it in place of some other form of quotation. Jpez (talk) 06:30, 9 December 2021 (UTC)

Smallrefs

Hello,

So, I have been working on Historic Highways of America/Volume 1, and have finished proofreading and transcluding. When I look at the proofread pages (one by one in the Monthly Challenge workspace, or whatever it is called), the references appear small in the footnotes. However, when I head to any of the actual completed pages on Wikisource, the references are no longer small. Does anyone know what I am doing wrong? Do I have to put smallrefs in the footer of every page, even those that don't actually have any in text references on that page? Or is it to do with how I have performed the transclusion (which was my first attempt, and I could easily have missed something, or many somethings)?

Thanks,

Hi, thank you for your work on this book. You would need to add {{smallrefs}} to each page in the main namespace, see Historic Highways of America/Volume 1/Chapter 3 Languageseeker (talk) 02:44, 9 December 2021 (UTC)
Hi, Thanks for the help. Out of curiosity, what is the point of using smallrefs in the footers of each of the individual pages when proofreading (that actually have references on them)? Or does adding smallrefs to each page of the main namespace only work if you have already used smallrefs in the footers of each of the individual pages? Thanks, TeysaKarlov (talk) 03:08, 9 December 2021 (UTC)

Changed file name

I uploaded Lady Anne Granard 1.pdf some time ago and have now added all the text. The title is Lady Anne Granard, or Keeping up Appearances. Volume 1. More recently I uploaded Lady Anne Granard 2.pdf, this being the second volume with the title Lady Anne Granard, or Keeping up Appearances. Volume 2. For some reason the name of this file has been changed. The first is correctly shown as Index:Lady Granard 1.pdf and the second should have been Index:Lady Anne Granard 2.pdf, instead of which is has been changed without a by your leave to Index:Lady Anne Granard, or Keeping up Appearances.pdf. I cannot re-upload Lady Anne Granard 2.pdf because I am told that file is already uploaded. I can, I suppose work around this incorrect Index name but I am concerned about what might happen when I come to upload Lady Anne Granard 3.pdf, the final volume. Have you any explanation as to why the file name was changed? Esme Shepherd (talk) 19:57, 10 December 2021 (UTC)

Whilst I would prefer my original file name of Index:Lady Anne Granard 2.pdf, I have put in a request that the name be changed to Index:Lady Anne Granard, or Keeping up Appearances Volume 2.pdf. Without the volume number it is obviously wrong as this is a three volume work. I don't mind uploading the third volume as Lady Anne Granard, or Keeping up Appearances Volume 3.pdf, if that is what you prefer but including alternative titles seems a bit pointless when nobody ever refers to them. Many famous 19th century works had them and few know what they are. In any case, why include both titles when it should be either one or the other?Esme Shepherd (talk) 11:41, 11 December 2021 (UTC)

The file is now Lady Anne Granard, or Keeping up Appearances Volume 2.pdf with all my work up to date in place, so I can go on from here. No comment has been made as to what went on here, or what the expectations were.Esme Shepherd (talk) 17:12, 11 December 2021 (UTC)

@Esme Shepherd: So far as I can tell from the logs, you uploaded this as File:Lady Anne Granard, or Keeping up Appearances.pdf on December 6 at 17:02. Then today at 12:15 you requested file be renamed to File:Lady Anne Granard, or Keeping up Appearances Volume 2.pdf, which request a Commons file mover executed at 13:32. There is now a file redirect in place at the former name, which is why Index:Lady Anne Granard, or Keeping up Appearances.pdf still works. There has never been a file at File:Lady Anne Granard 2.pdf. The "Index:" pages here at Wikisource are immaterial to the issue: they simply match the file name at Commons and moving an Index: page has no effect on the File: over on Commons. Xover (talk) 17:48, 11 December 2021 (UTC)

From what you say, there can only remain a mystery, which I suppose doesn't matter now. I do not have a file called Lady Anne Granard, or Keeping up Appearances.pdf, nor have I ever had one. The only file for this volume on my computer is Lady Anne Granard 2.pdf. When I did try later to upload this, again, as I remarked, my upload was rejected because this file had already been uploaded. Furthermore, up until 10th December, I thought I was adding text to Index:Lady Anne Granard 2.pdf (pages 1-43). It was only on the 10th December that I noticed the name Index:Lady Anne Granard, or Keeping up Appearances.pdf but posted pages 44-53 anyway before raising this issue. Anyway, I would like to thank the Commons file mover for the rename from which I hope I can proceed without further confusion.Esme Shepherd (talk) 20:40, 11 December 2021 (UTC)

Searching for pages with transcluded content

Is there a way to do this? I'm trying to find works which have transcluded content and a text quality indicator. I've successfully used PetScan to search for pages which have a text quality indicator and use the {{page}} template, but can I search for works that transclude using the <pages /> tag, using direct transclusion with the {{NS:PAGENAME}} format, or using labeled section transclusion? —CalendulaAsteraceae (talkcontribs) 22:22, 14 December 2021 (UTC)

@CalendulaAsteraceae I'm not really sure about a general solution. But if you just want to find "some" instances of dodgy transclusions to whittle down, you can do something quick and dirty like a regex search: insource:/\{\{Page:/. I'm not sure that will find all of them (and I'm not sure it can do LST), but it gives something! Inductiveloadtalk/contribs 23:01, 14 December 2021 (UTC)
@Inductiveload: Thank you! Combining that with searching for pages with {{TextQuality}} got me some results. I'll poke at the search to see if it can find other transclusion methods. —CalendulaAsteraceae (talkcontribs) 05:39, 15 December 2021 (UTC)
Searching for pages and LST/section also got results! —CalendulaAsteraceae (talkcontribs) 07:40, 15 December 2021 (UTC)

Force a Table left

Hello,

I was working on Jack London's the People of the Abyss in the Monthly Challenge, and almost all the pages are now validated. Only the final page (admittedly an advertisment) is not. Technically, there are two issues. One, some strange symbols in the title, which given that it is an advertisment, I am not too fazed about missing. The second, is that when trying to insert a brace, I used a table, which seems to be automatically centreing, rather than being left aligned/floated left. Is there some way to force the table to the left (just curious in general, unless the default is being overriden by the brace parameters), or equally, to insert a brace in some other manner such that by default the text is left aligned. If this can be done, and the page validated, the entire work should then be complete.

Thanks,

Table of contents with "chapter descriptions"

In the table of contents to The Angels of Mons, the second-to-last entry comes with a "chapter description" hanging below it whose formatting I can't figure out how to reproduce. I've tried any number of permutations of the various TOC templates, and the closest I can get it is as it appears now (the description is properly placed, but the page number is vertically aligned with it instead of with the main entry). Does anyone have any idea how to fix this? Eureiachthon (talk) 01:51, 17 December 2021 (UTC)

Eureiachthon Put it on a separate line. Not sure which toc template to use, but that line doesn't have dots so it should be not a big problem to just give it a separate (and the next) line.--RaboKarbakian (talk) 02:31, 17 December 2021 (UTC)
All right, I think I've got it now; thanks for the help. Eureiachthon (talk) 03:09, 17 December 2021 (UTC)

Monthly Challenge : Middlemarch

Hello, I am trying to proofread Index:Middlemarch_(Second_Edition).djvu but the quality of the original text provided is very poor. Can someone please give me a better text ? I have downloaded on my PC a PDF copy which is excellent. --Stamlou (talk) 16:27, 18 December 2021 (UTC)

@Inductiveload: Do you think that it's possible to replace the entire scan with [15]? Languageseeker (talk) 16:51, 18 December 2021 (UTC)
@Stamlou: Heads up that your link is malformed. —Justin (koavf)TCM 17:35, 18 December 2021 (UTC)
Oups ! Thanks. Stamlou (talk) 17:56, 18 December 2021 (UTC)
No worries, mon sœur. —Justin (koavf)TCM 18:01, 18 December 2021 (UTC)

Something strange going on with The Professor, Volume 1

Hello,

Earlier today, the pages for the Professor, Volume 1 were transcluded using, e.g. pages index=The Professor (1857 Volume 1).djvu from=XXX to=XXX. Now, as I was proofreading/validating at the time (with Chrisguise and DoublePendulumAttractor), I had a look at the pages created for chapters 1-12, and everything looked fine (on my end at any rate), where by fine, I mean they matched the individually proofread/validated pages. However, checking back now (a couple of hours later), the transcluded chapters look very different. The 'pages index=The Professor (1857 Volume 1).djvu from=202 to=233' are no longer a part of the wikisource pages, with what looks like the entire chapters' text somehow copied in. My question is, how/why has this happened, and can it be reverted? Or is this just happening to me, i.e. could someone else please check what the finished work (for volume 1 anyway) looks like?

Thanks,

Looks fine to me. Could you possible mix up The Professor: a Tale/Volume 1/Chapter 12 (transcluded) with The Professor/Chapter XII (unsourced)? Languageseeker (talk) 05:48, 19 December 2021 (UTC)
I think that is what has happened. Why does the other exist? The formatting on the unsourced version is displeasing, to say the least. Is the new version being proofread such that the old can be deleted? TeysaKarlov (talk) 05:54, 19 December 2021 (UTC)
@TeysaKarlov Along time ago, from the 1970s until the mid-2000s, people were excited about having the possibility of posting texts on line and began creating digital texts. However, they did not have the means to use advance formatting or even proofread from the original texts. Still, they wanted to make the great books available for free for everyone. So, they created basic texts copies from an amalgam of modern reprinting. These were posted on whole myriad of sites. In the early 2000s, Wikisource came about to gather all the texts scattered across the internet in one corner. Quickly, the realization came that many of these texts were quite problematic. So, ProofreaderPage extension came into being to allow for the direct comparison of transcribed text and original text. However, many felt that why create a new, scan-backed copy of an existing text when so many texts didn't even have a single digital edition. So most of the major works remain stuck in their 1990s form riddled with errors and subsequent alterations. The Monthly Challenge is a way to replace these 1990s texts with scan-backed versions so that these texts can finally be available in a high-quality version faithful to their original pubilication. Languageseeker (talk) 06:27, 19 December 2021 (UTC)

Is this document good to upload?

Hi, I'm a few months new to Wikipedia and a few minutes new to here and so I want to make sure I'm getting this right.

Is it good to upload this https://emrlibrary.gov.yk.ca/gsc/economic_geology_series/16-1962/egs_16.pdf It's a 1960's government scientific publication from Canada, in the public domain. CT55555 (talk) 20:02, 20 December 2021 (UTC)

If it's in the public domain (in both the US and Canada), then you would not upload it here, but at Commons. A transcription from the PDF would then be made here. --EncycloPetey (talk) 21:08, 20 December 2021 (UTC)
@CT55555: We generally host our media at Commons, which then allows us to use it on several other projects, such as our encyclopedia, Wikipedia and this collection of original sources, Wikisource. Have you seen c:Commons:Project scope or c:Commons:Licensing? Let me know if you need any help. —Justin (koavf)TCM 22:12, 20 December 2021 (UTC)
@CT55555: The key lies in that "…in the public domain" bit. The publication as such is within scope here, but the question is whether it is compatibly licensed (public domain being one compatible licensing status). From a quick look, and presuming the Economic Geology Series is a series of reports or similar published by the Geological Survey of Canada and written by employees of the Geological Survey, that would make it Crown Copyright in Canada. Canadian Crown Copyright seems to mostly expire 50 years after publication, so this 1962 work's Canadian Copyright expired at the end of 2012. That means it was still in copyright in Canada on the URAA date (1 January 1996), which in turn means its US copyright was restored to a publication + 95 years term. For both enWS and Commons the issue is then that on the face of it, the work is in copyright in the US until 2058 even if its Canadian copyright has expired and can't be hosted on either project. Xover (talk) 07:11, 21 December 2021 (UTC)

Formatting help with the title page

Could someone please look at Page:The Federalist, on the new Constitution.djvu/7 and add the appropriate formatting? It's the title page, so it's a bit more complicated than most pages. WhatamIdoing (talk) 17:05, 22 December 2021 (UTC)

I think this is a start. The book's CSS could be created to define a particular serif font and then some span used to display a different font for the slab serif section but even if that never happens, that's not a tragedy. —Justin (koavf)TCM 18:06, 22 December 2021 (UTC)
Thanks, Justin. This is a big improvement. WhatamIdoing (talk) 07:45, 23 December 2021 (UTC)
It's a team effort, What. I can't be what I am without you being what you are. —Justin (koavf)TCM 07:54, 23 December 2021 (UTC)

How do make a custom frame?

Hello. Basically, I'm now proofreading the scan version of "The Life and Adventures of Santa Claus" and found page 13 as the title page. Somehow I kinda remember seeing somewhere, where you can make a custom frame based on the scan, but I couldn't find it anymore. So, how do make a custom frame? Thanks. Mnafisalmukhdi1 (talk) 00:03, 25 December 2021 (UTC)

Mnafisalmukhdi1 that book is very format heavy. I don't know if templates exist, but divs with negative margins work nicely. Well, templates do exist. There is {{flow under}} and {{overfloat image}} but putting just the plain wiki image first and wrapping the text in a negative margin-top div works really well. And, if you don't add some margin to the bottom, the whole thing will end up in the editing area. The book has beautiful images, do you plan to do the images up nicely?--RaboKarbakian (talk) 01:31, 26 December 2021 (UTC)
First of all, I don't think to add images from other page too. It is beautiful, but there's too many of them. However, I think for such chapter heading, {{flow under}} will useful.
For now, I just want page 13 and I think {{overfloat image}} is the answer. Thanks, and I'm gonna think for the chapter heading that I've already mentioned. Mnafisalmukhdi1 (talk) 01:44, 26 December 2021 (UTC)

Unfamiliar symbol in text.

THIS PAGE uses an unknown symbol (to me), the closest I found was the symbol of Jupiter " ". But I have serious doubts. Can someone familiar with old texts could please look at it? Ineuw (talk) 16:16, 26 December 2021 (UTC)

Pretty sure it's the Cuatrillo Ꜭ. MarkLSteadman (talk) 16:31, 26 December 2021 (UTC)
Thanks. You can be more than just pretty sure. :-) Ineuw (talk) 17:11, 26 December 2021 (UTC)

Transferring Index Files Deleted from Commons

With reference to this unreplied help request, the two related index files (at here and here) have been deleted from Commons. Now that I have requested for temporary undeletion at there to facilitate transferring these files to here under {{PD-EdictGov}}, but it seems that the upload form don't work when a file exists on Commons. Can someone help with dealing with the transferal process? It seems that adding local description also don't work.廣九直通車 (talk) 07:02, 26 December 2021 (UTC)

廣九直通車: I have uploaded those files here. I changed the names slightly File:Government Gazette of the Republic of Namibia-No 321.pdf and File:Government Gazette of the Republic of Namibia-No 6807.pdf. They are just uploads, meaning, no info template and no categorization. There is an upload link in the navigation links. "Ignore warnings" has to be toggled to upload duplicate files. Possibly, I could have left the names the same also, by toggling that ignore warnings button, but that is for me to learn on another day.--RaboKarbakian (talk) 14:09, 26 December 2021 (UTC)
Thanks for your assistance. I was later found mislead by insufficient information provided on c:COM:Namibia, as it is found that Namibian copyright law did have provision that releases laws into their local public domain. I think once the corresponding copyright templates are completed on Commons, the local versions can be deleted (plus some other index files of Namibian Acts can be moved to there as well), regards.廣九直通車 (talk) 09:59, 27 December 2021 (UTC)
@廣九直通車: If c:COM:Namibia also gets updated with the new information I'd say the net effect is very positive. Thanks for looking into this and following up! Xover (talk) 10:47, 27 December 2021 (UTC)

side to side images

how do you make images go side to side like shown with the first 2 images on this page? my images are File:Fig. 2. (A complete course in dressmaking, (Vol. 2, Aprons and House Dresses)).png File:Fig. 3. (A complete course in dressmaking, (Vol. 2, Aprons and House Dresses)).png

Place two single images in a table with two rows and two columns. The first row is for the images side by side, the 2nd is row for the side by side captions. Ineuw (talk) 14:36, 29 December 2021 (UTC)