Commons:Village pump/Technical

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Shortcuts: COM:VP/T • COM:VPT

Welcome to the Village pump technical section
Technical discussion
Village pump/Technical
 Bug reports
 Code review
Tools
 Tools/Directory
 Idea Lab



This page is used for technical questions relating to the tools, gadgets, or other technical issues about Commons; it is distinguished from the main Village pump, which handles community-wide discussion of all kinds. The page may also be used to advertise significant discussions taking place elsewhere, such as on the talk page of a Commons policy. Recent sections with no replies for 30 days and sections tagged with {{Section resolved|1=--~~~~}} may be archived; for old discussions, see the archives; recent archives: /Archive/2024/08 /Archive/2024/09.

Please note
 
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 1 day and sections whose most recent comment is older than 30 days.

Special:UncategorizedCategories says it runs once a month. It is now five days beyond that. Even once a month is a long time for this report, which used to run every three days. No doubt partly as a result of that longer interval, we have gone from having about 100-200 such categories early this year to over 1500 in the latest (July 7) report. I've dealt with several hundred of those, and would really like to have a more updated report to work from. As far as I can tell, there is no reasonable way to find uncategorized categories other than this report, and except for the rare false positive, every single uncategorized category genuinely represents a task to be done. - Jmabel ! talk 19:31, 12 August 2024 (UTC)[reply]

Why was it changed to much less frequent runs? I think it would be better if like with Suggested Edits, categories were suggested for these (and also categories that already have categories) – for example using the categories of the Wikipedia items if the category is linked to (these could have cats with matching cats on WMC or cats that have parent cats with matching cats on WMC). Prototyperspective (talk) 23:20, 19 August 2024 (UTC)[reply]
It's updated. Count of 11:11, 22 August 2024 is 1534. Enhancing999 (talk) 12:25, 22 August 2024 (UTC)[reply]
phab:T369024: Seems the outcome is Special:UncategorizedPages got deactivated instead.
@Ladsgroup can't we keep both?
A way to speed it up could be to limit Special:UncategorizedPages to namespace 0. Pages don't need to appear there and on Special:UncategorizedFiles.
FYI: @Bawolff, @Mdaniels5757, @JopkeB. Enhancing999 (talk) 14:00, 22 August 2024 (UTC)[reply]
MediaWiki currently does not support setting Special:UncategorizedPages to use namespaces different than the Content Namespaces Bawolff (talk) 09:27, 23 August 2024 (UTC)[reply]
The query for Uncategorized pages is really strange. I wonder what usecase it's meant for. It's much quicker when it's limited to namespace 0. Adding namespace 6 and filtering that for pages without files is just strange. It seems to be me that the bug needing fixing is to write correct SQL. Enhancing999 (talk) 09:42, 23 August 2024 (UTC)[reply]
Thanks for pinging me. My questions:
  1. What exactly is the problem? What is the frequence of runs yet? @Jmabel: What would be a good frequence for you?
  2. What is the purpose of phab:T369024 exactly? Please explain in plain English. I have not enough knowledge of this tool to understand what the consequences of this task might be. Anyway: Special:UncategorizedPages should still run now and then, it should not be deactivated. If the frequence now is a problem, I think it might run weekly or perhaps even less frequent, but we cannot do without it.
  3. What does "limit Special:UncategorizedPages to namespace 0" mean? And in the next sentence: what does "there" mean? Now it looks like referring twice to the same list.
JopkeB (talk) 09:34, 23 August 2024 (UTC)[reply]
As you noted elsewhere Special:UncategorizedPages included both galleries (namespace 0) and file description pages that are empty (namespace 6). These empty file description pages appear also on Special:UncategorizedFiles.
The query for namespace 6 in Special:UncategorizedPages is really resource intensive at Commons (this lead to the bug above). I can't really think of a cases where it's needed though.
As the bug report isn't very clear, it lead to the wrong "fixes" and Special:UncategorizedPages is currently deactivated. Fixing it correctly should make updating Special:UncategorizedPages much faster. Enhancing999 (talk) 09:51, 23 August 2024 (UTC)[reply]
@Enhancing999: Thanks for your explanation. How would Special:UncategorizedPages get correctly updated, what should be done? Is phab:T369024 making that happen or is it the other way around: is that the ticket implementing the deactivating and should we ask for a new ticket/task? JopkeB (talk) 11:18, 23 August 2024 (UTC)[reply]
BTW, currently there are few pages on Special:UncategorizedPages and Special:UncategorizedFiles, but this is due to several people cleaning them up fairly thoroughly since JopeBe's report above. This does not mean the reports aren't needed or should be run less frequently. Enhancing999 (talk) 09:59, 23 August 2024 (UTC)[reply]
Hi, Page table of Wikimedia Commons currently has more than 110M rows, its categorylinks table has more than 800M rows. Joining these two tables with such conditions is too expensive in our production and can cause issues and bring down our databases. That's why we had to reduce its frequency to once a month. We will eventually migrate these reports to hadoop and bring back its previous frequency but that's far in the future (phab:T309738). In the meantime, you can improve the condition by excluding File namespace in the query (which would make it faster), and then just run it against wikireplicas. Similar to how English Wikipedia builds reports for their needs (en:WP:DBR). Sorry for the inconvenience but we don't really have a choice. ASarabadani (WMF) (talk) 11:38, 23 August 2024 (UTC)[reply]

@JopkeB: I guess I could live with Special:UncategorizedCategories being monthly, though certainly more often is better. Before the most recent report a couple of days ago, we'd gone 6 weeks. When you say "What exactly is the problem?" I'm not sure what you are asking. Are you asking how the page is used? Why it's a problem when it is far out of date? or something else? - Jmabel ! talk 18:55, 23 August 2024 (UTC)[reply]

I have already get an answer to this question: there are technical problems to run lists as frequent as we would like to. JopkeB (talk) 05:14, 24 August 2024 (UTC)[reply]

Could translated pages be hidden from categories?

[edit]

For example see Category:Commons video resources – all those translated pages in that category make it cluttered, hard to go through the pages, and bury pages on the next page.

Would it be possible to hide translated pages so that only one is shown?

  • One could have the links to the translated pages at the top of that page
  • One could automatically open the respective translated page when opening the page depending on one's language settings
  • (and there are more alternatives)

Prototyperspective (talk) 22:00, 16 August 2024 (UTC)[reply]

Much less useful but still useful would be if the translated pages were also hidden or hidable in the File uses on commons section of file pages because it makes it cluttered and hard to see where a file is used; example. Prototyperspective (talk) 10:08, 19 August 2024 (UTC)[reply]
It's possible to place them in a subcategory, but I don't think status quo is necessarily an issue. They should be categorized in any case. Enhancing999 (talk) 12:17, 22 August 2024 (UTC)[reply]
It's currently not a big problem but it makes things far less overseeable and buries things on other pages or beneath the cluttered page. Imagine if there were 300 translated versions of a page which is just roughly the number of languages with a Wikipedia and not even all notable languages, it would make the category barely usable to find and organize things. Manually subcategorizing wouldn't be a good solution because it requires people to spend time manually doing so and new translations will be directly in the category again. Maybe there could be some multilingual redirect page that redirects to whatever language the user has configured if that language version exists and English otherwise? Prototyperspective (talk) 12:35, 22 August 2024 (UTC)[reply]
Another example: Category:Digitisation help. Prototyperspective (talk) 10:52, 18 September 2024 (UTC)[reply]

Vandals often move captions to other languages – detection needed

[edit]

What's going on with people somewhat apparently systematically moving captions from one language to a false one? This is happening frequently and often, if not usually, not detected & reverted by editors.

Could a detection of this please be developed? Compared to other vandalism that is well-detected automatically on Wikipedia by for example ClueBot NG (acc) I think it would be easy to detect if a caption got moved from the original language to another one, if possible with language detection so it also checks if the language it was moved indeed does not match the caption text language. Maybe a better place to ask about this would be the Bot requests or the ClueBot NG talk page / code repo.


Examples

Previously I was wondering how to search my contributions (edit summaries). I was trying to use standardized phrases or terms in edit summaries so I can easily look them up later, for example to replace texts I previously added with templates. I found the following useful tool and because I try to always use term "rvv" when reverting edits that are or seem to most likely be vandalism many examples of such edits can be found here. These are the most recent ones: 1 2 3 4 5 6 7. By the way, I think captions are more a problem or redundant than anything else since there already is the machine-translatable description field (it's useful sometimes when descriptions are long but these could also instead have a short version at the top or be shortened). Another problem is that when undoing changes to captions there is no prefilled edit summary so one has to tediously copy the contributions link of the user and write the edit summary anew. Prototyperspective (talk) 13:04, 17 August 2024 (UTC)[reply]

Given how often this happens (I was able to find a couple of recent instances in a few minutes of looking through RecentChanges) and how weirdly specific of an action it is (it only changes the language of the caption, never its content), I suspect this is a UI/UX issue, not deliberate vandalism, and I suspect it can be addressed by making some changes, like inserting a confirmation dialog when changing the language of an existing caption.
Does anyone know how/where the caption editing interface is implemented, and who's responsible for it? (I also have a couple of gripes about the language picker used in the caption interface - especially its appearance on mobile.) Omphalographer (talk) 05:16, 18 August 2024 (UTC)[reply]
you're right. try File:JPG Test.jpg.
given existing captions, users can change the language and then click publish. that results in the aforementioned problematic edits. RZuo (talk) 05:30, 18 August 2024 (UTC)[reply]
There is the new feature to require captchas for action defined in an abuse filter. I thought about requiring captchas for all IP edits on captions. This might reduce these king of accidental edits. GPSLeo (talk) 05:36, 18 August 2024 (UTC)[reply]
  • Could be but sometimes they do change the text or change multiple languages at once and it seems like it's always only done by new or unregistered users who sometimes did some other different problematic changes.
  • Another thing that could be done is automatic detection of the language and displaying at least a warning or adding the file to a maintenance cat if it doesn't match the specified language – this would also be useful since often people specify the wrong language even at upload.
  • @RZuo: What do you mean? Why would Omphalographer be right in that this is a UI/UX issue? What you described is exactly the expected behavior: changing the language and then clicking publish, how does this suggest it's not vandalism and a UI issue? I don't see why you and GPSLeo think it would be accidental / an UI issue if one has to deliberately click "publish".
  • Even if both captchas are added and they indeed reduce these changes, I still think there should be automatic detection of these changes as well as other likely vandalism. Why is ClueBot_NG not active on WMC? Does ORES work with WMC? There's lots of vandalism here (not just in the captions and the relatively hidden structured data) and I've come across multiple cases where it stayed on a relatively large page for a year or so. Bots/tools could build a queue of edits to check as well as automatically revert edits that are very likely to be vandalism. Moreover, they could learn from edits that specify that the reverted edit was likely vandalism or similarly nonconstructive (it doesn't matter if deliberate or not) if terms like RVV are used.
Prototyperspective (talk) 10:07, 18 August 2024 (UTC)[reply]
People are on a website an they see a language selection. They want to change the language of the text. They do not expect that they are able to change the content on a website where they do not even have an account. The button says publish but how is this translated into different languages and are these translated terms always that clear? GPSLeo (talk) 10:37, 18 August 2024 (UTC)[reply]
This type of "vandalism" is so specific that it would basically have to be a very dedicated LTA who utilizes IPs from all around the world; it's disruptive but I don't think it's deliberate. Gnomingstuff (talk) 04:49, 26 August 2024 (UTC)[reply]
Yes I think the explanation by GPSLeo is quite plausible. The main subject of this thread however is detecting such edits and auto-reverting them (also useful for other unconstructive edits / reducing maintenance workload) regardless of whether or not they are intentional or not. I don't know what you mean by "LTA" but it could have also been many people that found this to be an effective type of vandalism as it's often not detected and reverted but I already think inadvertent edits may be more likely. Maybe there could be some special confirmation box asking if the user really wants to publish that to the file data without using the word "publish" because maybe those users didn't understand that word. In any case, detecting if the text in the caption matches the languages seems useful and needed in any case, for example because many users add captions in English to other languages at upload or similar things. Prototyperspective (talk) 09:52, 26 August 2024 (UTC)[reply]
Something in the GUI needs improvement. It's plausibel that changing the language and saving it actually adds an additional language rather than deleting one as well: [1] Enhancing999 (talk) 10:10, 26 August 2024 (UTC)[reply]
LTA = long-term abuse, people who vandalize in their specific identifiable way for months or years.
The main tells on these are edits by infrequent editors with 2 changes per file. Not much help though once it's out of recent changes. I've found some by searching for captions with mismatched languages e.g. "Spanish the," but obviously that only works with certain patterns.
The problem though with any kind of auto-reverting is that it would have to not catch people fixing this stuff, especially when it's undetected. Gnomingstuff (talk) 11:29, 27 August 2024 (UTC)[reply]
Many don't have 2 changes per file. I think the main indicators are 1. language does not match specified caption language (no other indicator is needed; check language auto-detection of Google Translate or DeepL to see what I mean with language detection) 2. user isn't an editor with many unreverted edits (would only use this indicator early on as language mismatching is a general problem).
I don't understand what you mean with The problem though with any kind of auto-reverting is that it would have to not catch people fixing this stuff. People fixing this stuff would move the caption back to its matching language or remove the flawed captions so they wouldn't be detected. Prototyperspective (talk) 11:51, 27 August 2024 (UTC)[reply]
The solution may be mw:Moderator Tools/Automoderator but I don't know if it also checks changes to captions, does it? Prototyperspective (talk) 10:51, 18 September 2024 (UTC)[reply]

Geolocate nepal photos

[edit]
Date Name Thumbnail Size Description
09:31, 23 March 2019 Chandragiri Cable Car, 2018-04-21 (2).jpg (file) 5.93 MB User created page with UploadWizard
07:18, 23 March 2019 Chandragiri Cable Car, 2018-04-21 (1).jpg (file) 6.31 MB User created page with UploadWizard

plz take a look at User:Simasuru's photos. i located one of them to 27.685, 85.2138. do you think that's right? once it's confirmed, i will rename the files. RZuo (talk) 05:46, 18 August 2024 (UTC)[reply]

I'd tend to have these tests deleted. Enhancing999 (talk) 12:21, 18 August 2024 (UTC)[reply]
Rename yes, but meets COM:SCOPE, right? --PantheraLeo1359531 😺 (talk) 11:55, 19 August 2024 (UTC)[reply]
@RZuo: it seems to match Google Street View. Of course locating an image retroactively might result in some minor error margins, but it should not be a huge deal. I've moved them to a more appropriate file name now. —Matrix(!) {user - talk? - uselesscontributions} 15:13, 27 August 2024 (UTC)[reply]
[edit]

With videos it is possible to specify a start (and end) time of the video – see Commons:Video#Temporal media fragments.

  • Is it possible to link to times of the videos in the file description?
  • Is that already possible somehow for audio files?

I think it would be very useful if in the file description of Spoken Wikipedia audio files, like those that I just uploaded, there were links to the different sections of the article. If one is only interested in a particular section of the article one could jump to it directly and listen to only that. It also gives some orientation where one is currently at when listening to articles. Linking to chapters is possible on YouTube and probably many users have already noticed some ways this can be useful. For example, I'd like to link the timestamps in the description here and add timestamp links to the different exercises here.

This doesn't seem to work with videos either as one would have to append ?start=00:26 but can only append things like #start=00:26. Is there some issue about these things? Prototyperspective (talk) 12:46, 22 August 2024 (UTC)[reply]

There isn't really a universal chapter methodology in HTML5 video. The HTML5 idea is that you create your own VTT file (which we don't yet support). These could refer to Commons link (they can be anything you like, as all support for it is completely custom work). Then you write custom Javascript to listen to the timedtext events, read the 'text' (a description and link in this case) and go do something with that (wrapped inside a videojs custom plugin).
Related tickets about this in phabricator. phab:T116154, phab:T301826. —TheDJ (talkcontribs) 18:08, 25 August 2024 (UTC)[reply]
You can link to start points in the file description page using start= and end= query params like:
File:Using_a_Power_Rack_in_bodybuilding,_powerlifting,_strength_training,_resistance_exercise.webm?start=00:10&end=00:15, but it is not that useful, as there is no autoplay etc. —TheDJ (talkcontribs) 18:14, 25 August 2024 (UTC)[reply]
  • The two issues you linked don't seem to be related to this at all. What you wrote is about annotations like a link or a balloon message informing about an inaccuracy in the video at specific times of the video. In contrast this post here is about video chapters and linking to different times of the video in the file description.
  • Yes, I know that one can link to start points that way and explicitly said that in the last paragraph. As said, one would need to link to these from there like #start=00:26 because when including a link like your example, it opens the video in a new tab at that starting point instead of directly jumping to it.
  • This is really important for Spoken Wikipedia where you may like to jump to a particular section. There also needs to be some proper audio player, with the current one only being the fallback, that for example is wider so you can jump to some timing better and with a -10 seconds feature.
Prototyperspective (talk) 10:49, 26 August 2024 (UTC)[reply]
It generally needs a modern improved audio-player that has not only chapter/timestamp-link functionality, but also several other widely established standard features (one could use an existing open source audio player). Prototyperspective (talk) 10:54, 18 September 2024 (UTC)[reply]

Bot no longer working to warn Wikivoyage about nominations for deletion?

[edit]

As I stated in User talk:IronGargoyle#No notifications to Wikivoyage anymore? (edited for brevity and relevance):

Commons delinker delinked an image on Wikivoyage that was deleted in the Commons:Deletion requests/Files in Category:Patuxai thread. Why was there never a notification on the voy:Talk:Indochina Wars page that this file was nominated for deletion? We're supposed to get such notifications.

Was there a malfunction of the relevant bot, or was a unilateral decision made on Commons to stop giving notice to other sites such as Wikivoyage that use thumbnails of images on Commons in our articles but can choose to locally upload images that we consider important and are problematic merely due to a lack of commercial freedom of panorama? If the bot malfunctioned, please try to find out why and ensure the problem does not recur. However, if a unilateral decision was made to stop giving sister sites such as Wikivoyage the chance to make our own decisions on affected files before they are deleted on this site, I cannot state too strongly that that is absolutely unacceptable! We cannot return to the days when slews of images were deleted from Wikivoyage articles without notice. I'll look forward to your response and explanation of how you will prevent this problem from recurring. Thanks, everyone! -- Ikan Kekek (talk) 04:30, 25 August 2024 (UTC)[reply]

I do not think this is specific to Wikivoyage, there were also bot notifications about imminent deletions in the English Wikipedia, but I do not see them anymore. May the the bot got abandoned, lost the flag or smth else happened. Ymblanter (talk) 18:23, 27 August 2024 (UTC)[reply]
I'm not entirely sure if that wikivoyage was on the bot, however I do know that the bot was recently broken for quite a while: https://phabricator.wikimedia.org/T339145 There was a surprising small amount of communities that adopted/approved that bot when the community tech team finally rewrote it in 2018 btw. Really sad, if you realize how much money the foundation poured into reworking that bot. —TheDJ (talkcontribs) 20:17, 27 August 2024 (UTC)[reply]
We need to be notified when images we use as thumbnails are nominated for deletion. If the bot isn't working, what's the solution? -- Ikan Kekek (talk) 23:03, 27 August 2024 (UTC)[reply]

Croptool connections/authorization

[edit]

Hey guys, I use croptool to correctly size images from Wikipedia commons for use as pagebanner in wikivoyage. Unfortunately, connections have been consistently failing the last couple days. Anyone seen this? Any tips on what I might need to do to get it to work? Mrkstvns (talk) 21:13, 25 August 2024 (UTC)[reply]

See COM:CropTool, the tool should be up now, but it seems like Toolforge had issues over the weekend. Sohom (talk) 14:53, 26 August 2024 (UTC)[reply]

Dark mode fix on Main Page?

[edit]

I've pushed some dark mode fixes on Template:Main Page Template, but it's not being reflected in Main Page for some reason? However it is being reflected in other languages like Tamil (முதற் பக்கம்). Is there a reason why? —Matrix(!) {user - talk? - uselesscontributions} 05:21, 26 August 2024 (UTC)[reply]

Upland Wizard bug

[edit]

I uncovered a bug in the Upload Wizard. Can it be reported somewhere locally, or should I use Phabricator? The Upload Wizard main page and Upload Wizard FAQ are both silent on bug reporting. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:49, 26 August 2024 (UTC)[reply]

The correct place on Wiki would be Commons talk:WMF support for Commons/Upload Wizard Improvements. But in this case this is not an UploadWizard problem. We have an template with the same name as a language code. That is an error that should never happen and we need to delete the template redirect. GPSLeo (talk) 15:07, 26 August 2024 (UTC)[reply]
I moved all uses of the template redirect to the actual template and deleted the redirect. But I am unsure if the language code handling will work without any additional action. GPSLeo (talk) 15:29, 26 August 2024 (UTC)[reply]
Thank you. Did you check whether any of those were genuine attempts at using the language? Looks like we need to replicate {{En}}, {{De}} etc., for the abr language code. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 15:34, 26 August 2024 (UTC)[reply]
Yes, I checked the files before changing the template. 6 files wanted to use the language code. I create the {{Abr}} and it seems to work correct. GPSLeo (talk) 15:44, 26 August 2024 (UTC)[reply]

Subcategory not seen in a category

[edit]

The Category:Books from the United States is categorized in Category:Books by country, but it is nowhere to be seen in that category. What is the reason? -- Jan Kameníček (talk) 16:53, 26 August 2024 (UTC)[reply]

Tech News: 2024-35

[edit]

MediaWiki message delivery 20:28, 26 August 2024 (UTC)[reply]

Addition of Pattypan Userguide on Commons

[edit]

I and a colleague are curating a complete guide for using Pattypan for batch uploads for Wikimedia Commons, from image quality to upload processes to license and creations of hyperlinks copyright approval. I need help understanding how we can upload here on wikimedia for use by everyone Lutarchitecture (talk) 00:44, 27 August 2024 (UTC)[reply]

You could complete Commons:Pattypan?
 ∞∞ Enhancing999 (talk) 15:19, 27 August 2024 (UTC)[reply]
What do you mean by complete Pattypan, please throw more light thanks, like give it more context so I effectively understand you Lutarchitecture (talk) 21:14, 27 August 2024 (UTC)[reply]
That page and its subpage are the guide of Commons for the use of Pattypan.
 ∞∞ Enhancing999 (talk) 21:17, 27 August 2024 (UTC)[reply]

Search operators for audio file size and duration?

[edit]

Because the gallery of new files (linked in the left panel that is shown on all pages under "Latest files") is broken for audio files since it's very cluttered with lots of pronunciation files and not having any way to filter files out within the page, I created this search that to some extent filters out pronunciation files: -intitle:/LL\-Q/ -Pronunciation -deepcategory:"Tamil pronunciation" -deepcategory:"Dutch pronunciation" -deepcategory:"English pronunciation" filetype:audio

There are two problems with it: deepcategory search operator does not work properly so the parent category can't be used (phab:T369808) and it's quite slow because the "intitle" search operator can't deal with "LL-Q" because of the hyphen needing this regex which slows the search down (phab:T371195). It's still good enough that it can be used in practice to see recent audio files that aren't pronunciation files so e.g. audiobooks, spoken Wikipedia, soundscapes, and music tracks as well as lots and lots of copyvios that no human or bot seems to check. To improve it further, I'd like to add a search operators to exclude files with a duration of only a few seconds or a filesize of just a few kB because these are usually also just pronunciation files. It seems like petscan can do so so is there a way to add this to the WMC search? If such search operators don't exist is there some phabricator issue about adding these? They would also be useful in finding for example subcategorize Short films videos in Films categories and many other things. Prototyperspective (talk) 10:16, 28 August 2024 (UTC)[reply]

@Prototyperspective: You can search by file size in kilobytes using filesize:. See mw:Help:CirrusSearch#File properties search. But there doesn't seem to be a property for duration. You can see what the search system can use by appending ?action=cirrusdump to a page URL, and trying that on a random audio file I can't see anything that looks like the duration. It seems to me that duration would be an obvious measure alongside height and width, but I'm not sure where I'd suggest that. --bjh21 (talk) 11:22, 28 August 2024 (UTC)[reply]
Ups, thanks apparently I only ctrl+F searched for "file size" with a space but not "filesize" on that help page. I added filesize:>100 to the search string.
I'll make a phabricator issue asking about a search operator for duration then because even if that's not needed for filtering out pronunciation audio files it would still be useful for finding short films and many other use-cases I haven't thought of. Prototyperspective (talk) 12:54, 28 August 2024 (UTC)[reply]
This ?action=cirrusdump thing seems very useful so I think it should be included in the help page properly, could you add it there (currently it's only buried in the notes of some reference). Another issue is that I can't find out how to specify the sort order with the search operators (asked about it at Template talk:Search link) – how could one specify sort=create_timestamp_desc as sort order? Prototyperspective (talk) 13:00, 28 August 2024 (UTC)[reply]

How to distinguish categories set by the Infobox from other cats?

[edit]

It would be great if somebody could create a report that lists categories that don't have any categories except for meta-categories set by the Wikidata Infobox like Category:Uses of Wikidata Infobox with no image as proposed here. Example

Is there some way to distinguish categories set by the Infobox from other categories?

It would be best if one could also distinguish between meta-categories set by the WD Infobox like the one above and other categories set by the WD Infobox like Category:Lonnie (given name) (from example) because some categories that only have categories set by the WD Infobox don't need any other categories so would best show up in some separate less important report at some point (and it may even be desirable to increase cats that only have cats set by their WD Infobox). Prototyperspective (talk) 19:47, 30 August 2024 (UTC)[reply]

Any other place to ask about this? Prototyperspective (talk) 10:45, 18 September 2024 (UTC)[reply]

Cat-a-lot performance, maintenance

[edit]

Cat-a-lot seems very slow, since a few days. For example It takes 8 mins to edit a batch of 500 files (locking the tab). Can this be confirmed to be a server or a scripting issue and checked and fixed for speed. rollback ? It should not be my bandwith, but maybe advice on a local setting? Thank you Peli (talk) 12:21, 1 September 2024 (UTC)[reply]

I have the same experience. Very slow. Wouter (talk) 12:51, 1 September 2024 (UTC)[reply]
Please see this thread. Prototyperspective (talk) 21:59, 1 September 2024 (UTC)[reply]

Tech News: 2024-36

[edit]

MediaWiki message delivery 01:02, 3 September 2024 (UTC)[reply]

upload wizard for books

[edit]

Is there a campaign interface (Upload Wizard configuration) that fills in {{Book}} instead of {{Information}}?

It could make it easier for people to understand files like the ones in the Chinese categories.
 ∞∞ Enhancing999 (talk) 12:30, 3 September 2024 (UTC)[reply]

Audio of music contain copyvio thumbnails

[edit]

The thumbnails are not showing up at the audio file but the thumbnail is embedded in them. However, they are embedded in the file and when downloading the file one can see or extract them. Example.

  1. Many of these thumbnails are copyrighted. This means usually the thumbnail would need to be removed. video2commons already imports audio files without the thumbnails. Could there be some script or bot that categorized all audio files with a thumbnail set into e.g. Category:Audio files with embedded thumbnail?
  2. Then as a next step one could remove all of them at scale and efficiently using some metadata removal tool, for example similar to command eyeD3 --remove-all-images **/*.opus (applied to all audio files in some category). I guess it would be best to not remove the thumbnail for identified cases where the thumbnail is CCBY as well, these could e.g. be moved to another category or audio files whose thumbnails should be removed to a subcategory of the category above. (A more sophisticated method would be to reverse image search each thumbnail for finds via tineye so only non-original works are deleted and thumbnails created by the person licensing the work under CCBY kept (if the CCBY license also applies to the thumbnail) but I don't think this would be necessary as it would cause a lot of manual work of checking whether it's indeed a copyvio and whether thumbnails without reverse search result are indeed not copyvios.)

Just as a note: the audio files of the example display 0:00 as duration instead of the duration which only shows after one has clicked play. Prototyperspective (talk) 00:07, 4 September 2024 (UTC)[reply]

When removing the thumbnail one could replace it with a link that enables people to easily download the thumbnail again from some metadata provider. So they should just contain a link or an ID with which to fetch the thumbnail but not a thumbnail image. Prototyperspective (talk) 15:12, 6 September 2024 (UTC)[reply]
Maybe this should be put into bot requests. I think thumbnails should be fetchable via e.g. MusicBrainz. Prototyperspective (talk) 10:45, 18 September 2024 (UTC)[reply]

A bot that moves categories to the bottom of the page

[edit]

Many of my files have the categories in the description. There’s too many to manually move, but is there a way to move them to the bottom with a bot as is done on Wikipedia? Immanuelle ❤️💚💙 (please tag me) 07:37, 8 September 2024 (UTC)[reply]

Example? Prototyperspective (talk) 09:53, 8 September 2024 (UTC)[reply]
@Immanuelle: You can almost certainly use COM:VFC to do it, but I'd suggest not bothering. The categories will work properly wherever they are, so no-one will care unless they're actually reading the wikitext. And if they're reading the wikitext they can fix it themselves. --bjh21 (talk) 11:11, 15 September 2024 (UTC)[reply]

Tech News: 2024-37

[edit]

MediaWiki message delivery 18:48, 9 September 2024 (UTC)[reply]

Troubleshooting needed for File:AMD Zen.svg

[edit]

Can someone investigate why the SVG graphic image File:AMD Zen.svg suddenly stopped working? On the Wikipedia pages where it's used, it's just a blank grey image, and if I click on it, it says "Sorry, the file cannot be displayed - There seems to be a technical issue. You can retry if it persists. Error: could not load image from https://upload.wikimedia.org/wikipedia/commons/thumb/9/9f/AMD_Zen.svg/800px-AMD_Zen.svg.png". Then when I go to Commons where it's hosted, I just see a link to "File:AMD Zen.svg" in place of where the image should be. Opening the link brings me to a page with the error "XML Parsing Error: prefix not bound to a namespace", and trying to open a lower-resolution render just results in a random WMF error like "server technical issue" or "Too many requests, try again later".

Obviously I've had a look at the file upload and page history for this item and there doesn't seem to be any recent changes (or vandalism) that could have caused this to happen. AP 499D25 (talk) 07:53, 13 September 2024 (UTC)[reply]

no xlink namespace declaration. We will need to wait for Commons image scalars to quiet down. Glrx (talk) 20:03, 13 September 2024 (UTC)[reply]
Now it's working again. Kinda bizarre that the other similar-looking files File:AMD Threadripper.svg and File:AMD Epyc.svg were still working at the time, which led me to think that perhaps there was a programming error or some code change that broke File:AMD Zen.svg. AP 499D25 (talk) 03:12, 14 September 2024 (UTC)[reply]

For some reason when I attempted to open File:Typhoon-Yagi 5.jpg, I get nothing but File not found: /v1/AUTH_mw/wikipedia-commons-local-public.d5/d/d5/Typhoon-Yagi_5.jpg. Neither switching browser nor clearing cache help the problem. Initially I thought the file was broken, but Túrelio informed me that he can access the file without problem.

From the archives there appears that there are 2 similar problems. One in February 2022 was resolved by clearing cache, while another in August 2022 ended up in Phabricator. I'd like to ask are there anyone having similar problems, and should I report the matter to Phabricator? Many thanks.廣九直通車 (talk) 13:02, 14 September 2024 (UTC)[reply]

Works perfectly well to me. Are you still getting the error message? — Alien  3
3 3
14:01, 14 September 2024 (UTC)[reply]
Well I just clicked on it and I'm getting the same error myself too! It must be related to my posting about File:AMD Zen.svg above. AP 499D25 (talk) 14:22, 14 September 2024 (UTC)[reply]
Maybe a browser issue? I'm on Firefox, and you? — Alien  3
3 3
14:30, 14 September 2024 (UTC)[reply]
i can also access it, both file page and original file https://upload.wikimedia.org/wikipedia/commons/d/d5/Typhoon-Yagi_5.jpg . using firefox on windows 11. RZuo (talk) 15:46, 14 September 2024 (UTC)[reply]
  • I'm on Win 11. File page does not display image with Chrome or Edge. Loading directly in Chrome, I get "File not found: /v1/AUTH_mw/wikipedia-commons-local-public.d5/d/d5/Typhoon-Yagi_5.jpg" out of the cache.
access-control-allow-origin: *
access-control-expose-headers: Age, Date, Content-Length, Content-Range, X-Content-Duration, X-Cache
age: 480
content-length: 85
content-type: text/html; charset=UTF-8
date: Sat, 14 Sep 2024 17:58:37 GMT
nel: { "report_to": "wm_nel", "max_age": 604800, "failure_fraction": 0.05, "success_fraction": 0.0}
report-to: { "group": "wm_nel", "max_age": 604800, "endpoints": [{ "url": "https://intake-logging.wikimedia.org/v1/events?stream=w3c.reportingapi.network_error&schema_uri=/w3c/reportingapi/network_error/1.0.0" }] }
server: envoy
server-timing: cache;desc="hit-front", host;desc="cp4052"
strict-transport-security: max-age=106384710; includeSubDomains; preload
timing-allow-origin: *
x-cache: cp4052 miss, cp4052 hit/4
x-cache-status: hit-front
x-content-type-options: nosniff
File page and JPEG display with Firefox 130.0 (64-bit).
Glrx (talk) 18:12, 14 September 2024 (UTC)[reply]
Different users get (or don't get) the same file from different servers?
 ∞∞ Enhancing999 (talk) 23:58, 14 September 2024 (UTC)[reply]
Update: tried Safari on iOS, also failed. Probably best to be dealt on Phabricator?廣九直通車 (talk) 06:58, 15 September 2024 (UTC)[reply]
Phabricator bug report filed at phab:T374773, FYI.廣九直通車 (talk) 07:19, 15 September 2024 (UTC)[reply]
Johannnes89 on Phabricator reported that he has no problem in accessing the file with Chrome and Safari, presumably in his home in Germany. Like to ask where did you access the file?廣九直通車 (talk) 09:24, 15 September 2024 (UTC)[reply]
Yes I accessed it from Germany indeed, so the issue might be about accessing it from different servers. Johannnes89 (talk) 09:37, 15 September 2024 (UTC)[reply]
Just tried it from Edge, and it worked.
accept-ranges: bytes
access-control-allow-origin: *
access-control-expose-headers: Age, Date, Content-Length, Content-Range, X-Content-Duration, X-Cache
age: 0
content-length: 7434740
content-type: image/jpeg
date: Sun, 15 Sep 2024 14:55:18 GMT
etag: fe68fa2d2c9fb9101db078cb263815cb
last-modified: Fri, 13 Sep 2024 09:57:44 GMT
nel: { "report_to": "wm_nel", "max_age": 604800, "failure_fraction": 0.05, "success_fraction": 0.0}
report-to: { "group": "wm_nel", "max_age": 604800, "endpoints": [{ "url": "https://intake-logging.wikimedia.org/v1/events?stream=w3c.reportingapi.network_error&schema_uri=/w3c/reportingapi/network_error/1.0.0" }] }
server: envoy
server-timing: cache;desc="miss", host;desc="cp1115"
strict-transport-security: max-age=106384710; includeSubDomains; preload
timing-allow-origin: *
x-cache: cp1115 miss, cp1115 miss
x-cache-status: miss
x-content-type-options: nosniff
x-object-meta-sha1base36: l1h10jxvtd5o73z4q51fcqsot4fy2wu
Glrx (talk) 14:59, 15 September 2024 (UTC)[reply]
server cp1115 seems to have the file, but not cp4052
 ∞∞ Enhancing999 (talk) 15:07, 15 September 2024 (UTC)[reply]
As of 16:00 UTC+8, I can now access the file without problem in Hong Kong. Will like to hear if anyone elsewhere still has trouble in accessing the file?廣九直通車 (talk) 08:14, 16 September 2024 (UTC)[reply]
Now that the task has been resolved on Phabricator, I think it's time to resolve and archive this thread. Thanks for all of your comments.廣九直通車 (talk) 10:33, 16 September 2024 (UTC)[reply]
Curious how often this happens. Apparently there is a weekly process to fix it, see phab:T374773#10147831.
 ∞∞ Enhancing999 (talk) 10:43, 16 September 2024 (UTC)[reply]

Reupload crashed midway

[edit]

On Poems Betham p9.jpg. I redid the colors, tried to reupload it, it lagged for a few minutes then crashed. A new version of the file has been added to the upload history, but the file itself is still exactly the same (including after purge), and when I try to reupload the corrected version, it gets refused as a duplicate of the "current version" of the file, which it is not. What should I do? — Alien  3
3 3
13:59, 14 September 2024 (UTC)[reply]

Undid the upload, redid the upload, all good now. Whatever... — Alien  3
3 3
14:54, 14 September 2024 (UTC)[reply]
Caching issue? I did see two different files when there were just two versions.
 ∞∞ Enhancing999 (talk) 15:43, 14 September 2024 (UTC)[reply]
No, purged everything twice, still didn't work. Once I undid and redid it, though, it started pretending that the first try worked. — Alien  3
3 3
15:52, 14 September 2024 (UTC)[reply]
Could be the typhoon problem mentioned above.
 ∞∞ Enhancing999 (talk) 13:40, 17 September 2024 (UTC)[reply]
Don't think so, it wasn't the same problem, as the file was not updating, but I didn't get a 404 error. — Alien  3
3 3
16:31, 17 September 2024 (UTC)[reply]

Tech News: 2024-38

[edit]

MediaWiki message delivery 23:58, 16 September 2024 (UTC)[reply]

Best way to batch upload from Youtube?

[edit]

I found on Youtube some interesting video collections under Creative Commons, mainly [16]https://www.youtube.com/@AnimadosICAIC Cartoons from Cuba]. Some of those are really good. But there are a ton of those cartoons under CC. Is there any way of batch uploading the collection?

I use video2commons to upload one per one, but it is a slow method. TaronjaSatsuma (talk) 12:55, 17 September 2024 (UTC)[reply]

I think asking at the talk page of video2commons would be more appropriate. There already is a thread about this: Commons talk:Video2commons#API for this tool albeit probably not easy to see due to its title. Some info on that there. Batch upload of all videos from a channel would be great. Your example isn't really good however, there are far better examples. I think it would be best be added directly into video2commons so you can simply enter a channel URL and it guides you through importing all CCBY videos where you can deselect some videos to not import, adjust the titles, and so on. Alternatively a separate tool could make use of V2C via some API or fork it for specifically this functionality. However it's implemented it should not lead to blocking others from using the tool so there would need to be some measures like some pause between every 5 videos or so. Another thing that is needed is that video2commons needs to check if a video with that youtube ID has already been imported so things don't get imported multiple times which can more easily happen once such functionality is there. Since nobody seems to yet developed such a tool according to the thread at V2C it may now indeed be good to ask here. Prototyperspective (talk) 15:21, 17 September 2024 (UTC)[reply]
Thanks. Very good reply. TaronjaSatsuma (talk) 15:30, 17 September 2024 (UTC)[reply]

Can we create a tracking category for Galleries not connected to Wikidata items?

[edit]

Can we create a tracking category for Galleries not connected on Wikidata items?

I would like to be able to see which Gallery pages are not being used in Wikidata's Gallery' Property Commons gallery (P935), but I have no idea what the best strategy is for a crosswiki tracking category like that would be. A bot? @Multichill: Anyone you think who would be good at this? Sadads (talk) 20:41, 19 September 2024 (UTC)[reply]