Commons:Village pump
This page is used for discussions of the operations, technical issues, and policies of Wikimedia Commons. Recent sections with no replies for 7 days and sections tagged with {{Section resolved|1=--~~~~}} may be archived; for old discussions, see the archives; the latest archive is Commons:Village pump/Archive/2024/09. Please note:
Purposes which do not meet the scope of this page:
Search archives: |
Legend |
---|
|
|
|
|
|
Manual settings |
When exceptions occur, please check the setting first. |
|
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 1 day and sections whose most recent comment is older than 7 days. | |
August 22
Is renaming categories with an English name to local language names a good idea?
Category:Heroes' Cemetery in the Philippines has been renamed to Category:Libingan ng mga Bayani, to "match Wikipedia and Wikidata", see history of this category. Though there are exceptions for English category names: "some proper names, ... and names for which the non-English name is most commonly used in the English language" (see Commons:Categories#Category names), I wonder whether such an exception applies to this category and whether this would be a good development for other category names that are now in English but might have other names in Wikipedia and Wikidata. Because I can understand the English name without a translation program, but not the name in Philippine (or the majority of other languages). @Seav: Can you give your opinion as well? JopkeB (talk) 05:43, 22 August 2024 (UTC)
- If there is a Category Redirect, I see no problem. There is a problem with English names that are meaningless to local people. Like Our Lady of the Forsaken, that is a very common thing to find around my place, that means absolutely nothing to the Valencian local people, who are enterly familiar with either la Virgen de los Desamparados or la Mare de Déu dels Desemparats. I do use English category names, but I have to explain my wife once and again what is Our Lady of Good Health (la Virgen de la Salud) or Saint Anthony the Great (Sant Antoni del Porquet).
I think a more multilingual approach is required. B25es (talk) 05:55, 22 August 2024 (UTC)- Then I would prefer to have the redirect the other way around: let the English name be the category name and let the local name have the redirect. JopkeB (talk) 06:11, 22 August 2024 (UTC)
- That's perfectly fine for us B25es (talk) 16:32, 23 August 2024 (UTC)
- Then I would prefer to have the redirect the other way around: let the English name be the category name and let the local name have the redirect. JopkeB (talk) 06:11, 22 August 2024 (UTC)
- Hi! Thank you for bringing this topic up. I think the relevant policy is Commons:Language policy, which defers to a section: Commons:Categories#Category names. As you have stated, the important passage is:
Category names should generally be in English (see Commons:Language policy). However, there are exceptions such as some proper names, biological taxa and names for which the non-English name is most commonly used in the English language (or there is no evidence of usage of an English-language version).
- I think my rename follows the "proper names [...] for which the non-English name is most commonly used in the English language". The English Wikipedia article title, Libingan ng mga Bayani, had been discussed and renamed a few times since 2009 between the English-translated name and the official native-language name (see the header on Talk:Libingan ng mga Bayani), with the latest move request in 2020 resolving to the native-language title. There is plenty of evidence that the native-language name is used in English-language sources (albeit sources in the Philippines, but then again, English is an official language of the country). Some examples of English reliable sources within the past year that use the native-language name: [1][2][3][4].
- I consider this situation similar to categories like Category:Taoisigh which could be reasonably be actually named Category:Prime ministers of Ireland, but we're using the Irish name here in Commons (so far without any argument, I think?) and also in the English Wikipedia (w:Taoiseach). —seav (talk) 06:14, 22 August 2024 (UTC)
- "Taoisigh" is apparently not a proper name of a person, like Mary Johnson, but the name of a function, in the rest of the world known as "Prime minister". So this is not part of the exception. AND it is clearly not conform the Universiality principle, which says that "Identical items should have identical names for all countries and at all levels of categorization." So, please make the redirect the other way around. JopkeB (talk) 08:43, 22 August 2024 (UTC)
- "Taoisigh" seems a bit odd to me, I don't recall seeing that spelling (or hearing a pronunciation that matches that spelling), but as a native English-speaker, I'd consider "Taoiseach" entirely appropriate. The BBC, for example, pretty consistently use "Taoiseach". FWIW, Google gives 239,000 results for "Taoisigh", 5,820,000 for "Taoiseach", 545,000 for the phrase "prime minister of Ireland" and 419,000 for "Irish prime minister", which seems to confirm my instinct. - Jmabel ! talk 18:41, 22 August 2024 (UTC)
- The plural of taoiseach is taoisigh, according to Wikipedia. --Geohakkeri (talk) 19:00, 22 August 2024 (UTC)
- Interesting, I guess I'd never heard it in the plural, and certainly have no idea how to form Gaelic plurals. - Jmabel ! talk 04:06, 23 August 2024 (UTC)
- So it is also OK to rename Category:Prime ministers of France to Category:Premiers ministres? And for all other countries alike? Then we might delete the Universiality principle as well. I am not a native English-speaker and I consider "Taoisigh" not appropriate because I have never heard it before, I am familiar with the name "Prime ministers" and I guess the same applies to the best part of non-native English-speakers. Why would there be an exception for Gaelic? JopkeB (talk) 04:30, 24 August 2024 (UTC)
- Because in English when we talk about this person/office, we use the word "Taoiseach". Similar issue to Category:Tsars of Russia.- Jmabel ! talk 17:40, 24 August 2024 (UTC)
- Is that true for other English-speaking countries as well, like the USA, Canada, Australia, New Zealand? And the word "tsar" is well known, also in other languages, and is about heads of state, not prime ministers. So I think this is not a good comparison. JopkeB (talk) 04:08, 25 August 2024 (UTC)
- Definitely true for UK; for U.S. I've usually heard radio news reporters first use the word "Taoiseach", then define it once (e.g. "equivalent of a prime minister"), then use "Taoiseach". Again, I think that Google count speaks volumes: over ten times as many hits for "Taoiseach" as for "prime minister of Ireland". - Jmabel ! talk 04:51, 25 August 2024 (UTC)
- "Chancellor" is also used for the German head of state. And "Teno" for the Japanese "king". Nakonana (talk) 11:48, 25 August 2024 (UTC)
- So, it looks a little bit like a mess, for Germany even more than for Ireland (for Ireland there is at least a redirect, for Germany you have to figure out yourself what the category for federal prime ministers is). But luckily prime ministers of Japan are still in Category:Prime ministers of Japan (I could not find teno and japanese "king"). My concern are about (1) non-native English (and German) speakers AND (2) creators of templates and other technical solutions. Both groups need clear category names, with the same category name throughout the category tree. I think the Universiality principle is made for both groups. How can we apply this principle to categories for prime ministers of Ireland and the federal state of Germany (and perhaps other countries?)? JopkeB (talk) 16:27, 25 August 2024 (UTC)
- I typed it wrong, of course, Tenno is a redirect. Maybe "Head of State" would work? Nakonana (talk) 17:16, 25 August 2024 (UTC)
- Head of state is not the same as prime minister. Germany and Ireland both have a president as well as a prime minister of the (federal) government. JopkeB (talk) 05:02, 26 August 2024 (UTC)
- Agreed that a prime minister is normally "head of government" but not "head of state". I can't think of a country with a prime ministerial system where the prime minister is considered head of state. - Jmabel ! talk 00:25, 27 August 2024 (UTC)
- Head of state is not the same as prime minister. Germany and Ireland both have a president as well as a prime minister of the (federal) government. JopkeB (talk) 05:02, 26 August 2024 (UTC)
- I typed it wrong, of course, Tenno is a redirect. Maybe "Head of State" would work? Nakonana (talk) 17:16, 25 August 2024 (UTC)
- So, it looks a little bit like a mess, for Germany even more than for Ireland (for Ireland there is at least a redirect, for Germany you have to figure out yourself what the category for federal prime ministers is). But luckily prime ministers of Japan are still in Category:Prime ministers of Japan (I could not find teno and japanese "king"). My concern are about (1) non-native English (and German) speakers AND (2) creators of templates and other technical solutions. Both groups need clear category names, with the same category name throughout the category tree. I think the Universiality principle is made for both groups. How can we apply this principle to categories for prime ministers of Ireland and the federal state of Germany (and perhaps other countries?)? JopkeB (talk) 16:27, 25 August 2024 (UTC)
- Is that true for other English-speaking countries as well, like the USA, Canada, Australia, New Zealand? And the word "tsar" is well known, also in other languages, and is about heads of state, not prime ministers. So I think this is not a good comparison. JopkeB (talk) 04:08, 25 August 2024 (UTC)
- Because in English when we talk about this person/office, we use the word "Taoiseach". Similar issue to Category:Tsars of Russia.- Jmabel ! talk 17:40, 24 August 2024 (UTC)
- The plural of taoiseach is taoisigh, according to Wikipedia. --Geohakkeri (talk) 19:00, 22 August 2024 (UTC)
- "Taoisigh" seems a bit odd to me, I don't recall seeing that spelling (or hearing a pronunciation that matches that spelling), but as a native English-speaker, I'd consider "Taoiseach" entirely appropriate. The BBC, for example, pretty consistently use "Taoiseach". FWIW, Google gives 239,000 results for "Taoisigh", 5,820,000 for "Taoiseach", 545,000 for the phrase "prime minister of Ireland" and 419,000 for "Irish prime minister", which seems to confirm my instinct. - Jmabel ! talk 18:41, 22 August 2024 (UTC)
- "Taoisigh" is apparently not a proper name of a person, like Mary Johnson, but the name of a function, in the rest of the world known as "Prime minister". So this is not part of the exception. AND it is clearly not conform the Universiality principle, which says that "Identical items should have identical names for all countries and at all levels of categorization." So, please make the redirect the other way around. JopkeB (talk) 08:43, 22 August 2024 (UTC)
- Oppose Changing the name to the "native-language" for two reasons
- 1. The idea that it's in the "native-language" now is totally ridiculous to begin with. According to Google 22.5 million people in the Philippines speak Tagalog. While 39.4 million speak English. Further the place being discussed here is a national cemetery within Fort Bonifacio. The national headquarters of the Philippine Army. So it's just more likely due to how many speak English then Tagalog nationally that they will speak English. Meaning the change will clearly reduce the number of people who will be able to find the category. I don't think it's simply enough to simply have a redirect in this case either. Otherwise you could justify renaming every category to the "native language" simply because redirects exists. That's not their purpose. Per the guidelines we have to go with the name whatever language has the most chance of being searched for and it's pretty clear that's English in this case.
- 2. There's been multiple CfDs having to do with this exact issue in the last couple years and there was clearly no consensus from them to change the names of the categories to the "native-language" at the time. I highly doubt if Seav had pf started a CfD for this before changing the name that it would have gone anywhere. That's what they should have done instead of just unliterally changing the name based purely on how the place is named on Wikipedia. Regardless, it's pretty clear that there is no consensus to use "native-language" names for categories in cases like this one. I'm not sure what the circumstances around Category:Taoisigh versus Category:Prime ministers of Ireland, but "other stuff" isn't really a valid reason to make the change. Again, especially considering the outcome of prior CfDs, guideline, and fact that clearly more people speak English in the Philippines then Tagalog and this is the national headquarters of Philippine Army. It would be ridiculous to say that shouldn't matter "because Wikipedia article." --Adamant1 (talk) 06:37, 22 August 2024 (UTC)
- To be fair, your first argument only works for this special case of a country where a lot of people speak English. It doesn't work for other categories. For example, sometimes I go through the Category:Media needing categories (Cyrillic names), and there I find a decent number of files that actually are properly categorized, except that all the categories are written in Russian. The Russian community has a bot that addresses the issue to some degree but automatically searching and replacing Russian-language categories for their English counterparts, but the process is not perfect and you still find a lot of "uncategorized" Russian media. The other issue with translated categories is that there are various ways to translate one and the same thing, and it's a pain to figure out what the English name of an already existing category is. Oftentimes I go to Ru Wiki, find the subject there, follow the link to the Wikidata item from the article, and then look up the Commons category on Wikidata. There are just too many ways to "translate" even something as simple as "площадь мира" (transc.: ploshad mira, lit.: square [of] peace). You can go by Category:Peace square, Kazan or Category:Peace Square, Krasnoyarsk with a capital S, or Category:Mira Square (Kaluga) (like Google does), or Category:Mir Square (the word "peace" = "mir" without the genetive suffix "-a"; Mira Square is actually equivalent to "Peace's Square", and if you want to have "Peace Square", you need to drop the genetive in Russian too "Mir Square"*), or you could translate it as Category:Square of Peace, or you use the Russian word order Category:Square Mira, which is a construction that you can find in translations of "проспект мира" (prospekt mira / Peace Avenue), such as Category:Prospekt Mira (Kaliningrad) — why even bother translating when you can transcribe instead? (Now we only have to agree whether it's prospekt or prospect — k vs. c, see Category:Prospect Mira in Lipetsk.) Or you can just translate it like in Category:Peace Avenue, Krasnoyarsk, or you can be like Moscow and create a grammatical language monster by keeping the Russian words while using English word order: Category:Mira Prospekt in Moscow — this one raises eye brows in English speakers and Russian speakers alike.
- * This is done with "площадь Ленина" (transc.: ploshad Lenina, lit.: square [of] Lenin). While the suffixed "-a" is kept in "Mir-a" for some reason, it is dropped in case of "Lenin-a": Category:Lenin Square (Ufa) instead of "Lenina Square". However if it's a street (улица Ленина / ulitsa Lenina), then Lenin can keep his suffixed "-a": Category:Lenina street (Irkutsk)... or not Category:Lenin Street (Gdov). Nakonana (talk) 20:51, 22 August 2024 (UTC)
- * typo: "It doesn't work for other countries." Nakonana (talk) 20:54, 22 August 2024 (UTC)
- It is a fine balance line between translating native names into English ones or keeping them in the original language (or as a transcription for languages that have other scripts than Latin). For me names of streets and squares may be kept in the original language, except perhaps for very well known ones, like Red Square in Moscow. So the same rule as for place names (where only names known in English should be in English in Commons categories). JopkeB (talk) 09:22, 24 August 2024 (UTC)
- Do we have a reference for the English name or is it just a literal translation? File:2551Taguig City Landmarks 17.jpg is in English. Enhancing999 (talk) 06:43, 22 August 2024 (UTC)
- In English the name literally translates to "Cemetery of (the) Heroes" (libingan = cemetery; mga bayani = heroes). I've seen that translation (with or without the definite article) and I've seen the variation "Heroes' Cemetery" as well. A couple more points: the official website of the cemetery is [5], which is in English but still uses the native-language name, and the law that established the cemetery is Proclamation No. 208, s. 1967, also in English, but the name is again the native-language name. —seav (talk) 07:03, 22 August 2024 (UTC)
- Oppose renaming but see meta:Community Wishlist/Wishes/Add machine translated category titles on WMC. Prototyperspective (talk) 09:19, 22 August 2024 (UTC)
- I think in this case the Tagalog name is appropriate, with soft redirects from the maybe two most likely English-language names.
- I'd also add: in practice, this varies a lot from country to country, and sometimes by region within a country. For example, for Catalonia, we pretty consistently favor Catalan names; for Romania, I've seen English-language names for things that it is hard to imagine anyone referring to that way, e.g. "Roman Square" for Piața Romană; it's like a Spanish-speaker calling New York's Times Square "La Plaza del Tiempo" or (even worse) "La Plaza de los Tiempos". - Jmabel ! talk 18:50, 22 August 2024 (UTC)
- One example of that is how they call pizza "pie" in the New York City area. I'd prefer keeping Category:Pies for images of actual pies though. --Adamant1 (talk) 15:04, 25 August 2024 (UTC)
- Oppose I think it is a bad idea in most cases. It is categorized in Category:Military cemeteries in the Philippines. Should that also be renamed to Category:Mga sementeryo ng militar sa Pilipinas? And Category:Sydney Opera House was designed by a Danish guy so should it be named in Danish? I think local names should be reserved for when there is no English name. --MGA73 (talk) 15:17, 23 August 2024 (UTC)
- @MGA73 how about Category:Cristo Redentor (Rio de Janeiro)? Should it be moved also? It has been under Portuguese name since the creation of the category. JWilz12345 (Talk|Contrib's.) 22:45, 23 August 2024 (UTC)
- There is zero possibility that category name should be changed to something fully English. I'm a native English-speaker; I've been aware of that statue for over 60 years, and literally never heard it referred to in a fully translated mode (usually just "the Cristo" or if context is unclear "the Cristo in Rio", but of course "Cristo Redentor" is more correct). You would as well translate "Rio de Janeiro" to "January River" (or "Las Vegas, Nevada" to "the low places, snowy"). - Jmabel ! talk 04:02, 24 August 2024 (UTC)
- @MGA73 how about Category:Cristo Redentor (Rio de Janeiro)? Should it be moved also? It has been under Portuguese name since the creation of the category. JWilz12345 (Talk|Contrib's.) 22:45, 23 August 2024 (UTC)
- I think in English the name is w:Rio de Janeiro so it would not make sense to translate the name of the city. I do not know if there is an official or established name in English for this statue but Category:Statue of the Little Mermaid (Copenhagen) is in English and so is Category:Eiffel Tower. And Category:Denmark and Category:Brazil are also English. Should that be changed to local names too? I think the rule on Wikipedia is that articles is named by the most used name. So I think it would make sense to do the same on Commons. Just like it is Category:Bill Clinton and not William Jefferson Clinton. --MGA73 (talk) 07:08, 24 August 2024 (UTC)
- @MGA73 the statue is "Christ the Redeemer", many know that. Unsure if using English language as the precedence will lead to the category being moved to "Category:Christ the Redeemer (Rio de Janeiro)". JWilz12345 (Talk|Contrib's.) 07:39, 24 August 2024 (UTC)
- JWilz12345 I think that if there is no clear name for something then just leave the name as it is and make some redirects if needed. I would just hate to see if someone get the idea to rename hundreds of thousands of categories just to use local names in categories. --MGA73 (talk) 07:52, 24 August 2024 (UTC)
- "Rio de Janeiro" also has a literal translation to English. Enhancing999 (talk) 07:55, 24 August 2024 (UTC)
- @Enhancing999@MGA73 we are talking about the statue's name in English. Everyone in English-speaking world calls it "Christ the Redeemer". Little to none use "Cristo Redentor". It is understood, but I am surprised about claims here that there is no clear name of the statue in English. It is crystal clear: Christ the Redeemer. Of course, English-speaking world calls the city "Rio de Janeiro"! So: "Category:Christ the Redeemer (Rio de Janeiro)" is the most-fitting name. JWilz12345 (Talk|Contrib's.) 09:04, 24 August 2024 (UTC)
- The concern is not so much this statue, but that people use the equivalent of "River of January" as it occasionally happens. Enhancing999 (talk) 09:09, 24 August 2024 (UTC)
- @Enhancing999@MGA73 we are talking about the statue's name in English. Everyone in English-speaking world calls it "Christ the Redeemer". Little to none use "Cristo Redentor". It is understood, but I am surprised about claims here that there is no clear name of the statue in English. It is crystal clear: Christ the Redeemer. Of course, English-speaking world calls the city "Rio de Janeiro"! So: "Category:Christ the Redeemer (Rio de Janeiro)" is the most-fitting name. JWilz12345 (Talk|Contrib's.) 09:04, 24 August 2024 (UTC)
- "Rio de Janeiro" also has a literal translation to English. Enhancing999 (talk) 07:55, 24 August 2024 (UTC)
- JWilz12345 I think that if there is no clear name for something then just leave the name as it is and make some redirects if needed. I would just hate to see if someone get the idea to rename hundreds of thousands of categories just to use local names in categories. --MGA73 (talk) 07:52, 24 August 2024 (UTC)
- @MGA73 the statue is "Christ the Redeemer", many know that. Unsure if using English language as the precedence will lead to the category being moved to "Category:Christ the Redeemer (Rio de Janeiro)". JWilz12345 (Talk|Contrib's.) 07:39, 24 August 2024 (UTC)
- I think in English the name is w:Rio de Janeiro so it would not make sense to translate the name of the city. I do not know if there is an official or established name in English for this statue but Category:Statue of the Little Mermaid (Copenhagen) is in English and so is Category:Eiffel Tower. And Category:Denmark and Category:Brazil are also English. Should that be changed to local names too? I think the rule on Wikipedia is that articles is named by the most used name. So I think it would make sense to do the same on Commons. Just like it is Category:Bill Clinton and not William Jefferson Clinton. --MGA73 (talk) 07:08, 24 August 2024 (UTC)
- Should that also be renamed to Category:Mga sementeryo ng militar sa Pilipinas? I think this is a strawman argument. That category name is not the official name or proper name of an actual entity unlike "Libingan ng mga Bayani" and so would run afoul of Commons:Language policy. —seav (talk) 08:47, 24 August 2024 (UTC)
- The argument from MGA73 seems strange, but it's true that "Heroes' Cemetery in the Philippines" could suggest that it isn't a specific cemetery, but a class.
- "in the Philippines" as a disambiguator is unusual, possibly incorrect. Enhancing999 (talk) 08:55, 24 August 2024 (UTC)
- The question was "Is renaming categories with an English name to local language names a good idea?" And I think not. I think in general it is best to use English names if there is one. For example "Denmark" not "Danmark" and "Statue of the Little Mermaid (Copenhagen)" not "Statuen af Den Lille Havfrue (København)". You can always find cases where the name can be discussed and in these cases just leave the name the creator have chosen. --MGA73 (talk) 09:14, 24 August 2024 (UTC)
- (1) The problem was the sample didn't quite match the question. If there is an actual name that can be referenced and that is in use, not a "river of january"-thing. (2) Your argument is about classes of objects, not specific objects. (3) there are casses where we don't use English names even for classes: Category:Betula pendula etc. Enhancing999 (talk) 09:23, 24 August 2024 (UTC)
- The question was "Is renaming categories with an English name to local language names a good idea?" And I think not. I think in general it is best to use English names if there is one. For example "Denmark" not "Danmark" and "Statue of the Little Mermaid (Copenhagen)" not "Statuen af Den Lille Havfrue (København)". You can always find cases where the name can be discussed and in these cases just leave the name the creator have chosen. --MGA73 (talk) 09:14, 24 August 2024 (UTC)
- Oppose The default language for Commons is English for good reasons, some of them well outlined by others here. The best idea, is to put alternative names, in however many languages into Wikidata. Broichmore (talk) 11:07, 24 August 2024 (UTC)
- Oppose There are many situations where the "local language" may not reflect the majority of people talking about it. Take Category:Vatican City for instance. We could call it Category:Città del Vaticano or Category:Civitas Vaticana but the problem is that the Vatican city is a location with many international visitors and discussion. This would add a lot to confusion without adding much. We don't need to translate the name, but in more international locations this could cause major issues. Immanuelle ❤️💚💙 (please tag me) 16:38, 25 August 2024 (UTC)
Conclusions + proposals + questions
- The question is: Is it allowed to rename categories with an English name to local language names?
- Answers:
- Local language names are OK for category names if:
- there is no English name, or
- there is a redirect from the local language name to the English name or
- it meets the criteria of Commons:Categories#Category names (names for which the non-English name is most commonly used in the English language).
- Categories with local language names are doubtful if:
- it is not about a proper name, but about something else, like a function: Category:Taoisigh (prime ministers of Ireland) and Category:Chancellors of Germany (prime ministers of Germany) are in the rest of the world known as prime ministers. Though these local words may be well known in the UK, they are not in most of the rest of the world and they certainly do not meet the Universiality principle, which says that "Identical items should have identical names for all countries and at all levels of categorization."
- names are hard to translate into English (then they should at least be transcribed to Latin script).
- It is not OK if:
- there was no CfD for (there was no CfD for this case)
- the name is not in the language that has the most chance of being searched for (in this case that is English)
- the name is not in Latin script.
- Another solution is to put alternative names, in however many languages, into Wikidata.
- Local language names are OK for category names if:
- Category:Heroes' Cemetery in the Philippines should not have been renamed to Category:Libingan ng mga Bayani.
Proposals
- Revert the renaming of Category:Heroes' Cemetery in the Philippines and let the local category have a redirect to the English one.
- Try to include the conclusions of this discussion in Commons:Categories.
@B25es, Seav, Geohakkeri, Jmabel, Nakonana, Adamant1, Enhancing999, Prototyperspective, MGA73, JWilz12345, Broichmore, and Immanuelle:
- Do you agree with the conclusions?
- Do you agree with the proposals?
--JopkeB (talk) 15:29, 30 August 2024 (UTC)
- I thought we don't want "River of January" type of translations (easy to do)? The question for a reference for the past name of the category seems still unanswered. We do have it for the current category name (even in the category). We do seem to agree the "Category:Cemeteries in the Philippines"-category should remain in English.
∞∞ Enhancing999 (talk) 15:35, 30 August 2024 (UTC)
- I think there is a big difference between "allowed" to rename and "a good idea" to rename. I understand "a good idea" as "we should" rename. I'm from Denmark and we have Category:Copenhagen and I think it should be left alone because there are more readers that know the city as Copenhagen than København. But if someone had named the category "Buy a harbour" then I would ofcourse agree that the original name was better just like I think "River of January" should be renamed.
- I think both the conclusions and the proposals are fine. But I also live happily with the new name. Just don't start a mass rename of thousands and thousands of categories just to get local names. --MGA73 (talk) 16:01, 30 August 2024 (UTC)
- Both conclusions and proposals make sense.
Some understanding has to be used regarding those of us that not being proficient enough in English ignore how to say some things in English. Some categories written the best we can will pop up.
Also when things may have an English name but it is not widely known even among native English speakers themselves, category names may be not as perfect as desired.
I do appreciate when categories I have created are properly Anglisized, but category redirects from other languages are of great help.
A long term plan/idea/intention to have something like Wikidata's Qs, where somebody writes iron, my wife reads hierro, and someone in Dodoma reads chuma, would be great. B25es (talk) 17:25, 30 August 2024 (UTC)
- I agree Immanuelle ❤️💚💙 (please tag me) 10:22, 5 September 2024 (UTC)
I certainly disagree with the conclusion that we should call the Chancellor of Germany the "prime minister of Germany". The term "Chancellor" dates back at least to the 1870s (and may have been used in Prussia before that, I'm not sure). It is simply the correct word in English (the German being Kanzler). The Germans do not use the word Kanzler to refer to comparable positions in other countries (e.g. they would not refer to the Prime Minister of the UK as a Kanzler, they would use Premierminister. It is simply not the same title, even if the role is similar. - 19:11, 30 August 2024 (UTC) — Preceding unsigned comment added by Jmabel (talk • contribs) (UTC)
- Maybe chancellor would be an appropriate term for historical figures who traditionally called that, but I don't anyone outside of Germany uses the term these days. Like I know Hitler was called a Chancellor in Germany during World War 2 but if I do a search for him Google there's plenty of sources, including Google's information panel that calls him a Prime Minister. But then conversely the Wikipedia article for the role is called "Chancellor of Germany." So...There should be some standard and at least IMO Prime Minister makes more sense for our purposes if for no other reason then universality. I don't think it would make sense or work if he had diifferent terms for political appointees based purely what they are called locally regardless of if there's a universally recognized term for the role or not. Categories mainly (if not completely) exist as a way to organize images and naming them based on universality is clearly the best to do that regardless of if its 100% acccurate in every single instance. --Adamant1 (talk) 20:41, 30 August 2024 (UTC)
- "Chancellor Angela Merkel" (in quotes) gets 1,880,000 Google hits. "Prime Minister Angela Merkel" gets a mere 5,940. Rather more recent than Adolph Hitler (also held the office rather longer), I'd say a 30:1 ratio suggests that the term is quite current. And since both "Chancellor" and "Prime minister" are specifically English-language, this should be specific to English. - Jmabel ! talk 21:59, 30 August 2024 (UTC)
- @Jmabel: With Hitler it's 60,600 results for "Chancellor Hitler" versus 6,550,000 for "Prime Minister Hitler" and as I've already said that's even with him being called Chancellor in Germany at the time. So it clearly depends. You'd have to agree it would be pointless to argue about which term should be used used based on who the particular head of state was at the time. "Well, my German head of state has 6 million results on Google for Chancellor and yours only has 60 thousand for Prime Minister. So clearly the category should be named 'Chancellor'!" Not to say you'd do that, but I do think it's a case where it would just be better to go with the university principle instead of trying to determine how to name the category based on a game of one upmanship over who's favorite term for a head of state gets more results on Google Search. --Adamant1 (talk) 22:47, 30 August 2024 (UTC)
- @Adamant1: I'm pretty sure you didn't use the quotation marks in doing that Google search. Obviously, Hitler would often be mentioned on the same page as some prime minister. But with quotation marks I get and 59,700 for "Chancellor Hitler" and only 9,440 for "prime minister Hitler", and even the latter include some more-or-less false positives: right near the top I see things like, "To the British Prime Minister, Hitler complained…" or "On May 13, 1940, Churchill gave his first speech as. Prime Minister. Hitler invaded France…" - Jmabel ! talk 23:23, 30 August 2024 (UTC)
- @Jmabel: Weird. I thought I had. It looks like I probably left out the leading quotation mark though. My bad. --Adamant1 (talk) 23:26, 30 August 2024 (UTC)
- @Adamant1: I'm pretty sure you didn't use the quotation marks in doing that Google search. Obviously, Hitler would often be mentioned on the same page as some prime minister. But with quotation marks I get and 59,700 for "Chancellor Hitler" and only 9,440 for "prime minister Hitler", and even the latter include some more-or-less false positives: right near the top I see things like, "To the British Prime Minister, Hitler complained…" or "On May 13, 1940, Churchill gave his first speech as. Prime Minister. Hitler invaded France…" - Jmabel ! talk 23:23, 30 August 2024 (UTC)
- @Jmabel: With Hitler it's 60,600 results for "Chancellor Hitler" versus 6,550,000 for "Prime Minister Hitler" and as I've already said that's even with him being called Chancellor in Germany at the time. So it clearly depends. You'd have to agree it would be pointless to argue about which term should be used used based on who the particular head of state was at the time. "Well, my German head of state has 6 million results on Google for Chancellor and yours only has 60 thousand for Prime Minister. So clearly the category should be named 'Chancellor'!" Not to say you'd do that, but I do think it's a case where it would just be better to go with the university principle instead of trying to determine how to name the category based on a game of one upmanship over who's favorite term for a head of state gets more results on Google Search. --Adamant1 (talk) 22:47, 30 August 2024 (UTC)
- "Chancellor Angela Merkel" (in quotes) gets 1,880,000 Google hits. "Prime Minister Angela Merkel" gets a mere 5,940. Rather more recent than Adolph Hitler (also held the office rather longer), I'd say a 30:1 ratio suggests that the term is quite current. And since both "Chancellor" and "Prime minister" are specifically English-language, this should be specific to English. - Jmabel ! talk 21:59, 30 August 2024 (UTC)
- (Reaction to the disagreement of Jmabel, 19:11, 30 August 2024) The point is not "that we should call the Chancellor of Germany the "prime minister of Germany""; of coarse you may call anyone the way you or the other person likes it. Nor whether what has more hits in Google or any other search machine. The point is: how do we name (sub)categories in a structural and consistent way. The real question here may be: What is more important: the consistancy of the category structure (like the Universality principle prescribes) or the titles of the prime ministers of Germany and Ireland? And if you choose the latter: what other exceptions exist to the Universality principle? --JopkeB (talk) 09:12, 9 September 2024 (UTC)
- @JopkeB: as long as there are appropriate redirects and appropriate descriptions on the category page, ultimately I can deal with it. In the specific case of German chancellors, though, perhaps then we want a generic Category:Prime ministers of Germany that exists only to be a parent category of Category:Chancellors of Germany, much as Category:Monarchs of France includes (directly) the Kings of France and has subcategories for the emperors, the monarchs of areas within present-day France, etc.? - Jmabel ! talk 06:20, 10 September 2024 (UTC)
- @Jmabel: I would like that Category:Chancellors of Germany only has a redirect to Category:Prime ministers of Germany and that the latter becomes the main category. Only then the Universality principle principle will be done justice. The current subcategories of Category:Prime ministers of Germany should get a parent like Category:Prime ministers of states of Germany. JopkeB (talk) 04:27, 11 September 2024 (UTC)
- Wouldn't it be better the other way around? Have Category:Prime ministers of Germany redirect to Category:Chancellors of Germany? The former category, to ensure that templates and categories like "Prime ministers of Europe" will work, and the latter, for the actual name. Nakonana (talk) 15:14, 11 September 2024 (UTC)
- @Jmabel: I would like that Category:Chancellors of Germany only has a redirect to Category:Prime ministers of Germany and that the latter becomes the main category. Only then the Universality principle principle will be done justice. The current subcategories of Category:Prime ministers of Germany should get a parent like Category:Prime ministers of states of Germany. JopkeB (talk) 04:27, 11 September 2024 (UTC)
- If we are insisting that we use strict English for the cemetery in question (despite the Tagalog name being extensively used in English sources [see above]), then there is still the question of which English translation should be used. Should it be the original "Heroes' Cemetery", or "Cemetery of Heroes" or "Cemetery of the Heroes"? All three are acceptable English translations of the official name, but I prefer any of the two latter names as these more closely resemble the official name (this is like preferring "River of January" instead of "January River" for Rio de Janeiro). As for disambiguation, do we append "in the Philippines" as with the original category name, or use a parenthetical "(Philippines)" which seems to be the convention? —seav (talk) 22:32, 30 August 2024 (UTC)
- At least in regards to the first part of your message, probably the best way forward at this point would be to revert the edit that changed the name and then someone can open a CfD to figure out what the best option is from there. It's pretty clear from this that the name shouldn't have been changed without there being a CfD first though and at least IMO figuring out what is to go with instead is probably out of scope of this conversation. --Adamant1 (talk) 22:55, 30 August 2024 (UTC)
- I agree with Adamant1 about reverting. And, @seav, I intended "River of January" as a reductio ad absurdum, not a serious proposal. In English, the city in question is universally called either "Rio de Janeiro" or just "Rio". This is never turned into English words. - Jmabel ! talk 23:29, 30 August 2024 (UTC)
- Yes, I know that "River of January" wasn't a serious suggestion, but I simply used it as an example of which of several possible English translations ought to be used if we are translating a name. — seav (talk) 00:39, 31 August 2024 (UTC)
- I don't think anybody tried to do a literal translation of "Rio do J.", but occasionally I come across similar ones, where it's likely that Commons is the sole source for that name. I could name one or the other, but I don't want to embarrass the user who did try to do so. If you find a bad translation of mine, feel free to cite is as a negative example.
- Obviously, I don't think there is an issue to provide a literal translations in the description.
- For the cemetery, I agree we should have a proper discussion on CfD, especially as the previous name was the result of one.
∞∞ Enhancing999 (talk) 11:51, 31 August 2024 (UTC)- @Enhancing999 I've seen countless literal "translations" to English of Brazilian buildings - generally produced by Brazilian editors themselves - which come out as nothing but butchered English names with no existence out of Commons. Every time I find one of those I move it to the building proper name in Portuguese, without leaving any redirect - there's already a lot of UG crap around, we don't need more. Darwin Ahoy! 11:10, 18 September 2024 (UTC)
- I wonder if the same should be done with other languages, too, like translating the names of churches (e.g. in Eastern Europe) that are only known regionally but unknown on an international level. I've been translating them, too, but it's a pain to come up with translations of partially old-Slavonic (or church-Slavonic) names, which is sort of a different language from the modern Eastern Slavic languages. And ones the category name is translated, it becomes a pain to categorize files that don't have categories yet but mention the name of the church in the native language; so you go to the "Category:Churches of City X" and find 20 sub-categories and now have to click through all of them to figure out the right one because the English name does not ring a bell, so to speak. Nakonana (talk) 12:50, 18 September 2024 (UTC)
- @Enhancing999 I've seen countless literal "translations" to English of Brazilian buildings - generally produced by Brazilian editors themselves - which come out as nothing but butchered English names with no existence out of Commons. Every time I find one of those I move it to the building proper name in Portuguese, without leaving any redirect - there's already a lot of UG crap around, we don't need more. Darwin Ahoy! 11:10, 18 September 2024 (UTC)
- Yes, I know that "River of January" wasn't a serious suggestion, but I simply used it as an example of which of several possible English translations ought to be used if we are translating a name. — seav (talk) 00:39, 31 August 2024 (UTC)
- I agree with Adamant1 about reverting. And, @seav, I intended "River of January" as a reductio ad absurdum, not a serious proposal. In English, the city in question is universally called either "Rio de Janeiro" or just "Rio". This is never turned into English words. - Jmabel ! talk 23:29, 30 August 2024 (UTC)
- At least in regards to the first part of your message, probably the best way forward at this point would be to revert the edit that changed the name and then someone can open a CfD to figure out what the best option is from there. It's pretty clear from this that the name shouldn't have been changed without there being a CfD first though and at least IMO figuring out what is to go with instead is probably out of scope of this conversation. --Adamant1 (talk) 22:55, 30 August 2024 (UTC)
- In the UK, the German prime minister is always referred to as the chancellor, as is his Irish equivalent the Taoiseach. Broichmore (talk) 13:18, 31 August 2024 (UTC)
- In passing, I've just looked at Category:General Wali Kothi, who would believe this is actually a building aka Star house, Tara Khothe, Tara Khothee, andd Lucknow Observatory. There will be other names and spellings too. Never mind the Hindi variants, and counting. You can bet your bottom dollar, that the locals call it Star House, or the old Observatory, terms not officially used since 1947.
- Almost every Indian category suffers from the same malaise, and exemplify reasons why, English is the standard here. Broichmore (talk) 13:26, 31 August 2024 (UTC)
- Comment, personally, I think that the solution is really simple, we need to reform the way category redirects and title displays work on a technical level. We simply change Wikidata's notability guidelines to include any and all Wikimedia Commons categories, then integrate each and every Commonswiki category with Structured Data on Wikimedia Commons (SDC), then have a robot automatically create redirects based on alternative names in different languages based on what is listed in Wikidata, then make title display names based on the settings of the viewer. Congratulations, now the Wikimedia Commons is truly a multilingual project rather than an Anglocentric one.
- How do we prevent vandals from deliberately adding wrong terms and / or insulting words and phrases in translations? Simple, we limit who can add these translations, it's a special but very easily acquired new right at Wikidata that any administrator could bestow upon someone based on either a local (at Wikidata) or Commonswiki endorsement.
- How do we integrate this at every level? Simply upgrade the MediaWiki Upload Wizard to properly integrate category redirects like Cat-a-lot already does, this way redirects become useful and someone can theoretically only ever contribute using French or Japanese without ever seeing a single word in English and not even notice that the Wikimedia Commons is in a language that they don't understand.
- These changes would solve a lot of issues overnight, all we need is for Wikidata to slightly adjust its notability guidelines (something user "Mike Peel" and I have been advocating for for years), then the rest can be 90% (ninety percent) done by robots, and humans can simply do quality checks and focus on more important things like finding educational content and checking if that educational content is licensed in a free way. --Donald Trung 『徵國單』 (No Fake News 💬) (WikiProject Numismatics 💴) (Articles 📚) 07:40, 5 September 2024 (UTC)
- This is probably a hot take but I've been a fan of how people use redirects for essentially everything on here. There's a point where it just kind of defeats their purpose and usefulness if everything is redirected multiple times and with multiple different variations of the word, concept, or whatever. Like create a redirect for something that clearly needs one, but do we really need redirects for every random misspelling of a name or redirects for every name in every niche language on the planet? Probably not. --Adamant1 (talk) 07:58, 5 September 2024 (UTC)
- Every misspelling rather not, but "reasonable" alternative yes. That would definitely solve most category issues with "Lenin Street", "Lenina Street", "Ulitsa Lenina" and "улица Ленина". Wikidata entries don't usually have all that many alternative spellings listed. Even if we would just use an English category name and one native language name for a category, that would already help a lot. Nakonana (talk) 18:47, 5 September 2024 (UTC)
- Insofar as additional spellings are valid, adding them to Wikidata is helpful all around. - Jmabel ! talk 18:55, 5 September 2024 (UTC)
- Every misspelling rather not, but "reasonable" alternative yes. That would definitely solve most category issues with "Lenin Street", "Lenina Street", "Ulitsa Lenina" and "улица Ленина". Wikidata entries don't usually have all that many alternative spellings listed. Even if we would just use an English category name and one native language name for a category, that would already help a lot. Nakonana (talk) 18:47, 5 September 2024 (UTC)
- Where do I sign? Nakonana (talk) 18:48, 5 September 2024 (UTC)
- Donald Trung: That looks like a fine solution and proposal. Would you please add it to the Community Wishlist? JopkeB (talk) 08:48, 9 September 2024 (UTC)
- This is probably a hot take but I've been a fan of how people use redirects for essentially everything on here. There's a point where it just kind of defeats their purpose and usefulness if everything is redirected multiple times and with multiple different variations of the word, concept, or whatever. Like create a redirect for something that clearly needs one, but do we really need redirects for every random misspelling of a name or redirects for every name in every niche language on the planet? Probably not. --Adamant1 (talk) 07:58, 5 September 2024 (UTC)
Overall conclusions
We agree on:
- The renaming of Category:Heroes' Cemetery in the Philippines to Category:Libingan ng mga Bayani should be reverted. Action JopkeB Done
- After the reverting, whoever wants a different name for this category can start a CfD (discussion).
- (For future cases:) Whoever wants to rename an English category name to a local name should start a CfD before the renaming.
- The proposal of Donald Trung to solve the problem with category names in different languages with a technical solution, like in Wikidata. Action Donald Trung (to bring it to the Community Wishlist)
We do not agree on: Whether it is allowed to have exceptions to the Universality principle, for example: Should category names of the German and Irish prime ministers (now in German and Irish) be renamed to the general (English) name for categories about prime ministers? I suggest to move this discussion to Commons talk:Categories. If you agree: Action (perhaps Jmabel?)
--JopkeB (talk) 09:44, 9 September 2024 (UTC)
- Why should we start a category discussion to fix "River of January" problems?
∞∞ Enhancing999 (talk) 09:55, 9 September 2024 (UTC)- Because there are cases that might be obvious for you and some others, but not for everyone. JopkeB (talk) 10:18, 9 September 2024 (UTC)
- Well, that's just your POV. I think everybody agreed we shouldn't have "River of January".
∞∞ Enhancing999 (talk) 10:20, 9 September 2024 (UTC)- Started the discussion now for "Cristo Redentor" (which the English world has a fitting name: "Statue of Christ the Redeemer"). Commons:Categories for discussion/2024/09/Category:Cristo Redentor (Rio de Janeiro). JWilz12345 (Talk|Contrib's.) 11:20, 9 September 2024 (UTC)
- Well, that's just your POV. I think everybody agreed we shouldn't have "River of January".
- Because there are cases that might be obvious for you and some others, but not for everyone. JopkeB (talk) 10:18, 9 September 2024 (UTC)
- I would differentiate between category names for object classes and for individual objects. Class names, descriptive names and categories for universal concepts should always be in English. Category names for specific things (persons, official titles, places, institutions, etc.), on the other hand, should be permitted in the local language. There may be exceptions where the English names are well established and immediately understood in the language area concerned, but as a rule, arbitrary translations bring few advantages for international users and significant disadvantages not only for local users: For example, unambiguity is often lost in the case of political parties or parliamentary names; in the case of buildings the alphabetical sorting may change; in the case of squares, streets or churches, the assignment in the city map becomes a guessing game. And the systematic information is provided in English by the parent categories anyway, so this does not need to be repeated in the individual object category. Rudolph Buch (talk) 16:27, 9 September 2024 (UTC)
- IMO (and that is for years my guideline) is to name institutions, place names (including all othe geographic features and building names) as far as possible accourding to their local name, the so called endemic name. It does not make sense to name München Munich becaus in Czech it is Mnichov, in Italian it's Monaco, and so on. So why an Italian should look/search for an English name? So it's the White House, the Tour Eiffel, the Karlsplatz, the Hlavní třída.
- Off course, they're conflicts, as in Burj Khalifa, which is transcripted following English rules; following German rules it would be Burdsch Khalifa, or there are cases in which the original language uses English names, 101 Tower, or there exists an international name as in Japan Meteorological Organization.
- There is no precedense for english names (Why there should it. This is not an english project!) Matthiasb (talk) 07:37, 17 September 2024 (UTC)
- @Matthiasb: So you'd be totally cool with someone creating a category name for something having to do with Guangdong, China in Kejia instead of Mandarin because Kejia is the "local language" there? Of course I'm being a little sarcastic, but people seem to take issue with category names being translated into English when whatever language they prefer to instead isn't the "local language" to begin with either. I don't see anyone on here complaining about how this isn't a Mandarin language project though. It only comes up with English for some reason. As if there's something uniquely bad about it compared to other languages that people aren't speaking locally anyway. Chinese, English, or whatever. Your still putting out locals regardless, but there's a point where it's necessary to go with the next language up for the project to be usable. Whatever language that might be. Again though, in a lot of cases it isn't "local" either. But let's all complain about English like it's unique. --Adamant1 (talk) 08:11, 17 September 2024 (UTC)
- People are complaining about English because it's the language that is being used here, not because it's a bad language or something. If most category names were in Chinese, then people would complain about the names being in Chinese instead of the local name. Translating things always comes with problems and some things being lost in translation. Nakonana (talk) 12:31, 17 September 2024 (UTC)
- To be more specific, yes, as a matter of lingua franca, categorie names are expected to be in English. But we're talking also n proper names or proper nouns. So churches in München is okay, but its Hofbräuhaus (München). Its Staromĕstské namĕsti (Praha). It's not about translating. No we do not translate names. It's George Walker Bush not Georg Geher Busch And yes, I am aware, that before and even with Elizabeth II. it was common in Germany and the German press to use the German name variants. It is not anymore. It's Charles III. (or simply Charles) in any news outlet, even in the yellow press. In the German WP thera are some prominent exceptions, f.ex. Weißes Haus instead of White House (but it is Flat Iron Building or Chrysler Building (and not "Bügeleisengebäude" or "Chryslergebäude"). There are some intriguing rules to that which are not needed to be discussed here. I don't know about the differences between Mandarin and other chinese dialects but the issue here is another: we need the common romanization which certainly is Guangdong. As I wrote above, there are languages in which romanization into German texts differs from romanization in English or French texts, for example Arab or Ivrit, and there are languages where are no differences. Matthiasb (talk) 21:38, 19 September 2024 (UTC)
- People are complaining about English because it's the language that is being used here, not because it's a bad language or something. If most category names were in Chinese, then people would complain about the names being in Chinese instead of the local name. Translating things always comes with problems and some things being lost in translation. Nakonana (talk) 12:31, 17 September 2024 (UTC)
- @Matthiasb: So you'd be totally cool with someone creating a category name for something having to do with Guangdong, China in Kejia instead of Mandarin because Kejia is the "local language" there? Of course I'm being a little sarcastic, but people seem to take issue with category names being translated into English when whatever language they prefer to instead isn't the "local language" to begin with either. I don't see anyone on here complaining about how this isn't a Mandarin language project though. It only comes up with English for some reason. As if there's something uniquely bad about it compared to other languages that people aren't speaking locally anyway. Chinese, English, or whatever. Your still putting out locals regardless, but there's a point where it's necessary to go with the next language up for the project to be usable. Whatever language that might be. Again though, in a lot of cases it isn't "local" either. But let's all complain about English like it's unique. --Adamant1 (talk) 08:11, 17 September 2024 (UTC)
August 27
Massive backlogs are now the norm
Looking at Commons:Deletion requests, there are currently nearly three thousand deletion discussions that are past due for closure. Commons is teetering on being a failed project due in part to the lack of administrators. There simply is too much work for the available pool of administrators. This problem isn't getting better. It's getting worse.
So, why are we allowing new uploads to the project from anyone which just exacerbates this problem? The status quo is failing. --Hammersoft (talk) 14:03, 27 August 2024 (UTC)
- Category:Items with disputed copyright information is another example of backlogs. And it's hard to tell how well new uploads are reviewed. --EugeneZelenko (talk) 15:14, 27 August 2024 (UTC)
- This is a real problem. Aside from more administrators, what do you propose? What should be done to limit bad uploads (for copyrights or other reasons)? JopkeB (talk) 15:37, 27 August 2024 (UTC)
- Maybe more agressive and pro-active use of CheckUser? Trade (talk) 17:18, 27 August 2024 (UTC)
- I forgot where I read it, but I think CheckUser has to be extremely rare because of the nature of the thing. --Adamant1 (talk) 18:03, 27 August 2024 (UTC)
- Then almost every other Wikiproject should have the same high rejection rate. Except they dont. Its just Commons Trade (talk) 20:44, 27 August 2024 (UTC)
- I haven't participated in Wikipedia for at least a few years but from what I remember they have pretty high standards to. But then they also have way more users and bad actors because of that. Plus it's just seems easier to come up with evidence of socking on there. So it just makes sense that they would have a higher CheckUser rate. I know from my own experience that there's been at least a couple of times where I wanted to file a CheckUser report on here but ended up not doing one because there just wasn't enough of a paper trail in both cases to justify it. I feel like there probably would have been if it had happened on Wikipedia either time though. --Adamant1 (talk) 20:50, 27 August 2024 (UTC)
- The larger Wikipedia projects are in my experience much more strict in almost all areas compared to Commons. ReneeWrites (talk) 18:12, 3 September 2024 (UTC)
- Then almost every other Wikiproject should have the same high rejection rate. Except they dont. Its just Commons Trade (talk) 20:44, 27 August 2024 (UTC)
- I forgot where I read it, but I think CheckUser has to be extremely rare because of the nature of the thing. --Adamant1 (talk) 18:03, 27 August 2024 (UTC)
- Maybe more agressive and pro-active use of CheckUser? Trade (talk) 17:18, 27 August 2024 (UTC)
- There's been at least a couple of discussions lately about expanding and/or improving the criteria for speedy deletion. It seemed like none of them went anywhere at the time, but I think that would a lot with the backlog. As would making it easier for people to nominate images for speedy deletion to begin. As I guess its not an option on the side panel right now outside of clear copyright violations when it really should be. We could also use more admins, but I the standards are currently to high and I doubt they would work in that area anyway. Really you could argue current admins not dealing with deletion request is as much of or more of an issue then us just not having enough of them. There's no point in having more admins if they aren't going to work in the areas where they are most needed to begin with though. --Adamant1 (talk) 15:54, 27 August 2024 (UTC)
- The backlog was much lager some month ago. If we need an urgent action we should block IP edits to prevent their spam and get time to work on deletion requests. GPSLeo (talk) 15:56, 27 August 2024 (UTC)
- I had thought about that but it doesn't seem like there's any support to block IP edits on here. I totally agree that they should be blocked though. IP editors are a massive time suck in general. --Adamant1 (talk) 16:23, 27 August 2024 (UTC)
- How exactly is blocking IP's supposed to help in any way? The issue is with people uploading copyvio. IP's obviously cannot upload anything in the first place Trade (talk) 17:16, 27 August 2024 (UTC)
- But if we need 6 admin hours per day to check and clean up IP edits we can not use these 6 hours to handle deletion requests. It is simply a question of admin time availability. GPSLeo (talk) 17:50, 27 August 2024 (UTC)
- Aside from what GPSLeo says I work in the area a lot and IP editors tend to either create a lot of meritless deletion requests or troll DR discussions. Obviously it would be a lot easier to deal with the back log if there weren't as many clearly pointless deletion requests being created to begin with. --Adamant1 (talk) 17:53, 27 August 2024 (UTC)
- It's a bit odd for random IPs new to Commons to be so familiar with the DR procedure Trade (talk) 20:46, 27 August 2024 (UTC)
- Most of the time their comments amount to something along the lines of "not own work?" Otherwise I'd agree with you. There are some times where there's clearly socking involved or the IP editor is probably a previously blocked user though. Both of which is all the more reason to block them IMO. It's purely conjecture on my part, but I highly doubt most IP editors on here are participating in the project that way for legitimate reasons. --Adamant1 (talk) 21:13, 27 August 2024 (UTC)
- Agree with Adamant1. I also see too often violation and other non-constructive contributions by IP adresses, also in discussions. It takes a lot of time to fix them. Why do IP's have nearly the same rights on Commons as accounts have? My proposal: Restrict their rights. They should just be allowed to read things on Commons and have no possibility for violation. They are already excluded from uploading, why not from editing as well? If they want to start or join discussions, nominate files for deletion and so on, then they should use an account, like the rest of us. JopkeB (talk) 06:36, 28 August 2024 (UTC)
- It's a bit odd for random IPs new to Commons to be so familiar with the DR procedure Trade (talk) 20:46, 27 August 2024 (UTC)
- How exactly is blocking IP's supposed to help in any way? The issue is with people uploading copyvio. IP's obviously cannot upload anything in the first place Trade (talk) 17:16, 27 August 2024 (UTC)
- I had thought about that but it doesn't seem like there's any support to block IP edits on here. I totally agree that they should be blocked though. IP editors are a massive time suck in general. --Adamant1 (talk) 16:23, 27 August 2024 (UTC)
- The deletion requests would be much easier to close if more people participated. Trade (talk) 17:19, 27 August 2024 (UTC)
- The backlog was much lager some month ago. If we need an urgent action we should block IP edits to prevent their spam and get time to work on deletion requests. GPSLeo (talk) 15:56, 27 August 2024 (UTC)
- Less than 3.000? Good job then, just a few years ago it used to be over 6.000. --Rosenzweig τ 15:59, 27 August 2024 (UTC)
- 3000 sounds like what @Yann does in a blink. ;) Thanks btw.
∞∞ Enhancing999 (talk) 20:48, 27 August 2024 (UTC)
- 3000 sounds like what @Yann does in a blink. ;) Thanks btw.
- Is the problem really getting worse? As in, are the various backlogs growing faster than they're shrinking? This argument came up on enwiki with regard to unsourced articles recently and it turns out that though there are an enormous number of those, they are being taken care of faster than new ones are being created.
- I suspect something similar is going on here, especially since there is over a decade's worth of files to go through. To use another backlog as a comparison point, there are roughly 128,000 uncategorized files uploaded in 2024. Which is bad, but more than 128,000 files have been removed from the uncategorized backlog from 2021 and 2023 alone. Gnomingstuff (talk) 00:24, 28 August 2024 (UTC)
" Commons is teetering on being a failed project" if backlogs mean a project is teetering on the edge of failure, the virtually all Wikimedia projects have been teetering on the edge of failure since their inception. Yes, it would be good to do better. No, this is not a crisis, let alone a death knell.
- I opened this CFD 1,983 days ago. Just sayin'. Just Step Sideways (talk) 19:45, 27 August 2024 (UTC)
- Do not forget you need a native English speaker who is at the same time a railway fan (not necessarily a railway expert, but has knowledge above the average) to close this discussion. I just looked at it and I can not close it. I can not say a difference between "close" and "closed" in this case. Ymblanter (talk) 21:02, 27 August 2024 (UTC)
- More admins would help, the issue is really active admins. Among the 183 admins, only 129 have done any admin action during the last month, and only 64 have done more than 20 admin actions during the same period. Yann (talk) 21:10, 27 August 2024 (UTC)
- A CFD is not a Deletion request, so I think it is out of scope of this discussion. CFD's cannot be closed just by administrators, but every experienced editor can do so. JopkeB (talk) 06:07, 28 August 2024 (UTC)
- When deletion discussions remain open essentially forever, when backlogs remain in the thousands across a number of types, when copyright violations are allowed to sit on the project indefinitely, then yes I would call this a failed project. We are not able to effectively police ourselves, and the incoming level of problematic files overwhelms existing ability to deal with it. --Hammersoft (talk) 17:36, 1 September 2024 (UTC)
- maybe you can become a com:sysop and help? would you accept a nomination? RZuo (talk) 18:55, 1 September 2024 (UTC)
- Frankly, I'm not active enough. I've seen various requests here, and noted concerns about activity levels. I doubt I'd meet expectations. Regardless, adding another administrator to the mix, especially at my activity levels, isn't going to help. I have other reasons as well, but this is really getting out of the scope of this discussion. --Hammersoft (talk) 19:18, 1 September 2024 (UTC)
- maybe you can become a com:sysop and help? would you accept a nomination? RZuo (talk) 18:55, 1 September 2024 (UTC)
I don't think we're teetering on anything, but we should all be able to agree the backlog is too big. Here's another reason it exists: DRs are often hard. Most admins aren't IP lawyers in one country, let alone every country, and there are a lot of really rough calls. Scope-based and people-based debates are also often hotly contested and difficult to assess. Beyond that, AfD on enwp is contentious and results in a lot of grief for closers, especially from newer users who don't understand the process. On commons, even many of the experienced users (or experienced users from other projects) routinely misunderstand the basics and make life sadder for commons admins. Back to the main point, however, there are definitely a lot of DRs that aren't that complicated which wait a long time to be closed, too, so more admins would help, at least. — Rhododendrites talk | 17:59, 1 September 2024 (UTC)
- The same thing applies to CfDs. It just took me 2 days to deal with 4 of them because they all involved moving thousands of files and deleting hundreds of categories. Then there's ones that are extremely simple and can be closed in a couple of minutes. But its near impossible to deal with the later to any meaningful degree because of the time it takes deal with the former. Then there's a bunch that are clearly just complex to deal with for whatever reason. But its such a big mishmosh. Anyway, these things don't really scale. For them to be managable you'd need to have everyone on here participate and inforce clear time limits on how long the discussions can be open for. As well as cnsistantly keep the back logs down to a week or two. Otherwise its just going to be an unworkable mess. Its hard to say something should be closed a certain way after a week or two when it has a low turn out and an unclear outcome though. Sometimes it takes years just for someone to comment. So I don't know. There really is no single, easy solution. --Adamant1 (talk) 20:46, 1 September 2024 (UTC)
- Just adding an example to go with my "DRs are often hard" comment above. I started looking through the oldest DRs and saw Commons:Deletion requests/Files in public domain featuring various Indian film artists post-1945. Taking action on that DR would involve a few dozen images and an understanding not just Indian copyright or US copyright but the interplay between the two. Even if we get more admins, the number of people who would feel confident weighing in on something like that are very few. Let's not have a discussion here about the correct outcome, though -- I'm just highlighting that these are hard, and while the act of closing a discussion may not take much time, attaining the level of understanding required to parse such a case would be very time consuming. It's no wonder nobody wants to do it. I wonder if there's a potential project for the WMF to undertake that would effectively create charts like COM:HIRTLE for media published outside the US and uploaded to Commons. Maybe even an interactive tool. — Rhododendrites talk | 20:56, 1 September 2024 (UTC)
- I think for reusers and uploaders the open request signals there is a complex issue. Highly problematic uploads get nuked within hours.
∞∞ Enhancing999 (talk) 22:15, 1 September 2024 (UTC) - Just want to ping Sannita (WMF) to this discussion. No response require -- just something where the WMF could help with a backlog indirectly. — Rhododendrites talk | 21:41, 16 September 2024 (UTC)
- @Rhododendrites I know no response was required, but I figured it would be nice to do it anyway. :) I'm not entirely sure the Legal department can help with this, because it would take quite some time to do research, but I can try to make the case for it. Sannita (WMF) (talk) 17:17, 17 September 2024 (UTC)
- I think for reusers and uploaders the open request signals there is a complex issue. Highly problematic uploads get nuked within hours.
- I'll again bring up my suggestion that first uploads by new users should not be publicly visible until reviewed. -- Infrogmation of New Orleans (talk) 02:26, 9 September 2024 (UTC)
- Maybe, but wouldn't that create a different kind of backlog? — Rhododendrites talk | 12:29, 12 September 2024 (UTC)
- Strong oppose there is at least 5000(five thousand) new files every day, we cannot handle it. modern_primat ඞඞඞ ----TALK 21:41, 18 September 2024 (UTC)
- 5000 new files that are first uploads by new users every day? -- DaxServer (talk) 07:50, 19 September 2024 (UTC)
- uhm... no, thats probably 500-1000 modern_primat ඞඞඞ ----TALK 08:16, 19 September 2024 (UTC)
- 5000 new files that are first uploads by new users every day? -- DaxServer (talk) 07:50, 19 September 2024 (UTC)
- Oppose as well, would create just another backlog. On the more general question, as Rosenzweig points out, there were times when the DR backlog was even considerably larger, so I don't see a dramatic issue right now. Of course it's not great that some discussions are open for months, and I think we need more admins, but this hardly makes Commons "teetering on being a failed project". Regarding the categories for discussion, which sometimes are open for years, this IMHO just shows that there isn't that much interest in specialist topics such as the mentioned example of Commons:Categories for discussion/2019/03/Category:Store railway wagons. I'd suggest that the few people that are interested in such a category just do what they think is best. Gestumblindi (talk) 08:10, 19 September 2024 (UTC)
- IMO this request is not decideable. That is not a question of backlog but of three different opinions by two users in the discussion. On what base an sysop should close this (or any user). BUt I will another opinion which won't make it easier either. Matthiasb (talk) 21:45, 19 September 2024 (UTC)
- I often encounter uploads from new users and tag them with our various tags - speedy deletion, DR, No-source, No-permission, among others. How I decide [at the moment] if I want to tag: If the uploads have no Exif data and no Resolution data, and if the uploader, may or may not be new, has very few uploads. From my experience so far, I observed that these files are usually copyright violations, for various reasons. If I can find which violation, I'd tag speedy deletion. If not I'd probably tag with the other tags depending on the file. Does it help if there's a tool that can list out exactly these files: "have no Exif data, no Resolution data, uploader has very few uploads". This could narrow the effort to track files further down rather than to patrol a broad range of files? -- DaxServer (talk) 08:38, 19 September 2024 (UTC)
August 29
Category descriptions
I'd like to establish some community consensus on what constitutes an appropriate amount/use of category descriptions. Categories that have to do with North Brabant often have large, self-referential descriptions with all manner of interwiki linking and use of external linkage that is not appropriate, for example Category:Geertruidenberg or Category:Heusden, North Brabant, or any of the places linked in the template above. Compare that to famous cities like Category:London or Category:Chicago, which have minimal descriptions by comparison.
Then there's subcategories. Category:Van Goghkerkje mentions it's at the Vincent van Goghplein 1, 4881 DG Zundert in the municipality of Zundert in the province of North Brabant in the south of the Netherlands. This information is then repeated in almost every single subcategory, including the metacat Category:Van Goghkerkje by year.
Category:Streets in Geertruidenberg mentions in the category description that this subcategory is for pictures of streets in the municipality Geertruidenberg in the province of North Brabant in the south of in the Netherlands. Category:Nature of Bergen op Zoom mentions in the category description that this subcategory is for pictures of nature and nature reserves in and around Bergen op Zoom in the province of North Brabant in the south of in the Netherlands. The category Category:Geography of Moerdijk mentions in the category description that this subcategory is for pictures of the geography in de gemeente Moerdijk in the province of North Brabant in the south of in the Netherlands.
Thousands of categories are like this, almost exclusively those in North Brabant. I'd like to start cleaning these up, but before I do I want to establish community consensus on where the line is drawn when something turns excessive so I know what to trim and what to keep.
My opinion is this:
- No repeating of information that can be found in the category name ("The category "Streets of Geertruidenberg" is for pictures of streets of the municipality of Geertruidenberg...")
- No repeating of information that can be found in an infobox
- No repeating of information that can be found in a parent category
- No external linking to personal websites, or other sites that would not be considered reliable sources on other Wikiprojects
ReneeWrites (talk) 07:36, 29 August 2024 (UTC)
- I guess most categories where the description would be relevant have {{Wikidata Infobox}}, which has all necessary information. Ymblanter (talk) 09:11, 29 August 2024 (UTC)
- Maybe we miss something, but both Category:London and Category:Chicago currently have tons of category description, most can be found elsewhere, compared to Category:Van Goghkerkje. The later mentions an essential point that seems odd to mention without including a basic description before including stating the category name in the description itself.
- Not sure how Category:Geography of Moerdijk can be compared negatively to Category:Chicago except when there is some bias involved. Stating the topic in local language(s) is fairly important.
- Category:Chicago seems relatively bad as it has a large seal image in the center. It mentions "largest city in Illinois," which is marginally helpful.
- A nice thing about Category:London is that it has that collapsed list that helps sorting. I find such Commons specific pointers fairly important.
∞∞ Enhancing999 (talk) 12:02, 29 August 2024 (UTC)- I was comparing the categories of Geertruidenberg and Heusden with those of London and Chicago, the latter of which have very succinct descriptions. London's is just one line of text. This is not about the navigation templates.
- My critique of Category:Van Goghkerkje is that this information is repeated in most subcategories like Category:Van Goghkerkje by year and Category:Van Goghkerkje in 1969, etc.
- And I was not comparing Category:Geography of Moerdijk to any of the above categories. I was making a stand-alone critique (that applies to Category:Streets in Geertruidenberg and others) that the category description just says what's in the category name, and that this is superfluous. ReneeWrites (talk) 12:26, 29 August 2024 (UTC)
- Sorry about that then. Contentwise Category:Geertruidenberg or Category:Heusden, North Brabant don't compare that badly, though I don't think there should be two descriptions in Dutch and English.
- If there are Wikipedia articles on the topic, references would generally not be needed. There was some discussion about this at Commons:Categories_for_discussion/2024/07/Category:Harp_Guitar_Form_3a.
- The repetition at Category:Van Goghkerkje by year seems suboptimal, but that applies to the entire "by year" tree as well.
∞∞ Enhancing999 (talk) 12:52, 29 August 2024 (UTC) - I think that at least the street description makes sense, because I often treat subcategories of streets as "address-like" categories, so if there's an image of a house at "Peace Avenue 24" I might put the image of that into the "Category:Peace Avenue". But the description makes it sound as if only images of the actual road would belong into that category. Nakonana (talk) 20:55, 16 September 2024 (UTC)
- @Nakonana: Buildings on a street definitely belong in the category for the street. Keep in mind: this is not about ontology, it is about helping users find things. - Jmabel ! talk 19:26, 17 September 2024 (UTC)
- There seem to be two types of descriptions (maybe one more popular than others), for a sample Category: "ABC":
- 1. "ABC" is a ..
- 2. This category is about media/photos/images related to/about "ABC", a ..
- Personally, I prefer (1.), but 2. is not uncommon.
- Content-wise, there are three types of descriptions about "ABC":
- 1. no description
- 2. Just repeat the title literally ("ABC")
- 3. State something about ABC
- Personally, I prefer 1 or 3.
∞∞ Enhancing999 (talk) 12:38, 29 August 2024 (UTC)- I agree with Ymblanter, wikidata supplies this info.
- In the case, of all things North Brabant wikidata may be insufficient. I more than suspect, what you have here is an editor set in their ways since 2008, who finds Wikidata impenetrable, or even superfluous and persists without it. The cats are cluttered, I agree, and few (if any), will use these links. However, there’s no harm done here.
- After all, this is supposed to be a project that anyone can edit.
- I've beaten this drum about over complication, before, and I contribute to Wikidata (4000 edits), and it falls on deaf ears.
- There's too much of high tech people here, making subject matter complicated and difficult to edit. I'm still waiting for Category:Gartenlaube (Magazine)'s open architecture to be restored. All new uploads there, are incompatible with the established closed off format.
- Even today I discovered a source template for a major library that is no longer viable, because the library changed its catalog system rendering our source template into a link to an enormous pile of link rot. I fail to understand how people, make templates or such, then not put in place the mechanisms for continuous maintenance, or just walk away from them. Broichmore (talk) 13:21, 29 August 2024 (UTC)
- I think I disagree with some premises here. Respectively, to the bullet list of opinions given by User:ReneeWrites:
- Don't forget the multilingual aspect of this. Repeating the category name in other languages can be very useful.
- I get what you say about the infobox, but sometimes a tight piece of prose is easier to skim quickly.
- Not everyone will be navigating down the hierarchy. I think it would be ridiculous, for example, for someone navigating up from a village to have to navigate many, many layers up the hierarchy to find out what country it is in.
- I could perfectly well accept the last point, with one possible exception: on topics where there is a serious limitation on commercially available material, it can be useful to have a link to a trove of NC content on (for example) Flickr. Not sure whether that is even an exception, maybe more of a clarification.
- I'd also add: I think somewhat longer than usual descriptions are often useful for topics not likely to get a Wikipedia article. Example: Category:Court in the Square.
- Further: the main problem with the subcats of Category:Van Goghkerkje is that there are this big batch of "by year" categories, mostly with one or two photos, and with little or no differences from year to year to suggest that is a useful way to subcat this. - Jmabel ! talk 14:03, 29 August 2024 (UTC)
- Re. "the multilingual aspect" - surely that should be handled by Wikidata? Wikidata already has extensive support for multilingual desriptions of entities, and those descriptions should get pulled in by the Wikidata infobox. Writing those descriptions locally on Commons shouldn't be necessary. Omphalographer (talk) 19:16, 29 August 2024 (UTC)
- Maybe you could explain how it should be happening for the samples where Renee writes that [Dutch] is super-flous.
∞∞ Enhancing999 (talk) 19:25, 29 August 2024 (UTC)
- Maybe you could explain how it should be happening for the samples where Renee writes that [Dutch] is super-flous.
- Regarding 4, I think this is a good point and a good way to add nuance to the topic. I'm not against removing all external links, though I didn't really clarify what I meant with "personal websites", and I think the exception you mention is reasonable. The website in particular I was thinking of was https://breda-en-omgeving.nl/ which you can find linked numerous times in the categories above (6 times in Geertruidenberg, 7 times in Heusden, as well as many other categories linked in the navigation template at the top). I'll also ping @Prototyperspective: because this is relevant to a thread below. ReneeWrites (talk) 11:45, 30 August 2024 (UTC)
- Re. "the multilingual aspect" - surely that should be handled by Wikidata? Wikidata already has extensive support for multilingual desriptions of entities, and those descriptions should get pulled in by the Wikidata infobox. Writing those descriptions locally on Commons shouldn't be necessary. Omphalographer (talk) 19:16, 29 August 2024 (UTC)
- Support ReneeWrites opinion about it. Although I think Jmabel makes some good points, but I don't see them as mutually exclusive or applicable in every situation. Just to give an example from Jmabel's comment, don't forget the multilingual aspect of this, I'd say it depends on the topic of the category and what level down it is. For instance if we're talking about shoes and the person is browsing through Category:Shoes to find images of them then I don't think it's necessary or helpful to have the word "shoes" translated into every single language possible just because this is a multilingual project or whatever. It's a pretty good bet that most users know what a shoe looks like and has heard the word in English before. So there's zero reason to translate it or to have a description, in English or any other language. "Shoes are things you wear on your feet." No really? We don't need that in English, let alone 10 other languages.
- Maybe you'll say that's a bad example since the category for shoes doesn't have a description to begin with though. But there's plenty of categories for extremely obvious universally known subjects out there that have totally pointless descriptions in multiple languages. So essentially I agree with ReneeWrites opinion about it. Although I think Jmabel makes a few good points, but multilingual thing shouldn't be a free pass to create descriptions that are otherwise totally pointless for whatever reason. --Adamant1 (talk) 17:22, 29 August 2024 (UTC)
- @Adamant1: Again, though, you are assuming they got there by following the category tree from a more abstract category. They could just as easily have arrived from a category on an individual photo. - Jmabel ! talk 18:51, 29 August 2024 (UTC)
- @Jmabel: Sure, but if someone gets to Category:Shoes from an individual photo of something that's not a shoe then there's other problems involved that categories having descriptions or not has nothing to do with. The only way to deal with people confused about where they are in the category structure at any given time is to categorize images properly. You can't make up for or fix that by putting "chaussure" at the top of a category. --Adamant1 (talk) 18:59, 29 August 2024 (UTC)
- @Adamant1: "other problems involved" Well, yes. And if they are, for example, a Chinese-speaking editor, it would be useful if they don't have to to running way up the category hierarchy to work out that it was an error.
- Some of this can be solved if there is a corresponding Wikidata item, but the more narrow the category, the less likely to have a Wikidata item, so the more important mere translation of the category name may be. - Jmabel ! talk 19:53, 29 August 2024 (UTC)
- @Jmabel: I don't even disagree with that, which is why the used an extremely general topic like shoes as an exmple. It's certainly a different thing altogether the further down in the categories some goes. I'm not sure what the solution to that is but I still think ReneeWrites' proposal still makes sense with broader topics. Maybe there could be a cavit to it about how to handle categories for more obscure topics though. --Adamant1 (talk) 20:00, 29 August 2024 (UTC)
- More so, they could be a logged-out user who doesn't see any categories displayed at the bottom of the page, because the display of categories is something you need to activate in your settings after you've created an account. Or they could be inexperienced in navigation the Wikimedias. They might not know how to access Wikidata for more information on the subject (because how would a non-Wikipedian know that you need to click that number that starts with the letter Q to get to a page from which you can access the relevant Wiki article in different languages?). Or the non-Wikipedia is someone who's accessing Commons via mobile browser where the Wikidata Infobox is almost completely useless because it only lists the information that is in Wikidata's parameter "instance of" or "occupation" but is otherwise just empty. Nakonana (talk) 21:07, 16 September 2024 (UTC)
- @Jmabel: Sure, but if someone gets to Category:Shoes from an individual photo of something that's not a shoe then there's other problems involved that categories having descriptions or not has nothing to do with. The only way to deal with people confused about where they are in the category structure at any given time is to categorize images properly. You can't make up for or fix that by putting "chaussure" at the top of a category. --Adamant1 (talk) 18:59, 29 August 2024 (UTC)
- @Adamant1: Again, though, you are assuming they got there by following the category tree from a more abstract category. They could just as easily have arrived from a category on an individual photo. - Jmabel ! talk 18:51, 29 August 2024 (UTC)
- Support what ReneeWrites wrote. In the vast majority of cases, categories either have a Wikidata infobox that makes the subject clear (and can support multiple languages far more cleanly than Commons), or the subject is obvious from the category name. Descriptions should be reserved for categories where additional clarification is useful (eg Category:Granby station and Category:Trolley Brook Trail) and where descriptions provide important context for subjects without Wikidata items (eg Category:Central Subway shuttle trains and Category:South Ashburnham switchback). Pi.1415926535 (talk) 19:13, 29 August 2024 (UTC)
- Comment Generally agree except for the point 4. For example a link to a github repo may not be a reliable source at WP but still relevant and useful in a category about the software without a Wikidata infobox. Another problem is that several links in the Wikidata item don't show up in the infobox and something should be done about it there instead of adding a link also to official website (example: links to mediawiki.org in the "Multilingual sites"). Another caveat is that I don't know how Web search engine indexing works and that "No repeating of information that can be found in a parent category" is ambiguous – if you mean useful info that is not self-explanatory should be excluded in a category just because it's already in the parent category then I disagree (e.g. people may have gone directly to that cat from search results or a file instead of having seen the parent cat earlier). Prototyperspective (talk) 19:46, 29 August 2024 (UTC)
- I don't think Github is a very good example. A Github page isn't a personal website, and it's perfectly acceptable to use as a source on Wikipedia (there's even a template for it: Template:GitHub). Github can't be used as a source to establish notability however. ReneeWrites (talk) 08:03, 30 August 2024 (UTC)
- I think it's a good example because it illustrates well how websites considered nonreliable can be very appropriate and useful to have there. You agree that it can be appropriate so it's evidently a good example. That Wikipedia template is about external links, not references. I think it's used sometimes as a reference when the article subject is the software but it's not generally considered reliable. There are more possible examples like a category about some small organization linking to the organization's website and so on. There are many more cases where some website that would be considered unreliable would be very useful so at a minimum one would need to change that part, especially be considered reliable sources on other Wikiprojects, but I think it may be best if this point was removed or turned into a recommendation about what is usually (instead of always) the case. I'm still unsure what is meant with repeating info already in parent cat. However, I haven't seen any cases of such anyway but it doesn't seem like a good requirement. Prototyperspective (talk) 10:22, 30 August 2024 (UTC)
- I don't think Github is a very good example. A Github page isn't a personal website, and it's perfectly acceptable to use as a source on Wikipedia (there's even a template for it: Template:GitHub). Github can't be used as a source to establish notability however. ReneeWrites (talk) 08:03, 30 August 2024 (UTC)
- Support Most of what has been said before. Thanks for starting this discussion. Maybe the four points of ReneeWrites just need some nuance. My remarks:
- This does not concern only categories about the province of North Brabant, but for many location categories of the Netherlands. I have seen many of those descriptions elsewhere too.
- For me they are redundant and annoying, but I am not the target group: perhaps to occasional visitors they might give useful information, especially if they are foreigners.
- To limit obvious redundant information: No repeating of information that can be found in an infobox. But then there should be a Wikidata infobox. For subcategories like "in" categories (for countries, "streets in" and so on) and years, there usually are no, and should not be either (except for when there is a Wikipedia article).
- I prefer "ABC" is a .. as well. The shorter, the better.
- I agree with Jmabel: Repeating the category name in other languages can be very useful. Ever since I discovered in an English discussion about a Dutch subject that a Dutch participant had trouble discussing in English, I translate categories about the Netherlands into Dutch (unless there is a Wikidata item or it is a "by" category). And sometimes there is more explaning to do because the English word has no equivalent in Dutch (see for instance Category:Estates in the Netherlands) or the English word looks like a Dutch one, but means something else.
- So in general:
- Every topic category should have either a Wikidata item or a description. Exceptions: if the category name is about a universally known subject (like "shoes"). In case of doubt: add a description in English, other languages are allowed too.
- No repeating of information that can be found in a corresponding Wikidata infobox.
- If there is no Wikidata item: Add descriptions to categories for countries that are not native-English speaking countries, preferably in their native language.
- Keep descriptions as short as possible. Add a link to the corresponding Wikipedia article in the native language if that is appropriate (especially in case of country categories). You may add links to other relevant websites that you trust if there is no Wikidata item and the website is within the scope of Commons (no avertising, and so on).
- Repeating of information that can be found in the category name ("The category "Streets of Geertruidenberg" is for pictures of streets of the municipality of Geertruidenberg...") should be avoided as much as possible. But sometimes it can be useful, even in this case: Geertruidenberg is a municipality AND a populated place. It is good to know for which one the category is (though it would be better to have it in the category name).
- --JopkeB (talk) 07:41, 30 August 2024 (UTC)
- What do you think of the samples Chicago and London?
∞∞ Enhancing999 (talk) 10:21, 30 August 2024 (UTC)
- What do you think of the samples Chicago and London?
BTW, if anyone wants to see a good example of major overkill when it comes to a category description check out Category:Louvre Museum - Room 185 and scroll down past the infobox. I assume everyone here would agree that something like that is way to excessive. --Adamant1 (talk) 05:49, 31 August 2024 (UTC)
- Not only is that wildly excessive, it seems like an unreliable basis for categorization. The museum can reorganize their collection; Commons shouldn't have to shuffle a bunch of categories around (or argue about what objects used to be where) when that happens. Omphalographer (talk) 06:15, 31 August 2024 (UTC)
Category instructions
There are descriptions, but sometimes instructions are also usefull. Examples: Category:Rail vehicle doors: Please place images of closed train doors only if they are prominent. All passenger trains photografed from the side have train doors. There are subcategories for open doors. Other example: Category:Rail vehicle sliding-plug doors: The door slides outside the vehicle by closing is pulled in at the last moment inside (plug). There is no supporting structure outside the dooropening and a closed door is flush with the vehicle surface. animation and rail door systems. This category is sorted by country. (Spain and Sweden under the letter 'S'). Train material door types can be determined as soon as there are enough good examples of 'open' doors to classify. In the Commons the definition of train doors is broader than in the NL article nl:Treindeur. Outside doors not used by passengers are excluded (for example drivers or loading doors) or aisle doors.Smiley.toerist (talk) 13:45, 6 September 2024 (UTC)
Coordinates
I just want to add one aspect or make it clear as it is already contained in No repeating of information that can be found in an infobox. Object coordinates should not be added to categories / should be consolidated and removed, if any. --Herzi Pinki (talk) 22:32, 6 September 2024 (UTC)
- @Herzi Pinki: can you point me to anywhere that was decided as policy? Last I remember this being discussed (a few years ago) the opposite conclusion was reached: that it was useful to have Commons and Wikidata both contain location information, because the duplication was very useful in identifying possible vandalism, since a bot would notice the discrepancy, allowing for easy review. There could well have been a change, but if so I missed it. - Jmabel ! talk 14:04, 7 September 2024 (UTC)
- No, nothing has changed. We don't trust. But I just wanted to point out that as a consequence of the above No repeating of information that can be found in an infobox, this will also affect the policy on coordinates. best --Herzi Pinki (talk) 14:40, 7 September 2024 (UTC)
- The above is discussion is a bit odd, because it says that and people agree that including samples that do exactly the opposite. Go figure. Except when it's translated into Dutch. So local languages are ok, except Dutch.
- Actually, it's a mistaken assumption about coordinates that the infobox offers the same. Some users might find some parts of it, most don't that see that. I think we discussed it before. Obviously, we could integrate coordinates better in the interface.
∞∞ Enhancing999 (talk) 20:03, 7 September 2024 (UTC)
- No, nothing has changed. We don't trust. But I just wanted to point out that as a consequence of the above No repeating of information that can be found in an infobox, this will also affect the policy on coordinates. best --Herzi Pinki (talk) 14:40, 7 September 2024 (UTC)
- The Wikidata Infobox information is not visible to mobile users. The coordinates from an object location template, however, are visible in mobile browser. The whole Wikidata Infobox things just doesn't work for mobile users, and if I remember correctly, most users are mobile users? So we shouldn't rely too much on that infobox. Nakonana (talk) 21:17, 16 September 2024 (UTC)
- If indeed Wikidata Infobox doesn't work for mobile users, that would be a clincher to keep the geocoordinates explicitly on the category page. - Jmabel ! talk 19:28, 17 September 2024 (UTC)
- If you want to check it out, here is a category that has the object location template in addition to the Wikidata Infobox template with coordinates. I checked with Firefox mobile browser and a Chromium based mobile browser, and the only visible coordinates are the one of the object location template. Nakonana (talk) 21:37, 17 September 2024 (UTC)
- It looks as though this is a deliberate behavior - Module:Wikidata Infobox#L-218--L-220 intentionally hides infobox rows on mobile devices unless it's specifically directed to show them. Want to start a discussion on its talk page about changing that?
- As an aside, the combination of the Wikidata infobox and the object location template is really unpleasant on desktop. The object location creates a horizontal "wall" across the page which pushes all of the category contents below the infobox, which can make the page look empty if the infobox is large. Omphalographer (talk) 23:35, 17 September 2024 (UTC)
- How does one specifically direct it to show the rows as a user? The infobox only has a "collapse" button, but no "expand" button. It would certainly be helpful if one would at least have the option to expand the infobox on mobile. Nakonana (talk) 00:22, 18 September 2024 (UTC)
- As far as I'm aware, you can't. And that's the problem. Omphalographer (talk) 01:46, 18 September 2024 (UTC)
- Just discovered that there's already a somewhat related discussion on the talk page.: Template talk:Wikidata Infobox#Auto-collapse the Wikidata Infobox when browsing Commons on Mobile. Nakonana (talk) 00:37, 18 September 2024 (UTC)
- How does one specifically direct it to show the rows as a user? The infobox only has a "collapse" button, but no "expand" button. It would certainly be helpful if one would at least have the option to expand the infobox on mobile. Nakonana (talk) 00:22, 18 September 2024 (UTC)
- If you want to check it out, here is a category that has the object location template in addition to the Wikidata Infobox template with coordinates. I checked with Firefox mobile browser and a Chromium based mobile browser, and the only visible coordinates are the one of the object location template. Nakonana (talk) 21:37, 17 September 2024 (UTC)
- If indeed Wikidata Infobox doesn't work for mobile users, that would be a clincher to keep the geocoordinates explicitly on the category page. - Jmabel ! talk 19:28, 17 September 2024 (UTC)
Family trees (see below )
There is a discussion below if family trees should cover up the category pages or not, see #Familytree.
∞∞ Enhancing999 (talk) 20:03, 7 September 2024 (UTC)
Example images for subcategories
I would appreciate a typical sample image for each subcategory to ease correct assignment to subcategories. Of course not for standard subcategories like 'Views of', 'Nature of', 'Buildings in', etc. This should be done by marking an image of the subcategory in a suitable way (by a dedicated cat, by QI?) and done automatically when generating the category pages. best --Herzi Pinki (talk) 18:13, 9 September 2024 (UTC)
- Maybe there could be a gadget that shows an image when hovering the subcat (for longer than 0.2 s). However, I think showing four images instead of just one would be best. For categories that are linked to a Wikidata item, selecting a typical representative fairly high-quality is simple: the one(s) that are set in the image property of the item. However there are many categories without Wikidata item (or with item but no image set there). In regards to your use-case however, the category title should be fairly self-explanatory so I think the use of that would be quite limited (but this and other potential usecases could still be useful enough for having some gadget that enables that). Prototyperspective (talk) 20:25, 9 September 2024 (UTC)
- Don't take this the wrong way, but what's the ultimate end game here and how exactly does it relate to the topic of the discussion, "category descriptions"? Because I really fail to see the connection between that and your suggestion for a gadget that shows an image when hovering the subcat. I don't think anything ReneeWrites said in their original comment would be negated or effected by your idea. --Adamant1 (talk) 00:58, 10 September 2024 (UTC)
- Could be interesting. Ideally without too much need for configuration.
∞∞ Enhancing999 (talk) 03:08, 13 September 2024 (UTC)
Creator template
At Category:Alexander Hamilton, there is the Creator:Alexander Hamilton in addition to the Wikidata infobox. Is this needed? I suggest we remove or collapse them on categories with an infobox filled from Wikidata.
∞∞ Enhancing999 (talk) 03:08, 13 September 2024 (UTC)
- Open the category in a mobile browser and see the difference between how helpful the Creator template is in contrast to the quite useless Wikidata Infobox. Nakonana (talk) 21:22, 16 September 2024 (UTC)
- That seems like something that should be resolved regardless of if there's creator templates in categories or not, but at least IMO creator templates are totally pointless and just get in the way when they are being used in categories. I don't think that's what they were created for either. So should really just be gotten rid of. Sucks for mobile users sure, but if I were to guess most mobile users don't find images through categories anyway since they don't show up on mobile to begin with. There's no point in having a feature for a non-exiting costumer. Especially if its screwing up everyone else's experiences in the meantime. --Adamant1 (talk) 21:40, 16 September 2024 (UTC)
- Categories do show up for mobile users who are logged in. And the Creator templates does show up for mobile users who are not logged in, too. The infobox doesn't show up no matter whether logged in or not. Ultimately, it's just that the Wikidata Infobox template should be adjusted to work for mobile users. That would definitely render the Creator template redundant, and also solve other problems. Nakonana (talk) 22:14, 16 September 2024 (UTC)
- Have you checked Phabricator to see if there's a ticket about it? It might be worth opening one if no one else has yet. --Adamant1 (talk) 22:42, 16 September 2024 (UTC)
- No, I haven't, I'm not familiar with Wikimedia venues to report stuff. But I think that they are already aware of the issue because the infobox you currently see is actually the improved one. IIrc, a few months ago, there was no infobox at all when using mobile browser. Now there's at least an incomplete one. Nakonana (talk) 23:31, 16 September 2024 (UTC)
- Well at least it sounds like their working on it then. It would suck if we decided something now based on the infoboxes not showing up if their just going to fix it at some point. --Adamant1 (talk) 00:50, 18 September 2024 (UTC)
- No, I haven't, I'm not familiar with Wikimedia venues to report stuff. But I think that they are already aware of the issue because the infobox you currently see is actually the improved one. IIrc, a few months ago, there was no infobox at all when using mobile browser. Now there's at least an incomplete one. Nakonana (talk) 23:31, 16 September 2024 (UTC)
- Have you checked Phabricator to see if there's a ticket about it? It might be worth opening one if no one else has yet. --Adamant1 (talk) 22:42, 16 September 2024 (UTC)
- Categories do show up for mobile users who are logged in. And the Creator templates does show up for mobile users who are not logged in, too. The infobox doesn't show up no matter whether logged in or not. Ultimately, it's just that the Wikidata Infobox template should be adjusted to work for mobile users. That would definitely render the Creator template redundant, and also solve other problems. Nakonana (talk) 22:14, 16 September 2024 (UTC)
- That seems like something that should be resolved regardless of if there's creator templates in categories or not, but at least IMO creator templates are totally pointless and just get in the way when they are being used in categories. I don't think that's what they were created for either. So should really just be gotten rid of. Sucks for mobile users sure, but if I were to guess most mobile users don't find images through categories anyway since they don't show up on mobile to begin with. There's no point in having a feature for a non-exiting costumer. Especially if its screwing up everyone else's experiences in the meantime. --Adamant1 (talk) 21:40, 16 September 2024 (UTC)
September 01
Abusefilter to prevent uncategorised pages?
are all pages in main and category namespaces supposed to be categorised? if so then perhaps an abusefilter can be created to prevent creation of such pages without any "Category" or "{{" in source? (pages are categorised either directly or by transcluding a template.) RZuo (talk) 11:12, 1 September 2024 (UTC)
- The ones on Commons:Report UncategorizedCategories with infobox all have "{{".
- To update the list I have to purge recent creations to ensure those connected to Wikidata, but without edits since don't end up there.
- For files, everything just ends up in Category:All_media_needing_categories_as_of_2024. Do users still get notified when they don't categorize?
∞∞ Enhancing999 (talk) 11:38, 1 September 2024 (UTC)
- My experience is that especially some bulk bot uploads from external sources (like flickr) are not properly categorized or not categorized at all. While inexperienced users fail to categorize quite often, they do not upload such huge numbers of media to commons. So coping with lacking categories by inexperienced users is manageable. But bulk uploaders can work around such an abusefilter by adding some not helpful category like Category:Files from xxxx stream on flickr needing categories and not taking care for proper categorization. Even if bulk uploaders do proper categorization, they will do this in a second step after uploading - an abusefilter will just disable their usual procedures.
- for the cat mentioned above, https://wikimap.toolforge.org/?cat=All_media_needing_categories_as_of_2024 will show (after a while) images on the map and then it is easy to care for your area of interest. best --Herzi Pinki (talk) 21:54, 6 September 2024 (UTC)
- "pages in main and category namespaces". not files. RZuo (talk) 09:29, 9 September 2024 (UTC)
- Yes, but the approach used for files could be applied to categories as well. There already is a (mostly empty) Category:Uncategorized categories.
∞∞ Enhancing999 (talk) 09:35, 9 September 2024 (UTC)- Category:Uncategorized categories is "mostly empty" at the moment, but only because it was recently winnowed down from over 1500 categories. A year ago, it had ballooned to about 8000, and those past 5000 in alphabetical order couldn't even be seen in the list. - Jmabel ! talk 06:27, 10 September 2024 (UTC)
- You mean Special:UncategorizedCategories, not Category:Uncategorized categories, isn't it?
∞∞ Enhancing999 (talk) 06:42, 10 September 2024 (UTC)- @Enhancing999: Oops, you are correct, I should have clicked before assuming I knew what you meant. - Jmabel ! talk 21:30, 13 September 2024 (UTC)
- You mean Special:UncategorizedCategories, not Category:Uncategorized categories, isn't it?
- Category:Uncategorized categories is "mostly empty" at the moment, but only because it was recently winnowed down from over 1500 categories. A year ago, it had ballooned to about 8000, and those past 5000 in alphabetical order couldn't even be seen in the list. - Jmabel ! talk 06:27, 10 September 2024 (UTC)
- Yes, but the approach used for files could be applied to categories as well. There already is a (mostly empty) Category:Uncategorized categories.
September 05
Familytree
@Enhancing999: See: Category:Byington Ford A change was just made to collapse the family tree function in categories as the default. Now it isn't obvious that the navigation device is there. What are people's preferences? Which is the most useful? RAN (talk) 13:14, 5 September 2024 (UTC)
- I don't recall ever seeing this before and don't see the purpose of it here. If I want to know genealogical information, Wikidata and Wikipedia are appropriate for that. I guess that it's useful for navigating within the same family, but that is pretty niche and probably not something I would ever use. That said, I generally don't like auto-collapsing any actual content and the default should be for these to be uncollapsed. —Justin (koavf)❤T☮C☺M☯ 13:16, 5 September 2024 (UTC)
- There was a bug in the classes used by the template, see Template_talk:Wikidata/FamilyTree#class="collapsible_autocollapse"_not_working. This was fixed today.
∞∞ Enhancing999 (talk) 13:19, 5 September 2024 (UTC)- I don't see how that responds to what I wrote, but it does seem to respond to the original comment. How is what you wrote relevant to what I wrote? —Justin (koavf)❤T☮C☺M☯ 13:47, 5 September 2024 (UTC)
- You write that you prefer it in an uncollapsed form. If you read my comment and its link to additional discussion then you discover that your preference leads to category pages being replaced by family trees. I think this response both to the original comment and yours.
∞∞ Enhancing999 (talk) 13:50, 5 September 2024 (UTC)- Thanks. —Justin (koavf)❤T☮C☺M☯ 14:06, 5 September 2024 (UTC)
- You write that you prefer it in an uncollapsed form. If you read my comment and its link to additional discussion then you discover that your preference leads to category pages being replaced by family trees. I think this response both to the original comment and yours.
- I don't see how that responds to what I wrote, but it does seem to respond to the original comment. How is what you wrote relevant to what I wrote? —Justin (koavf)❤T☮C☺M☯ 13:47, 5 September 2024 (UTC)
- There was a bug in the classes used by the template, see Template_talk:Wikidata/FamilyTree#class="collapsible_autocollapse"_not_working. This was fixed today.
- I think it should be collapsed if it is to be included in category pages. That's not really what category pages are for, not how or where people look for such data and it hides category contents (mainly subcats and files) as well as cause UI issues. Another example. Prototyperspective (talk) 13:44, 5 September 2024 (UTC)
- The new template has been around since 2019, replacing the hand built trees that been around for a decade or more. You only see them where the person is a member of a large family such as in royal/noble lines. It isn't decorative, it is a navigation device. It sometimes is the only way to see that there is a disconnection in the family, caused by incorrect merges or the deletion of someone in the family when someone doesn't realize it has a structural need. The problem is that in the current collapsed state it is not obvious that a tree is present. --RAN (talk) 16:59, 5 September 2024 (UTC)
- The problem is that in the current collapsed state it is not obvious that a tree is present. Because the way it's wrapped into a collapsed box is having it show only the text "Byington Ford (Q5004096) [Expand]" (in this example) instead of e.g. "Show family tree [Expand]". The name is already in the cat so redundant and the Q number is not useful to the reader and also already on the page via the infobox. Prototyperspective (talk) 17:07, 5 September 2024 (UTC)
- I think "autocollapse" is the old version for collapsed state. It must have been designed to be collapsed.
- The header of the template should be improved. Merely using the name of a person and a quid doesn't really explain what it's meant to be. This isn't really better when it's expanded and covers the entire screen.
∞∞ Enhancing999 (talk) 17:08, 5 September 2024 (UTC)
- See: Category:Thomas_Jefferson where consensus was to have the tree collapsed so a collapse function was wrapped around the tree, now it is doubly collapsed. --RAN (talk) 17:15, 5 September 2024 (UTC)
- Seems the problem was known, but incorrectly fixed.
∞∞ Enhancing999 (talk) 17:19, 5 September 2024 (UTC)
- Seems the problem was known, but incorrectly fixed.
- One thing I like about the wrapper on Category:Thomas Jefferson is that it actually says "Family Tree". If that text got added in the template header it would already tell people a lot more on what the information is about before/without having to expand it. ReneeWrites (talk) 18:20, 5 September 2024 (UTC)
- Having it say "Family Tree" is great, having it doubly collapsed is probably something we should fix. - Jmabel ! talk 18:58, 5 September 2024 (UTC)
- Looking at the Thomas Jefferson example: do I understand correctly that the tree reads right-to-left, and that for the central figure there is no indication who is the other parent of their various children, nor is there any indication of legitimate vs. illegitimate offspring? - Jmabel ! talk 19:03, 5 September 2024 (UTC)
- Oh my goodness, this is heinous: now there are two collapsed boxes and you have to vertically scroll within a div?!?!? This is inaccessible and poor web design, folks. —Justin (koavf)❤T☮C☺M☯ 19:11, 5 September 2024 (UTC)
- About the label issue, it's seems that the default text is "Family Tree (tree of ancestors and descendants)", but somehow this is overwritten sometimes by the QID. It seems that the template is just not used as designed: Sample at Category:Ulrica Christina Wellingk, the default is overwritten with |title={{Q|Q104549892}}.
∞∞ Enhancing999 (talk) 19:15, 5 September 2024 (UTC)- Jeez. At least that could be
|title={{Q|Q104549892}} family tree
. - Jmabel ! talk 21:14, 5 September 2024 (UTC)- For translation purposes, it would be better if that label came directly from the template. In the sample, |entityId=Q104549892|title={{Q|Q104549892}} could be limited to |entityId=Q104549892, if the QID isn't determined directly from the category.
- To sum it up: the following maintenance seems to be needed:
- (1) fix the label in the template
- (2) remove |title={{Q|Qnnnn}}
- (3) remove the duplicate collapsing (as on Category:Thomas Jefferson)
∞∞ Enhancing999 (talk) 08:28, 6 September 2024 (UTC)
- Jeez. At least that could be
- About the label issue, it's seems that the default text is "Family Tree (tree of ancestors and descendants)", but somehow this is overwritten sometimes by the QID. It seems that the template is just not used as designed: Sample at Category:Ulrica Christina Wellingk, the default is overwritten with |title={{Q|Q104549892}}.
- I think Jmabel has the fix which solves 1 and 2. --RAN (talk) 04:55, 7 September 2024 (UTC)
- That might work for English, but not for any other language.
∞∞ Enhancing999 (talk) 10:20, 7 September 2024 (UTC)
- That might work for English, but not for any other language.
- Now Category:Henderikus Cashow has the title: "Category:Neighbourhoods in Heerenveen" --RAN (talk) 07:00, 7 September 2024 (UTC) fixed
- Changes I would like to see:
- 1 only have to add in Q number once, instead of twice and have title be "Q family tree" appended. --RAN (talk) 15:58, 7 September 2024 (UTC)
- 2 have collapsed be default but allow that to be changed manually to open. We have Categories called X family or X noble family, here you expect to see the tree by default. --RAN (talk) 15:58, 7 September 2024 (UTC)
- 3 if using the year function, take out the line space between the two so it is more compact and put the years in parentheses to match our style guide. --RAN (talk) 15:58, 7 September 2024 (UTC)
- I would say always collapsed. This is a media repository; family trees are metadata that we may keep here for convenience, but they should not dominate the page. It's easy enough to open it if that is what you want. - Jmabel ! talk 23:03, 7 September 2024 (UTC)
- I don't see how to make any individual tree not-collapsed for the display when a category is opened. When I am working on a noble family, it would nice to have the tree open while I am working on it. RAN (talk) 02:09, 9 September 2024 (UTC)
- @Richard Arthur Norton (1958- ): So for your particular purpose, you can click once to open it. But most people go to a category looking for media. - Jmabel ! talk 06:40, 10 September 2024 (UTC)
- I don't see how to make any individual tree not-collapsed for the display when a category is opened. When I am working on a noble family, it would nice to have the tree open while I am working on it. RAN (talk) 02:09, 9 September 2024 (UTC)
- A. I did some cleanup on the duplicated "collapsing". This should be mostly fixed.
- B. Also, titles now behave as described at Template:Wikidata/FamilyTree#Title_of_family_tree_box. It also mentions how to improve localization for a given language. A few cases with other titles can be found at Special:Search/Category: hastemplate:"Wikidata/FamilyTree" -"Family tree of", notably when the "entityID" doesn't match the item in the title.
- C. I noticed a few persons where the corresponding Wikidata item doesn't exist or was deleted: such as Category:Marion Boisot, Category:Jacob Ford (1759-1845). Supposedly the template should report some error there.
∞∞ Enhancing999 (talk) 13:13, 15 September 2024 (UTC)
- I would say always collapsed. This is a media repository; family trees are metadata that we may keep here for convenience, but they should not dominate the page. It's easy enough to open it if that is what you want. - Jmabel ! talk 23:03, 7 September 2024 (UTC)
September 06
Categorizing Newspapers by date (YYYY-MM-DD)
Hi, there are a few newspaper collections here on Commons that are categorized by date, for example the Abilene Daily Reporter. Other newspapers are not categorized this way, which makes it harder to find contemporary news. With some newspapers, the amount of files is large enough that I think this work could be delegated to a bot. For starters, there is The Ohio Sentinel, with conveniently labelled files like File:Ohio Sentinel 1952-06-28 - DPLA.... Older files of the same newspaper from 1951 are less convenient: File:Ohio Sentinel August 11-August 18, 1951 - DPLA.... Then there is the Galveston Tribune with file names like File:Galveston Tribune. (Galveston, Tex.), Vol. 17, No. 100, Ed. 1 Tuesday, March 16, 1897 - DPLA.... More examples: File:Gazeta săteanului 1886-05-05, nr. 07.pdf, File:Epoca 1886-05-18, nr. 146.pdf, File:Bukarester Tagblatt 1886-05-02, nr. 095.pdf, File:Томские губернские ведомости, 1886 № 19 (1886-05-15).pdf, Journal de La Haye 03-01-1847 (IA ddd 010258178 mpeg21).pdf.
Is bot-processing feasible? --Enyavar (talk) 16:50, 6 September 2024 (UTC)
- There seems to be {{Book}} (most likely) with
{{{publication date}}}
set to the date. Assuming it is correctly done, or corrected beforehand, a bot can easily categorize that info -- DaxServer (talk) 17:22, 6 September 2024 (UTC)- Why not use Wikidata en SD? Newspaper issues can be itemized in Wikidata with publication date (P577). Examples d:Q45747062 (Category:Journal de Bruxelles nr 76), d:Q46834135 (Category:Journal de Bruxelles nr 83), d:Q62015763 (Category:Journal de Bruxelles nr 90) Smiley.toerist (talk) 08:55, 9 September 2024 (UTC)
- Hi, whatever "en SD" is, it seems to not help at all with search results. When I search for "1799-12-17", the Journal de Bruxelles does not appear in the search results (with quote marks not at all, without quote marks not before literally tens of thousands of other manuscripts from different dates, some of them decades/centuries earlier/later than 1799. Furthermore, the Category:1799-12-17 is so empty that it appears as if Commons does not have any media regarding that specific date. Only your link proves the contrary. So I think that WD-en-SD could also benefit from some categorization.
- What good are the vast newspaper archives of Commons if they are buried and inaccessible unless you know the title of an accessible newspaper? Please tell me what was in the news around the world, on 1844-02-20 or another random date. --Enyavar (talk) 05:38, 10 September 2024 (UTC)
- Why not use Wikidata en SD? Newspaper issues can be itemized in Wikidata with publication date (P577). Examples d:Q45747062 (Category:Journal de Bruxelles nr 76), d:Q46834135 (Category:Journal de Bruxelles nr 83), d:Q62015763 (Category:Journal de Bruxelles nr 90) Smiley.toerist (talk) 08:55, 9 September 2024 (UTC)
- It is in the category 1799 newspapers. I have my doubts that a specific publishing date is of any use in researching a specific event. At that time there was no instant communication, and all news travelled slowly. So in practice the newspaper got dispatches from different places and times. In the issue of 76 (12 december 1799), there are many dates and places: al in the year 1799:
- Paris: 3-12 (2X), 29-11, 27-11
- 20-11 (Marseille), 25-11 (Nice)
- Genua 8-11, Florence 18-11
- Lausanne 23-11, Bern 23-11, Basel 25-11, Zurich 25-11
- Leiden 30-11
- London 21-11
- Frankfurt 24-11, Rastadt 29-11
- Stockholm 15-11
- Sint Petersburg (Russia) 9-11
- Istanbul (Constantinople) 29-10
- Ratisbonne 28-11 (latest incoming news in the morning)
- If you looking for a specific event, you must look at all issues, months after. News across the Atlantic could even take more than a month. Smiley.toerist (talk) 09:13, 10 September 2024 (UTC)
- It is in the category 1799 newspapers. I have my doubts that a specific publishing date is of any use in researching a specific event. At that time there was no instant communication, and all news travelled slowly. So in practice the newspaper got dispatches from different places and times. In the issue of 76 (12 december 1799), there are many dates and places: al in the year 1799:
- That's true, but who expects pre-telegraph newspapers to print news from the other side of the globe? That is why I strongly suspect that the issue of long news-travel times stops being this noteworthy after the 1870s. And even the weird French Revolutionary Calendar dates you give above, are not that problematic.
- The thing is that Wikidata is not working either: Yes, Q46834135 = [Journal de Bruxelles (1790-1800)/83-1799] exists and has a publishing date. Now what? There are no links to any other media of the same publishing date, nor are there links to previous or later issues of the JdB itself. The publishing date (P577) is apparently not even meant to be a searchable item on wikidata: "P577 = 17 December 1799" --> No match found. "P577 = 1799-12-17" --> No match found either.
- The search function here on Commons actually works: My search for "date = 1897-03-16" immediately spits out a few newspapers as a search result - thankfully - but that still does not mean that the search results would match my request (1897-03-04 and 1897-07-16 are among the results), and newspapers without proper description attributes ("date = 1897" and March 16 only in the file title) are again lost to the search filter. The search function can only help that much, after all.
- Furthermore when relying on search, it remains impossible to browse between different newspapers that were released on the same date. Nor is it easily possible to go a few days forward or backward. Take for example The Halletsville Herald, The Abilene Reporter and The Daily Hesperian (I found all of them with the query above): Neither of them is meaningfully categorized at all; while the Galveston Tribune is at least categorized by year. And the categories are certainly not sorted by date either. --Enyavar (talk) 14:23, 10 September 2024 (UTC)
The Journal de Bruxelles is a nice and precious project of nine preserved issues. Categorizing less than 100 categories/files is a task that a dedicated editor can quickly tackle. Sorting all available newspapers on Commons by publication date is a different thing, and I would not suggest doing so if we only had a few isolated copies of each paper.
- Many big and small newspapers have their own categories, and files with the newspaper issues often follow standardized patterns with regards to descriptions/filenames, like the ones I linked above. Some of them have hundreds and thousands of digitized issues.
- Many thousands of files already have the attribute of "date = YYYY-MM-DD", right here either in the description on Commons, or in the filename like those I pointed out in the initial question.
It is these that I'd like to be more accessible. Category:Newspapers published 1900-01-01 and basically thousands of days before and after this date, have a huge potential. If a bot can be tasked, one newspaper-archive-category after the other, I believe no day between 1850 and 1950 will not have at least a few files, and that is before Commons editors manually categorize the many "very incomplete" archives (like Category:Le Gaulois du dimanche with merely 27 files).
And sure, I am aware that his is casually asking to create 36'525 categories and more.
And sure, I have more concurrent projects and to-dos than I will ever have time.
And sure, I don't know the first thing about bots.
But this would be a really nice upgrade to the newspaper section of Commons. --Enyavar (talk) 14:23, 10 September 2024 (UTC)
- Lets start with creating the missing YYYY-MM-DD categories. I just created Category:1799-12-21 and Category:1799-12-31 for the Journal de Bruxelles issues. (And 1800-01-31, 1800-02-05, 1800-02-11, 1800-03-08) If the date gets crowded, we can always add the more specific Newspapers published xxxx categories.Smiley.toerist (talk) 12:09, 15 September 2024 (UTC)
- Agree this could be useful. IMO YYYY-MM-DD should have been the preferred date format for Commons from the beginning, and we should eventually transition to using it (and get rid of categorizing by English language month names). -- Infrogmation of New Orleans (talk) 15:07, 10 September 2024 (UTC)
September 07
20@ Wikimedia COMMONS
🎉 HAPPY #20 BIRTHDAY WIKIMEDIA COMMONS & COMMUNITY !
(not as widely popular or loved as Wikipedia or Wikidata,
but hopefully worth fixing, updating and making sustainable,
if not advancing where no media server has gone before)
What are you doing today to celebrate Commons talk:Wikimedia Commons 20th anniversary?
Zblace (talk) 06:14, 7 September 2024 (UTC)
- We really should mention this on the homepage and we should have had birthday-related media spotlighted. This reminds me of when we hit 100 million files and no one really seemed to care. :/ —Justin (koavf)❤T☮C☺M☯ 15:35, 7 September 2024 (UTC)
- Is it too late to temporarily change the logo and maybe have it link to somewhere? Is there any press reporting about this like they do for Wikipedia anniversaries? Prototyperspective (talk) 21:41, 13 September 2024 (UTC)
- It would be nice to celebrate a few anniversary days or weeks, even if starting after the actual anniversary day.
- not as widely popular or loved as Wikipedia or Wikidata: I think it's because most people don't realize that the vast majority of images in Wikipedia actually belong to Commons. Wikipedia wouldn't be what it is without its media. If all projects were viewed as a single whole (almost all media in Commons, even if not used in any article, can be viewed as an extension of some Wikipedia article, and the same for any citation in Wikiquote, any book in Wikisource, any contributor-made book in Wikibooks, any travel guide in Wikivoyage, etc), Commons and other projects would probably be much more noticed. I am not against each project's own personality, but I am against any kind of rivalry between them. If it was better for the Wikimedia ecosystem and the dissemination of knowledge, I'd have no problem if Commons was renamed to "Wikipedia Media Repository" and Wikisource to "Wikipedia Library", for example. MGeog2022 (talk) 11:06, 14 September 2024 (UTC)
- Don't think there's any kind of rivalry between them. Zblace I think WM Commons is far more popular and used way more than Wikidata. I'm interested in real usecases of Wikidata beyond linking Wikipedia articles and the infoboxes but for WMC there's lots of use-cases such as people coming here to find images for their videos (often starting on Google Images due to which addressing it not properly indexing most media here is a key problem and I wonder why noone is addressing it except for a user writing this proposal). Prototyperspective (talk) 11:47, 14 September 2024 (UTC)
- Yes, I was surprised when reading that Wikidata was more popular than Commons. I doubt Wikidata has many direct human readers, but maybe it's popular to get machine-readable information with automatic systems. MGeog2022 (talk) 12:22, 14 September 2024 (UTC)
- (and, if we consider images in Wikipedia articles, I think Commons is as popular and loved as Wikipedia, even if unconsciously) MGeog2022 (talk) 12:25, 14 September 2024 (UTC)
- If you or somebody else knows of any applications (not for demo purposes but actually used), please let me know. It can't be used to get data about studies (far more incomplete than OpenAlex & ScienceOpen) or books (Anna's Archive) or foods (see this and OpenFoodFacts or propriety MyFitnessPal) or films (eg IMDB) or music (MusicBrainz etc) or chemical ingredients (CodeCheck) or anything else where data repositories are currently used in society. Prototyperspective (talk) 12:27, 14 September 2024 (UTC)
- No, I don't really know any real Wikidata use case (that's why I said maybe). I haven't searched anything about it, though. MGeog2022 (talk) 12:36, 14 September 2024 (UTC)
- Yes, I was surprised when reading that Wikidata was more popular than Commons. I doubt Wikidata has many direct human readers, but maybe it's popular to get machine-readable information with automatic systems. MGeog2022 (talk) 12:22, 14 September 2024 (UTC)
- Don't think there's any kind of rivalry between them. Zblace I think WM Commons is far more popular and used way more than Wikidata. I'm interested in real usecases of Wikidata beyond linking Wikipedia articles and the infoboxes but for WMC there's lots of use-cases such as people coming here to find images for their videos (often starting on Google Images due to which addressing it not properly indexing most media here is a key problem and I wonder why noone is addressing it except for a user writing this proposal). Prototyperspective (talk) 11:47, 14 September 2024 (UTC)
- So: no temporary logo change and no post reviewing WMC on diff or anything alike? Prototyperspective (talk) 09:31, 17 September 2024 (UTC)
Many images of book scans
Is there some policy about how book scans are to be handled? I think it would be best if books and similar literature were uploaded as one PDF document rather than as many scattered separate image files.
This way one can easily download or read them / go through pages and they don't clutter search results or require subcategories for just one literature item. For scans already uploaded, a bot could convert them into PDF files. One can also embed single PDF pages like an image on Wikipedia so there's not really any disadvantage to converting them to document files. Example example. --Prototyperspective (talk) 17:22, 7 September 2024 (UTC)
- How about we leave them as the projects that are using them are uploading them? One may be able to embed single PDF pages, but I don't know how, and it's hard to find the appropriate illustration pages. Not to mention that PDFs use compression, and if the originals were PNGs, converted to JPEG2000 for the PDF, and then converted to JPG for Commons, that makes worse final images than directly working with the original scans.--Prosfilaes (talk) 18:19, 7 September 2024 (UTC)
- I described several reasons why not to keep them as people and projects upload them. Moreover, they could change how they upload them. Scans of one page of a book are nearly never used on a Wikimedia project and when they are used often having the full book available at the page would be much better and useful. How to embed a specific page of a PDF is described here: Help:PDF#Page. As for quality they could be converted without loss of quality and it doesn't have to be PDF files if there is something better. Prototyperspective (talk) 18:24, 7 September 2024 (UTC)
- They could be converted without loss of quality how? You're adding another level of conversion.--Prosfilaes (talk) 06:04, 8 September 2024 (UTC)
- With a command that converts losslessly. Maybe ImageMagick's
convert ./page*.png ./output.pdf
already converts losslessly and somebody should check if that is the case. Prototyperspective (talk) 13:20, 8 September 2024 (UTC)
- With a command that converts losslessly. Maybe ImageMagick's
- They could be converted without loss of quality how? You're adding another level of conversion.--Prosfilaes (talk) 06:04, 8 September 2024 (UTC)
- I described several reasons why not to keep them as people and projects upload them. Moreover, they could change how they upload them. Scans of one page of a book are nearly never used on a Wikimedia project and when they are used often having the full book available at the page would be much better and useful. How to embed a specific page of a PDF is described here: Help:PDF#Page. As for quality they could be converted without loss of quality and it doesn't have to be PDF files if there is something better. Prototyperspective (talk) 18:24, 7 September 2024 (UTC)
- I am not against merging into a pdf or djvu file, but as pointed out previously book illustrations can be used by other projects as well as the title page to illustrate the Wikidata entry. --RAN (talk) 20:51, 7 September 2024 (UTC)
- Can one not specify the page of a PDF document on Wikidata for the image/cover property? Maybe that functionality can be added. Then people could also upload the cover or a particular image separately rather than uploading a whopping 300 files for one book. That is also useful because the title can be more descriptive and here is an example of an image from a PDF file that is used on Wikidata...it's much better than having the image just as some whole-page generically titled document scan image. Prototyperspective (talk) 21:04, 7 September 2024 (UTC)
- As far as I'm aware, the default thumbnail for PDF/DjVu documents is always a thumbnail of the first page. Which is a shame; there are a lot of books where that page is a Google Books notice, or an image of a badly damaged cover. Being able to override that with a thumbnail of the title page would be a huge improvement. Omphalographer (talk) 21:33, 7 September 2024 (UTC)
- Can one not specify the page of a PDF document on Wikidata for the image/cover property? Maybe that functionality can be added. Then people could also upload the cover or a particular image separately rather than uploading a whopping 300 files for one book. That is also useful because the title can be more descriptive and here is an example of an image from a PDF file that is used on Wikidata...it's much better than having the image just as some whole-page generically titled document scan image. Prototyperspective (talk) 21:04, 7 September 2024 (UTC)
- @Prototyperspective: I feel like the benefits of something like this would depend on the subject of the book. Like if its mainly or only text, cool. Convert it into a PDF and get rid of the JPEGs. There's of books with photographs or illustrations of historical subjects and places where it makes sense to have individual images. And not just for other projects but in general. Its kind of redundant to have individual images for books that are purelu text though but I think they are easier to transcribe for other projects that way. But I'd still argue its pointless (if not against the guidelines and /or goals of the project) to have jpegs of individual pages that are purely text on our end. --Adamant1 (talk) 21:22, 7 September 2024 (UTC)
- Strong Support towards converting multi-page documents into multi-page document formats - PDF. (I also posted some thoughts about this once) ~TheImaCow (talk) 21:28, 7 September 2024 (UTC)
- I tend to agree with the idea that it should just one file, but PDF support hasn't really gotten better. It's actually somewhat broken and even the WMF person in charge of Commons has no idea if or when it's going to be fixed.
∞∞ Enhancing999 (talk) 21:49, 7 September 2024 (UTC) - Weak oppose Maybe for type-written works it makes sense to only have a PDF, but for hand-written it's often really useful to have higher-resolution files and if they were combined into PDFs the PDFs could be quite large. A small example could be something like this where it's often useful to be able to zoom right in on a letter to better see the pen strokes etc. It's also more likely that people preparing PDFs will end up with trusting whatever their scanner software gives them and by doing so be throwing away information that we'd rather keep (e.g. postmarks on letters). I wonder if more could be done with the search system and structured data to make it easier to exclude individual scans? Sam Wilson 04:47, 8 September 2024 (UTC)
- Maybe a Wikidata property for scan quality or something that could be used to push bad quality PDFs and images down in the search results? I've been wanting something like that for awhile now to suppress crappy scans of postcards from the search results. --Adamant1 (talk) 05:02, 8 September 2024 (UTC)
- @Adamant1: Yes, good point. Maybe DPI for original size (P10300) would suffice? I've not ever bothered adding that, but it would totally make sense. Sam Wilson 05:12, 8 September 2024 (UTC)
- Good call. I didn't know that existed. Although the "for original size" thing seems a little needless, but whatever. I don't see why it wouldn't work. --Adamant1 (talk) 05:26, 8 September 2024 (UTC)
- If you have poorly printed pages, where OCR does not work, pdf is not an usefull format. For the Category:Journal de Bruxelles, I scan most pages, two at a time folded open. I have a standard naming principle for the files. In Wikisource the pages have to be manualy typed over as the OCR does not work. A lot of work, but a usefull result.Smiley.toerist (talk) 11:16, 8 September 2024 (UTC)
- @Adamant1: Yes, good point. Maybe DPI for original size (P10300) would suffice? I've not ever bothered adding that, but it would totally make sense. Sam Wilson 05:12, 8 September 2024 (UTC)
- Maybe a Wikidata property for scan quality or something that could be used to push bad quality PDFs and images down in the search results? I've been wanting something like that for awhile now to suppress crappy scans of postcards from the search results. --Adamant1 (talk) 05:02, 8 September 2024 (UTC)
- Comment Commons doesn't have an official policy on this, but I like to keep in mind what other projects might need these files for and how. For regular books, Wikisource needs an original .pdf or .djvu to be the source/reference material, and illustrations exported separately, usually retouched. See for example here: https://en.wikisource.org/wiki/Page:The_Osteology_of_the_Reptiles.pdf/23 ReneeWrites (talk) 10:39, 8 September 2024 (UTC)
- Note that Wikisource can also work with individual image files as well as PDFs/DjVus. Sam Wilson 12:52, 8 September 2024 (UTC)
- Comment I've been working on Category:Helix (newspaper). If these had been PDFs, I'd have had to extract files for almost every page to document them decently. Yes, there are probably times whe PDFs are best, but it's not a universal. - Jmabel ! talk 12:15, 8 September 2024 (UTC)
- Comment something I find suboptimal with individual pages is that {{Book}} doesn't really seem to be adapted for it. Also, when actually using elements from a page, one generally needs to do another crop just for that part. In the end, that part and the original page might end up categorized for that content.
∞∞ Enhancing999 (talk) 13:23, 8 September 2024 (UTC)- "when actually using elements from a page, one generally needs to do another crop just for that part": there's a template for that. See the syntax used for the image at en:Native American civil rights#Fishing and hunting rights. - Jmabel ! talk 06:46, 10 September 2024 (UTC)
- Interesting, but most users would probably still want to crop the photo as a separate file if they want to use that part.
∞∞ Enhancing999 (talk) 06:50, 10 September 2024 (UTC)
- Interesting, but most users would probably still want to crop the photo as a separate file if they want to use that part.
- "when actually using elements from a page, one generally needs to do another crop just for that part": there's a template for that. See the syntax used for the image at en:Native American civil rights#Fishing and hunting rights. - Jmabel ! talk 06:46, 10 September 2024 (UTC)
- Remember nothing actually gets deleted, so we are not saving any space. We also have had the case where a page was missing, a page was out of order, or a page was upside down in the pdf, and we had to recompile the file. Perhaps if all the individual pages were in a folder there would be less clutter once the pdf was created. --RAN (talk) 18:01, 8 September 2024 (UTC)
- It's not about space but about clutter. It also causes misleading upload stats and clutter the UploadedFiles page. The main issue is the clutter in the search results and from views that e.g. combine categories. The many subcategories are also clutter. Then it's also hard to download the file and a main problem is that it's hard to read a document if it's in dispersed pages instead of in one document. And I think this also applies to old 1800s drawings of organisms which are not really useful anymore, not what people look for, and for which there are now high-quality photos & illustrations. If an image is needed one can specify the page of the PDF or extract an image (that is different from a whole page) from it. Prototyperspective (talk) 18:21, 8 September 2024 (UTC)
- This also applies to old 1800s drawings of organisms which are not really useful anymore, not what people look for, and for which there are now high-quality photos & illustrations. I strongly disagree with this. You have made the common mistake of assuming that what you find useful or interesting is what other people also find useful or interesting, and what you find irrelevant or annoying is what other people also find irrelevant or annoying. For some users of the Commons, "old 1800s drawings of organisms" are in fact more useful than modern photographs, and individual page scans are more useful than PDFs, especially if PDFs of the same public domain documents are already widely available from Google Books, Internet Archive, Hathi Trust, and other repositories. We all use the resources here differently, and the specific resources you like to use and the way you like to use them are not universally shared. There may be good reasons to consider changing the way that multipage documents are handled, but generalizations about "what people look for" based solely on your own preferences are not among them. Crawdad Blues (talk) 20:00, 8 September 2024 (UTC)
- No, I did not make this mistake – maybe I have made this mistake of not clarifying which of my statements are a statement of opinion/argumentation. It's not surprising that there are users who think so since there are users who upload such images particularly as separate pages and with that sentence I was addressing an earlier comment arguing it would be good to do so for images that contain images. These users are very few and I have yet to see any actual usefulness case / application of such images. It's great that we have them on WMC, but I don't see why having them in separate scans would be reasonable. Please make a survey or look at pageviews or explain specific use-cases with examples if you think what I said is wrong. Prototyperspective (talk) 20:17, 8 September 2024 (UTC)
- (With such images I was referring to separate pages as scanned page image files, not whether or not these files are on WMC.) This proposal is mainly about documents with text that include images, not documents which contain only drawings as a page. I think an example of a WMC search that shows mostly scans of a few books and burying what most users are looking for is needed to illustrate what I mean. Here's a bad but maybe still useful example: if you search for "animal" there is an undue number of scans from Category:The animal kingdom, arranged after its organization, forming a natural history of animals, and an introduction to comparative anatomy (1834) at the top instead of higher-quality drawings/artworks and photos (like the collages at the very top). They can be useful but often inaccurate unreliable 1834 drawings are usually not what the user can use for any of the purpose such as adding them to a WP article or illustrating the species. These images (the book) are still useful and it's worse for other searches. Prototyperspective (talk) 20:35, 8 September 2024 (UTC)
- I agree with Crawdad Blues: images of images from books (and other documents), like drawings, prints and photographs, should stay and be images: one in a file. Then they are easy to reuse and then end users do not have to go through the upload process, but can directly use the file. For a type-written, printed book that primarely is about text: yes, then a PDF file is easier, for searching within the text and to leafing through it. JopkeB (talk) 05:17, 9 September 2024 (UTC)
- No, I did not make this mistake – maybe I have made this mistake of not clarifying which of my statements are a statement of opinion/argumentation. It's not surprising that there are users who think so since there are users who upload such images particularly as separate pages and with that sentence I was addressing an earlier comment arguing it would be good to do so for images that contain images. These users are very few and I have yet to see any actual usefulness case / application of such images. It's great that we have them on WMC, but I don't see why having them in separate scans would be reasonable. Please make a survey or look at pageviews or explain specific use-cases with examples if you think what I said is wrong. Prototyperspective (talk) 20:17, 8 September 2024 (UTC)
- This also applies to old 1800s drawings of organisms which are not really useful anymore, not what people look for, and for which there are now high-quality photos & illustrations. I strongly disagree with this. You have made the common mistake of assuming that what you find useful or interesting is what other people also find useful or interesting, and what you find irrelevant or annoying is what other people also find irrelevant or annoying. For some users of the Commons, "old 1800s drawings of organisms" are in fact more useful than modern photographs, and individual page scans are more useful than PDFs, especially if PDFs of the same public domain documents are already widely available from Google Books, Internet Archive, Hathi Trust, and other repositories. We all use the resources here differently, and the specific resources you like to use and the way you like to use them are not universally shared. There may be good reasons to consider changing the way that multipage documents are handled, but generalizations about "what people look for" based solely on your own preferences are not among them. Crawdad Blues (talk) 20:00, 8 September 2024 (UTC)
- It's not about space but about clutter. It also causes misleading upload stats and clutter the UploadedFiles page. The main issue is the clutter in the search results and from views that e.g. combine categories. The many subcategories are also clutter. Then it's also hard to download the file and a main problem is that it's hard to read a document if it's in dispersed pages instead of in one document. And I think this also applies to old 1800s drawings of organisms which are not really useful anymore, not what people look for, and for which there are now high-quality photos & illustrations. If an image is needed one can specify the page of the PDF or extract an image (that is different from a whole page) from it. Prototyperspective (talk) 18:21, 8 September 2024 (UTC)
- It depends a lot on each specific case, but, if uploading a new book, I think PDF is the best option. If the book contains images that are interesting in their own right, they should be uploaded as separate files (or even be left for another user who is specially interested to upload them, if you don't have time for it). For already uploaded books, I think that existing images should not be deleted, but creating a new PDF book can be a good idea.
- I think Commons, for some cases, can be a good book deposit/library, in addition to Wikisource (with OCR or some freely licensed or public domain original PDF books, you can even have machine-readable PDF books in Commons with relatively little effort, while migrating their content to Wikisource can be a really big work in many cases). MGeog2022 (talk) 10:51, 10 September 2024 (UTC)
- You cannot migrate PDFs to Wikisource for the reason that you can't upload any files at Wikisource. (As in: there is not an upload page at all). Wikisource does not host any files. It's Commons what is intended to be used. Matthiasb (talk) 07:12, 17 September 2024 (UTC)
- @Matthiasb, yes, I know that.
- while migrating their content to Wikisource can be a really big work in many cases: by this, I'm talking about converting them to wiki format. In those cases that converting them to wiki at Wikisource is too much work, the full book could be hosted at Commons as a single PDF file, even if it (its content, not the PDF file as such) is never migrated to Wikisource. MGeog2022 (talk) 13:15, 17 September 2024 (UTC)
- Oh, I understand. For this case the different language versions perhaps have different policies. I don't know about other than the German WS. In the German WS we differ between texts which are digitized available. Some users collect any link to GB, IA, (university) libraries, Haithi Trust where a scanned work by a German language author is available wether it's a PDF or some other file format. When a user wants to transcribe a text a copy of the original at commons is requested so that other can verify and correct the transcription. A reader-only would not use commons but the original link to Google Books, Internet Archive and the like, of if the transcription is complete, rather to the transcprited version at WS since it might be better to read than a possibly bad scan. See for ex. s:de:Justinus Kerner showing both cases. Matthiasb (talk) 22:44, 19 September 2024 (UTC)
- Well, I was talking about the case that the user wants to upload the public domain or freely licensed book to Wikimedia, but without converting its full content to wiki format. Maybe we aren't a host for PDF books if they aren't to be converted to wiki format at Wikisource, and Internet Archive is the right place for uploading and viewing PDF books. I don't know the exact Commons policy on this. MGeog2022 (talk) 12:58, 20 September 2024 (UTC)
- Oh, I understand. For this case the different language versions perhaps have different policies. I don't know about other than the German WS. In the German WS we differ between texts which are digitized available. Some users collect any link to GB, IA, (university) libraries, Haithi Trust where a scanned work by a German language author is available wether it's a PDF or some other file format. When a user wants to transcribe a text a copy of the original at commons is requested so that other can verify and correct the transcription. A reader-only would not use commons but the original link to Google Books, Internet Archive and the like, of if the transcription is complete, rather to the transcprited version at WS since it might be better to read than a possibly bad scan. See for ex. s:de:Justinus Kerner showing both cases. Matthiasb (talk) 22:44, 19 September 2024 (UTC)
- You cannot migrate PDFs to Wikisource for the reason that you can't upload any files at Wikisource. (As in: there is not an upload page at all). Wikisource does not host any files. It's Commons what is intended to be used. Matthiasb (talk) 07:12, 17 September 2024 (UTC)
- Oppose Strongly opposing. If such PDFs would be converted one (the person converting it) would have to make sure that each and every Wikisource page would work as before. If we have 300 pages in the PDF that makes 300 pages on WS to be changed. I am not sure about wether they are needed to be modified only on one place; this Wikisource techniques/syntaxes are quiet obscure to my though I am taking part in the project. So what would be achieved? Editing tenthousands of Wikisource pages to reduce the number of PDF files for several ten thousands whereas we are hosting 120 million files on Commons? Produces lot of server load for nothing. --Matthiasb (talk) 07:02, 17 September 2024 (UTC)
- Not this proposal. I don't think you're voting on what is proposed here – at least it seems like that from your rationale. This is about people uploading at times hundreds of images instead of one PDF file to WMC. It doesn't have to do with Wikisource. Those images are not used in Wikisource if that's what you mean. Prototyperspective (talk) 09:52, 17 September 2024 (UTC)
- I think so. If one finds a PDF file on GB or in internet archive or a haithi trust he will keep this PDF together. We not need to diskuss that. There is a difference if the user does the scan himself. Fox example my scanner is capable to scan several pages in a row but never ever I would scan 300 pages in a row. The phone might ring or another interruption would occur. There is no chance to get more than 20 pages or so in a row in one PDF with my Canon. And after the scanning, I don't have the software to combine it and if I had my hardware won't make it due to hardware restrictions. My machine is about ten years old, you understand. Or I even have another image file format, jpg or tiff. Matthiasb (talk) 22:01, 19 September 2024 (UTC)
- Nonsensical. Really absurd at this point. Just use some software to combine images into a PDF file which also works on very old PCs. Takes a minute maybe and you don't necessarily have to do it yourself. That's better than cluttering the search results here for example. Prototyperspective (talk) 10:38, 20 September 2024 (UTC)
- I think so. If one finds a PDF file on GB or in internet archive or a haithi trust he will keep this PDF together. We not need to diskuss that. There is a difference if the user does the scan himself. Fox example my scanner is capable to scan several pages in a row but never ever I would scan 300 pages in a row. The phone might ring or another interruption would occur. There is no chance to get more than 20 pages or so in a row in one PDF with my Canon. And after the scanning, I don't have the software to combine it and if I had my hardware won't make it due to hardware restrictions. My machine is about ten years old, you understand. Or I even have another image file format, jpg or tiff. Matthiasb (talk) 22:01, 19 September 2024 (UTC)
- Not this proposal. I don't think you're voting on what is proposed here – at least it seems like that from your rationale. This is about people uploading at times hundreds of images instead of one PDF file to WMC. It doesn't have to do with Wikisource. Those images are not used in Wikisource if that's what you mean. Prototyperspective (talk) 09:52, 17 September 2024 (UTC)
- Weak support If a complete book scan of x-dozen up to some x-hundred single text pages gets uploaded, and is not in use for Wikisource (!), then I think that all the pages can be collated into a single PDF, which can then replace the single-page scans. Note that my opinion states "can", not "should". It depends on the intended use case and the interest that gets placed on a book. If all single-page-scans remain contained in their intended scan-category, I have no problem with them being loose, not bound. Also, I checked the first example in the OP again: yes, these are single-page scans, but only of the 37 illustrated pages from a 200-page book. For a PDF, we need to have all pages; while the ones in the category should have the text clipped away, to make the images usable as illustrations (Edit, upon rereading, this point was also made by Adamant1, so I support that opinion). On another hand, if PDF-conversion means OCR treatment is simpler, then I'll strongly support this. --Enyavar (talk) 13:45, 20 September 2024 (UTC)
September 10
Repeated file renaming requests, how to react to their occurence
Hello,
I am a filemover and came across this request. The requested move seems to be for that the new name is similarly styled to the other 2 in Category:DC candidates TSE portraits, but it does not really convince me as it would only be "a bit better looking" (maybe! capital lettering is not nice to read), per COM:FRNOT. Hence, I declined the request. Now, it was back again in the queue (denied again by me). Is there any procedure on how to deal with repeated renaming requests after a denial, or any established way of appeals? Regards, Grand-Duc (talk) 19:11, 10 September 2024 (UTC)
- @Grand-Duc: As far as I know there's no formal review process for file moving. In my experience what usually happens when I decline a request and someone disagrees is that they request it again and either I change my mind or I leave it for another filemover to deal with. I don't think I've seen anyone try to get a fourth opinion after two declines from different filemovers. In this case, while criterion 4 obviously doesn't apply I did wonder if Erick Soares3 and KakuLogia+ might be connected in a way that would allow criterion 1 to apply, since they've been making very similar requests. And anyway the overwriting of that file was bad under COM:OVERWRITE and maybe it needs its history splitting. At which point I decided it was all too complicated and I should do something less confusing instead. --bjh21 (talk) 21:31, 10 September 2024 (UTC)
- @Bjh21: and the specific dataset is for Category:TSE electoral portraits of 2024 (the general category for this year) or even Category:Files from Portal de Dados Abertos do TSE (about 200 styled the same way in the first page) with almost everything from 2004-2024, while the DC is only for members of this specific party. Criterion 1 is if we were the same uploader, and no, this isn't the case: we are only from the same country, and we are working with this dataset (specially because we are less than a month before the elections and those images are useful in the Wikipédia/Wikidata). Erick Soares3 (talk) 21:38, 10 September 2024 (UTC)
- So the rename is from:
- to
- ? This because other files had been renamed?
- I DON'T THINK FILENAMES SHOULD BE IN ALL CAPS AND ONE SHOULD NOT RENAME CORRECTLY CAPITALIZED FILENAMES TO ALL CAPS. RENAMES ARE NOT HERE FOR SORTING FILES IN CATEGORIES NOR FOR ACHIEVING SOME COSMETIC GOAL.
- Another problem with the file is that an initial photo was overwritten.
∞∞ Enhancing999 (talk) 01:53, 13 September 2024 (UTC)- I would retain the number in brackets (which might or might not originat in the Flickr feed and enables or facilates identification of photographs) but no ALL CAPS. Matthiasb (talk) 07:17, 17 September 2024 (UTC)
Proposal for a path forward for bringing together folks with a stake in Commons’ future
While at Wikimania, I was able to spend time with folks who participate in and contribute to Commons. I attended the Photowalk community session, where we had a lovely and rainy meander along a brick lined city walkway, where we saw lots of iconic buildings and neighborhoods in Katowice, as well as a lovely park and views of trains and city life. I also was able to sit down with 10 people attending Wikimania who I’d previously met and talked with me about Commons, or who had discussed my recent comments on-wiki about the future of Commons and some of the big challenges and questions we face and need to tackle together.
In that meeting, I spent a lot of time listening to perspectives and the challenges that technical contributors, photographers, organizers, GLAM advocates and individuals just representing themselves each see as we try to determine what next steps to take.
I asked the group to help me think about what comes next, and how to work together to bring the many parts of Commons together in a meaningful way, so that we can start to make some important decisions together. Any choice we make will involve tradeoffs – of attention, time and resources. And I’d like us to all be able to do that with as much information as we can bring together and put on a shared table as possible.
Some of the perspectives I heard included:
- The Wikimedia Commons Query Service doesn’t work very well and needs focus.
- Wiki Loves contests attract a lot of attention, but haven’t been optimized for getting new contributors. Things like licenses, templates and other complexities might be significant barriers to new contributors. Other sites don’t have such complex workflows and maybe attract new folks much more easily.
- Each time a project starts and ends, we lose important knowledge because we don’t have continuity across projects or anyone really working to ensure that we have continuity over time.
- Some projects focus on small batch uploads, and other projects focus on doing very large batch uploads. The disconnect in approach and goals may be problematic when trying to prioritize where we focus attention and effort, particularly from the WMF.
- We don’t have clarity about which tools should be used for producing, reusing, finding and reporting photos, and some tools that have been made don’t have enough functionality to be truly useful and to replace older tools.
- There's a tension between admins and patrollers asking for more things to prevent vandalism, but also campaign creators who want lower barriers to entry. We also have the tech communities with needs for APIs, Commons embedders who would like different features on other projects, and yet more needs from photographers user groups, GLAM professionals, and external reusers. We asked: how could we go about helping these folks see each other, and help everyone understand some of the tradeoffs involved in supporting such a complex project?
In listening to each other, people remarked that they could see how different people’s perspectives were, and also that there were perspectives not represented in the room. We spent some time trying to name them, and discussed what a good next step might be to try and help one another come to some conclusions about how to best prioritize the limited resources of the Foundation, and how best to ask for help from the whole of our movement.
As a result of this discussion, several people suggested that regular meetings that helped to surface and then explore the challenges we now face among Commons contributors and users would be really helpful, with the intention that this will lead us to identify key strategic trade offs.
We have a wide variety of tools ranging from very specific to very general, and we have many sub-communities with different needs and agendas. The Foundation and contributors all need to understand this landscape in order to plot a way forward together.
So my proposal for the next year or so is to start having monthly forums, where we pose some of the big questions around Commons, and come together as a community. I think this will involve a live discussion as well as on-wiki discussions to get to the heart of some of the big tradeoffs and questions that we all need to face together.
I am committed to finding a better way of supporting Commons, and I see the need to get to a shared goal and a timeframe for achieving a clear set of things. And I will need help from all of you to identify the most important problems to solve, which ones not to solve right now, and then the right kinds of solutions for supporting a vibrant Commons into the future.
Thanks for all the input so far, and I’m looking forward to connecting with more of you in the near future! SDeckelmann-WMF (talk) 22:01, 10 September 2024 (UTC)
- Thanks Selena. Strange that video has not been discussed. Ymblanter (talk) 18:31, 11 September 2024 (UTC)
- @SDeckelmann-WMF, maybe some things I said in my comment here are of interest to you when requesting resources for Commons at WMF. Maybe some improvements to Commons can be seen as secondary, but Commons as a whole in no way is so. Unlike other WMF projects (all of them important as they are), Commons content is continuously shown in Wikipedia articles themselves, so Commons is, in some way, truly part of Wikipedia, without question. MGeog2022 (talk) 11:18, 14 September 2024 (UTC)
- I'm definitely interested in participating in monthly forums. I would suggest that rather than have them at the same time of day each time, we might want have them at different times of day from one month to another and post the schedule very publicly, because Commons is such a globe-spanning project, and any time of day we picked is going to be very inconvenient for a significant number of people who might like to participate. - Jmabel ! talk 15:56, 16 September 2024 (UTC)
- Hi, My main general worry is that Commons workload is increasing, and there are new legal, technical, and social challenges, but there is no increase in the number of people to maintain this all. Specially the number of administrators is decreasing. The challenges include AI-generated images, new kinds of media (Commons:Project scope/Allowable file types: 3D rendering, video was marginal, but is now mainstream, etc.), the huge complexity to understand, document and maintain the legal information, etc. Commons is in a way victim of its success. People confuse Commons with social media. It was a small project nobody really cared, and it seems to have become the place to advertise companies and people. We now delete as many out-of-scope content and selfies as copyright violations. The formers were just a small percentage of deletions a few years back. I have suggested several times a recruitment campaign for admins and patrollers. The WMF could help organize crash courses on copyright. Moreover I think more people would volunteer if they feel supported by the WMF in one way or another (better tools, less bugs, fast answer to technical issues, etc.). Yann (talk) 17:41, 16 September 2024 (UTC)
- Thank you, @SDeckelmann-WMF for putting forward this proposal. The Wikimedia communities, and not the least the Wikimedia Commons community, have plenty of different workflows, priorities and purposes. Putting all our brains together we'll probably find common ground, identify recurring needs and come up with well-informed proposals for improvements. In my mind, a clarity on the different positions of the different stakeholders is already a useful first step. – Susanna Ånäs (Susannaanas) (talk) 19:00, 18 September 2024 (UTC)
I was in the Wikimania meeting and just wanted to add one of my takeaways. Part of the challenge for the WMF are, as alluded to above, the diverse stakeholders who rely on Commons and often have different or even conflicting priorities. Selena said that, once we can identify stakeholder groups, if those groups come together to produce a list of priorities (features, fixes, etc.), someone at the WMF will go through the lists and respond to each item -- what can be done, what needs more information, and what won't be done for reasons xyz. To me, that commitment was the most noteworthy action item for the community. So we have to figure out these stakeholders and then those stakeholders need to self-organize and produce a list. Maybe a good use of the first monthly meeting is to brainstorm stakeholder groups. — Rhododendrites talk | 21:46, 16 September 2024 (UTC)
- I think this is a little bit short-sighted. "Stakeholders" or even hypothetical organized groups thereof often don't have much insights into and experience with Wikimedia software. They don't have an inside view of what could be possible, what may be very useful and so on. I'm not saying that this wouldn't be valuable, just that this would only be half of the equation if at all. Be aware that with "stakeholders" that's random people somewhere in the world using Wikimedia Commons to find media for their educational YouTube video, some teacher looking for materials, a Wikimedia contributor looking for some image to use in some poster, a Wikipedian looking for media to include in a new article, and some journalist looking for an image to use in low-budget news article. Nearly none of these are organized and most would probably not provide well thought through proposals.
- Furthermore, there actually already is a list of priority features and fixes – there could be more and some are included in the current Community Wishlist but I don't see any justification for it not getting considered deeply by the WMF: Commons:Requests for comment/Technical needs survey. I don't think the key problem or bottleneck is 'unclear wishes', 'conflicting priorities' or any of that sort. The problem is a lack of will in increasing capacity for implementing any of widely requested features such as via more funding for development and/or a banner campaign to get more volunteer developers engaged. Monthly talks may be useful but I suspect it would likely overall be a waste of time with little productive things coming out of it – in the end more time spent on things other than actually implementing things. Once there is some ongoing work on implementing things and some capacity, one could still have monthly talks for the time when well-known issues have been solved. @ Yann to reduce workload one could start by implementing bots that automatically identify likely copyvios or likely vandalism and similar things (like auto-addition of inferrable categories) which is another thing that requires some will for technical development. Prototyperspective (talk) 22:54, 16 September 2024 (UTC)
- I was in many of those meeting at Wikimania,at @Ymblanter: yes video was discussed and identified as a choke point that we WMF support to work on. Firstly was the need to make reviewing, identifiing, and address issues within recordings much easier with support tools. The secondpart of that was once there were working tools was to look at formats and ease of putting videos under a free license. We are also challenge in who has what rights, audio tracks, interpretations of content, this really is complex area that needs its own skill set our saving grace is that video isnt as popular as it could be within other sister projects so the demand isnt there but its on the horizon as more people are becoming familiar with creating videos.
@Prototyperspective: Commons and other sister projects attract people who dont have technical skills those people have skills that are equally important in creating Commons. So yes you are right we of the nontechnical incling dont know the capabilities of the software, or any limitation but we do know what we'd like to see possible in the future across many other areas excluding stakeholders for any reason isnt the solution either. Selena is engaging in meaningful way and useful way to ensure tated he WMF knows what commons is, where commonalities of people who are building differents part exist, where there are gaps in capacity. Commons has for sometime lacked a large cohort of software developers who were attracted away to other more recently created sister project. I agree Bots that flag potential copyright vios would help and speed the processes up especially as some of them quack when given the standard duck test, and image search tools like Tinyeye & Google already exist, my question would be whats stoppping someone creating such a bot now?
Commons was created as a space saver for the servers ensuring that we didnt need 300 copies of the same file and that as better media file come along everyone can benefit from them. Withit every new idea was just tacked on, anything "media" was put here now we have lots of little pockets that are known only to different stakeholders it not until we get hit with a major issue like WikiZero created that these start to cross over. I think a monthly meeting at differing times or multiple times as Selena is proposing is a good starting to point to find every one of those little pockets, and bringing potential solutions like a copyright duck bot dont get lost but rather brought to a more public sphere where they can be created and in such away that the WMF can flag them as a critical tool which regardless of the creator is maintained long after the developer has moved on to other projects. Gnangarra 00:48, 17 September 2024 (UTC)
- I'd love to join a meeting (particularly if one is held at WCNA next month)! My perspective is primarily one of a Wikipedian who uses Commons images, and my top priority currently is collaborating with the WMF on the Upload Wizard improvements. Sdkb talk 01:39, 20 September 2024 (UTC)
- Here are some thoughts from my side, I was also at the meeting at Wikimedia. I like the idea to have a meeting on a regular basis. But such a meeting has one problem: At such a meeting there are often some interest groups over represented and some groups are not represented. And such a meeting can never be the place to make final decisions. That is the main problem that we do not have good process as community to make decisions. Proposals on Commons:Village pump/Proposals often to not have enough participation to be a good legitimization. If voting on proposals does not work to make decisions we should think about delegating certain decisions to a certain group of users they are elected and with quotas of certain interest groups. For the meeting maybe a solution could be the make an agenda for the meeting that focuses on the topics of a certain group. I also want to say something on one specific point: "There's a tension between admins and patrollers asking for more things to prevent vandalism, but also campaign creators who want lower barriers to entry." I think this is not really a problem. Most campaigns are organized by a group of active users giving guidance to new users and check the content uploaded in the campaign. Therefore the patrolling is done by the campaign organizers and no patrolling by admins or other patrollers is needed. And if people read the word "photo contest" they are aware that they are only allowed to upload their own works. In the last years of Wiki Loves Earth in Germany we did not have a single copyright violation uploaded by participants. There are problems with campaigns using the Commons:ISA Tool as not all organizers made a quality check. GPSLeo (talk) 06:10, 20 September 2024 (UTC)
- I don't want to point fingers but there's certainly been a lot of copyright violations coming from certain Wiki Loves Monuments events over the last couple of years. Although I agree your general point that more active users can give guidance to the less active ones in those situations, but it seems like they just don't care or aren't willing to in a lot of cases. I've certainly seen people who run Wiki Loves Monuments events from certain countries act rather ambivalent about it before. So it seems like there's a general sentiment that it's not on them to moderate people who participate in their events, which I guess I can understand. As they are mainly concerned with getting people to participate and it's easy to turn people off from doing that. But it does create a lot of needless hassle on our end. At this point I just see it at the cost of doing business, but it would be good if there was a more long-term solution then expecting users to sift through other people's uploads for copyright violations every time there's an event. Maybe things like better training or a guideline saying that they have to review files for copyright violations before uploading them would help though. Who knows. --Adamant1 (talk) 07:00, 20 September 2024 (UTC)
- But these copyright violations are related to lack of freedom of panorama and not false own work claims they are very common by users uploading photos while editing a Wikipedia article. GPSLeo (talk) 15:27, 20 September 2024 (UTC)
- I don't want to point fingers but there's certainly been a lot of copyright violations coming from certain Wiki Loves Monuments events over the last couple of years. Although I agree your general point that more active users can give guidance to the less active ones in those situations, but it seems like they just don't care or aren't willing to in a lot of cases. I've certainly seen people who run Wiki Loves Monuments events from certain countries act rather ambivalent about it before. So it seems like there's a general sentiment that it's not on them to moderate people who participate in their events, which I guess I can understand. As they are mainly concerned with getting people to participate and it's easy to turn people off from doing that. But it does create a lot of needless hassle on our end. At this point I just see it at the cost of doing business, but it would be good if there was a more long-term solution then expecting users to sift through other people's uploads for copyright violations every time there's an event. Maybe things like better training or a guideline saying that they have to review files for copyright violations before uploading them would help though. Who knows. --Adamant1 (talk) 07:00, 20 September 2024 (UTC)
September 11
Proposal: AI generated images must be clear they're AI in the file name
Now these are being used for a good purpose (reporting on the misuse of AI at en:Wikipedia:Signpost) so please don't just nominate them for deletion, but look at these filenames:
File:Amoeba moving.jpg File:Leukocytes.jpg
Now I've moved them to File:AI genetated image of... - but that's just actively setting people up to use semi-believable illustrations that have no scientific accuracy, and then making it relatively hard to catch what happened.
Should we be somewhat stricter about filenames for AI? There's cases where I think it matters less than these, but the capacity to mislead is high. Adam Cuerden (talk) 10:01, 11 September 2024 (UTC)
- Oh, there's also File:Cancer cell.jpg, now File:AI generated image of a cancer cell.jpg which is not used anywhere, and looks ridiculously misleading. That's just the AI giving the cell its own tumor. Actually, maybe that should be in that article about misleading AI Adam Cuerden (talk) 10:05, 11 September 2024 (UTC)
- File names for AI generated images not indicating that's what they are is definitely an issue. There's no reason there shouldn't be some indication in the file name that an image is AI generated. I think it would be in alignment with the changes to guideline on how to name files that was passed recently to. Regardless, file names should be as descriptive as possible and I can't see why that information shouldn't be included in the file name. --Adamant1 (talk) 11:02, 11 September 2024 (UTC)
- An alternative would be to add a tag to the media without requiring files to be moved or named with that in the title from the start. Just like any NSFW image has an indicator for that on sites like reddit. It would be shown on all files in Category:AI-generated images either at the end of the file-title or e.g. within a corner of the thumbnail. I think adding that automatically would be better. However, when uploading the file using the Upload Wizard and checking made with AI one could also automatically append (AI-generated) or (made using AI) to the file-title. Prototyperspective (talk) 11:29, 11 September 2024 (UTC)
- Is it really misleading if people cant be arsed to even read the template? Trade (talk) 11:30, 11 September 2024 (UTC)
- The template only shows on the file page. And even there it doesn't look very different from other common license templates which people only interested in the content usually probably don't look at either and many files like the linked examples don't even have these templates. Prototyperspective (talk) 12:47, 11 September 2024 (UTC)
- Support The file name and caption are the first things you see when you select an image that's used on Wikipedia for better viewing, right before you click through to the file's own page. For most people it'll probably be the only information they'll see. This information is absolutely important enough that it should be mentioned in the file name. ReneeWrites (talk) 13:03, 11 September 2024 (UTC)
- Support per ReneeWrites. I agree that having AI generated marked in the file name will give Wikipedia users much more transparency on the provenance of files. William Graham (talk) 15:36, 11 September 2024 (UTC)
- Oppose. Nice idea, but I prefer templates that can be translated and add properties. Latin letters in a filename are not a good clue in other scripts. We may have endless rename requests. File naming hacks are also not systematic; we do not routinely encode other properties in filenames. Glrx (talk) 15:47, 11 September 2024 (UTC)
- At the same time, an actively misleading filename is a problem. They are not leucocytes (for example. They don't even look much like cells. AI is very good at creating images that look like they're plausible depictions but really aren't, they just ape the - for lack of a better word - art style of real scientific illustrations, coloured electron microscope depictions, and so on. Adam Cuerden (talk) 16:05, 11 September 2024 (UTC)
- we do not routinely encode other properties in filenames - We routinely use naming systems like "Flag of [Country]" for other types of files. Using filenames to make important disclosures about the origin of files isn't a huge leap. Omphalographer (talk) 19:11, 11 September 2024 (UTC)
- @Omphalographer, Well said. -- Ooligan (talk) 17:11, 12 September 2024 (UTC)
- Support Update with caveat: my support is for this idea in principle, with the understanding that we would need an additional discussion about implementation to cover things like wording. — Rhododendrites talk | 12:32, 12 September 2024 (UTC) This is a good idea, and in line with the spirit of many off-wiki policies proposed for AI content. It also doesn't preclude a template. The question, though, is what label/language should be used. It would need to be something someone wouldn't choose accidentally for a non-AI image. Also, documentation for this rule would need to be clear that we're talking about media that is produced through generative AI models (as opposed to, say, a scientific visualization in which machine learning was used somewhere in the process). — Rhododendrites talk | 16:10, 11 September 2024 (UTC)
- I wouldn't be too concerned about language necessarily. If the filename is not in a language people speak, they're much more likely to check the decription. We don't need a perfect solution, just an improvement. Adam Cuerden (talk) 16:14, 11 September 2024 (UTC)
- Comment - This is a good idea, but it needs refinement. Besides Rhododendrites's caveat's above, I think it should only apply to images which depict something in a realistic manner. There's not much point requiring this for something like File:Portrait of a Unicorn.png. Otherwise, I would only support it as a recommendation, not a requirement. Nosferattus (talk) 18:21, 11 September 2024 (UTC)
- What harm would it be to apply it to that image, to make clear (a) this is not a human-made illustration, (b) this is not a Lisa Frank image, etc.? — Rhododendrites talk | 18:33, 11 September 2024 (UTC)
- We shouldn't be mass changing file titles without a compelling reason. It just generally causes chaos. I think files being factually misleading is a compelling reason, but otherwise, I'm not convinced. Nosferattus (talk) 16:57, 15 September 2024 (UTC)
- I'd say there's a class of images where it matters less. But a human-made illustration probably has some effort to get key aspects, whatever those might be. AI just tries to get something that looks like other images with similar key words, and might miss out important bits that a human wouldn't. Honestly, as a general rule, the higher the likelihood it'd be used on Wikipedia, the more that's an issue. Adam Cuerden (talk) 19:34, 11 September 2024 (UTC)
- Agreed on Nosferattus' conditional support: images that can be mistaken for something else, should be marked, and the filename is the most obvious place to do so. By the way, this also applies to photoshop fabrications of "real life flower elfs" etc. And from a filemover perspective: We are supposed to only rename files that are realistically going to be kept. Is there even a rationale to keep misleading non-scientific AI illustrations? I mean, beyond illustrating how you can't trust AI illustrations? --Enyavar (talk) 00:01, 13 September 2024 (UTC)
- Aye. These couple are useful to illustrate the problem, but we certainly don't need more. Adam Cuerden (talk) 15:22, 14 September 2024 (UTC)
- Oppose for now, unless proposal is substantially modified to address concerns above. Nosferattus (talk) 16:53, 15 September 2024 (UTC)
- What harm would it be to apply it to that image, to make clear (a) this is not a human-made illustration, (b) this is not a Lisa Frank image, etc.? — Rhododendrites talk | 18:33, 11 September 2024 (UTC)
- Support, as we should use any (and all) means to achieve maximum transparency for re-users about the non-authenticity of AI-generated images. --Túrelio (talk) 18:27, 11 September 2024 (UTC)
- Support: I don't see any downside. One remark, though: like everything else on Commons, this should not be restricted to English, and I don't imagine I would recognize something if it were marked in Chinese as AI. How do we intend to deal with the multilingual aspect of this? - Jmabel ! talk 19:32, 11 September 2024 (UTC)
- As I said above, I think that perfection isn't needed. If it's labelled in Chinese, as long as the whole filename is in Chinese, Anglosphere people will presumably go to the description. They might not for one that has a plausible English filename. Adam Cuerden (talk) 19:37, 11 September 2024 (UTC)
- Yep. In a Czech file name, the warning should be in Czech, and in a Japanese file name, in Japanese - tailored to the native languages these images are likely to get used for. And if I'm that determined to use a cool image with Tamil filename in the German WP, I the user must make sure to understand the filename and description. (GTranslate exists.) --Enyavar (talk) 00:10, 13 September 2024 (UTC)
- As I said above, I think that perfection isn't needed. If it's labelled in Chinese, as long as the whole filename is in Chinese, Anglosphere people will presumably go to the description. They might not for one that has a plausible English filename. Adam Cuerden (talk) 19:37, 11 September 2024 (UTC)
- Support: Let´s do it. Transparency first. Alexpl (talk) 20:11, 11 September 2024 (UTC)
- Support as a general idea. Would this extend to AI-upscaled images, which can get very strange at the deep end? Belbury (talk) 20:39, 11 September 2024 (UTC)
- How do I unsee that image?! Omphalographer (talk) 22:58, 11 September 2024 (UTC)
- Extremely disturbing heh Bedivere (talk) 04:49, 12 September 2024 (UTC)
- How do I unsee that image?! Omphalographer (talk) 22:58, 11 September 2024 (UTC)
- Comment from a filemover. If you want to apply this requirement to files after upload, you should amend Commons:File renaming to make it clear that lacking a statement of AI-generation in the filename is good cause for renaming. Either by adding a new numbered criterion or by finding a way to shoehorn it into an existing one (2 or 3, I'd guess). --bjh21 (talk) 21:01, 11 September 2024 (UTC)
- @Bjh21: You could argue, in clear cases like the ones I mentioned earlier, it's already covered by 2, since they aren't actually pictures of (say) leukocyctes, but I agree that adding an example would help. Adam Cuerden (talk) 20:40, 12 September 2024 (UTC)
- @Adam Cuerden: I think you mean 3 (obvious error), and I agree that would cover clear cases like those. But there are other cases that I don't think would be covered, like File:White generic hatchback.png or File:Wikimedia LGBT+ graphic illustration 1.png. --bjh21 (talk) 21:48, 12 September 2024 (UTC)
- It'd be nice to expand criterion 2 to allow adding information about the non-factual nature of an image in general (e.g. AI generated images, simulations, reenactments, historical reconstructions, artistic representations, etc). Omphalographer (talk) 22:40, 13 September 2024 (UTC)
- Aye. Certainly in the spirit of, but explicitly permitted never hurt. Adam Cuerden (talk) 15:23, 14 September 2024 (UTC)
- @Bjh21: You could argue, in clear cases like the ones I mentioned earlier, it's already covered by 2, since they aren't actually pictures of (say) leukocyctes, but I agree that adding an example would help. Adam Cuerden (talk) 20:40, 12 September 2024 (UTC)
- I believe that ALL substantially manipulated or fabricated images, that might be misunderstood as "real", should be very clearly marked. It shouldn't depend only on whether someone decides or rules that the fabrication method was "AI". ITookSomePhotos (talk) 21:37, 11 September 2024 (UTC)
- Fair, but this is an easy "bright line" case. We can easily spot AI by tags and upload tools, Adam Cuerden (talk) 15:23, 14 September 2024 (UTC)
- Support, probably difficult to enforce though given the backlogs of other bad file names needing renaming (screenshot, whatsapp, etc). Gnomingstuff (talk) 22:28, 11 September 2024 (UTC)
- Support AI generated images must have "AI" in the file name as a principle, perhaps even better would be "AI generated", which is more clear. And always at least in Latin letters. Yes, the backlog might be a problem, but we can start now for new uploads. --JopkeB (talk) 05:12, 12 September 2024 (UTC)
- What do people think about when uploading the file using the Upload Wizard and checking made with AI one could also automatically append (AI-generated) or (made using AI) to the file-title? (No replies on that above or on the idea of a tag displayed dynamically next to the file-title and in the thumbnail.) I think doing something automatically and in a standardized way would be better than just requiring this which many uploaders will not follow up on. Prototyperspective (talk) 10:49, 12 September 2024 (UTC)
- Given that the Upload Wizard already asks about AI tools, I think it would be appropriate for it to ensure that uploads using them follow whatever policy arises from this discussion. --bjh21 (talk) 17:26, 12 September 2024 (UTC)
- What do people think about when uploading the file using the Upload Wizard and checking made with AI one could also automatically append (AI-generated) or (made using AI) to the file-title? (No replies on that above or on the idea of a tag displayed dynamically next to the file-title and in the thumbnail.) I think doing something automatically and in a standardized way would be better than just requiring this which many uploaders will not follow up on. Prototyperspective (talk) 10:49, 12 September 2024 (UTC)
- Support All AI images must use this required prefix phrase: "AI generated image of..." -- Ooligan (talk) 05:58, 12 September 2024 (UTC)
- Support. on top of this, if we need to rename these files, i suggest requiring the new name to begin with " «AI generated» " or " ~AI generated ". this will make them appear behind all ascii letters when sorted alphabetically. RZuo (talk) 13:58, 12 September 2024 (UTC)
- If you want to change where something sorts, I think it's better to do it using
{{DEFAULTSORT}}
rather than by requiring a particular pattern in the filename. --bjh21 (talk) 15:46, 12 September 2024 (UTC)- @Bjh21, Can we do both? -- Ooligan (talk) 16:47, 12 September 2024 (UTC)
- @Ooligan: I can't see why you would want to, but you certainly can. --bjh21 (talk) 17:23, 12 September 2024 (UTC)
- this is just an idea that can be done with no extra cost, when the file will be renamed anyway. RZuo (talk) 20:50, 12 September 2024 (UTC)
- Probably doable through the AI templates. Something like {{DEFAULTSORT:«{{BASEPAGENAME}}}} Adam Cuerden (talk) 21:05, 12 September 2024 (UTC)
- @Ooligan: I can't see why you would want to, but you certainly can. --bjh21 (talk) 17:23, 12 September 2024 (UTC)
- @Bjh21, Can we do both? -- Ooligan (talk) 16:47, 12 September 2024 (UTC)
- Please no. This makes file names unnecessarily difficult to type - most keyboards don't have
«»
keys, and~
is difficult to find on many mobile devices. The goal is to label these files, not to make them difficult to use. Omphalographer (talk) 22:53, 12 September 2024 (UTC)- Agreed. DEFAULTSORT is the better solution for de-prioritizing AI images. ReneeWrites (talk) 07:23, 13 September 2024 (UTC)
- +3. Adding special characters to file names should be banned. --Adamant1 (talk) 07:42, 13 September 2024 (UTC)
- It really depends on the meaning of "Special", lest we ban, say, Korean file names, or accents. We have French filenames with French-style quotes in them, and we shouldn't change those. At the same time, we have default sort; let's not make it a policy to name AI images File:💩AI generated💩 Foobar.jpg Adam Cuerden (talk) 10:03, 13 September 2024 (UTC)
- That's not really what I'm talking about. I don't think arbitrary putting brackets in file names is useful though. Maybe circle brackets, but
«»
or~
. If for no other reason then most keyboards don't have them to begin with. I'm also super annoyed by file names with emojis them though. They should 100% be banned. I'd be totally fine with requiring people put (AI generated image) at the end of a file name though. --Adamant1 (talk) 10:50, 13 September 2024 (UTC)- Aye, merely trying to avoid bad policy coming out of this. Should we append characters at the start of filenames to deprioritise them? No. That's a job for {{DEFAULTSORT}}. But «» are the standard quotes used in French, so we shouldn't ban their use, lest we require bad French. I'm a little bit of a stickler for trying to avoid policy for one situation that screws up other situations. Adam Cuerden (talk) 15:38, 14 September 2024 (UTC)
- That's not really what I'm talking about. I don't think arbitrary putting brackets in file names is useful though. Maybe circle brackets, but
- It really depends on the meaning of "Special", lest we ban, say, Korean file names, or accents. We have French filenames with French-style quotes in them, and we shouldn't change those. At the same time, we have default sort; let's not make it a policy to name AI images File:💩AI generated💩 Foobar.jpg Adam Cuerden (talk) 10:03, 13 September 2024 (UTC)
- Keep it simple. Just put "AI" infront of the filename. Those who want to know more can check the summary / category of a file for details. Alexpl (talk) 11:06, 13 September 2024 (UTC)
- Please rather put it at the end of the filename. Moreover, "AI" is ambiguous and also included in many other files, so again I'd suggest (AI-generated) or (made using AI) and this could be appended to the initial file-titles automatically in the Upload Wizard. Prototyperspective (talk) 11:24, 13 September 2024 (UTC)
- the thing is, if you prepend filenames with A, it's counterproductive to your aim (discouraging use of ai files as illustrations) because then all the ai files will occupy the front rows in categories (unless you add defaultsort of a super "late" unicode to the ai template). RZuo (talk) 18:34, 16 September 2024 (UTC)
- If you want to change where something sorts, I think it's better to do it using
- Oppose Use a template, similar to {{Retouched}}. Note that we already have non-AI files with names beginning "Ai ". Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:17, 13 September 2024 (UTC)
- The examples people have brought up in support or as examples in this discussion include "AI generated", not just AI. Edit: Except for Alexpl's comment. ReneeWrites (talk) 14:22, 13 September 2024 (UTC)
- Support When images are used internally editors know to check the filepage, when outside people reuse the images they are usually not aware of the info on the files page. We have had people reusing images without complying to the terms of the license by naming the creator. I would like to see (AI generated) the way we use "(cropped)" in file names. --RAN (talk) 18:06, 13 September 2024 (UTC)
- Ooh, that's a good point - if a user saves an image from Commons (or another wiki) to their computer, for instance, there's a decent chance that its file name will remain intact as a warning. A template on the file's talk page certainly won't. Omphalographer (talk) 18:13, 13 September 2024 (UTC)
- Yes, I agree. @Richard Arthur Norton (1958- ) & @Omphalographer -- Ooligan (talk) 02:57, 14 September 2024 (UTC)
- Strong oppose no no no, file names shouldn't be used for that. This opens up Pandora's box for controversial subjects. This should be in the file description, not in the name. Multichill (talk) 12:53, 15 September 2024 (UTC)
- Comment Why do we even accept AI generated images to begin with? Most of them are misleading, useless for Wikipedia articles, fake-y look as standalone content, and can be barely trusted. AI images should only be limited to very specific scenarios, otherwise we end up with a bunch of superheros holding the Commons logo, which we can all agree is largely a set of very interesting trademark violations and not consistent with community practices. Moreover, there have been recently a number of court cases around copyright infringement for several of these AI companies, so I'm concerned that we can't distinguish the provenance from different models that may or may not be trained on infringing datasets. We have no idea how this is going to be regulated.
- Scann (talk) 00:12, 17 September 2024 (UTC)
- No thing such as "infringing datasets" exist Trade (talk) 20:42, 20 September 2024 (UTC)
- Support Helps a lot with the identification of the file as AI and not the real thing, it's immediate.-- Darwin Ahoy! 10:45, 18 September 2024 (UTC)
- Comment I generally think templates and/or structured data are a better way to do this, and more multilingual-friendly. But I'm not opposed to also requiring it in file names if the community agrees on that (and I will anyway include in the filename of any AI files I upload myself). Noting that I have made a related proposal below: #Proposal: de-prioritise AI images in search the wub "?!" 15:09, 19 September 2024 (UTC)
Working out changes
I'm sensing pretty widespread support, so let's plan out what would need changed:
- Commons:File renaming: #2 gains "To identify AI generated works" with a possible more general version of "to point out major manipulations" (colourization, etc). This is explicitly allowed to be in any language.
- 1b. Also Commons:File naming, under "Clear".
- Commons:AI-generated media notes that the AI-generation must be mentioned in the filename, ideally in the same language as the rest of the filename.
- File upload wizard appends "AI generated" if the AI creation option is ticked, with the option to change this after, but with a note saying that identifying AI art in filenames is important. Alternatively, this can just be a soft prompt, that suggests a new filename, but doesn't require. (Similar to others where you can click "ignore and upload file anyway)
- Possibly, {{PD-algorithm}} and similar can be edited to add a {{DEFAULTSORT}} to move AI works lower in categories.
Have I missed anything, and anyone have suggestions? Adam Cuerden (talk) 19:55, 14 September 2024 (UTC)
- I think #4 should either not be implemented or be for files in Category:AI misgeneration. Images shouldn't be sorted by how they were produced but by by where the user is expecting to find them / looking for them or generally the relevance and quality of the image as it relates to the category concept, not the method/techniques used to produce it. You may have missed an addition to Commons:File naming. Prototyperspective (talk) 20:40, 14 September 2024 (UTC)
- Added File naming, and you're probably right about #4. Wanted to pull all the suggestions made, but that may be too much (if nothing else, AI image categories wouldn't do the headers for first letter of filename). Adam Cuerden (talk) 11:44, 15 September 2024 (UTC)
- #3 and #4 are terrible ideas. #3 will cause uploader confusion, filename conflicts, language issues, etc. This needs to be done by humans, not machines. #4 will also be confusing as no one will expect {{PD-algorithm}} to mess with the sorting. Plus it's just unneeded and potentially unhelpful, as there may be other reasons an AI-generated file needs to be sorted in a particular way. Nosferattus (talk) 17:05, 15 September 2024 (UTC)
- I don't think they're terrible ideas, but I don't think these two are needed. We're not inundated with such a flood of AI-generated images being uploaded to Commons that these couldn't be done by hand, and a lot more people have filemover rights than admin rights, so this wouldn't add to the backlog of issues needing admin attention. ReneeWrites (talk) 09:46, 16 September 2024 (UTC)
- Okay. Then let's focus on points 1, 1b, and 2. For 1b, I'm thinking (under "Clear")
- "Where an image, either through method of creation or modifications, might mislead, this should be noted in the filename. This includes AI generation, colourization of a photograph, turning a sepia image black and white, upscaling an image, and other things that might not be immediately obvious. Simple, minor fixes do not need to be noted."
- Too much? Adam Cuerden (talk) 10:56, 16 September 2024 (UTC)
- This discussion is about AI, and I think we should stick with that. I've seen way too many discussions get killed the moment they gain any traction because people keep attaching stuff to it that is tangentially related that no consensus was reached on. ReneeWrites (talk) 15:22, 16 September 2024 (UTC)
- Okay. Let's get this implemented, and any further additions can be discussed on the talk pages after? Adam Cuerden (talk) 17:41, 16 September 2024 (UTC)
- This discussion is about AI, and I think we should stick with that. I've seen way too many discussions get killed the moment they gain any traction because people keep attaching stuff to it that is tangentially related that no consensus was reached on. ReneeWrites (talk) 15:22, 16 September 2024 (UTC)
- Okay. Then let's focus on points 1, 1b, and 2. For 1b, I'm thinking (under "Clear")
- I don't think they're terrible ideas, but I don't think these two are needed. We're not inundated with such a flood of AI-generated images being uploaded to Commons that these couldn't be done by hand, and a lot more people have filemover rights than admin rights, so this wouldn't add to the backlog of issues needing admin attention. ReneeWrites (talk) 09:46, 16 September 2024 (UTC)
- 1a: Support "To identify AI generated works" sounds good to me.
- 1b: Comment I think it would be more at home under "Descriptive", specifically the subheader "Correct". There's nothing particularly unclear about the filename "Cancer cell.jpg", but it leaves out a lot of pretty crucial context that makes it pretty misleading.
- I'd like to propose this change: Correct – The name should describe the file's content and convey what the subject is actually called. Inaccurate names for the file subject, although they may be common, should be avoided. The title given to a work of art by the artist that created it is considered appropriate, even if the name has nothing to do with what is depicted (for example, many works of Dadaism). The name should also be free of obvious errors, such as misspelled proper nouns, incorrect dates, and misidentified objects or organisms. Users are allowed to upload "unidentified" or "unknown" organisms but such files may be renamed upon identification. AI-generated images must disclose this fact in the file name.
- It's tempting to include a bit on the rationale as to why, but none of the other examples have that either, they simply state what is policy. So I think addressing this with just one line that's clear and unambiguous is both pragmatic and in line with how the rest is written. ReneeWrites (talk) 15:50, 16 September 2024 (UTC)
- Sounds good. Adam Cuerden (talk) 17:42, 16 September 2024 (UTC)
- Made a slight adjustment to the wording. A lot of the file names on Commons made with Dall-E or Midjourney have that in their file name, which should also cover this base. ReneeWrites (talk) 08:42, 17 September 2024 (UTC)
- Could maybe move it a sentence later to keep the talk about organisms together. Adam Cuerden (talk) 09:43, 17 September 2024 (UTC)
- Good idea, I moved the sentence. ReneeWrites (talk) 11:17, 17 September 2024 (UTC)
- Okay. If no-one else has suggestions after a couple days, let's bring 1, 1b (with your text), and 2 together, ping everyone involved in the original discussion, and implement. Secondary ideas can be considered after that. Adam Cuerden (talk) 17:21, 17 September 2024 (UTC)
- Good idea, I moved the sentence. ReneeWrites (talk) 11:17, 17 September 2024 (UTC)
- Could maybe move it a sentence later to keep the talk about organisms together. Adam Cuerden (talk) 09:43, 17 September 2024 (UTC)
- Made a slight adjustment to the wording. A lot of the file names on Commons made with Dall-E or Midjourney have that in their file name, which should also cover this base. ReneeWrites (talk) 08:42, 17 September 2024 (UTC)
- Sounds good. Adam Cuerden (talk) 17:42, 16 September 2024 (UTC)
Upload wizards capacity
I note in this AI discussion that the upload wizard asks the question of is it ai generated, so its possible that appending to file names or adding a template to identify AI generated media could be relatively easy to do automatically at upload. with a high degreee of consistance Gnangarra 01:06, 17 September 2024 (UTC)
- This was mentioned a couple of times, but there is currently not enough support (or opposition) to reach a consensus on this. ReneeWrites (talk) 09:42, 17 September 2024 (UTC)
- I'd say that's probably something to bring up after the policy changes go through. Though I am surprised a template isn't already auto-added. Adam Cuerden (talk) 17:20, 17 September 2024 (UTC)
- {{PD-algorithm}} is already added to any upload where this box is ticked, in addition to the licence template specified by the uploader.
- It's worth remembering that some users tick this box in error, fairly regularly. Any additional effects of ticking it will require additional steps of cleanup in that minority of cases. Belbury (talk) 13:46, 19 September 2024 (UTC)
- I'd say that's probably something to bring up after the policy changes go through. Though I am surprised a template isn't already auto-added. Adam Cuerden (talk) 17:20, 17 September 2024 (UTC)
- Support both suggestions, specially the automated AI template placement which should already be there. Darwin Ahoy! 10:47, 18 September 2024 (UTC)
- Pinging @Sannita (WMF) Darwin Ahoy! 13:30, 19 September 2024 (UTC)
- @DarwIn I already relayed the idea of adding automatically a template, if I remember correctly something is already added. Anyway, if there is community consensus to add a(nother) specific template, I can relay this too and discuss it with the team. It's going to take some time anyway. Sannita (WMF) (talk) 13:47, 19 September 2024 (UTC)
- Yes, {{PD-algorithm}} is already automatically added when the checkbox in UploadWizard is selected. I tested today by uploading this image. the wub "?!" 15:12, 19 September 2024 (UTC)
- @DarwIn I already relayed the idea of adding automatically a template, if I remember correctly something is already added. Anyway, if there is community consensus to add a(nother) specific template, I can relay this too and discuss it with the team. It's going to take some time anyway. Sannita (WMF) (talk) 13:47, 19 September 2024 (UTC)
- Pinging @Sannita (WMF) Darwin Ahoy! 13:30, 19 September 2024 (UTC)
- Is there also a way to insert "AI generated" into the metadata? --RAN (talk) 17:08, 18 September 2024 (UTC)
- The template exists for a reason Trade (talk) 20:41, 20 September 2024 (UTC)
September 12
Help in closing CfD
Hello, it would be extremely helpful if some users could read the CfD and weigh in their opinion at Commons:Categories_for_discussion/2024/09/Category:Towers_in_Iran#Resolution. —Matrix(!) {user - talk? - uselesscontributions} 17:29, 12 September 2024 (UTC)
- Just close it guys asap, any way to do so is good, as per the comment there: "To avoid further edit warring and loss of brain cells... " . It is utterly inconsequential how these categories are called, see the arbitrariness categorization theory here: Fuzzy_set. These (potentially mis-)categorized pics are searched for in (and by) multidimensional vector spaces anyway: LLMs; via image recognition engines: Computer_vision, and that is for the starters... Zezen (talk) 05:54, 14 September 2024 (UTC)
- Just an FYI, but the CfD doesn't have anything to do with what the categories are called. It revolves around if main categories for towers should be child categories of ones for buildings by shape. Since at least IMO, "tower" isn't a shape. Regardless, it's not about the names of the categories. --Adamant1 (talk) 06:38, 14 September 2024 (UTC)
- Re: 'if main categories for towers should be child categories of ones for buildings by shape' - same problem (or lack of it): the debate over whether “towers” should be categorized under “Buildings by shape” could be seen as a matter of degree rather than a strict yes-or-no decision. This approach (see again fuzzy sets, recursive ontology entities...) acknowledges that categories can overlap and that elements can belong to multiple categories to varying extents. In practice, see: Thomas Aquinas who discussed smth similar in Summa Theologiae, Part II (On Angels) Chapter LII § 3: "Whether Many Angels Can Be in the Same Place at the Same Time [e.g. at the needle's pin]"...
- The only practical answer: "😔 possibly..." Zezen (talk) 13:03, 15 September 2024 (UTC)
- Just an FYI, but the CfD doesn't have anything to do with what the categories are called. It revolves around if main categories for towers should be child categories of ones for buildings by shape. Since at least IMO, "tower" isn't a shape. Regardless, it's not about the names of the categories. --Adamant1 (talk) 06:38, 14 September 2024 (UTC)
Created a derivative work, unsure how to tag it when uploading
Good evening. I've created a derivative of the work File:Balanced_justice_scale_silhouette,_small.svg, featuring the same image, but overlaid with a red X. The use of this work will be to create a "No legal threats" template, for use on the English Vikidia website. Could you please tell me, while uploading, how I should tag a derivative file, or whether that comes as part of the upload process? Thanks! DaneGeld (talk) 20:05, 12 September 2024 (UTC)
- You can add the {{Derived from}} template to link the original file and its author. And add the {{Derivative versions}} template to the original file to link back. For example: File:Cumulus clouds in fair weather.jpeg and File:Cloud.jpg. William Graham (talk) 20:37, 12 September 2024 (UTC)
- Thank you! I have already found the "Derived from" template, which is on the derivative I've created - File:Balanced_justice_scale_silhouette_(crossed_out),_small.png. The other one you linked is the one I have been looking for, so I appreciate your help in linking me to it. Regards, DaneGeld (talk) 20:52, 12 September 2024 (UTC)
- @DaneGeld: What happened to the chains for the baskets? Nosferattus (talk) 16:47, 15 September 2024 (UTC)
- @Nosferattus: Now that's odd. I see the chains when I look at the file on my PC, and when I view the file on commons. I presume what I have is a cached version which I should have purged. My guess is that I've not grouped the chains with the rest of the image before saving the file, and then exporting it. I'm not too well at the moment, but I'll correct the error tomorrow, and re-upload the file. Thanks for pointing it out! DaneGeld (talk) 19:13, 15 September 2024 (UTC)
- @Nosferattus: - This is getting weird now. I repaired the svg that I produced this from, grouped all the files correctly, and exported it as a png file. The new version is now available - but I have a problem with it, that I can't solve. The preview in the file history, now shows the chains in place, and if you click to view the file, they show up too. BUT - They don't show up when you just go to view the file, and I'm mystified as to what the hell I've done. I've never had this much trouble getting an svg to png export to work. Any thoughts? Thanks. DaneGeld (talk) 21:13, 16 September 2024 (UTC)
- @Nosferattus: Now that's odd. I see the chains when I look at the file on my PC, and when I view the file on commons. I presume what I have is a cached version which I should have purged. My guess is that I've not grouped the chains with the rest of the image before saving the file, and then exporting it. I'm not too well at the moment, but I'll correct the error tomorrow, and re-upload the file. Thanks for pointing it out! DaneGeld (talk) 19:13, 15 September 2024 (UTC)
- @DaneGeld: What happened to the chains for the baskets? Nosferattus (talk) 16:47, 15 September 2024 (UTC)
- Thank you! I have already found the "Derived from" template, which is on the derivative I've created - File:Balanced_justice_scale_silhouette_(crossed_out),_small.png. The other one you linked is the one I have been looking for, so I appreciate your help in linking me to it. Regards, DaneGeld (talk) 20:52, 12 September 2024 (UTC)
Super-CfDs
Hi everyone, we recently had an alert about the CfD on the issue of “Georgia” here in the village pump, and now I realized that “Historical images” is eerily similar. The result of this second CfD was essentially: "delete and upmerge to Category:History". There is sound reasoning to do so, but we're talking about the previous parent category to numberless sub categories (I cannot even give estimations on a probable number... tens of thousands?), which are currently all in a waiting list for getting deleted and upmerged, one by one. This campaign is lasting for months already, I have no clue what the percentage of completion is on that front. Various angry protests of various kinds against this action have not been organized well, and the prevailing counter-argument is: "the CfD lasted many years, we reached consensus, there were no counter-proposals".
My concerns on this issue were close to nil previously (again: there was sound reasoning), but now I've checked Commons:CFD for guidelines on CfDs that process multiple categories. The guidelines imply that the concerned categories should be mentioned in the CfD, but that was not done in either of these two cases. Only the top-most parent category was put up for debate, and I'm fairly certain now: The reason it took five years to gather a meager ten votes was not because there was no interest. Rather, there were only ten votes because close to nobody was aware that this debate was ongoing. Not until the bulldozers arrived at the front doors.
Sound reasoning and good intentions or not: Not notifying those who are going feel the impact of your decision, is the behavior of the Vogons from "Hitchhiker to the Galaxis".
The particular CfDs I linked initially should not be debated here. I'd rather like to focus on our CfD procedures. In fundamental cases like these two mentioned above - can we agree that all concerned child categories (i.e. the ones that are going to be affected by the CfD, because of similar naming conventions) should have the same CfD notification in their header? --Enyavar (talk) 23:31, 12 September 2024 (UTC)
- Commons:Categories for discussion/2024/09/Category:Towers in Iran, mentioned a couple of threads above this, is another CfD that would affect a hundred categories but has only been raised against one (and not even the top one). Belbury (talk) 07:52, 13 September 2024 (UTC)
- I wouldn't say Commons:Categories for discussion/2024/09/Category:Towers in Iran is the same as the one for "historical images" because there's a pretty clear guideline I was following. The CfD was only started because someone decided to edit war me and a couple of admins over it. Otherwise the CfD wouldn't have been necessary. I don't really think it was necessary to notify everyone who came within 100 miles of a "towers" category in the last 20 years in that case though. Since again, 99% of the issue that led the CfD was caused by a single user.
- Really, the same goes for the "historical images" CfD. Although I do think it should have at least been announced on the Village Pump after it was closed. It clearly wouldn't have been workable for the closing admin to notifying everyone who was going to be impacted by it though. Same goes for the nominator putting a CfD notification in the header of every category involved. It's ridiculous to suggest people should notify hundreds of people or put a notification in the header of potentially thousands of categories just for the outcome of a CfD to be valid. Neither one is really a fair, workable way to do things. There should have at least been a notification about the outcome of the CfD for "historical images" on the Village Pump once it was closed though, but that's more then enough IMO. --Adamant1 (talk) 08:11, 13 September 2024 (UTC)
- I think it would be better if a large-scale CfD was announced at the village pump right after it was opened rather than closed, so more people can read the discussion and participate. Once a discussion is closed you're not supposed to start it up again. ReneeWrites (talk) 14:20, 13 September 2024 (UTC)
- Either way. The main issue for me is that I just don't think it's fair to expect whomever opens the CfD to add the notification template to hundreds or thousands of sub-categories. Although it's debatable what makes something a "super-CfD" to begin with and the expectation should be that people normally find them through Commons:Categories for discussion. To the degree that it's not a good way to find important CfDs is more about the backlog then anything. It wouldn't be as much of an issue if there wasn't such a huge backlog though. ---Adamant1 (talk) 06:53, 14 September 2024 (UTC)
- Of course, we have so many CfDs, and what I called here "Super-CfDs" are exceptionally rare. Just because a CfD is about a top-most super-category (like here) does not mean that all sub-categories need to be notified. In case of Category:Past, only the super-category itself will be affected, no cascading effects. So no need to notify, of course.
- But: Once it becomes clear that a CfD will have impact on its structured subcategories (like deletion or renaming), all these subcategories should get a notification. These rare and special discussions are long-lasting anyway, so if there are hundreds or more of impacted subcategories that all have the same naming structure (like "Historical images of" or "in/of Georgia"), a bot-job should be tasked with inserting the CfD-note on all these subcategories. This should not be the job of the original CfD-opener (for example, in case that the CfD is uncontroversially rejected or doesn't change anything in the sub-categories). But once it becomes clear that a large impact will happen as a result, then a wide-spread notification is necessary, before continuing the CfD for another appropriately long time. And I would hope that our guidelines can be amended, to include this as a future rule. --Enyavar (talk) 12:52, 14 September 2024 (UTC)
- I guess you could do it with a bot. But it seems like derailing to put the notifications on the pages once it becomes clear that a CfD will have impact on its structured subcategories. There's already going to be clear consensus to delete or rename the categories at that point. So the only reason to find more people to participate in the CfD would be to change the outcome. Especially since people who created the categories to begin with, which realistically are the being notified about the CfD at that point, are going to vote to a certain way. How is that any different then canvasing? --Adamant1 (talk) 22:36, 14 September 2024 (UTC)
- Either way. The main issue for me is that I just don't think it's fair to expect whomever opens the CfD to add the notification template to hundreds or thousands of sub-categories. Although it's debatable what makes something a "super-CfD" to begin with and the expectation should be that people normally find them through Commons:Categories for discussion. To the degree that it's not a good way to find important CfDs is more about the backlog then anything. It wouldn't be as much of an issue if there wasn't such a huge backlog though. ---Adamant1 (talk) 06:53, 14 September 2024 (UTC)
- I think it would be better if a large-scale CfD was announced at the village pump right after it was opened rather than closed, so more people can read the discussion and participate. Once a discussion is closed you're not supposed to start it up again. ReneeWrites (talk) 14:20, 13 September 2024 (UTC)
September 13
Feedback research
Mandatory settings must be made regarding image classification before uploading them. Otherwise, the site will be flooded with images and it will be difficult to find images when searching. Mohmad Abdul sahib 02:05, 13 September 2024 (UTC)
- It's unclear what you're asking for. Files have filetitles and categories and both are used (alongside descriptions) to find and organize images. Prototyperspective (talk) 21:43, 13 September 2024 (UTC)
Is there a categories need to be improved tag?
There is an uncategorized tag, but what about a categories need to be improved tag? Like something I would add in the event that I see a random photo on a farm somewhere in Japan but all it is categorized as is just "Japan" Immanuelle ❤️💚💙 (please tag me) 04:37, 13 September 2024 (UTC)
- @Immanuelle: yes, {{Check categories}} and the corresponding Category:Media needing category review. MKFI (talk) 06:28, 13 September 2024 (UTC)
- You can dissiminate it it easily into:
- Category:Unidentified plants
- Category:Unidentified landscapes
- (there’s no Category:Unidentified farms, maybe it should)
- Category:Unidentified locations in Japan
- It’s a start whence other people can pick from. -- Tuválkin ✉ ✇ 18:28, 13 September 2024 (UTC)
- There's also Category:Files needing categories (see Category:Commons needing categorization). Tools to easily/quickly add the mentioned tag would be useful as well as scripts adding it to files. Many of un/badly categorized files are problematic or simply low-quality. Prototyperspective (talk) 21:47, 13 September 2024 (UTC)
- I was going to mention Category:Unidentified locations in Japan. I'm not a super big fan of the whole "unidentified" category thing myself but there doesn't seem to be anything better at this point. --Adamant1 (talk) 21:53, 13 September 2024 (UTC)
Crop Tool SNAFU
Crop tool is not working, again. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:24, 13 September 2024 (UTC)
- What's SNAFU? --PantheraLeo1359531 😺 (talk) 14:23, 13 September 2024 (UTC)
- Situation Normal: All Fucked Up. Means the Crop tool is broken but this is not unusual. ReneeWrites (talk) 14:52, 13 September 2024 (UTC)
- Ah thank you :) --PantheraLeo1359531 😺 (talk) 16:32, 13 September 2024 (UTC)
- Situation Normal: All Fucked Up. Means the Crop tool is broken but this is not unusual. ReneeWrites (talk) 14:52, 13 September 2024 (UTC)
- "! 502 The CropTool backend is currently having problems." is one message I have gotten all day. -- Ooligan (talk) 00:42, 14 September 2024 (UTC)
Community Wishlist: Let’s discuss how to improve template discovery and reuse
Hello everyone,
The new Community Wishlist now has a focus area named Template recall and discovery. This focus area contains popular wishes gathered from previous Wishlist editions:
- Wish #1: Quickly add infobox – an easier way for newer editors to find and insert common templates such as infoboxes.
- Wish #2: Quickly add favorite and related templates – a way to show recently used templates and favorited templates for frequent template use.
- Wish #3: Select templates by categories – a way to find templates by "type".
- Wish #4: Easy access templates – help users configure the Template dialog to show common and favorite templates.
We have shared on the focus area page how we are seeing this problem, and approaching it. We also have some design mockups to show you.
We are inviting you all to discuss, hopefully support (or let us know what to improve) about the focus area. You can leave your feedback on the talkpage of the focus area.
On behalf of Community Tech, –– STei (WMF) (talk) 16:06, 13 September 2024 (UTC)
- @STei (WMF): : Just check the thread
aboutabove this one. Here’s one for the Community Wishlist: the WMF should drop their nonsensical makework, for it breaks tools maintained by the community. -- Tuválkin ✉ ✇ 18:10, 13 September 2024 (UTC)
Sorry, I noticed the typo only now: I meant to refer to the thread above this one, the one about the (still broken) CropTool. -- Tuválkin ✉ ✇ 20:36, 17 September 2024 (UTC)- @Tuvalkin, these wishes ranked 5th and 11th in the Community Wishlist Survey 2023, #74 in 2021, and #85 in 2022, respectively, and have been mentioned in different discussions off the Wishlist. This is not a nonsensical choice in my opinion. Is it that you'd rather see a focus area for Commons?
- This is also not make-work. Community Tech needs to actually implement a focus area and learn as soon as possible the merits and demerits of this focus area concept, as well as take community feedback, before the next annual planning season which plans to take in more focus areas. Templates is a classic example of a focus area to use for our learning.
- Please find below a much more formal background on what Community Tech is trying to do around the Community Wishlist and giving the community a clear path to influencing Product and Tech annual planning:
- To enhance the effectiveness of Product and Technology work, the overall goal for the Community Wishlist in the 2024-2025 Annual Plan is to better align movement requests and the Wikimedia Foundation’s future plans. Beginning the end of Q1 2024/25, the new Wishlist wants to do as follows:
- The Moderator Tools team, which serves editors with extended rights should identify at least two focus areas from the Community Wishlist in Q1.
- 3-5 related wishes regarding templates should be bundled, addressed and deployed as improved template features starting Q1 as a proof of concept, guiding the Foundation to incorporate more focus areas into the 2025-26 Annual Plan.
- By 2025-26 annual planning cycle, Community Tech must present three focus areas for inclusion in the Wikimedia Foundation’s 2025-26 Annual Plan.
- Please note that item #3 hasn't began yet. We are still with #1 and #2.
- If you don't mind, please see the focus areas list, perhaps one of them might resonate with you or other groups of users you are familiar with. –– STei (WMF) (talk) 15:07, 17 September 2024 (UTC)
- I think that focus area is far less important than other focus areas that would save contributors much time and also less relevant to WMC than other focus areas or proposals. Maybe sharing a link for all proposals that are WMC-related would be useful (such as the one about WMC media dumps or a WMC copyvio-detecting bot). These template wishes all just seem like nice-to-haves without much benefit and which aren't solving a real problem but can and are implemented quite well by editors and bots sooner or later. Prototyperspective (talk) 21:52, 13 September 2024 (UTC)
- Prototyperspective, this is good advise that focus areas relevant to WMC will be more useful. I will do that. Nonetheless the shared/intersecting Wikimedian ways of life made me consider notifying people here too. –– STei (WMF) (talk) 15:17, 17 September 2024 (UTC)
- This stuff is probably pretty far down on the priority list, but their relatively easy things to implement in the grand of scheme things and help the dev team get use to working with Commons. At least they are doing something. It seems like you guys expect them to tackle the hardest things on here first. That's not how it works. And seriously, how many people actually wanted data dumps when you suggested it and what problem does it actually solve? Like someone can't create a python script to scrape the site and re-upload the files to archive.org or something. At least IMO the teams shouldn't be wasting their time on things that can be done extremely easily by normal users through scripts. --Adamant1 (talk) 15:28, 17 September 2024 (UTC)
- That was just one example and just one of several. Among the problems it solves is server-load from people scraping the site. These things can't be done extremely easily if at all and this is more about a complete than some random small subset. So far, I don't see any reasons for why it would be relatively easy to implement. I also don't see that reasoning anywhere in the focus area or in the post above. I also don't see any reason why some other suggestions wouldn't also be easy to implement but more useful, such as showing categories on mobile. In any case, dumps, which are also backups, aren't wasted time and can't be done easily so your comment doesn't make sense anyway. Not even considering that this isn't about Commons at all so not even get use[d] to working with Commons makes sense. Prototyperspective (talk) 15:41, 17 September 2024 (UTC)
- I'm sure they have some kind of metric to decide what can or can't be implemented and how easily they do something. Although I agree that their reasons for working on these specific things aren't clear, but so what? Their clearly at the beginning of the planning phase here. So things will probably become clearer as they work on it. It seems like your just bike shedding though. And I'm not sure what your talking about in regards to it being hard to do data dumps of files. There's plenty of software out there that consumes images from Commons and there's plenty web scrapers that support other websites. A good portion of both of which are open source. So I'm sure it would be easy to create one.
- That was just one example and just one of several. Among the problems it solves is server-load from people scraping the site. These things can't be done extremely easily if at all and this is more about a complete than some random small subset. So far, I don't see any reasons for why it would be relatively easy to implement. I also don't see that reasoning anywhere in the focus area or in the post above. I also don't see any reason why some other suggestions wouldn't also be easy to implement but more useful, such as showing categories on mobile. In any case, dumps, which are also backups, aren't wasted time and can't be done easily so your comment doesn't make sense anyway. Not even considering that this isn't about Commons at all so not even get use[d] to working with Commons makes sense. Prototyperspective (talk) 15:41, 17 September 2024 (UTC)
- I'm actually kind of tempted to do it myself but there's to many things working on already. You might look into what's already out there and ask the developers to add support for Commons though. Like Gallery-dl supports a ton of websites. Not Commons, but it also allows for adding costume URLs through text files. You might give that a try and let me know if it works. Then I'll give it a try myself if you can't figure it out when I have the time. Or conversely you could just continue complaining about it for no reason. Whichever. Personally, I prefer the proactive DIY approach since at the end of the day we probably don't want the WMF or their developers having control over most of this. That could just be me though. --Adamant1 (talk) 16:26, 17 September 2024 (UTC)
- Nowhere do I see them talk about this being easier than the other. You just imagined some random reason in your head. You're talking offtopic. I recently listed available data-scrapers for WMC and there's few and many of them probably don't work or are very impartial. I'm not complaining, you're making offtopic points, I'll look into that software. Dumps was just one random example that is like 5% of my comment but 100% what you're talking about. Prototyperspective (talk) 16:32, 17 September 2024 (UTC)
- You're talking offtopic. No, I think your the one doing that by making this data dumps when it has nothing to do with the original topic. All I'm doing is responding to your off-topic complaining. Otherwise be my guest and point out where STei (WMF) said anything about data dumps in their original message. --Adamant1 (talk) 16:40, 17 September 2024 (UTC)
- Here I was only mentioning it as an example with 2 words, here you were making this all about this particular example. Case closed. I won't continue this wall of mostly offtopic text. Prototyperspective (talk) 16:45, 17 September 2024 (UTC)
- I asked where STei (WMF) brought up data dumps in their original message. Not where you did when this had nothing to do with that. I'm sure you get the difference. Anyway, I made one simple comment about it in response to you bringing it up. To the degree it's off-topic or a wall of text is totally on you for making this about that and getting defense. Maybe keep the conversation on topic, put your ego in check, and try not turn it into a debate next time huh? Personally, I'm just happy they are finally giving the project some attention and working on things. Maybe that's just me though. --Adamant1 (talk) 16:53, 17 September 2024 (UTC)
- Here I was only mentioning it as an example with 2 words, here you were making this all about this particular example. Case closed. I won't continue this wall of mostly offtopic text. Prototyperspective (talk) 16:45, 17 September 2024 (UTC)
- You're talking offtopic. No, I think your the one doing that by making this data dumps when it has nothing to do with the original topic. All I'm doing is responding to your off-topic complaining. Otherwise be my guest and point out where STei (WMF) said anything about data dumps in their original message. --Adamant1 (talk) 16:40, 17 September 2024 (UTC)
- Nowhere do I see them talk about this being easier than the other. You just imagined some random reason in your head. You're talking offtopic. I recently listed available data-scrapers for WMC and there's few and many of them probably don't work or are very impartial. I'm not complaining, you're making offtopic points, I'll look into that software. Dumps was just one random example that is like 5% of my comment but 100% what you're talking about. Prototyperspective (talk) 16:32, 17 September 2024 (UTC)
- I'm actually kind of tempted to do it myself but there's to many things working on already. You might look into what's already out there and ask the developers to add support for Commons though. Like Gallery-dl supports a ton of websites. Not Commons, but it also allows for adding costume URLs through text files. You might give that a try and let me know if it works. Then I'll give it a try myself if you can't figure it out when I have the time. Or conversely you could just continue complaining about it for no reason. Whichever. Personally, I prefer the proactive DIY approach since at the end of the day we probably don't want the WMF or their developers having control over most of this. That could just be me though. --Adamant1 (talk) 16:26, 17 September 2024 (UTC)
W. A. Schulenburg of Copenhagen
Can anyone find out more about this photographer File:Sophus_Theodor_Pihl_(1840-1888)_by_W._A._Schulenburg.png to fill out their Wikidata page. --RAN (talk) 18:09, 13 September 2024 (UTC)
- Hi. Schulenburg is mentioned very briefly in Bjørn Ochsner's book of Danish photographers before 1920 (in Danish). The book knows his first names and a probable year of birth. Wikidata has been updated with those data. Cheers Rsteen (talk) 04:19, 16 September 2024 (UTC)
September 14
Fate of image that is broken, wrong, unused and duplicated simultaneously
The image in question was brought to the Graphic Lab when User:Swiãtopôłk discovered it displays as a white rectangle. After looking at it, I found that:
- The image uses some exotic SVG features, but is easy to fix
- Once fixed, it is almost identical (except empty space) to File:OceanGate_logo.svg
- This is not what the flag of OceanGate looks like. Sources: [6][7]
- Nothing obvious uses it.
I can think of some solutions:
- Leaving it alone
- Overwriting with actual flag, once available (creation requested and I'll probably do it)
- Overwriting with fixed version
- Fixing and renaming
- Fixing, renaming and reusing the name for the actual flag
- Speedily deleting as broken and usused
- Speedily deleting as a duplicate
- Speedily deleting and reusing the name
etc. What's the proper way to deal with it? Gabuxae (talk) 07:42, 14 September 2024 (UTC)
- If fixed it's a duplicate, one would use {{Duplicate}} and it would end up being deleted and redirected.
- However, as "flag" would be misleading, I'd just have it deleted.
∞∞ Enhancing999 (talk) 22:34, 14 September 2024 (UTC)
- The file had a classic Inkscape bug, so it should display properly with Inkscape. It would have displayed properly on Commons until April 2024.
- The file has been around for a year, so I would not just delete it; there should be a redirect.
- For me, I would not bother deleting the file. It represents the OceanGate logo, so any errors are minor. If you believe the file does not represent the flag, then add {{Fact disputed}} with the reason.
- So I'm mostly in the fix the file and then leave it alone camp. Glrx (talk) 15:52, 15 September 2024 (UTC)
Requesting some simple category renaming
I believe his should be easy for someone who uses AWB.
The subcats of Category:Interchange Exit Direction Signs in the United States by state all need "Signs" capitalized. "Sign" is part of the proper noun, as documented by the link at Category:Interchange Exit Direction Signs in the United States. - Jmabel ! talk 09:13, 14 September 2024 (UTC)
- @Jmabel It might be better of you can add it to User:CommonsDelinker/commands so the bot can do that -- DaxServer (talk) 10:17, 14 September 2024 (UTC)
- That would be another way, though if renames are done, Cat-a-lot makes the rest easy.
- What I'm trying to avoid is 43 non-trivial by-hand actions, and putting it in User:CommonsDelinker/commands editing each category name into the correct format to make the request. - Jmabel ! talk 20:49, 14 September 2024 (UTC)
- @Jmabel: Done with a bit of AWB and mostly copypasta in Notepad++, see here. — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 16:53, 15 September 2024 (UTC)
- @Jeff G.: Thanks! - Jmabel ! talk 15:57, 16 September 2024 (UTC)
- @Jmabel: Done with a bit of AWB and mostly copypasta in Notepad++, see here. — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 16:53, 15 September 2024 (UTC)
September 15
Own work selfie upload with a contested "no permission" tag
COM:MYWORK says: Usually, stating that it is a selfie will suffice if that is really the case, though in some cases you may be asked for additional evidence.
I have stated that there is no reason to ask for additional evidence because (1) it very much is really the case that the photograph is a selfie, (2) there is no reason to ask for additional evidence relative to any given case of a user uploading a selfie from a pseudonymous account, which is the default and expected scenario, (3) there is mild circumstantial evidence (yes, mild evidence, but there is no evidence to the contrary as far as I can tell) that the uploader specifically is the subject and the author (explained on the linked talk page).
So there is no reason that this should be the case when additional evidence is requested.
Under these circumstances, as the tag is controversial, I believe that it is time for a COM:DR discussion to finally determine if additional evidence is genuinely needed or not. When one user considers the permission to be missing and another the permission to be present (present insofar as the uploader who is the author added the CC BY-SA 4.0 license at the time of the upload), surely the way to resolve the disagreement is a deletion discussion. However, instead of substantively discussing, one file mover threatens that they will delete the file, which they technically can accomplish using a combination of actions that they are technically privileged to as a file mover, but in terms of the deletion policy, it would be improper for them to so.
I admit that I could be wrong about how this should be handled, and I could be missing some fact or circumstance, in which case I apologize. I tried talking on that talk page, and one contributor supported the view that this is a legitimate own work selfie, but there is no dialogue and the file is simply heading for deletion as if the talk page had never existed. Therefore, I ask for a community review of this situation, and my suggested resolution is to remove the tag and start a deletion discussion.—Alalch E. (talk) 01:27, 15 September 2024 (UTC)
- It looks like a selfie. The question for me would be if she is the same person who uploaded it. Probably not. That can easily be dealt with though by having her or them send permission to the Volunteer Response Team. Although even at that point there would still scope/PROMO issues. But they probably don't matter since she's a politician. The more important thing is verifying that her and uploader are the same person or that they have permission to upload the image if they aren't her. --Adamant1 (talk) 02:03, 15 September 2024 (UTC)
- There are no scope issues because the pic is perfectly functional as illustration for her Wikipedia articles, and is being used so in various languages without issue. About the same person question: I really don't see why "probably not". What's the reason for suspicion? Alalch E. (talk) 02:12, 15 September 2024 (UTC)
- I meant as far as it being uploaded purely for promotional purposes. Which I don't think images get a pass on if they are being used on other projects. Especially if the image is of the person who uploaded and their the one's who added it to their own Wikipedia article. As to why I don't think the uploader is the same person, different names and the fact that the selfies are available on other websites. It's immaterial though. They should still be required to send VRT permission for the image anyway. Especially since again, the image is already out there on other places. What's your evidence that it was just uploaded from another website by a random PR person for Sanija Ameti or something? --Adamant1 (talk) 05:14, 15 September 2024 (UTC)
- We don't really need to get into who has what degree of suspicion. Given that some suspicion exists, COM:VRT would be the right way to sort this out. - Jmabel ! talk 15:54, 15 September 2024 (UTC)
- If possible, as that would solve the issue. However, to delete a file that is in scope (which I assume this clearly is, be it uploaded for promotional reasons or not), there needs to be serious doubt about the copyright. There may be, but "some suspicion" is not enough for deletion, and thus not for requiring VRT. Was the image on the internet before it was uploaded? –LPfi (talk) 09:16, 20 September 2024 (UTC)
- We don't really need to get into who has what degree of suspicion. Given that some suspicion exists, COM:VRT would be the right way to sort this out. - Jmabel ! talk 15:54, 15 September 2024 (UTC)
- I meant as far as it being uploaded purely for promotional purposes. Which I don't think images get a pass on if they are being used on other projects. Especially if the image is of the person who uploaded and their the one's who added it to their own Wikipedia article. As to why I don't think the uploader is the same person, different names and the fact that the selfies are available on other websites. It's immaterial though. They should still be required to send VRT permission for the image anyway. Especially since again, the image is already out there on other places. What's your evidence that it was just uploaded from another website by a random PR person for Sanija Ameti or something? --Adamant1 (talk) 05:14, 15 September 2024 (UTC)
- There are no scope issues because the pic is perfectly functional as illustration for her Wikipedia articles, and is being used so in various languages without issue. About the same person question: I really don't see why "probably not". What's the reason for suspicion? Alalch E. (talk) 02:12, 15 September 2024 (UTC)
Request to delete previous version of file
Hi. I like to delete older version of a file but I can't manage to find the correct way to do it. Should I go to deletion request, contact administrator or there are a another dedicated page for it ? — Preceding unsigned comment added by Nicolas22g (talk • contribs)
- You can add a speedy deletion template to the file with the request to delete the respective file version. You can highlight the file version, because I had a case where the file in total was accidentally deleted :D -- PantheraLeo1359531 😺 (talk) 06:10, 16 September 2024 (UTC)
- Thanks ! Total deletion is indeed my fear. I'm putting a G7 on these and crossing my fingers that my comment will be taken into account successfully. Nicolas22g (talk) 12:16, 16 September 2024 (UTC)
- If it happens, you can request Undeletion with your reasoning --PantheraLeo1359531 😺 (talk) 18:22, 16 September 2024 (UTC)
September 16
Problems in Commons:Upload
Hi. As a user of Commons:Upload (with the basic upload form) I have observed problems in the upload process. These problems have ocurred within the last week, and they are intermittent, not continuous. First problem is that there is sometimes no "preview" button. So you can not see the result of your upload and catch errors. You just have to upload. Second problem is, that after the upload you can sometimes not add or edit categories. They are locked, so you have to edit the file information manually. As written, these problems are rather new and do not occur all the time. Have anybody else noticed this? Cheers Rsteen (talk) 04:38, 16 September 2024 (UTC)
- This sounds like JavaScript issues, what is your operating system + version and browser + version? Sjoerd de Bruin (talk) 08:31, 16 September 2024 (UTC)
- Thanks for going into this. System is Windows 11 and browser is Firefox 130.0 (64 bit). Cheers Rsteen (talk) 12:32, 16 September 2024 (UTC)
Category:Unidentified subjects in Japan is a mess
The category Category:Unidentified subjects in Japan is a mess. There are a ton of images that are only tangentiall associated with Japan such as this image File:Forster dexter energy transfer.jpg which is just a diagram in Japanese, this one File:Circuit box.png which isn't even in Japanese but just seems to have an incorrectly labelled Japanese description and was possibly uploaded from Japan. This one File:Diversity of Archaea.jpg which I guess is technically a photo collage but no evidence any of the micrographs were taken in Jaspan. And this one which has straight up no connection File:Eye of the Crimson King.png Immanuelle ❤️💚💙 (please tag me) 08:05, 16 September 2024 (UTC)
- Similar situation to many other countries. It is better there than somewhere in Category:Media needing categories requiring human attention. Sometimes I give an image without category from, say, Indonesia the Category:Unidentified subjects in Indonesia when it is not clear from the filename, the image itself or the (often poetic) description what the essence is. Bad pictures I just leave without a category. For example, someone who speaks Indonesian can tackle a subcategory to put 10% of the best photos in the right category. Wouter (talk) 09:06, 16 September 2024 (UTC)
- I tried to nominate Category:Unidentified logos for deletion a few years ago but it got shot down. At the end of the day most of the categories are just used as dumps because people don't want to spend the effort better categorizing the images. 99% of the time the images aren't even unidentified to begin with though. So at least IMO the categories are essentially worthless. Their just an extremely low effort way to empty main categories. --Adamant1 (talk) 11:14, 16 September 2024 (UTC)
Category WSC contributors
Hello, because we have Category:WLE contributors and Category:WLM contributors, can we consider the creation of Category:WSC contributors for the Wiki Science Competition? Una tantum (talk) 11:11, 16 September 2024 (UTC)
Cleaning up the Category:Intel Core mess
Hi!
As the first generation after the Intel Core i 14th gen is approaching, I would like to hear some opinions on how we bring structure in the irritating Intel Core nomenclature. We have the Intel Core Duo, the Intel Core 2, the Intel Core i series (represented by Intel Core 3; 5; 7 and 9), and the Intel Core Ultra series (a possible name may be "Intel Core Ultra 9 285K"), including Arrow Lake CPUs (maybe another group will be revealed in the near future?). As the naming scheme itself already causes much irritation, I would like to see how we can prevent this with proper categorizing. Thanks!
--PantheraLeo1359531 😺 (talk) 13:05, 16 September 2024 (UTC)
- My suggestion would not to categorize the images down to that granular of a level to begin with. Otherwise your bound to run into issues. Regardless though there's only superficial differences between CPU models to begin with. Let alone when your talking about images of them. So it seems kind of pointless to categorize the images this granularly. --Adamant1 (talk) 07:45, 18 September 2024 (UTC)
- I don't see this as pointless, as the CPUs differ in core numbers and other specifications that can be structurized. And what about the upcoming Core Ultra series? --PantheraLeo1359531 😺 (talk) 09:43, 19 September 2024 (UTC)
- And it gives an overview of what CPUs are missing --PantheraLeo1359531 😺 (talk) 09:44, 19 September 2024 (UTC)
- I don't see this as pointless, as the CPUs differ in core numbers and other specifications that can be structurized. And what about the upcoming Core Ultra series? --PantheraLeo1359531 😺 (talk) 09:43, 19 September 2024 (UTC)
- You could categorize them by socket or by microarchitecture. -- Zache (talk) 10:23, 19 September 2024 (UTC)
Issue with Wikidata Item
Hello. I recently created Category:Cage of Agony and matched it to Wikidata item Q129951709. However, the commons category is not pulling data from the Wikipedia page, but rather is reading "Wikidata item Q129951709 has no claims. Please add some." I am not sure how to resolve this and would be grateful for any assistance. McPhail (talk) 15:36, 16 September 2024 (UTC)
- @McPhail: It won't pull from the Wikipedia page, it pulls from the Wikidata page, which indeed has no claims.
- Also: are the underscores really part of the name? If not, then the label on the Wikidata page is wrong. - Jmabel ! talk 16:03, 16 September 2024 (UTC)
- Thank you for this - it looks like the underscores were the issue. McPhail (talk) 18:13, 16 September 2024 (UTC)
According to Exif data
https://commons.wikimedia.org/w/index.php?sort=create_timestamp_desc&search=deepcategory%3A%22Uploaded_with_Mobile%2FAndroid%22 it seems all uploads through the android app use this template by default.
is this use good? (as better than just using the plain timestamp like uploadwizard does?)
imo there're two problems: #1 only the date without the time is used in the template, #2 uses an extra template. RZuo (talk) 18:58, 16 September 2024 (UTC)
- It's better than just the plain timestamp because it adds a time related category like "Photographs taken on 2024-09-16" while the plain timestamp does not add any categories. Though, personally, I'd prefer if the default template would be the Template:Taken on as this further allows for categorization by location and then result in more specific categories like "United States photographs taken on 2024-09-16". Time related categories are helpful for historians. Nakonana (talk) 22:07, 16 September 2024 (UTC)
- I agree, {{Taken on}} is better. Also, I've always felt that {{According to Exif data}} implies that there's some uncertainty about the date, like we're saying "we don't know the date, but Exif says this" — otherwise, why don't we add it to existing
|date=
values? Maybe that's just me though! Sam Wilson 06:20, 17 September 2024 (UTC)- Missing time is a bit odd even though this may be generally less reliable (#1).
- https://add-information.toolforge.org/ adds "according to Exif data" too (with some problems). Personally, I leave that unless the uploader specified the data somewhere (using "taken on" then.
∞∞ Enhancing999 (talk) 09:41, 17 September 2024 (UTC) - {{Taken on}} can also handle timestamps whereas {{According to Exif data}} doesn't. ReneeWrites (talk) 23:05, 19 September 2024 (UTC)
- I agree, {{Taken on}} is better. Also, I've always felt that {{According to Exif data}} implies that there's some uncertainty about the date, like we're saying "we don't know the date, but Exif says this" — otherwise, why don't we add it to existing
September 17
Free online ocr that is the best?
What free online ocr is the best? I have been using https://www.onlineocr.net/ but there must be better. See the poor results here: File:Viola_Gentry_in_the_Times_Herald_of_Olean,_New_York_on_December_24,_1928,_part_1.jpg. Someone was able to get a better quality ocr from their iPhone here: File:Joseph Henderson (1826-1890) obituary in the New York Herald on October 12, 1890.png, but I work from a desktop and hope there is something better online. Any suggestions? — Preceding unsigned comment added by Richard Arthur Norton (1958- ) (talk • contribs) 16:21, 17 September 2024 (UTC)
- I did not try onlineocr.net (I saw the text on the file page), but here is the transcription using WMF tools.
- It transcribes the text in the article to the left, but I cannot fault that. It could be fixed by rotating the image and then selecting the transcription area.
- Glrx (talk) 17:19, 17 September 2024 (UTC)
- @Glrx: That is awesome, I will fix up the few errors by hand. I wasn't even aware of this tool. I just added it to my Chrome toolbar. Thank you. I like that it can operate on the url so I don't have to download anymore. Take a peek at the transcription and do a search on the word "Stillman" and see if you can decipher the intention. I want to migrate the transcription to Wikisource if we can decipher that word--RAN (talk) 23:43, 17 September 2024 (UTC)
- @Richard Arthur Norton (1958- ): "I was aviation crazy — and still am." Glrx (talk) 01:57, 18 September 2024 (UTC)
Redaction request by the Indian Supreme Court
The Supreme Court of India has demanded that the WMF remove references to the victim of the recent rape and murder in Kolkata. [8] There seems to be an emerging consensus on Wikipedia to remove the name, not necessarily because of the court case, but just for general sensitivity reasons. There are a bunch of files in "Category:2024 Kolkata rape and murder" that either state the name explicitly in the file name ([9]), or the name is visible in the image [10]. Not really sure what should be done about this. On English Wikipedia, the WMF asked the community to make its own decision on the matter. [11] Is there a consensus on file deletion/redaction as a response to non-copyright related court requests? Hemiauchenia (talk) 22:06, 17 September 2024 (UTC)
- @Hemiauchenia: If a court wants the WMF to do something, they can use the proper channels, and WMF Legal can act. This is very similar to how the WMF responds to a DMCA takedown request. — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 22:28, 17 September 2024 (UTC)
- It's kind of an odd thing so I might be wrong, but it seems like there's really only two options here. Either we redact the name or delete the images. I do wonder what stops people from just re-adding the name or uploading new files that contain it in the future though. Since the amount of media related to her is probably essentially endless, or will be once this is all said and done. Compare that to Wikipedia where there's only one or two articles that can easily be edited and locked.
- So I guess my point is that realistically nothing can ultimately be done about it on Commons. Otherwise it's just going to be an endless game of Whac-A-Mole. Which I don't think is a good use of anyone's time. Especially considering her name is already out there to begin with. Are we seriously going to waste the time and energy enforcing something like that when anyone can get her just by doing a basic Google Search? Screw that. I don't think we should allow the WMF to do it either. Since it isn't a copyright issue. They can't just censor things on Commons because of an ill conceived, unenforceable court order. --Adamant1 (talk) 01:29, 18 September 2024 (UTC)
- Yes. In addition, yielding to the Indian Supreme Court opens a Pandora box about pressure from from external organizations and people, for reasons other than copyright. Yann (talk) 19:10, 18 September 2024 (UTC)
September 18
Categorization of old maps, by year (another round)
I was again asked about what the prevailing consensus is, when categorizing old maps. I asked before in 2021/12 and 2024/03 but maybe I misunderstood the outcomes, so let me rephrase it in a much shorter way:
The general attitude I took home beforewas that old maps should be categorized (=grouped) by the depicted location. A bunch of old maps gets grouped into "Old maps of X", and only if there are enough old maps of X, the "Old" category gets split up. Depending on the contents, either by (sub-)location again, or by century/decade. In short: Old maps get categorized by the displayed location first and foremost, the creation date is less important.
Now here is the argument. - Tl;dr: Is ":Cat:1892 maps of <country>" really obligatory? |
---|
As far as I know, there is no reason why for example this Old map of Exeter should also be assigned the "Category:1884 maps of the United Kingdom", just so that it's 'properly' categorized by year also. In my opinion, that practice leads to overcat, especially since a map of Exeter is not really the same as a map of the UK.
The other day, I re-evaluated the contents of the category "1892 maps of Boston", which contained a mixture of maps that showed various locations located in the greater area of Boston: Maps of Larger Boston and surroundings, tram maps, aerial maps, detail crops, building plans, suburb maps and so on. I sorted this eclectic collection into different, more meaningful groups and locations: "1890s bird's eye maps of Boston", "Details of 1890s maps...", "1890s maps of Boston and environs" and so on. Five maps from 1892 did not fit into these groupings, and these I upmerged into Category:1890s maps of Boston. Preferably in my opinion, the maps of Boston should eventually get sorted further by suburbs and quarters: The location has priority when grouping files together. |
So, I'm interested in reading a fuller argument by Pi in favor of these by-year map categories; and/or learn about the community consensus: Are "Maps-by-year" an integral/obligatory part of the category tree, or are they ultimately expendable? --Enyavar (talk) 00:20, 18 September 2024 (UTC)
- Support Getting rid of "by year" categories for images of maps. I have some experience with maps and cartography more broadly. Generally it's nearly impossible to tell the specific year a map was created in. Most of the time what people mean is the publication date, but even then it's purely based on that specific publication of the map. Maps can, and often are, republished multiple times over decades though. Especially older ones. So a "by year" category will usually be wrong, if not also totally pointless because no one looks for maps that way. Usually they find maps based on the subject. Like "maps of the Southern Pacific Railroad." Not "1953 maps of whatever." I don't there's enough images in most cases to justify "by year" categories in most cases anyway and at least from what I've seen the community seems to be going in a different direction with things then categorizing every single image in a "by year" category for it's own sake. --Adamant1 (talk) 02:31, 18 September 2024 (UTC)
- Seconding this. Maps aren't photographs; they aren't created at a specific instant in time, nor do they represent the world as it existed (or was perceived) at a single time either. The general historical period in which a map was created is, of course, still important, but the exact year is not. Omphalographer (talk) 20:49, 18 September 2024 (UTC)
- It is rare when we need to need map categories for a place down to a single year; mostly useful when we have a large set of maps from a particular year, and even then the particular map set may be a more useful category than something about the year as such. - 06:46, 18 September 2024 (UTC)
- I agree that year categories for maps are often not extremely useful, but decade and century are. Without a point in time, even if approximate, a map is mostly useless. OTOH, what means "old"? A 2005 map is old? I support getting rid of vague terms such as "old maps" and "historical images" (thankfully gone, now, at last!!) which have very subjective limits, and replacing them with the century & decade categories. Only in rare occasions one doesn't know which century is a map from, and those very rare cases can happily stay in the main category. Darwin Ahoy! 10:31, 18 September 2024 (UTC)
- @DarwIn: I don't have time to look for something to cite, but the meaning of "old" for maps has come up before: more than 50 years old, with the proviso that there are a few places that have changed so much so fast that we need to special-case them. - Jmabel ! talk 19:58, 18 September 2024 (UTC)
- @Jmabel Even so, doesn't the categorization by century replace with merit the "old" categorization? Darwin Ahoy! 13:25, 19 September 2024 (UTC)
- @DarwIn: I don't have time to look for something to cite, but the meaning of "old" for maps has come up before: more than 50 years old, with the proviso that there are a few places that have changed so much so fast that we need to special-case them. - Jmabel ! talk 19:58, 18 September 2024 (UTC)
- Hi Darwin, Jmabel, Category:Old maps defines Old maps, check out
{{Old maps meaning}}
Thanks to this definition, if you have 15 old maps of a town, from 5 different centuries, you do not need to establish 5 different century-categories that each only hold 2-4 files. And even better: undated maps or those which were created "between 1650 and 1750" can actually get categorized. (That is not the topic here, but needed to be explained apparently) --Enyavar (talk) 18:59, 19 September 2024 (UTC)
- Hi Darwin, Jmabel, Category:Old maps defines Old maps, check out
- I objected to Enyavar removing files from the subcategories of Category:Maps of Massachusetts by year, and similar by-year categories for Boston, without any discussion to establish consensus. These categories were long-established and well-populated (typically 5-20 maps per year). Most of the maps being moved have their dates well-established either by a date on the map itself or by the contents. More significantly, Enyavar has not been properly upmerging categories, so basic information is being removed. For example, File:Boston Street Map, 1871.jpg (which has the date printed on the map itself) is currently not in any subcategories of Category:1871 in Boston nor Category:1871 maps - even though it is a map of Boston clearly dated 1871! @Adamant1: If someone is specifically looking for a map from that year (which is a very common use case for historical research), they would not be able to find that map through the most obvious category trees. Pi.1415926535 (talk) 20:02, 18 September 2024 (UTC)
- @Pi.1415926535: I'm not going to claim it applies to every file of a map on here but from what I've seen most of them have the year in the file name. So I don't really see how people can't find them by year if their in decade categories instead. Its tangential, but tend to act like categories are the only, or most important, way to find images when there's others that work perfectly fine depending on the situation. There's nothing wrong with people having to read files names to find information about a file sometimes. That's what they exist for. --Adamant1 (talk) 23:40, 18 September 2024 (UTC)
- Disagreeing with User:Adamant1 here. Navigating the category hierarchy should be one way to find things. It should be usable independent of search. Relevant categories should be used, even if the information is in the title or description. Similarly, the fact that something is in the title is not normally a reason to leave it out of the description. - Jmabel ! talk 05:15, 19 September 2024 (UTC)
- The fact that something is in the title is not normally a reason to leave it out of the description. Where did say that? I have zero issue with repeating facts in the description that are already in the file name. Since their inherently suppose to be descriptions. There's no reason to turn category names into a complete listing of everything related to a file though and I don't think it's necessarily important or useful to repeat facts that already in the file names. Otherwise it just gets to obtuse.
- Disagreeing with User:Adamant1 here. Navigating the category hierarchy should be one way to find things. It should be usable independent of search. Relevant categories should be used, even if the information is in the title or description. Similarly, the fact that something is in the title is not normally a reason to leave it out of the description. - Jmabel ! talk 05:15, 19 September 2024 (UTC)
- @Pi.1415926535: I'm not going to claim it applies to every file of a map on here but from what I've seen most of them have the year in the file name. So I don't really see how people can't find them by year if their in decade categories instead. Its tangential, but tend to act like categories are the only, or most important, way to find images when there's others that work perfectly fine depending on the situation. There's nothing wrong with people having to read files names to find information about a file sometimes. That's what they exist for. --Adamant1 (talk) 23:40, 18 September 2024 (UTC)
- Like realistically how many times does there need to be a specific word on a page before it just becomes needless duplication? There's file names, category names, descriptions, section headers, infoboxes, Etc. Etc. Seriously does anyone think that Category:Tom Cruise isn't a category for images of Tom Cruise? You (and by that I mean the developers) are just treating users like their stupid children who can't read and need their hands held at every turn. It's just garbage UX design regardless. --Adamant1 (talk) 05:25, 19 September 2024 (UTC)
- @Adamant1: I think you might have misinterpreted my comment. The main problem isn't finding maps in categories that they are in - it's that the maps aren't in the category trees whatsoever. For the example I gave - which is just one of hundreds of maps recently recategorized by Enyavar - it has been completely removed from two of the three most obvious category trees. A user looking through those category trees would not be able to find that map. Pi.1415926535 (talk) 17:33, 19 September 2024 (UTC)
- Any users who use the category system at all, should know enough about the workings of time and space, to check the surrounding categories and not give up immediately. There is no 1819 map of Luxembourg? Well, have you checked "19th-century maps of Luxembourg", or "1810s maps of Belgium"? If you search for anything, you usually don't have much to choose from anyway: there was never a "1762 maps of Boston"-category, but the "1760s maps of Boston" has a map of just a few years later. --Enyavar (talk) 18:59, 19 September 2024 (UTC)
- @Adamant1: I think you might have misinterpreted my comment. The main problem isn't finding maps in categories that they are in - it's that the maps aren't in the category trees whatsoever. For the example I gave - which is just one of hundreds of maps recently recategorized by Enyavar - it has been completely removed from two of the three most obvious category trees. A user looking through those category trees would not be able to find that map. Pi.1415926535 (talk) 17:33, 19 September 2024 (UTC)
- Like realistically how many times does there need to be a specific word on a page before it just becomes needless duplication? There's file names, category names, descriptions, section headers, infoboxes, Etc. Etc. Seriously does anyone think that Category:Tom Cruise isn't a category for images of Tom Cruise? You (and by that I mean the developers) are just treating users like their stupid children who can't read and need their hands held at every turn. It's just garbage UX design regardless. --Adamant1 (talk) 05:25, 19 September 2024 (UTC)
- Oppose I'm not sure if users misunderstood or will misunderstand this to be about year categories for maps which are e.g. useful to find uses of outdated datagraphic maps (like a world map of number of automobiles per capita per country where it matters whether it's year 2000 or 2020). Second reason and specifically re old maps: not sure if I understood the proposal correctly but the years are very useful there as well such as for context. People in the past didn't know or have maps of the whole Earth in the past and accuracies also vary by date. It's key contextual information and I don't see any reason to remove it which seems like an unprecedented removal of useful information. Prototyperspective (talk) 11:56, 19 September 2024 (UTC)
- Hi, this is only about old maps, not maps from the 21st century. Nor are we talking about any information getting removed from file descriptions, that would be horrible. This is about keeping decade-categories like "1890s-produced maps" and not picking apart the content artificially by single years. And anyway, maps should get categorized by location first, rather than by the production year. This one here is much better categorized as a "19th-century maps of Jefferson County, NY|1835 maps" (with the year in the sorting key, there), instead of "1835 map of the United States". --Enyavar (talk) 18:59, 19 September 2024 (UTC)
- So your proposal is that old maps should only be organized by decade and not by year? Maybe that makes sense and maybe something like see also links could be used to address the issue raised by Pi.1415926535 where the files are missing in category trees where people may look for them. In any case I think the proposal should be clearer. Also files should be in all main category trees where they belong into, not just one and then removed from the parent due to OVERCAT and missing in the other. Prototyperspective (talk) 19:47, 19 September 2024 (UTC)
- This was less of a proposal for some new rule, more the wish for clarification that this general categorization principle still applies (I provided links to the previous discussions). And also, this is not a hard rule either, there are several cases of categorizing maps where the year is indeed critical. But I think that Pi's concerns can be adressed if I add the respective "<year> in <location>" category to the respective files. Best, --Enyavar (talk) 04:41, 20 September 2024 (UTC)
- So your proposal is that old maps should only be organized by decade and not by year? Maybe that makes sense and maybe something like see also links could be used to address the issue raised by Pi.1415926535 where the files are missing in category trees where people may look for them. In any case I think the proposal should be clearer. Also files should be in all main category trees where they belong into, not just one and then removed from the parent due to OVERCAT and missing in the other. Prototyperspective (talk) 19:47, 19 September 2024 (UTC)
- Hi, this is only about old maps, not maps from the 21st century. Nor are we talking about any information getting removed from file descriptions, that would be horrible. This is about keeping decade-categories like "1890s-produced maps" and not picking apart the content artificially by single years. And anyway, maps should get categorized by location first, rather than by the production year. This one here is much better categorized as a "19th-century maps of Jefferson County, NY|1835 maps" (with the year in the sorting key, there), instead of "1835 map of the United States". --Enyavar (talk) 18:59, 19 September 2024 (UTC)
- Strong oppose Maps by year is useful as I have uploaded some historical maps that clearly show the situation as it was in a particular year. By Decade just fills up categories where By Year is more manageable. Abzeronow (talk) 17:12, 19 September 2024 (UTC)
- Hi, which historical maps are you referring to here? Please check your own upload File:Iowa, Primary Road System, 1928.jpg, which you sorted into "Category:1920s maps of Iowa" instead of creating a new "1928 maps of Iowa" for the single file to rattle around in. Furthermore, if you take the whole current content from that decade in Iowa, you find a total of two 1920s maps of Iowa, and a bunch of 1920s detail maps of singular counties and building plans, which actually should be categorized into "... maps of Winnebago County, Iowa" etc., instead of cluttering the 1920s maps of Iowa. --Enyavar (talk) 18:59, 19 September 2024 (UTC)
- I disagree somewhat. A map with a specific year, is usually made for a specific incident, usu, a battle. Therefore it's already filed in the battle category, which comes with a year. Most developed countries old maps, were decades in the making, and names don't change often. Filing by decade is almost superfluous for Europe. Hoewver countries mapped for the first time by explorers and invaders, are the exception to that. India for example is a nightmare of name changing, still going on today.
- It's against the rules to file in multiple cats, but perhaps maps should be an exception. They were expensive, and would have been used over decades by individuals (authors) in India, where names and spelling were in perpetual flux. Filing by any date is an arbitrary process here, if I want to research South Carolina in 1838, I would look at an 1866 map first, and then go backwards. Filing by year is by publication date. I'd be happy by decade as a compromise. Broichmore (talk) 12:04, 20 September 2024 (UTC)
- Hi, which historical maps are you referring to here? Please check your own upload File:Iowa, Primary Road System, 1928.jpg, which you sorted into "Category:1920s maps of Iowa" instead of creating a new "1928 maps of Iowa" for the single file to rattle around in. Furthermore, if you take the whole current content from that decade in Iowa, you find a total of two 1920s maps of Iowa, and a bunch of 1920s detail maps of singular counties and building plans, which actually should be categorized into "... maps of Winnebago County, Iowa" etc., instead of cluttering the 1920s maps of Iowa. --Enyavar (talk) 18:59, 19 September 2024 (UTC)
Photo challenge July results
Rank | 1 | 2 | 3 |
---|---|---|---|
image | |||
Title | African bush squirrel eating bread gotten from tourists in Namibia |
A young man putting discs of bread into the oven in the Islamic Cairo section of Cairo, Egypt. |
Bread being kneaded in a bakery in Aalen, Baden-Württemberg, Germany |
Author | Lusi Lindwurm | FischerFotos | Mozzihh |
Score | 24 | 12 | 9 |
Rank | 1 | 2 | 3 |
---|---|---|---|
image | |||
Title | A lion is coming from a manhole, in León, Spain |
Manhole opened for cleaning, Heidelberg |
Child coming out of a sump |
Author | Guy Delsaut | Foeniz | Saral Shots |
Score | 15 | 13 | 9 |
Congratulations to Lusi Lindwurm, FischerFotos, Mozzihh, Guy Delsaut, Foeniz and Saral Shots. -- Jarekt (talk) 03:49, 18 September 2024 (UTC)
- I find the challenge on bread interesting. However, I think it would make sense if the photo challenge focused more on areas where photos are missing rather than making this some Art photography contest.
- I thought this site was not mainly for amateur art but for useful/educational images (for example images that can illustrate things on Wikipedia) with there being separate Art-sharing sites for the former. Few people participate in selecting the themes, maybe more people could and also consider where the gaps in media on WMC actually are. See Commons talk:Photo challenge/themes.
Prototyperspective (talk) 12:01, 19 September 2024 (UTC)
New round of proposed changes to UploadWizard: we are looking for your feedback!
Hi all! The Structured Content team will continue, in the following months, to improve the current user experience with UploadWizard. For this reason, we published on our project page the mockups of our proposed changes to the "Release rights" step.
In short, starting from your feedback received with our first round of improvements carried on in Fiscal Year 2023-2024, we suggest making some more changes to the step in which users select if their media is an “own work” or “not own work”. This includes also changes requested by you regarding adding custom public domain tags or license options, as well as a space to clarify AI prompts for AI-generated media.
We are looking for your feedback on our project's talk page. What do you think of our proposed changes?
Thanks in advance! --Sannita (WMF) (talk) 13:58, 18 September 2024 (UTC)
September 19
MM/DD/YYYY vs DD/MM/YYYY recognition
There is an example in https://commons.wikimedia.org/wiki/File:Zoo_Heidelberg,_Sumatra-Tiger.jpg where the "Created" tag in the page says it was created on 7 December 2024 (that means: in the future) and clicking on the "More details" we find that the Exif data are interpreted as 12 July 2024 (which is nice/ok) and that the Date is 12.07.2024 (which is the most common European way of writing DD.MM.YYYY) So there is some sort of ambiguity or bug that toggles days and months in between these formats. --13:42, 19 September 2024 R. J. Mathar
- while it's definitely a bug of the mediaviewer, pattypan should also be modified so that users dont input dates in strange formats instead of iso yyyy-mm-dd.--RZuo (talk) 13:53, 19 September 2024 (UTC)
Proposal: de-prioritise AI images in search
Related to the thread above (#Proposal: AI generated images must be clear they're AI in the file name). One issue with AI images is that they can appear high up in searches and crowd out original images. You can see this by searching for Javan rhinoceros or Wikimedia Commons logo.
However it should be possible to de-prioritise such images by adding Template:PD-algorithm (which all AI generated images are supposed to be tagged with) to MediaWiki:Cirrussearch-boost-templates. I would suggest a ranking of 50%, which would rank AI files below most other files, but above those nominated for deletion or otherwise tagged as low quality/superseded.
(Noting I have also submitted a related feature request at the Community Wishlist to allow filtering such images on Special:MediaSearch) the wub "?!" 14:35, 19 September 2024 (UTC)
- Strong oppose Images made using some AI tools should either not be deboosted or only files in Category:AI misgeneration should be sorted to be further down in general: images shouldn't be sorted by how or by which techniques and tools they were produced but by where the user is expecting to find them / looking for them or generally the relevance and quality of the image as it relates to the category concept, not the method/techniques used to produce it. Additionally, this functionality only incentivizes people to hide the fact they used AI to make an image which we should encourage people to specify. There are many cases where AI images are some of the best for the concept searched for and they should be shown fairly high up in such cases, especially when they're in use. The example search results look like that because nearly all images for a useful Wikibook used rhino images. They indeed should not show high up there (for this search term) but that doesn't mean all AI images should be downranked in general. Some other improvements to the search results should be made that are not specific to AI...for example only showing used images high up if they are used on Wimedia items (like Wikipedia articles) that are actually related to the search term. I will attach a screenshot that shows how bad the search results often are, this is a general issue and lots of other media crowd out useful media. Some other issues include that it often shows very outdated charts at the top when there are more up-to-date charts. This is just some discrimination of the use of a novel tool to produce often high-quality useful images but not a reasonable effective measure to actually improve the search results. Prototyperspective (talk) 15:06, 19 September 2024 (UTC) Added screenshot --Prototyperspective (talk) 15:27, 19 September 2024 (UTC)
- " or only files in Category:AI misgeneration should be sorted to be further down in general" I added that a whole month ago already Trade (talk) 18:25, 20 September 2024 (UTC)
- Support -- this is a huge problem currently with Google and other image searches and we should not replicate it. Gnomingstuff (talk) 22:01, 19 September 2024 (UTC)
- IMO we should just ban all this AI stuff from the Commons with the exceptions in cases in which AI itself is the topicof the image. Matthiasb (talk) 22:54, 19 September 2024 (UTC)
- Instead we could have a filter that excludes AI images. Moreover, these aren't a problem here the example given is one of very very few cases where the results are indeed cluttered by AI images. Third, again the tools and techniques used for producing an image is not the important part...rather whether it's high-quality, useful and relevant to the search term. I don't see this problem on Image search engines but I also don't know what (and how) you searched for. What's next – downranking images made or edited with Photoshop or Krita? Prototyperspective (talk) 11:00, 20 September 2024 (UTC)
- Neutral I'm not sure this is the answer to dealing with the problems caused by AI images. I think another proposal for putting "AI generated" in file names of AI generated artwork would be a better solution. Since it's not so much where the images show up in search results, but that people often don't know the images are AI generated to begin with. People can still accidently get an AI generated image if they are pushed down in the search results. What we need is better ways to clearly identify and deal with AI generated artwork to begin with. I feel like it's mostly a curation issue at this point though. --Adamant1 (talk) 00:32, 20 September 2024 (UTC)
- Support but it may not be sufficient. Yann (talk) 09:55, 20 September 2024 (UTC)
- Why discriminate against the use of one particular tool? I don't see any rationale there only that you apparently want to continue to subjectively semi-censor images made with a tool you apparently don't like. Prototyperspective (talk) 11:02, 20 September 2024 (UTC)
- AI generated artwork isn't a tool, its a specific type of art. Stop trying to act like we're just discussing Microsoft Paint every time this comes up. Its getting rather tendentious. --Adamant1 (talk) 11:08, 20 September 2024 (UTC)
- It's the AI tool. One can use the result and edit it in Krita/Photoshop or create a draft and use an AI tool to improve it via img2img. AI-generated art is a type of art that makes use of these tools, similar to photography by definition makes use of cameras or paintings characteristically make use of paintbrushes. The level of involvement of the human is often far lower than in manual paintings. I doubt people voting here have much experience with prompting around for a few hours to get an image to look nearly exactly like you want it to or seen cases where AI images are the most illustrative. Stop trying to act like it's not discrimination against one particular novel tool (or art-technique/artform). Prototyperspective (talk) 11:18, 20 September 2024 (UTC)
- Don't get me wrong, I'm not acting like it isn't singling out a specific type of art. It has to be due to the nature of the thing. "Artwork" isn't the problem here. AI generated artwork is. You know as well as do that no one is uploading personal drawings of cells made in Photoshop and trying to pass them off as actual images. Let alone that anyone will be fooled by said images like they clearly are for AI generated artwork. Your just being disingenuous and choosing to ignore the issue instead of admiting that AI artwork has its own problems that inherently don't exist with other artforms. --Adamant1 (talk) 11:31, 20 September 2024 (UTC)
- It's the AI tool. One can use the result and edit it in Krita/Photoshop or create a draft and use an AI tool to improve it via img2img. AI-generated art is a type of art that makes use of these tools, similar to photography by definition makes use of cameras or paintings characteristically make use of paintbrushes. The level of involvement of the human is often far lower than in manual paintings. I doubt people voting here have much experience with prompting around for a few hours to get an image to look nearly exactly like you want it to or seen cases where AI images are the most illustrative. Stop trying to act like it's not discrimination against one particular novel tool (or art-technique/artform). Prototyperspective (talk) 11:18, 20 September 2024 (UTC)
- AI generated artwork isn't a tool, its a specific type of art. Stop trying to act like we're just discussing Microsoft Paint every time this comes up. Its getting rather tendentious. --Adamant1 (talk) 11:08, 20 September 2024 (UTC)
- Why discriminate against the use of one particular tool? I don't see any rationale there only that you apparently want to continue to subjectively semi-censor images made with a tool you apparently don't like. Prototyperspective (talk) 11:02, 20 September 2024 (UTC)
- "(which all AI generated images are supposed to be tagged with)" Says whom? Trade (talk) 18:24, 20 September 2024 (UTC)
- Oppose - This should be handled with a filtering option, not arbitrary de-prioritization. The search should be as content-neutral as possible. Nosferattus (talk) 23:20, 20 September 2024 (UTC)
This is a new page to request categorizations of files. Some people have some time but may not know many or more things to do while some are drowning in potential tasks or aren't using sophisticated/time-efficient ways (like cat-a-lot + SpecialSearch) to implement them or would implement them in suboptimal ways (like not creating subcategories when these would be due). It could be quite constructive to bring these together there so people looking for things to do can (also) go there and find a task they're interested in by which they can contribute to WMC.
I added several of my categorization-todos as examples. The page also includes a section for categories missing many items. A user going to such a category may think this is all there is on WMC for that subject when only the category has not yet been populated properly or is missing many files. --Prototyperspective (talk) 16:40, 19 September 2024 (UTC)
Wikimedia Commons group on Flickr
(cross posted from Commons talk:Flickr files)
I'm an admin of the Wikimedia Commons Group on Flickr, a group that has, up until now, had no moderation for numerous years. Given my uptick in activity on Flickr (resulting from my uptick on Commons) I'm hoping to change that.
Photographs in the group out to be able to be readily able to be ported to Commons, thereby having a free license. People who sign up for the group agree to that rule. However, currently numerous people are just sharing photographs that they share on numerous other groups for exposure.
I do not have the bandwidth to moderate completely on my own. Is anyone else currently a regular Flickr user that would be willing and able to help out? —Bastique ☎ appelez-moi! 20:02, 19 September 2024 (UTC)
- I'm a pretty regular user of Flickr and am pretty active on Commons as well. So I'd probably be willing to help out. What exactly does it entail doing though? --Adamant1 (talk) 00:25, 20 September 2024 (UTC)
- Basically scan the incoming pictures to see if they're using a legitimate license. Remove them if not. Not sure if I want to include a message on their image suggesting they change the license to a free one, many of these people are just submitting everything to groups. Bastique ☎ appelez-moi! 00:35, 20 September 2024 (UTC)
- Cool. I could totally do that. I'm pretty knowledgeable about the different licenses out there and what's acceptable or not. You want me to email you my screen name on Flickr or something? --Adamant1 (talk) 00:48, 20 September 2024 (UTC)
- Yes, and join this group! Wikimedia Commons | Flickr. Bastique ☎ appelez-moi! 01:03, 20 September 2024 (UTC)
- Cool. I joined the group and sent you an email with my user name. --Adamant1 (talk) 02:19, 20 September 2024 (UTC)
- Yes, and join this group! Wikimedia Commons | Flickr. Bastique ☎ appelez-moi! 01:03, 20 September 2024 (UTC)
- Cool. I could totally do that. I'm pretty knowledgeable about the different licenses out there and what's acceptable or not. You want me to email you my screen name on Flickr or something? --Adamant1 (talk) 00:48, 20 September 2024 (UTC)
- Basically scan the incoming pictures to see if they're using a legitimate license. Remove them if not. Not sure if I want to include a message on their image suggesting they change the license to a free one, many of these people are just submitting everything to groups. Bastique ☎ appelez-moi! 00:35, 20 September 2024 (UTC)
September 20
Your wiki will be in read-only soon
Read this message in another language • Please help translate to your language
The Wikimedia Foundation will switch the traffic between its data centers. This will make sure that Wikipedia and the other Wikimedia wikis can stay online even after a disaster.
All traffic will switch on 25 September. The switch will start at 15:00 UTC.
Unfortunately, because of some limitations in MediaWiki, all editing must stop while the switch is made. We apologize for this disruption, and we are working to minimize it in the future.
A banner will be displayed on all wikis 30 minutes before this operation happens. This banner will remain visible until the end of the operation.
You will be able to read, but not edit, all wikis for a short period of time.
- You will not be able to edit for up to an hour on Wednesday 25 September 2024.
- If you try to edit or save during these times, you will see an error message. We hope that no edits will be lost during these minutes, but we can't guarantee it. If you see the error message, then please wait until everything is back to normal. Then you should be able to save your edit. But, we recommend that you make a copy of your changes first, just in case.
Other effects:
- Background jobs will be slower and some may be dropped. Red links might not be updated as quickly as normal. If you create an article that is already linked somewhere else, the link will stay red longer than usual. Some long-running scripts will have to be stopped.
- We expect the code deployments to happen as any other week. However, some case-by-case code freezes could punctually happen if the operation require them afterwards.
- GitLab will be unavailable for about 90 minutes.
This project may be postponed if necessary. You can read the schedule at wikitech.wikimedia.org. Any changes will be announced in the schedule.
Please share this information with your community.Trizek_(WMF), 09:36, 20 September 2024 (UTC)
File (apparently) broken
I noticed that in Joint European standard for size labelling of clothes the File:EN-13402-pictogram.png is not shown. But the thumbnail system for it seems working. ZandDev (talk) 16:44, 20 September 2024 (UTC)
I somehow doubt that the Taliban asked Commons user Falerístico to design their coat of arms for them. Any idea if there is a more suitable license we can use?--Trade (talk) 22:13, 20 September 2024 (UTC)
- @Trade: The uploader was indeffed later that year to "prevent project disruption". However, nothing I found with Google Lens or Tineye predates our 2011 upload. — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 22:43, 20 September 2024 (UTC)
- Clearly he had to source the coat of arms from somewhere Trade (talk) 22:55, 20 September 2024 (UTC)
- @Trade: My guess would be an official source that eschews free culture, has no copyright relations with the US, or both. — 🇺🇦Jeff G. ツ please ping or talk to me🇺🇦 23:05, 20 September 2024 (UTC)
- Clearly he had to source the coat of arms from somewhere Trade (talk) 22:55, 20 September 2024 (UTC)